FS#53663 - [linux] frequent f2fs lockups
Attached to Project:
Arch Linux
Opened by Rulatir (Rulatir) - Thursday, 13 April 2017, 13:12 GMT
Last edited by Toolybird (Toolybird) - Saturday, 27 May 2023, 08:05 GMT
Opened by Rulatir (Rulatir) - Thursday, 13 April 2017, 13:12 GMT
Last edited by Toolybird (Toolybird) - Saturday, 27 May 2023, 08:05 GMT
|
Details
Description:
Additional info: * package version(s) * config and/or log files etc. Steps to reproduce: Programs that use the f2fs partition a lot (including systemd-journal) get randomly stuck in Disk Sleep forever and remain unkillable until reboot. Examples from dmesg: [ 9340.673596] INFO: task systemd-journal:18653 blocked for more than 120 seconds. [ 9340.673604] Tainted: P O 4.10.9-1-ARCH #1 [ 9340.673607] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 9340.673611] systemd-journal D 0 18653 1 0x00000104 [ 9340.673615] Call Trace: [ 9340.673626] __schedule+0x22f/0x700 [ 9340.673631] ? resched_curr+0xc3/0xd0 [ 9340.673635] schedule+0x3d/0x90 [ 9340.673639] schedule_timeout+0x243/0x3d0 [ 9340.673642] ? ttwu_do_activate+0x6f/0x80 [ 9340.673646] ? try_to_wake_up+0x18d/0x3c0 [ 9340.673650] wait_for_common+0xbe/0x180 [ 9340.673653] ? wake_up_q+0x80/0x80 [ 9340.673657] wait_for_completion+0x1d/0x20 [ 9340.673670] f2fs_issue_flush+0x160/0x1b0 [f2fs] [ 9340.673679] f2fs_do_sync_file+0x46f/0x730 [f2fs] [ 9340.673688] f2fs_sync_file+0x11/0x20 [f2fs] [ 9340.673692] vfs_fsync_range+0x4b/0xb0 [ 9340.673695] do_fsync+0x3d/0x70 [ 9340.673698] SyS_fsync+0x10/0x20 [ 9340.673701] do_syscall_64+0x54/0xc0 [ 9340.673704] entry_SYSCALL64_slow_path+0x25/0x25 [ 9340.673706] RIP: 0033:0x7f6b338b54cd [ 9340.673708] RSP: 002b:00007f6b30d82e50 EFLAGS: 00000293 ORIG_RAX: 000000000000004a [ 9340.673711] RAX: ffffffffffffffda RBX: 0000555d2e0d6060 RCX: 00007f6b338b54cd [ 9340.673713] RDX: 00007f6b31584000 RSI: 0000000000000000 RDI: 0000000000000027 [ 9340.673715] RBP: 00007f6b33e26b40 R08: 00007f6b30d83700 R09: 00007f6b30d83700 [ 9340.673716] R10: 00007f6b30d839d0 R11: 0000000000000293 R12: 0000000000000002 [ 9340.673718] R13: 00007ffed627b30f R14: 0000555d2e0c6880 R15: 00007f6b30d83700 [ 9340.673771] INFO: task firefox:2654 blocked for more than 120 seconds. [ 9340.673775] Tainted: P O 4.10.9-1-ARCH #1 [ 9340.673777] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 9340.673780] firefox D 0 2654 1012 0x00000004 [ 9340.673783] Call Trace: [ 9340.673787] __schedule+0x22f/0x700 [ 9340.673792] ? native_smp_send_reschedule+0x27/0x40 [ 9340.673795] schedule+0x3d/0x90 [ 9340.673799] schedule_timeout+0x243/0x3d0 [ 9340.673802] ? ttwu_do_activate+0x6f/0x80 [ 9340.673806] ? try_to_wake_up+0x18d/0x3c0 [ 9340.673810] wait_for_common+0xbe/0x180 [ 9340.673813] ? wake_up_q+0x80/0x80 [ 9340.673817] wait_for_completion+0x1d/0x20 [ 9340.673826] f2fs_issue_flush+0x160/0x1b0 [f2fs] [ 9340.673835] f2fs_do_sync_file+0x46f/0x730 [f2fs] [ 9340.673844] f2fs_sync_file+0x11/0x20 [f2fs] [ 9340.673848] vfs_fsync_range+0x4b/0xb0 [ 9340.673850] do_fsync+0x3d/0x70 [ 9340.673853] SyS_fsync+0x10/0x20 [ 9340.673856] entry_SYSCALL_64_fastpath+0x1a/0xa9 [ 9340.673858] RIP: 0033:0x7f41c1d0d4cd [ 9340.673860] RSP: 002b:00007fff2e816a30 EFLAGS: 00000293 ORIG_RAX: 000000000000004a [ 9340.673862] RAX: ffffffffffffffda RBX: 000000000000001c RCX: 00007f41c1d0d4cd [ 9340.673864] RDX: 0000000000000000 RSI: 0000000000000001 RDI: 00000000000000b3 [ 9340.673866] RBP: 00007f41a8fc60c0 R08: 0000000000000000 R09: 00007f41c0d00300 [ 9340.673867] R10: 00007f41c0d00300 R11: 0000000000000293 R12: 00007fff2e8168b0 [ 9340.673869] R13: 0000000000000034 R14: 00007fff2e816930 R15: 00007f41b41f8ff4 In case it's relevant, I think this started when I started running two vagrant boxes simultaneously, but these are host processes that get stuck. |
This task depends upon
Closed by Toolybird (Toolybird)
Saturday, 27 May 2023, 08:05 GMT
Reason for closing: No response
Additional comments about closing: Plus it's old and stale. If still an issue, please report upstream.
Saturday, 27 May 2023, 08:05 GMT
Reason for closing: No response
Additional comments about closing: Plus it's old and stale. If still an issue, please report upstream.
I don't experience any actual complete lock-ups, but when some processes that experience this issue seem to become zombie processes and can't be gotten rid of without a reboot. In my case, an often recurrent and noticable victim of this seems to be dockerd, which just freezes up randomly during various operations (see also the forum).
[1] https://bbs.archlinux.org/viewtopic.php?pid=1715404
I'm curious to see if this will fix my other issues as well, although that doesn't mean that this issue is any less problematic as sooner or later lts will be bumped to a newer version that includes this issue if it isn't fixed before that.