linux

mirror of https://github.com/hardkernel/linux.git synced 2026-06-06 10:58:48 +09:00

Author	SHA1	Message	Date
Stefan Metzmacher	827f8fcb29	UPSTREAM: io_uring/net: fix fast_iov assignment in io_setup_async_msg() commit `3e4cb6ebbb` upstream. I hit a very bad problem during my tests of SENDMSG_ZC. BUG(); in first_iovec_segment() triggered very easily. The problem was io_setup_async_msg() in the partial retry case, which seems to happen more often with _ZC. iov_iter_iovec_advance() may change i->iov in order to have i->iov_offset being only relative to the first element. Which means kmsg->msg.msg_iter.iov is no longer the same as kmsg->fast_iov. But this would rewind the copy to be the start of async_msg->fast_iov, which means the internal state of sync_msg->msg.msg_iter is inconsitent. I tested with 5 vectors with length like this 4, 0, 64, 20, 8388608 and got a short writes with: - ret=2675244 min_ret=8388692 => remaining 5713448 sr->done_io=2675244 - ret=-EAGAIN => io_uring_poll_arm - ret=4911225 min_ret=5713448 => remaining 802223 sr->done_io=7586469 - ret=-EAGAIN => io_uring_poll_arm - ret=802223 min_ret=802223 => res=8388692 While this was easily triggered with SENDMSG_ZC (queued for 6.1), it was a potential problem starting with `7ba89d2af1` in 5.18 for IORING_OP_RECVMSG. And also with `4c3c09439c` in 5.19 for IORING_OP_SENDMSG. However `257e84a537` introduced the critical code into io_setup_async_msg() in 5.11. Fixes: `7ba89d2af1` ("io_uring: ensure recv and recvmsg handle MSG_WAITALL correctly") Fixes: `257e84a537` ("io_uring: refactor sendmsg/recvmsg iov managing") Cc: stable@vger.kernel.org Change-Id: I72c459fdbae2938d176126ed2f17eea990c42d49 Signed-off-by: Stefan Metzmacher <metze@samba.org> Reviewed-by: Pavel Begunkov <asml.silence@gmail.com> Link: https://lore.kernel.org/r/b2e7be246e2fb173520862b0c7098e55767567a2.1664436949.git.metze@samba.org Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> Bug: 268174392 (cherry picked from commit `fc2491562a`) Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-17 18:01:23 +00:00
Jens Axboe	403642c036	UPSTREAM: io_uring: io_kiocb_update_pos() should not touch file for non -1 offset commit `6f83ab22ad` upstream. -1 tells use to use the current position, but we check if the file is a stream regardless of that. Fix up io_kiocb_update_pos() to only dip into file if we need to. This is both more efficient and also drops 12 bytes of text on aarch64 and 64 bytes on x86-64. Fixes: `b4aec40015` ("io_uring: do not recalculate ppos unnecessarily") Change-Id: I5c22ce8122b0e1f0ad423a5b3aa520ee416feff1 Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> Bug: 268174392 (cherry picked from commit `89a77271d2`) Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-17 17:57:16 +00:00
Jens Axboe	0c50a117bf	UPSTREAM: io_uring/rw: defer fsnotify calls to task context commit `b000145e99` upstream. We can't call these off the kiocb completion as that might be off soft/hard irq context. Defer the calls to when we process the task_work for this request. That avoids valid complaints like: stack backtrace: CPU: 1 PID: 0 Comm: swapper/1 Not tainted 6.0.0-rc6-syzkaller-00321-g105a36f3694e #0 Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 08/26/2022 Call Trace: <IRQ> __dump_stack lib/dump_stack.c:88 [inline] dump_stack_lvl+0xcd/0x134 lib/dump_stack.c:106 print_usage_bug kernel/locking/lockdep.c:3961 [inline] valid_state kernel/locking/lockdep.c:3973 [inline] mark_lock_irq kernel/locking/lockdep.c:4176 [inline] mark_lock.part.0.cold+0x18/0xd8 kernel/locking/lockdep.c:4632 mark_lock kernel/locking/lockdep.c:4596 [inline] mark_usage kernel/locking/lockdep.c:4527 [inline] __lock_acquire+0x11d9/0x56d0 kernel/locking/lockdep.c:5007 lock_acquire kernel/locking/lockdep.c:5666 [inline] lock_acquire+0x1ab/0x570 kernel/locking/lockdep.c:5631 __fs_reclaim_acquire mm/page_alloc.c:4674 [inline] fs_reclaim_acquire+0x115/0x160 mm/page_alloc.c:4688 might_alloc include/linux/sched/mm.h:271 [inline] slab_pre_alloc_hook mm/slab.h:700 [inline] slab_alloc mm/slab.c:3278 [inline] __kmem_cache_alloc_lru mm/slab.c:3471 [inline] kmem_cache_alloc+0x39/0x520 mm/slab.c:3491 fanotify_alloc_fid_event fs/notify/fanotify/fanotify.c:580 [inline] fanotify_alloc_event fs/notify/fanotify/fanotify.c:813 [inline] fanotify_handle_event+0x1130/0x3f40 fs/notify/fanotify/fanotify.c:948 send_to_group fs/notify/fsnotify.c:360 [inline] fsnotify+0xafb/0x1680 fs/notify/fsnotify.c:570 __fsnotify_parent+0x62f/0xa60 fs/notify/fsnotify.c:230 fsnotify_parent include/linux/fsnotify.h:77 [inline] fsnotify_file include/linux/fsnotify.h:99 [inline] fsnotify_access include/linux/fsnotify.h:309 [inline] __io_complete_rw_common+0x485/0x720 io_uring/rw.c:195 io_complete_rw+0x1a/0x1f0 io_uring/rw.c:228 iomap_dio_complete_work fs/iomap/direct-io.c:144 [inline] iomap_dio_bio_end_io+0x438/0x5e0 fs/iomap/direct-io.c:178 bio_endio+0x5f9/0x780 block/bio.c:1564 req_bio_endio block/blk-mq.c:695 [inline] blk_update_request+0x3fc/0x1300 block/blk-mq.c:825 scsi_end_request+0x7a/0x9a0 drivers/scsi/scsi_lib.c:541 scsi_io_completion+0x173/0x1f70 drivers/scsi/scsi_lib.c:971 scsi_complete+0x122/0x3b0 drivers/scsi/scsi_lib.c:1438 blk_complete_reqs+0xad/0xe0 block/blk-mq.c:1022 __do_softirq+0x1d3/0x9c6 kernel/softirq.c:571 invoke_softirq kernel/softirq.c:445 [inline] __irq_exit_rcu+0x123/0x180 kernel/softirq.c:650 irq_exit_rcu+0x5/0x20 kernel/softirq.c:662 common_interrupt+0xa9/0xc0 arch/x86/kernel/irq.c:240 Fixes: `f63cf5192f` ("io_uring: ensure that fsnotify is always called") Link: https://lore.kernel.org/all/20220929135627.ykivmdks2w5vzrwg@quack3/ Reported-by: syzbot+dfcc5f4da15868df7d4d@syzkaller.appspotmail.com Reported-by: Jan Kara <jack@suse.cz> Change-Id: Ia16078bdf53c6b2536cacb7aafa03a4ec1079a94 Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Sasha Levin <sashal@kernel.org> Bug: 268174392 (cherry picked from commit `ea2e6286e3`) Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-17 17:57:16 +00:00
Dylan Yudaken	b29c357309	UPSTREAM: io_uring: do not recalculate ppos unnecessarily commit `b4aec40015` upstream. There is a slight optimisation to be had by calculating the correct pos pointer inside io_kiocb_update_pos and then using that later. It seems code size drops by a bit: 000000000000a1b0 0000000000000400 t io_read 000000000000a5b0 0000000000000319 t io_write vs 000000000000a1b0 00000000000003f6 t io_read 000000000000a5b0 0000000000000310 t io_write Change-Id: I19d8cdb6ea88d8fc4625e521363d5a8f638dfdcb Signed-off-by: Dylan Yudaken <dylany@fb.com> Reviewed-by: Pavel Begunkov <asml.silence@gmail.com> Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Sasha Levin <sashal@kernel.org> Bug: 268174392 (cherry picked from commit `e90cfb9699`) Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-17 17:57:16 +00:00
Dylan Yudaken	84e34d2ef5	UPSTREAM: io_uring: update kiocb->ki_pos at execution time commit `d34e1e5b39` upstream. Update kiocb->ki_pos at execution time rather than in io_prep_rw(). io_prep_rw() happens before the job is enqueued to a worker and so the offset might be read multiple times before being executed once. Ensures that the file position in a set of _linked_ SQEs will be only obtained after earlier SQEs have completed, and so will include their incremented file position. Change-Id: I3c5abbf6a337ec1958fd6600c5feb44fb61a5772 Signed-off-by: Dylan Yudaken <dylany@fb.com> Reviewed-by: Pavel Begunkov <asml.silence@gmail.com> Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Sasha Levin <sashal@kernel.org> Bug: 268174392 (cherry picked from commit `ea528ecac3`) Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-17 17:57:16 +00:00
Dylan Yudaken	b543e0d210	UPSTREAM: io_uring: remove duplicated calls to io_kiocb_ppos commit `af9c45eceb` upstream. io_kiocb_ppos is called in both branches, and it seems that the compiler does not fuse this. Fusing removes a few bytes from loop_rw_iter. Before: $ nm -S fs/io_uring.o \| grep loop_rw_iter 0000000000002430 0000000000000124 t loop_rw_iter After: $ nm -S fs/io_uring.o \| grep loop_rw_iter 0000000000002430 000000000000010d t loop_rw_iter Change-Id: Ibd662d59697d9cb1e484319050f6e5f960f6ac5c Signed-off-by: Dylan Yudaken <dylany@fb.com> Reviewed-by: Pavel Begunkov <asml.silence@gmail.com> Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Sasha Levin <sashal@kernel.org> Bug: 268174392 (cherry picked from commit `076f872314`) Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-17 17:56:56 +00:00
Jens Axboe	9166f5418a	UPSTREAM: io_uring: ensure that cached task references are always put on exit commit `e775f93f2a` upstream. io_uring caches task references to avoid doing atomics for each of them per request. If a request is put from the same task that allocated it, then we can maintain a per-ctx cache of them. This obviously relies on io_uring always pruning caches in a reliable way, and there's currently a case off io_uring fd release where we can miss that. One example is a ring setup with IOPOLL, which relies on the task polling for completions, which will free them. However, if such a task submits a request and then exits or closes the ring without reaping the completion, then ring release will reap and put. If release happens from that very same task, the completed request task refs will get put back into the cache pool. This is problematic, as we're now beyond the point of pruning caches. Manually drop these caches after doing an IOPOLL reap. This releases references from the current task, which is enough. If another task happens to be doing the release, then the caching will not be triggered and there's no issue. Cc: stable@vger.kernel.org Fixes: `e98e49b2bb` ("io_uring: extend task put optimisations") Reported-by: Homin Rhee <hominlab@gmail.com> Change-Id: I9495121af065424141fa9c39840ab9aa91f45c72 Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Sasha Levin <sashal@kernel.org> Bug: 268174392 (cherry picked from commit `e9c6556708`) Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-17 17:56:56 +00:00
Pavel Begunkov	fee5372abf	UPSTREAM: io_uring: fix CQ waiting timeout handling commit `12521a5d5c` upstream. Jiffy to ktime CQ waiting conversion broke how we treat timeouts, in particular we rearm it anew every time we get into io_cqring_wait_schedule() without adjusting the timeout. Waiting for 2 CQEs and getting a task_work in the middle may double the timeout value, or even worse in some cases task may wait indefinitely. Cc: stable@vger.kernel.org Fixes: `228339662b` ("io_uring: don't convert to jiffies for waiting on timeouts") Change-Id: If8605a13266ae2b49b1f7d7cd5ee092f9ffd2805 Signed-off-by: Pavel Begunkov <asml.silence@gmail.com> Link: https://lore.kernel.org/r/f7bffddd71b08f28a877d44d37ac953ddb01590d.1672915663.git.asml.silence@gmail.com Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> Signed-off-by: Sasha Levin <sashal@kernel.org> Bug: 268174392 (cherry picked from commit `e0140e9da3`) Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-17 17:56:56 +00:00
Pavel Begunkov	a4d056e350	UPSTREAM: io_uring: lock overflowing for IOPOLL commit `544d163d65` upstream. syzbot reports an issue with overflow filling for IOPOLL: WARNING: CPU: 0 PID: 28 at io_uring/io_uring.c:734 io_cqring_event_overflow+0x1c0/0x230 io_uring/io_uring.c:734 CPU: 0 PID: 28 Comm: kworker/u4:1 Not tainted 6.2.0-rc3-syzkaller-16369-g358a161a6a9e #0 Workqueue: events_unbound io_ring_exit_work Call trace: io_cqring_event_overflow+0x1c0/0x230 io_uring/io_uring.c:734 io_req_cqe_overflow+0x5c/0x70 io_uring/io_uring.c:773 io_fill_cqe_req io_uring/io_uring.h:168 [inline] io_do_iopoll+0x474/0x62c io_uring/rw.c:1065 io_iopoll_try_reap_events+0x6c/0x108 io_uring/io_uring.c:1513 io_uring_try_cancel_requests+0x13c/0x258 io_uring/io_uring.c:3056 io_ring_exit_work+0xec/0x390 io_uring/io_uring.c:2869 process_one_work+0x2d8/0x504 kernel/workqueue.c:2289 worker_thread+0x340/0x610 kernel/workqueue.c:2436 kthread+0x12c/0x158 kernel/kthread.c:376 ret_from_fork+0x10/0x20 arch/arm64/kernel/entry.S:863 There is no real problem for normal IOPOLL as flush is also called with uring_lock taken, but it's getting more complicated for IOPOLL\|SQPOLL, for which __io_cqring_overflow_flush() happens from the CQ waiting path. Reported-and-tested-by: syzbot+6805087452d72929404e@syzkaller.appspotmail.com Cc: stable@vger.kernel.org # 5.10+ Change-Id: I3449b2ea1b71ff2f04f119741751b42870386923 Signed-off-by: Pavel Begunkov <asml.silence@gmail.com> Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Sasha Levin <sashal@kernel.org> Bug: 268174392 (cherry picked from commit `de77faee28`) Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-17 17:56:56 +00:00
Jens Axboe	0dfe72e890	UPSTREAM: io_uring: check for valid register opcode earlier [ Upstream commit `343190841a` ] We only check the register opcode value inside the restricted ring section, move it into the main io_uring_register() function instead and check it up front. Change-Id: I4b5f782dad48eb0e7f04d5956cc087494e02b2ec Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Sasha Levin <sashal@kernel.org> Bug: 268174392 (cherry picked from commit `78e8151f04`) Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-17 17:56:56 +00:00
Dylan Yudaken	1b735b5eb2	UPSTREAM: io_uring: fix async accept on O_NONBLOCK sockets commit `a73825ba70` upstream. Do not set REQ_F_NOWAIT if the socket is non blocking. When enabled this causes the accept to immediately post a CQE with EAGAIN, which means you cannot perform an accept SQE on a NONBLOCK socket asynchronously. By removing the flag if there is no pending accept then poll is armed as usual and when a connection comes in the CQE is posted. Change-Id: I0fae3f75c7fbbf44f85da7d83f48c4cfed1fcae9 Signed-off-by: Dylan Yudaken <dylany@fb.com> Link: https://lore.kernel.org/r/20220324143435.2875844-1-dylany@fb.com Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Sasha Levin <sashal@kernel.org> Bug: 268174392 (cherry picked from commit `aa4c9b3e45`) Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-17 17:56:56 +00:00
Jens Axboe	63bf975936	UPSTREAM: io_uring: allow re-poll if we made progress commit `10c873334f` upstream. We currently check REQ_F_POLLED before arming async poll for a notification to retry. If it's set, then we don't allow poll and will punt to io-wq instead. This is done to prevent a situation where a buggy driver will repeatedly return that there's space/data available yet we get -EAGAIN. However, if we already transferred data, then it should be safe to rely on poll again. Gate the check on whether or not REQ_F_PARTIAL_IO is also set. Change-Id: I36b6d16ac43202fdf9ae5eea64f9dfbcfbe7fee5 Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Sasha Levin <sashal@kernel.org> Bug: 268174392 (cherry picked from commit `4bc17e6381`) Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-17 17:56:56 +00:00
Jens Axboe	a64d6ea01b	UPSTREAM: io_uring: support MSG_WAITALL for IORING_OP_SEND(MSG) commit `4c3c09439c` upstream. Like commit `7ba89d2af1` for recv/recvmsg, support MSG_WAITALL for the send side. If this flag is set and we do a short send, retry for a stream of seqpacket socket. Change-Id: If67a4462576af1b683d53d2dc0d46e44c9dd8863 Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Sasha Levin <sashal@kernel.org> Bug: 268174392 (cherry picked from commit `f901b4bfd0`) Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-17 17:56:56 +00:00
Jens Axboe	cf7ef78842	UPSTREAM: io_uring: add flag for disabling provided buffer recycling commit `8a3e8ee564` upstream. If we need to continue doing this IO, then we don't want a potentially selected buffer recycled. Add a flag for that. Set this for recv/recvmsg if they do partial IO. Change-Id: If9381bd6a5695c8c85c7a51c3adccc0dc09f8999 Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Sasha Levin <sashal@kernel.org> Bug: 268174392 (cherry picked from commit `96ccba4a1a`) Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-17 17:56:56 +00:00
Jens Axboe	45b2a34e21	UPSTREAM: io_uring: ensure recv and recvmsg handle MSG_WAITALL correctly commit `7ba89d2af1` upstream. We currently don't attempt to get the full asked for length even if MSG_WAITALL is set, if we get a partial receive. If we do see a partial receive, then just note how many bytes we did and return -EAGAIN to get it retried. The iov is advanced appropriately for the vector based case, and we manually bump the buffer and remainder for the non-vector case. Cc: stable@vger.kernel.org Reported-by: Constantine Gavrilov <constantine.gavrilov@gmail.com> Change-Id: I618bde7c86b29f6053dd8cd19682f2916e57dd54 Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Sasha Levin <sashal@kernel.org> Bug: 268174392 (cherry picked from commit `aadd9b0930`) Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-17 17:56:56 +00:00
Pavel Begunkov	4b912a635e	UPSTREAM: io_uring: improve send/recv error handling commit `7297ce3d59` upstream. Hide all error handling under common if block, removes two extra ifs on the success path and keeps the handling more condensed. Change-Id: If6864c8ddd06bc853cef6b543fc06cf99d9ad147 Signed-off-by: Pavel Begunkov <asml.silence@gmail.com> Link: https://lore.kernel.org/r/5761545158a12968f3caf30f747eea65ed75dfc1.1637524285.git.asml.silence@gmail.com Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Sasha Levin <sashal@kernel.org> Bug: 268174392 (cherry picked from commit `abdc16c836`) Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-17 17:56:56 +00:00
Jens Axboe	ef0c71d0f1	UPSTREAM: io_uring: don't gate task_work run on TIF_NOTIFY_SIGNAL commit `46a525e199` upstream. This isn't a reliable mechanism to tell if we have task_work pending, we really should be looking at whether we have any items queued. This is problematic if forward progress is gated on running said task_work. One such example is reading from a pipe, where the write side has been closed right before the read is started. The fput() of the file queues TWA_RESUME task_work, and we need that task_work to be run before ->release() is called for the pipe. If ->release() isn't called, then the read will sit forever waiting on data that will never arise. Fix this by io_run_task_work() so it checks if we have task_work pending rather than rely on TIF_NOTIFY_SIGNAL for that. The latter obviously doesn't work for task_work that is queued without TWA_SIGNAL. Reported-by: Christiano Haesbaert <haesbaert@haesbaert.org> Cc: stable@vger.kernel.org Link: https://github.com/axboe/liburing/issues/665 Change-Id: I042b07491afac06692639d91bdf7dd21a2405651 Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Sasha Levin <sashal@kernel.org> Bug: 268174392 (cherry picked from commit `2fd232bbd6`) Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-17 17:56:56 +00:00
Robin Murphy	1531e1fb8d	BACKPORT: iommu: Avoid races around device probe We currently have 3 different ways that __iommu_probe_device() may be called, but no real guarantee that multiple callers can't tread on each other, especially once asynchronous driver probe gets involved. It would likely have taken a fair bit of luck to hit this previously, but commit `57365a04c9` ("iommu: Move bus setup to IOMMU device registration") ups the odds since now it's not just omap-iommu that may trigger multiple bus_iommu_probe() calls in parallel if probing asynchronously. Add a lock to ensure we can't try to double-probe a device, and also close some possible race windows to make sure we're truly robust against trying to double-initialise a group via two different member devices. Reported-by: Brian Norris <briannorris@chromium.org> Signed-off-by: Robin Murphy <robin.murphy@arm.com> Tested-by: Brian Norris <briannorris@chromium.org> Fixes: `57365a04c9` ("iommu: Move bus setup to IOMMU device registration") Link: https://lore.kernel.org/r/1946ef9f774851732eed78760a78ec40dbc6d178.1667591503.git.robin.murphy@arm.com Signed-off-by: Joerg Roedel <jroedel@suse.de> Bug: 269232600 (cherry picked from commit `01657bc14a`) Change-Id: Ie87f8f7a7b90431c3a2682923961885ce7b239f3 Signed-off-by: Zhenhua Huang <quic_zhenhuah@quicinc.com>	2023-02-17 16:22:47 +00:00
Jens Axboe	60944bdddc	UPSTREAM: io_uring/io-wq: only free worker if it was allocated for creation commit `e6db6f9398` upstream. We have two types of task_work based creation, one is using an existing worker to setup a new one (eg when going to sleep and we have no free workers), and the other is allocating a new worker. Only the latter should be freed when we cancel task_work creation for a new worker. Fixes: `af82425c6a` ("io_uring/io-wq: free worker if task_work creation is canceled") Reported-by: syzbot+d56ec896af3637bdb7e4@syzkaller.appspotmail.com Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> Bug: 268174392 (cherry picked from commit `a88a0d16e1`) Signed-off-by: Greg Kroah-Hartman <gregkh@google.com> Change-Id: I75c9b22dce02151b2687cf90d6c5b74c08d0f04b	2023-02-17 12:39:32 +00:00
Jens Axboe	ac06912075	UPSTREAM: io_uring/io-wq: free worker if task_work creation is canceled commit `af82425c6a` upstream. If we cancel the task_work, the worker will never come into existance. As this is the last reference to it, ensure that we get it freed appropriately. Cc: stable@vger.kernel.org Reported-by: 진호 <wnwlsgh98@gmail.com> Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> Bug: 268174392 (cherry picked from commit `b912ed1363`) Signed-off-by: Greg Kroah-Hartman <gregkh@google.com> Change-Id: Iacfd7a5db15c417fd1f02c85e414e3137e8729ec	2023-02-17 12:38:37 +00:00
Harshit Mogalapalli	98a15feed0	UPSTREAM: io_uring: Fix unsigned 'res' comparison with zero in io_fixup_rw_res() Smatch warning: io_fixup_rw_res() warn: unsigned 'res' is never less than zero. Change type of 'res' from unsigned to long. Fixes: `d6b7efc722` ("io_uring/rw: fix error'ed retry return values") Signed-off-by: Harshit Mogalapalli <harshit.m.mogalapalli@oracle.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> Bug: 268174392 (cherry picked from commit `07b3672c40`) Signed-off-by: Greg Kroah-Hartman <gregkh@google.com> Change-Id: I3534398af5e77e92a1ac48170e3ae4dffa42463b	2023-02-17 12:37:43 +00:00
Andy Shevchenko	a234cc4e55	UPSTREAM: um: Increase stack frame size threshold for signal.c The signal.c can't use heap for bit data located on stack. However, by default a compiler warns us about overstepping stack frame size threshold: arch/um/os-Linux/signal.c: In function ‘sig_handler_common’: arch/um/os-Linux/signal.c:51:1: warning: the frame size of 2960 bytes is larger than 2048 bytes [-Wframe-larger-than=] 51 \| } \| ^ arch/um/os-Linux/signal.c: In function ‘timer_real_alarm_handler’: arch/um/os-Linux/signal.c:95:1: warning: the frame size of 2960 bytes is larger than 2048 bytes [-Wframe-larger-than=] 95 \| } \| ^ Due to above increase stack frame size threshold explicitly for signal.c to avoid unnecessary warning. Bug: 269057599 Change-Id: Ib7474bddfefa97f9c60087db6a607a111e4d23bc Signed-off-by: Andy Shevchenko <andriy.shevchenko@linux.intel.com> Tested-by: David Gow <davidgow@google.com> Signed-off-by: Richard Weinberger <richard@nod.at> (cherry picked from commit `517f60206e`) Signed-off-by: Srinivasarao Pathipati <quic_spathi@quicinc.com>	2023-02-15 17:04:02 +05:30
Beata Michalska	d40d310e5e	ANDROID: GKI: Enable ARM64_ERRATUM_2454944 Enable workaround for Cortex-A510 erratum 2454944. Bug: 223346425 Change-Id: Ieb60640b26cd2093702045670890b6a204277cca Signed-off-by: Beata Michalska <beata.michalska@arm.com>	2023-02-09 18:53:48 +00:00
Beata Michalska	9d2ec2e0b6	ANDROID: dma-ops: Add restricted vendor hook Add a vendor hook to arch_setup_dma_ops to allow vendors to perform any necessary post-actions on setting up DMA ops for a given device, focusing mainly on enabling those to opt-in for the Cortex-A510 erratum 2454944. Bug: 263236925 Change-Id: I6fd4d3a30829437fc113ec15ca2e5d060a38e60c Signed-off-by: Beata Michalska <beata.michalska@arm.com>	2023-02-09 18:53:48 +00:00
Robin Murphy	3c75a6fb7f	ANDROID: arm64: Work around Cortex-A510 erratum 2454944 Cortex-A510 erratum 2454944 may cause clean cache lines to be erroneously written back to memory, breaking the assumptions we rely on for non-coherent DMA. Try to mitigate this by implementing special DMA ops that do their best to avoid cacheable aliases via a combination of bounce-buffering and manipulating the linear map directly, to minimise the chance of DMA-mapped pages being speculated back into caches. The other main concern is initial entry, where cache lines covering the kernel image might potentially become affected between being cleaned by the bootloader and the kernel being called, which might require additional cache maintenance from the bootloader to be safe in that regard too. Cortex-A510 supports S2FWB, so KVM should be unaffected. For the workaround to be applied, it needs to be explicitly requested through dedicated arm64_noalias_setup_dma_ops callback. Bug: 223346425 (cherry picked from commit 683efc5fc6eeb653caf85c33a2fb92a33c8faa75 https://git.gitlab.arm.com/linux-arm/linux-rm.git arm64/2454944-dev) Change-Id: If76b97dc39c278edb80f9b750129975ab2ac563e Signed-off-by: Robin Murphy <robin.murphy@arm.com> [BM: Stripping-down the original solution by removing support for cpu capabilities and ammending relevant bits, with the final version being reduced to dedicated DMA ops with dependencies on rodata_full being enabled (CONFIG_RODATA_FULL_DEFAULT_ENABLED), swiotlb late init and disabling lazy tlb flushing. Also, as a consequence, reducing debugging support.] Signed-off-by: Beata Michalska <beata.michalska@arm.com>	2023-02-09 18:53:48 +00:00
Robin Murphy	865f370bf9	ANDROID: mm/vmalloc: Add override for lazy vunmap Add an interface to disable lazy vunmap by forcing the threshold to zero. This might be interesting for debugging/testing in general, but primarily helps a horrible situation which needs to guarantee that vmalloc aliases are up-to-date from atomic context, wherein the only practical solution is to never let them get stale in the first place. Bug: 223346425 (cherry picked from commit 2a34c1503b85f49dd472dfd932dfcd16cab8ee8a https://git.gitlab.arm.com/linux-arm/linux-rm.git arm64/2454944-dev) Change-Id: I12fbbe3903f76a028ceea91ed078f0de2abe3815 Signed-off-by: Robin Murphy <robin.murphy@arm.com> [BM: Convert to a flag that can be explicitly modified at runtime instead of relying on arch specific bits] Signed-off-by: Beata Michalska <beata.michalska@arm.com>	2023-02-09 18:53:48 +00:00
Maulik Shah	1eb5992d60	ANDROID: cpuidle-psci: Fix suspicious RCU usage This change fixes suspicious RCU usage warnings from vendor hook. ============================= WARNING: suspicious RCU usage 5.15.41-debug-gc1163f69ba3b-dirty #1 Not tainted ----------------------------- include/trace/events/lock.h:37 suspicious rcu_dereference_check() usage! other info that might help us debug this: rcu_scheduler_active = 2, debug_locks = 1 RCU used illegally from extended quiescent state! no locks held by swapper/0/0. stack backtrace: CPU: 0 PID: 0 Comm: swapper/0 Not tainted 5.15.41-debug-gc1163f69ba3b-dirty #1 Call trace: dump_backtrace+0x0/0x1d8 dump_stack+0x1c/0x4c .. .. _printk+0x58/0x84 lockdep_rcu_suspicious+0x44/0x15c trace_android_vh_printk_caller_id+0xc4/0x13c vprintk_store+0x54/0x59c vprintk_emit+0x8c/0x130 vprintk_default+0x48/0x74 vprintk+0xf8/0x13c _printk+0x58/0x84 lockdep_rcu_suspicious+0x44/0x15c trace_android_vh_cpuidle_psci_enter+0xc4/0x144 __psci_enter_domain_idle_state+0x64/0x118 psci_enter_domain_idle_state+0x1c/0x2c cpuidle_enter_state+0x14c/0x2fc cpuidle_enter+0x3c/0x58 Bug: 267847290 Fixes: `3567f51602` ("ANDROID: cpuidle-psci: Add vendor hook for cpuidle psci enter and exit") Change-Id: I910a6a0595c3a79b75e581297eb56d512ce5885c Signed-off-by: Maulik Shah <quic_mkshah@quicinc.com>	2023-02-09 18:20:25 +00:00
Woogeun Lee	d6b2899ce6	ANDROID: ABI: update allowed list for galaxy 1 Added function: [A] 'function void _trace_android_vh_record_pcpu_rwsem_starttime(task_struct*, unsigned long int)' Bug: 262423323 Change-Id: I4ebef8d03a3c030da6eac2f4d857ce889005d5ec Signed-off-by: Woogeun Lee <woogeun.lee@samsung.com>	2023-02-09 18:11:20 +00:00
qixiaoyu1	3fcc69ca4d	FROMGIT: f2fs: add sysfs nodes to set last_age_weight Bug: 267580491 (cherry picked from commit `d23be468ea` https://git.kernel.org/pub/scm/linux/kernel/git/jaegeuk/f2fs.git dev) Signed-off-by: qixiaoyu1 <qixiaoyu1@xiaomi.com> Signed-off-by: xiongping1 <xiongping1@xiaomi.com> Reviewed-by: Chao Yu <chao@kernel.org> Signed-off-by: Jaegeuk Kim <jaegeuk@kernel.org> Change-Id: I88b795ec90f4589676daed4919db31b26574c84b	2023-02-09 01:00:54 +00:00
qixiaoyu1	899476c3af	FROMGIT: f2fs: fix wrong calculation of block age Currently we wrongly calculate the new block age to old * LAST_AGE_WEIGHT / 100. Fix it to new * (100 - LAST_AGE_WEIGHT) / 100 + old * LAST_AGE_WEIGHT / 100. Bug: 267580491 (cherry picked from commit `b03a41a495` https://git.kernel.org/pub/scm/linux/kernel/git/jaegeuk/f2fs.git dev) Signed-off-by: qixiaoyu1 <qixiaoyu1@xiaomi.com> Signed-off-by: xiongping1 <xiongping1@xiaomi.com> Reviewed-by: Chao Yu <chao@kernel.org> Signed-off-by: Jaegeuk Kim <jaegeuk@kernel.org> Change-Id: If06f04c63f9ed0de4e1d734936d9ea9a6c613d64	2023-02-09 01:00:54 +00:00
Greg Kroah-Hartman	d0f788b8fa	ANDROID: struct io_uring ABI preservation hack for 5.10.162 changes In the 5.10.162 release, the io_uring code was synced with the version that is in the 5.15.y kernel tree in order to resolve a huge number of potential, and known, problems with the codebase. This makes for a more secure and easier-to-update-and-maintain 5.10.y kernel tree, so this is a great thing, however this caused some issues when it comes to the Android KABI preservation and checking tools. A number of the io_uring structures get used in other core kernel structures, only as "opaque" pointers, so there is not any real ABI breakage. But, due to the visibility of the structures going away, the CRC values of many scheduler variables and functions were changed. In order to preserve the CRC values, to prevent all device kernels to be forced to rebuild for no reason whatsoever from a functional point of view, we need to keep around the "old" io_uring structures for the CRC calculation only. This is done by the following definitions of struct io_identity and struct io_uring_task which will only be visible when the CRC calculation build happens, not in any functional kernel build. Yes, this all is a horrible hack, and these really are not the true structures that any code uses, but so life is in the world of stable apis. Bug: 161946584 Bug: 268174392 Fixes: `788d082426` ("io_uring: import 5.15-stable io_uring") Signed-off-by: Greg Kroah-Hartman <gregkh@google.com> Change-Id: I2294f220ae78fe9aa32ee25b81829ae765e9deb2	2023-02-07 13:38:16 +00:00
Greg Kroah-Hartman	fef924db72	ANDROID: fix up struct task_struct ABI change in 5.10.162 In commit `788d082426` ("io_uring: import 5.15-stable io_uring"), a new field was added to struct task_struct. Move it to the proper location and macro in order to preserve the kernel ABI. Bug: 161946584 Bug: 268174392 Fixes: `788d082426` ("io_uring: import 5.15-stable io_uring") Signed-off-by: Greg Kroah-Hartman <gregkh@google.com> Change-Id: Ib2f65b7c1a973794b7ab525a9304f666ffebc9ee	2023-02-07 13:38:16 +00:00
Greg Kroah-Hartman	d369ac0b2a	ANDROID: add flags variable back to struct proto_ops In commit `a3025359ff` ("net: remove cmsg restriction from io_uring based send/recvmsg calls") the flags variable was removed from struct proto_ops as it is no longer needed. But the ABI signatures break, so put it back to preserve this, there's no functional change here. Bug: 161946584 Bug: 268174392 Fixes: `a3025359ff` ("net: remove cmsg restriction from io_uring based send/recvmsg calls") Change-Id: Ic6a868f038701a61c993e18b44cdd8ec8b0a4d58 Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-07 13:38:16 +00:00
Jens Axboe	5756328b3f	UPSTREAM: io_uring: pass in EPOLL_URING_WAKE for eventfd signaling and wakeups [ Upstream commit `4464853277` ] Pass in EPOLL_URING_WAKE when signaling eventfd or doing poll related wakups, so that we can check for a circular event dependency between eventfd and epoll. If this flag is set when our wakeup handlers are called, then we know we have a dependency that needs to terminate multishot requests. eventfd and epoll are the only such possible dependencies. Bug: 268174392 Cc: stable@vger.kernel.org # 6.0 Change-Id: I6e45fa1484657bd5caad007783785c2ee97a9929 Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> (cherry picked from commit `189556b05e`) Bug: 268174392 Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-07 13:38:16 +00:00
Jens Axboe	72d1c48675	UPSTREAM: eventfd: provide a eventfd_signal_mask() helper [ Upstream commit `03e02acda8` ] This is identical to eventfd_signal(), but it allows the caller to pass in a mask to be used for the poll wakeup key. The use case is avoiding repeated multishot triggers if we have a dependency between eventfd and io_uring. If we setup an eventfd context and register that as the io_uring eventfd, and at the same time queue a multishot poll request for the eventfd context, then any CQE posted will repeatedly trigger the multishot request until it terminates when the CQ ring overflows. In preparation for io_uring detecting this circular dependency, add the mentioned helper so that io_uring can pass in EPOLL_URING as part of the poll wakeup key. Cc: stable@vger.kernel.org # 6.0 [axboe: fold in !CONFIG_EVENTFD fix from Zhang Qilong] Change-Id: I0c38a56887777f85cb10673b7ca3b5ca4d70c61b Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> (cherry picked from commit `4ef66581d7`) Bug: 268174392 Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-07 13:38:16 +00:00
Jens Axboe	d7a47b29d5	UPSTREAM: eventpoll: add EPOLL_URING_WAKE poll wakeup flag [ Upstream commit `caf1aeaffc` ] We can have dependencies between epoll and io_uring. Consider an epoll context, identified by the epfd file descriptor, and an io_uring file descriptor identified by iofd. If we add iofd to the epfd context, and arm a multishot poll request for epfd with iofd, then the multishot poll request will repeatedly trigger and generate events until terminated by CQ ring overflow. This isn't a desired behavior. Add EPOLL_URING so that io_uring can pass it in as part of the poll wakeup key, and io_uring can check for that to detect a potential recursive invocation. Cc: stable@vger.kernel.org # 6.0 Change-Id: Ifafcb236b2cfe3ca3e7254a0155625fce00fd038 Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> (cherry picked from commit `2f09377502`) Bug: 268174392 Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-07 13:38:16 +00:00
Jens Axboe	7c9f38c09b	UPSTREAM: Revert "proc: don't allow async path resolution of /proc/self components" [ Upstream commit `9e8d9e829c` ] This reverts commit `8d4c3e76e3`. No longer needed, as the io-wq worker threads have the right identity. Change-Id: I6c12f6f957e1c789f4fd5b21379d167f17feb3ea Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> (cherry picked from commit `b76c5373f0`) Bug: 268174392 Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-07 13:38:15 +00:00
Jens Axboe	498b35b3c4	UPSTREAM: Revert "proc: don't allow async path resolution of /proc/thread-self components" [ Upstream commit `2587890b5e` ] This reverts commit `0d4370cfe3`. No longer needed, as the io-wq worker threads have the right identity. Change-Id: I7a28e02a0a1911555853cf4046e3a09c7e36d4a2 Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> (cherry picked from commit `87cb08dc6b`) Bug: 268174392 Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-07 13:38:15 +00:00
Jens Axboe	4b17dea786	UPSTREAM: net: remove cmsg restriction from io_uring based send/recvmsg calls [ Upstream commit `e54937963f` ] No need to restrict these anymore, as the worker threads are direct clones of the original task. Hence we know for a fact that we can support anything that the regular task can. Since the only user of proto_ops->flags was to flag PROTO_CMSG_DATA_ONLY, kill the member and the flag definition too. Change-Id: Ie87e4ff3c621cf53a8e9589a7689e62d759de983 Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> (cherry picked from commit `a3025359ff`) Bug: 268174392 Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-07 13:38:15 +00:00
Jens Axboe	d10f30da0d	UPSTREAM: task_work: unconditionally run task_work from get_signal() [ Upstream commit `35d0b389f3` ] Song reported a boot regression in a kvm image with 5.11-rc, and bisected it down to the below patch. Debugging this issue, turns out that the boot stalled when a task is waiting on a pipe being released. As we no longer run task_work from get_signal() unless it's queued with TWA_SIGNAL, the task goes idle without running the task_work. This prevents ->release() from being called on the pipe, which another boot task is waiting on. For now, re-instate the unconditional task_work run from get_signal(). For 5.12, we'll collapse TWA_RESUME and TWA_SIGNAL, as it no longer makes sense to have a distinction between the two. This will turn task_work notification into a simple boolean, whether to notify or not. Fixes: `98b89b649f` ("signal: kill JOBCTL_TASK_WORK") Reported-by: Song Liu <songliubraving@fb.com> Tested-by: John Stultz <john.stultz@linaro.org> Tested-by: Douglas Anderson <dianders@chromium.org> Tested-by: Sedat Dilek <sedat.dilek@gmail.com> # LLVM/Clang version 11.0.1 Change-Id: Id5ce292120cafff9ede9bb7421cde3aaf4e56924 Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> (cherry picked from commit `6ef2b4728a`) Bug: 268174392 Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-07 13:38:15 +00:00
Jens Axboe	62822bf630	UPSTREAM: signal: kill JOBCTL_TASK_WORK [ Upstream commit `98b89b649f` ] It's no longer used, get rid of it. Change-Id: Id14379554f3e1085c63ac4d044618f609ebc2f9f Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> (cherry picked from commit `c91ab04781`) Bug: 268174392 Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-07 13:38:15 +00:00
Jens Axboe	5e6347b586	UPSTREAM: io_uring: import 5.15-stable io_uring No upstream commit exists. This imports the io_uring codebase from 5.15.85, wholesale. Changes from that code base: - Drop IOCB_ALLOC_CACHE, we don't have that in 5.10. - Drop MKDIRAT/SYMLINKAT/LINKAT. Would require further VFS backports, and we don't support these in 5.10 to begin with. - sock_from_file() old style calling convention. - Use compat_get_bitmap() only for CONFIG_COMPAT=y Change-Id: I7ce5226d6b39763ffc246fd6357cece9aafd4b59 Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> (cherry picked from commit `788d082426`) Bug: 268174392 Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-07 13:38:15 +00:00
Jens Axboe	518e02ed06	UPSTREAM: task_work: add helper for more targeted task_work canceling [ Upstream commit `c7aab1a7c5` ] The only exported helper we have right now is task_work_cancel(), which cancels any task_work from a given task where func matches the queued work item. This is a bit too coarse for some use cases. Add a task_work_cancel_match() that allows to more specifically target individual work items outside of purely the callback function used. task_work_cancel() can be trivially implemented on top of that, hence do so. Reviewed-by: Oleg Nesterov <oleg@redhat.com> Change-Id: Ia33480d209b26d433a3ca196972d6931aa4f8dde Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> (cherry picked from commit `ed30050329`) Bug: 268174392 Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-07 13:38:15 +00:00
Jens Axboe	86acb6a529	UPSTREAM: kernel: don't call do_exit() for PF_IO_WORKER threads [ Upstream commit `10442994ba` ] Right now we're never calling get_signal() from PF_IO_WORKER threads, but in preparation for doing so, don't handle a fatal signal for them. The workers have state they need to cleanup when exiting, so just return instead of calling do_exit() on their behalf. The threads themselves will detect a fatal signal and do proper shutdown. Change-Id: Iedc3fae8cb496d003852c87fdefacc1ad7601cc5 Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> (cherry picked from commit `831cb78a2a`) Bug: 268174392 Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-07 13:38:14 +00:00
Jens Axboe	52f564e57b	UPSTREAM: kernel: stop masking signals in create_io_thread() [ Upstream commit `b16b3855d8` ] This is racy - move the blocking into when the task is created and we're marking it as PF_IO_WORKER anyway. The IO threads are now prepared to handle signals like SIGSTOP as well, so clear that from the mask to allow proper stopping of IO threads. Acked-by: "Eric W. Biederman" <ebiederm@xmission.com> Reported-by: Oleg Nesterov <oleg@redhat.com> Change-Id: I6317c88e0723c6c97555f8ceacfee3692372ac4c Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> (cherry picked from commit `9ded44b69c`) Bug: 268174392 Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-07 13:38:14 +00:00
Stefan Metzmacher	bcb749b0b1	UPSTREAM: x86/process: setup io_threads more like normal user space threads [ Upstream commit `50b7b6f29d` ] As io_threads are fully set up USER threads it's clearer to separate the code path from the KTHREAD logic. The only remaining difference to user space threads is that io_threads never return to user space again. Instead they loop within the given worker function. The fact that they never return to user space means they don't have an user space thread stack. In order to indicate that to tools like gdb we reset the stack and instruction pointers to 0. This allows gdb attach to user space processes using io-uring, which like means that they have io_threads, without printing worrying message like this: warning: Selected architecture i386:x86-64 is not compatible with reported target architecture i386 warning: Architecture rejected target-supplied description The output will be something like this: (gdb) info threads Id Target Id Frame * 1 LWP 4863 "io_uring-cp-for" syscall () at ../sysdeps/unix/sysv/linux/x86_64/syscall.S:38 2 LWP 4864 "iou-mgr-4863" 0x0000000000000000 in ?? () 3 LWP 4865 "iou-wrk-4863" 0x0000000000000000 in ?? () (gdb) thread 3 [Switching to thread 3 (LWP 4865)] #0 0x0000000000000000 in ?? () (gdb) bt #0 0x0000000000000000 in ?? () Backtrace stopped: Cannot access memory at address 0x0 Fixes: `4727dc20e0` ("arch: setup PF_IO_WORKER threads like PF_KTHREAD") Link: https://lore.kernel.org/io-uring/044d0bad-6888-a211-e1d3-159a4aeed52d@polymtl.ca/T/#m1bbf5727e3d4e839603f6ec7ed79c7eebfba6267 Change-Id: I83793e9a4fbc5f9024c9aeace0640043c81a93b0 Signed-off-by: Stefan Metzmacher <metze@samba.org> cc: Linus Torvalds <torvalds@linux-foundation.org> cc: Jens Axboe <axboe@kernel.dk> cc: Andy Lutomirski <luto@kernel.org> cc: linux-kernel@vger.kernel.org cc: io-uring@vger.kernel.org cc: x86@kernel.org Link: https://lore.kernel.org/r/20210505110310.237537-1-metze@samba.org Reviewed-by: Thomas Gleixner <tglx@linutronix.de> Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> (cherry picked from commit `f0a5f0dc01`) Bug: 268174392 Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-07 13:38:14 +00:00
Jens Axboe	1f4eb35546	UPSTREAM: arch: ensure parisc/powerpc handle PF_IO_WORKER in copy_thread() [ Upstream commit `0100e6bbdb` ] In the arch addition of PF_IO_WORKER, I missed parisc and powerpc for some reason. Fix that up, ensuring they handle PF_IO_WORKER like they do PF_KTHREAD in copy_thread(). Reported-by: Bruno Goncalves <bgoncalv@redhat.com> Fixes: `4727dc20e0` ("arch: setup PF_IO_WORKER threads like PF_KTHREAD") Change-Id: I3d0289912eb9e4545fc0b680df6890b6b837ebdd Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> (cherry picked from commit `dd26e2cec7`) Bug: 268174392 Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-07 13:38:14 +00:00
Jens Axboe	150dea15cb	UPSTREAM: arch: setup PF_IO_WORKER threads like PF_KTHREAD [ Upstream commit `4727dc20e0` ] PF_IO_WORKER are kernel threads too, but they aren't PF_KTHREAD in the sense that we don't assign ->set_child_tid with our own structure. Just ensure that every arch sets up the PF_IO_WORKER threads like kthreads in the arch implementation of copy_thread(). Change-Id: Iec4a3c42a39f016b323476d7238f3d36aaf0e6cf Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> (cherry picked from commit `320c8057ec`) Bug: 268174392 Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-07 13:38:14 +00:00
Seth Forshee	cf487d3c6a	UPSTREAM: entry/kvm: Exit to user mode when TIF_NOTIFY_SIGNAL is set [ Upstream commit `3e684903a8` ] A livepatch transition may stall indefinitely when a kvm vCPU is heavily loaded. To the host, the vCPU task is a user thread which is spending a very long time in the ioctl(KVM_RUN) syscall. During livepatch transition, set_notify_signal() will be called on such tasks to interrupt the syscall so that the task can be transitioned. This interrupts guest execution, but when xfer_to_guest_mode_work() sees that TIF_NOTIFY_SIGNAL is set but not TIF_SIGPENDING it concludes that an exit to user mode is unnecessary, and guest execution is resumed without transitioning the task for the livepatch. This handling of TIF_NOTIFY_SIGNAL is incorrect, as set_notify_signal() is expected to break tasks out of interruptible kernel loops and cause them to return to userspace. Change xfer_to_guest_mode_work() to handle TIF_NOTIFY_SIGNAL the same as TIF_SIGPENDING, signaling to the vCPU run loop that an exit to userpsace is needed. Any pending task_work will be run when get_signal() is called from exit_to_user_mode_loop(), so there is no longer any need to run task work from xfer_to_guest_mode_work(). Suggested-by: "Eric W. Biederman" <ebiederm@xmission.com> Cc: Petr Mladek <pmladek@suse.com> Change-Id: If14e86a516403671ccb122cea32cc704f774e8ce Signed-off-by: Seth Forshee <sforshee@digitalocean.com> Message-Id: <20220504180840.2907296-1-sforshee@digitalocean.com> Signed-off-by: Paolo Bonzini <pbonzini@redhat.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> (cherry picked from commit `000de389ad`) Bug: 268174392 Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-07 13:38:14 +00:00
Jens Axboe	6e4362caf9	UPSTREAM: kernel: allow fork with TIF_NOTIFY_SIGNAL pending [ Upstream commit `66ae0d1e2d` ] fork() fails if signal_pending() is true, but there are two conditions that can lead to that: 1) An actual signal is pending. We want fork to fail for that one, like we always have. 2) TIF_NOTIFY_SIGNAL is pending, because the task has pending task_work. We don't need to make it fail for that case. Allow fork() to proceed if just task_work is pending, by changing the signal_pending() check to task_sigpending(). Change-Id: Iec007746b42f5d62581a8b5f6cca4006e707b8e3 Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> (cherry picked from commit `0f735cf52b`) Bug: 268174392 Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-02-07 13:38:14 +00:00

1 2 3 4 5 ...

993433 Commits