linux

mirror of https://github.com/hardkernel/linux.git synced 2026-06-08 20:07:46 +09:00

Author	SHA1	Message	Date
Stefano Garzarella	3f46cd62c8	UPSTREAM: virtio: use virtio_device_ready() in virtio_device_restore() After waking up a suspended VM, the kernel prints the following trace for virtio drivers which do not directly call virtio_device_ready() in the .restore: PM: suspend exit irq 22: nobody cared (try booting with the "irqpoll" option) Call Trace: <IRQ> dump_stack_lvl+0x38/0x49 dump_stack+0x10/0x12 __report_bad_irq+0x3a/0xaf note_interrupt.cold+0xb/0x60 handle_irq_event+0x71/0x80 handle_fasteoi_irq+0x95/0x1e0 __common_interrupt+0x6b/0x110 common_interrupt+0x63/0xe0 asm_common_interrupt+0x1e/0x40 ? __do_softirq+0x75/0x2f3 irq_exit_rcu+0x93/0xe0 sysvec_apic_timer_interrupt+0xac/0xd0 </IRQ> <TASK> asm_sysvec_apic_timer_interrupt+0x12/0x20 arch_cpu_idle+0x12/0x20 default_idle_call+0x39/0xf0 do_idle+0x1b5/0x210 cpu_startup_entry+0x20/0x30 start_secondary+0xf3/0x100 secondary_startup_64_no_verify+0xc3/0xcb </TASK> handlers: [<000000008f9bac49>] vp_interrupt [<000000008f9bac49>] vp_interrupt Disabling IRQ #22 This happens because we don't invoke .enable_cbs callback in virtio_device_restore(). That callback is used by some transports (e.g. virtio-pci) to enable interrupts. Let's fix it, by calling virtio_device_ready() as we do in virtio_dev_probe(). This function calls .enable_cts callback and sets DRIVER_OK status bit. This fix also avoids setting DRIVER_OK twice for those drivers that call virtio_device_ready() in the .restore. Bug: 254441685 Fixes: `d50497eb4e` ("virtio_config: introduce a new .enable_cbs method") Signed-off-by: Stefano Garzarella <sgarzare@redhat.com> Link: https://lore.kernel.org/r/20220322114313.116516-1-sgarzare@redhat.com Signed-off-by: Michael S. Tsirkin <mst@redhat.com> (cherry picked from commit `8d65bc9a5b`) Signed-off-by: Lee Jones <joneslee@google.com> Change-Id: I1a16d4e905ed3929ecdd87c3a7852c0906611ff3	2022-12-19 16:30:28 +00:00
Dongliang Mu	816e125540	UPSTREAM: fs: erofs: add sanity check for kobject in erofs_unregister_sysfs Syzkaller hit 'WARNING: kobject bug in erofs_unregister_sysfs'. This bug is triggered by injecting fault in kobject_init_and_add of erofs_unregister_sysfs. Fix this by adding sanity check for kobject in erofs_unregister_sysfs Note that I've tested the patch and the crash does not occur any more. Bug: 254441685 Link: https://lore.kernel.org/r/20220315132814.12332-1-dzm91@hust.edu.cn Signed-off-by: Dongliang Mu <mudongliangabcd@gmail.com> Fixes: `168e9a7620` ("erofs: add sysfs interface") Reviewed-by: Gao Xiang <hsiangkao@linux.alibaba.com> Reviewed-by: Chao Yu <chao@kernel.org> Signed-off-by: Gao Xiang <hsiangkao@linux.alibaba.com> (cherry picked from commit `a942da24ab`) Signed-off-by: Lee Jones <joneslee@google.com> Change-Id: I31de79f830b63fc4b1357e04036a0f3f3b12903e	2022-12-19 16:30:28 +00:00
Suren Baghdasaryan	9b71fb2a76	UPSTREAM: mm: fix use-after-free bug when mm->mmap is reused after being freed 65;7000;1coom reaping (__oom_reap_task_mm) relies on a 2 way synchronization with exit_mmap. First it relies on the mmap_lock to exclude from unlock path[1], page tables tear down (free_pgtables) and vma destruction. This alone is not sufficient because mm->mmap is never reset. For historical reasons[2] the lock is taken there is also MMF_OOM_SKIP set for oom victims before. The oom reaper only ever looks at oom victims so the whole scheme works properly but process_mrelease can opearate on any task (with fatal signals pending) which doesn't really imply oom victims. That means that the MMF_OOM_SKIP part of the synchronization doesn't work and it can see a task after the whole address space has been demolished and traverse an already released mm->mmap list. This leads to use after free as properly caught up by KASAN report. Fix the issue by reseting mm->mmap so that MMF_OOM_SKIP synchronization is not needed anymore. The MMF_OOM_SKIP is not removed from exit_mmap yet but it acts mostly as an optimization now. [1] `27ae357fa8` ("mm, oom: fix concurrent munlock and oom reaper unmap, v3") [2] `2129258024` ("mm: oom: let oom_reap_task and exit_mmap run concurrently") [mhocko@suse.com: changelog rewrite] Bug: 254441685 Link: https://lore.kernel.org/all/00000000000072ef2c05d7f81950@google.com/ Link: https://lkml.kernel.org/r/20220215201922.1908156-1-surenb@google.com Fixes: `64591e8605` ("mm: protect free_pgtables with mmap_lock write lock in exit_mmap") Signed-off-by: Suren Baghdasaryan <surenb@google.com> Reported-by: syzbot+2ccf63a4bd07cf39cab0@syzkaller.appspotmail.com Suggested-by: Michal Hocko <mhocko@suse.com> Reviewed-by: Rik van Riel <riel@surriel.com> Reviewed-by: Yang Shi <shy828301@gmail.com> Acked-by: Michal Hocko <mhocko@suse.com> Cc: David Rientjes <rientjes@google.com> Cc: Matthew Wilcox <willy@infradead.org> Cc: Johannes Weiner <hannes@cmpxchg.org> Cc: Roman Gushchin <roman.gushchin@linux.dev> Cc: Rik van Riel <riel@surriel.com> Cc: Minchan Kim <minchan@kernel.org> Cc: Kirill A. Shutemov <kirill@shutemov.name> Cc: Andrea Arcangeli <aarcange@redhat.com> Cc: Christian Brauner <brauner@kernel.org> Cc: Christoph Hellwig <hch@infradead.org> Cc: Oleg Nesterov <oleg@redhat.com> Cc: David Hildenbrand <david@redhat.com> Cc: Jann Horn <jannh@google.com> Cc: Shakeel Butt <shakeelb@google.com> Cc: Andy Lutomirski <luto@kernel.org> Cc: Christian Brauner <christian.brauner@ubuntu.com> Cc: Florian Weimer <fweimer@redhat.com> Cc: Jan Engelhardt <jengelh@inai.de> Cc: Tim Murray <timmurray@google.com> Signed-off-by: Andrew Morton <akpm@linux-foundation.org> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org> (cherry picked from commit `f798a1d4f9`) Signed-off-by: Lee Jones <joneslee@google.com> Change-Id: I84fdfe87bab3df5e0ee6f1444e606b353f2e4662	2022-12-19 16:30:28 +00:00
Greg Kroah-Hartman	5a91f1aa85	Merge 5.15.84 into android14-5.15 Changes in 5.15.84 x86/vdso: Conditionally export __vdso_sgx_enter_enclave() vfs: fix copy_file_range() averts filesystem freeze protection nfp: fix use-after-free in area_cache_get() ASoC: fsl_micfil: explicitly clear software reset bit ASoC: fsl_micfil: explicitly clear CHnF flags ASoC: ops: Check bounds for second channel in snd_soc_put_volsw_sx() libbpf: Use page size as max_entries when probing ring buffer map pinctrl: meditatek: Startup with the IRQs disabled can: sja1000: fix size of OCR_MODE_MASK define can: mcba_usb: Fix termination command argument net: fec: don't reset irq coalesce settings to defaults on "ip link up" ASoC: cs42l51: Correct PGA Volume minimum value perf: Fix perf_pending_task() UaF nvme-pci: clear the prp2 field when not used ASoC: ops: Correct bounds check for second channel on SX controls net: fec: properly guard irq coalesce setup Linux 5.15.84 Change-Id: I34ef5e73fca9da9a77c89b1f0c7ad4af37b63a79 Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2022-12-19 17:29:08 +01:00
Andrey Konovalov	68f55096aa	UPSTREAM: kasan: test: prevent cache merging in kmem_cache_double_destroy With HW_TAGS KASAN and kasan.stacktrace=off, the cache created in the kmem_cache_double_destroy() test might get merged with an existing one. Thus, the first kmem_cache_destroy() call won't actually destroy it but will only decrease the refcount. This causes the test to fail. Provide an empty constructor for the created cache to prevent the cache from getting merged. Bug: 254441685 Link: https://lkml.kernel.org/r/b597bd434c49591d8af00ee3993a42c609dc9a59.1644346040.git.andreyknvl@google.com Fixes: `f98f966cd7` ("kasan: test: add test case for double-kmem_cache_destroy()") Signed-off-by: Andrey Konovalov <andreyknvl@google.com> Reviewed-by: Marco Elver <elver@google.com> Cc: Alexander Potapenko <glider@google.com> Cc: Dmitry Vyukov <dvyukov@google.com> Cc: Andrey Ryabinin <ryabinin.a.a@gmail.com> Signed-off-by: Andrew Morton <akpm@linux-foundation.org> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org> (cherry picked from commit `70effdc375`) Signed-off-by: Lee Jones <joneslee@google.com> Change-Id: Iadceef74f2239975ec6abd934776ecd5a6d39943	2022-12-19 16:21:56 +00:00
Catalin Marinas	17954359cf	UPSTREAM: arm64: Ensure that the 'bti' macro is defined where linkage.h is included Not all .S files include asm/assembler.h, however the SYM_FUNC_* definitions invoke the 'bti' macro. Include asm/assembler.h in asm/linkage.h. Bug: 254441685 Fixes: `9be34be87c` ("arm64: Add macro version of the BTI instruction") Signed-off-by: Catalin Marinas <catalin.marinas@arm.com> (cherry picked from commit `dd73d18e7f`) Signed-off-by: Lee Jones <joneslee@google.com> Change-Id: I5dc6693315e56c36bd5c597a3b0de1655e11c7ba	2022-12-19 16:16:37 +00:00
Wenbin Mei	033dde8261	UPSTREAM: mmc: mediatek: free the ext_csd when mmc_get_ext_csd success If mmc_get_ext_csd success, the ext_csd are not freed. Add the missing kfree() calls. Bug: 254441685 Signed-off-by: Wenbin Mei <wenbin.mei@mediatek.com> Fixes: `c4ac38c653` ("mmc: mtk-sd: Add HS400 online tuning support") Link: https://lore.kernel.org/r/20211207075013.22911-1-wenbin.mei@mediatek.com Signed-off-by: Ulf Hansson <ulf.hansson@linaro.org> (cherry picked from commit `d594b35d3b`) Signed-off-by: Lee Jones <joneslee@google.com> Change-Id: I5da7d65f841e84ea861dcf2213953a007d8a8892	2022-12-19 16:16:37 +00:00
Ramji Jiyani	a6eaf3db80	ANDROID: GKI: Only protect exports if KMI symbols are present Only enforce export protection if there are symbols in the unprotected list for the Kernel Module Interface (KMI). This is only relevant for targets like arm64 that have defined ABI symbol lists. This allows non-GKI targets like arm and x86 to continue using GKI source code without disabling the feature for those targets. Bug: 232430739 Test: TH Fixes: `fd1e768866` ("ANDROID: GKI: Protect exports of protected GKI modules") Change-Id: Ie89e8f63eda99d9b7aacd1bb76d036b3ff4ba37c Signed-off-by: Ramji Jiyani <ramjiyani@google.com>	2022-12-19 16:16:00 +00:00
Ramji Jiyani	16c63232db	ANDROID: GKI: Update GKI modules protected exports Update protected export symbols list with exports from list of protected modules at android/gki_protected_modules. It includes symbols from every GKI modules except zram & zsmalloc; and serves as a baseline. Bug: 232430739 Test: TH Change-Id: Iec33dfe093b4e9e0281b910b2b3bf998cef55394 Signed-off-by: Ramji Jiyani <ramjiyani@google.com>	2022-12-19 16:16:00 +00:00
Lee Jones	d19f8758ae	ANDROID: Revert "ANDROID: allmodconfig: disable WERROR" This reverts commit `eb57c31115`. This branch looks clean of WERROR warnings. Let's try to re-enable it. Fixes: `eb57c31115` ("ANDROID: allmodconfig: disable WERROR") Signed-off-by: Lee Jones <joneslee@google.com> Change-Id: I0106dcd43d7e4b4e20ac768f3faac40285bc837b Signed-off-by: Lee Jones <joneslee@google.com>	2022-12-19 14:11:45 +00:00
Greg Kroah-Hartman	d68f50bfb0	Linux 5.15.84 Link: https://lore.kernel.org/r/20221215172906.338769943@linuxfoundation.org Tested-by: Shuah Khan <skhan@linuxfoundation.org> Tested-by: Bagas Sanjaya <bagasdotme@gmail.com> Tested-by: Allen Pais <apais@linux.microsoft.com> Tested-by: Linux Kernel Functional Testing <lkft@linaro.org> Tested-by: Jon Hunter <jonathanh@nvidia.com> Tested-by: Sudip Mukherjee <sudip.mukherjee@codethink.co.uk> Tested-by: Florian Fainelli <f.fainelli@gmail.com> Tested-by: Guenter Roeck <linux@roeck-us.net> Tested-by: Ron Economos <re@w6rz.net> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2022-12-19 12:36:45 +01:00
Rasmus Villemoes	972707bae3	net: fec: properly guard irq coalesce setup commit `7e6303567c` upstream. Prior to the Fixes: commit, the initialization code went through the same fec_enet_set_coalesce() function as used by ethtool, and that function correctly checks whether the current variant has support for irq coalescing. Now that the initialization code instead calls fec_enet_itr_coal_set() directly, that call needs to be guarded by a check for the FEC_QUIRK_HAS_COALESCE bit. Fixes: `df727d4547` (net: fec: don't reset irq coalesce settings to defaults on "ip link up") Reported-by: Greg Ungerer <gregungerer@westnet.com.au> Signed-off-by: Rasmus Villemoes <linux@rasmusvillemoes.dk> Reviewed-by: Florian Fainelli <f.fainelli@gmail.com> Link: https://lore.kernel.org/r/20221205204604.869853-1-linux@rasmusvillemoes.dk Signed-off-by: Jakub Kicinski <kuba@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2022-12-19 12:36:45 +01:00
Charles Keepax	289721fe09	ASoC: ops: Correct bounds check for second channel on SX controls commit `f33bcc5060` upstream. Currently the check against the max value for the control is being applied after the value has had the minimum applied and been masked. But the max value simply indicates the number of volume levels on an SX control, and as such should just be applied on the raw value. Fixes: `97eea946b9` ("ASoC: ops: Check bounds for second channel in snd_soc_put_volsw_sx()") Signed-off-by: Charles Keepax <ckeepax@opensource.cirrus.com> Link: https://lore.kernel.org/r/20221125162348.1288005-1-ckeepax@opensource.cirrus.com Signed-off-by: Mark Brown <broonie@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2022-12-19 12:36:44 +01:00
Lei Rao	de0866b94a	nvme-pci: clear the prp2 field when not used [ Upstream commit `a56ea6147f` ] If the prp2 field is not filled in nvme_setup_prp_simple(), the prp2 field is garbage data. According to nvme spec, the prp2 is reserved if the data transfer does not cross a memory page boundary, so clear it to zero if it is not used. Signed-off-by: Lei Rao <lei.rao@intel.com> Signed-off-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Sasha Levin <sashal@kernel.org>	2022-12-19 12:36:44 +01:00
Peter Zijlstra	8bffa95ac1	perf: Fix perf_pending_task() UaF [ Upstream commit `517e6a301f` ] Per syzbot it is possible for perf_pending_task() to run after the event is free()'d. There are two related but distinct cases: - the task_work was already queued before destroying the event; - destroying the event itself queues the task_work. The first cannot be solved using task_work_cancel() since perf_release() itself might be called from a task_work (____fput), which means the current->task_works list is already empty and task_work_cancel() won't be able to find the perf_pending_task() entry. The simplest alternative is extending the perf_event lifetime to cover the task_work. The second is just silly, queueing a task_work while you know the event is going away makes no sense and is easily avoided by re-arranging how the event is marked STATE_DEAD and ensuring it goes through STATE_OFF on the way down. Reported-by: syzbot+9228d6098455bb209ec8@syzkaller.appspotmail.com Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Tested-by: Marco Elver <elver@google.com> Signed-off-by: Sasha Levin <sashal@kernel.org>	2022-12-19 12:36:43 +01:00
Charles Keepax	825bd2af42	ASoC: cs42l51: Correct PGA Volume minimum value [ Upstream commit `3d1bb6cc1a` ] The table in the datasheet actually shows the volume values in the wrong order, with the two -3dB values being reversed. This appears to have caused the lower of the two values to be used in the driver when the higher should have been, correct this mixup. Signed-off-by: Charles Keepax <ckeepax@opensource.cirrus.com> Link: https://lore.kernel.org/r/20221125162348.1288005-2-ckeepax@opensource.cirrus.com Signed-off-by: Mark Brown <broonie@kernel.org> Signed-off-by: Sasha Levin <sashal@kernel.org>	2022-12-19 12:36:43 +01:00
Rasmus Villemoes	91582b3a1a	net: fec: don't reset irq coalesce settings to defaults on "ip link up" [ Upstream commit `df727d4547` ] Currently, when a FEC device is brought up, the irq coalesce settings are reset to their default values (1000us, 200 frames). That's unexpected, and breaks for example use of an appropriate .link file to make systemd-udev apply the desired settings (https://www.freedesktop.org/software/systemd/man/systemd.link.html), or any other method that would do a one-time setup during early boot. Refactor the code so that fec_restart() instead uses fec_enet_itr_coal_set(), which simply applies the settings that are stored in the private data, and initialize that private data with the default values. Signed-off-by: Rasmus Villemoes <linux@rasmusvillemoes.dk> Reviewed-by: Andrew Lunn <andrew@lunn.ch> Signed-off-by: David S. Miller <davem@davemloft.net> Signed-off-by: Sasha Levin <sashal@kernel.org>	2022-12-19 12:36:42 +01:00
Yasushi SHOJI	c772dab247	can: mcba_usb: Fix termination command argument [ Upstream commit `1a8e3bd25f` ] Microchip USB Analyzer can activate the internal termination resistors by setting the "termination" option ON, or OFF to to deactivate them. As I've observed, both with my oscilloscope and captured USB packets below, you must send "0" to turn it ON, and "1" to turn it OFF. From the schematics in the user's guide, I can confirm that you must drive the CAN_RES signal LOW "0" to activate the resistors. Reverse the argument value of usb_msg.termination to fix this. These are the two commands sequence, ON then OFF. > No. Time Source Destination Protocol Length Info > 1 0.000000 host 1.3.1 USB 46 URB_BULK out > > Frame 1: 46 bytes on wire (368 bits), 46 bytes captured (368 bits) > USB URB > Leftover Capture Data: a80000000000000000000000000000000000a8 > > No. Time Source Destination Protocol Length Info > 2 4.372547 host 1.3.1 USB 46 URB_BULK out > > Frame 2: 46 bytes on wire (368 bits), 46 bytes captured (368 bits) > USB URB > Leftover Capture Data: a80100000000000000000000000000000000a9 Signed-off-by: Yasushi SHOJI <yashi@spacecubics.com> Link: https://lore.kernel.org/all/20221124152504.125994-1-yashi@spacecubics.com Signed-off-by: Marc Kleine-Budde <mkl@pengutronix.de> Signed-off-by: Sasha Levin <sashal@kernel.org>	2022-12-19 12:36:42 +01:00
Heiko Schocher	aa822de7de	can: sja1000: fix size of OCR_MODE_MASK define [ Upstream commit `26e8f6a752` ] bitfield mode in ocr register has only 2 bits not 3, so correct the OCR_MODE_MASK define. Signed-off-by: Heiko Schocher <hs@denx.de> Link: https://lore.kernel.org/all/20221123071636.2407823-1-hs@denx.de Signed-off-by: Marc Kleine-Budde <mkl@pengutronix.de> Signed-off-by: Sasha Levin <sashal@kernel.org>	2022-12-19 12:36:42 +01:00
Ricardo Ribalda	09e08740d7	pinctrl: meditatek: Startup with the IRQs disabled [ Upstream commit `11780e3756` ] If the system is restarted via kexec(), the peripherals do not start with a known state. If the previous system had enabled an IRQs we will receive unexected IRQs that can lock the system. [ 28.109251] watchdog: BUG: soft lockup - CPU#0 stuck for 26s! [swapper/0:0] [ 28.109263] Modules linked in: [ 28.109273] CPU: 0 PID: 0 Comm: swapper/0 Not tainted 5.15.79-14458-g4b9edf7b1ac6 #1 9f2e76613148af94acccd64c609a552fb4b4354b [ 28.109284] Hardware name: Google Elm (DT) [ 28.109290] pstate: 40400005 (nZcv daif +PAN -UAO -TCO -DIT -SSBS BTYPE=--) [ 28.109298] pc : __do_softirq+0xa0/0x388 [ 28.109309] lr : __do_softirq+0x70/0x388 [ 28.109316] sp : ffffffc008003ee0 [ 28.109321] x29: ffffffc008003f00 x28: 000000000000000a x27: 0000000000000080 [ 28.109334] x26: 0000000000000001 x25: ffffffefa7b350c0 x24: ffffffefa7b47480 [ 28.109346] x23: ffffffefa7b3d000 x22: 0000000000000000 x21: ffffffefa7b0fa40 [ 28.109358] x20: ffffffefa7b005b0 x19: ffffffefa7b47480 x18: 0000000000065b6b [ 28.109370] x17: ffffffefa749c8b0 x16: 000000000000018c x15: 00000000000001b8 [ 28.109382] x14: 00000000000d3b6b x13: 0000000000000006 x12: 0000000000057e91 [ 28.109394] x11: 0000000000000000 x10: 0000000000000000 x9 : ffffffefa7b47480 [ 28.109406] x8 : 00000000000000e0 x7 : 000000000f424000 x6 : 0000000000000000 [ 28.109418] x5 : ffffffefa7dfaca0 x4 : ffffffefa7dfadf0 x3 : 000000000000000f [ 28.109429] x2 : 0000000000000000 x1 : 0000000000000100 x0 : 0000000001ac65c5 [ 28.109441] Call trace: [ 28.109447] __do_softirq+0xa0/0x388 [ 28.109454] irq_exit+0xc0/0xe0 [ 28.109464] handle_domain_irq+0x68/0x90 [ 28.109473] gic_handle_irq+0xac/0xf0 [ 28.109480] call_on_irq_stack+0x28/0x50 [ 28.109488] do_interrupt_handler+0x44/0x58 [ 28.109496] el1_interrupt+0x30/0x58 [ 28.109506] el1h_64_irq_handler+0x18/0x24 [ 28.109512] el1h_64_irq+0x7c/0x80 [ 28.109519] arch_local_irq_enable+0xc/0x18 [ 28.109529] default_idle_call+0x40/0x140 [ 28.109539] do_idle+0x108/0x290 [ 28.109547] cpu_startup_entry+0x2c/0x30 [ 28.109554] rest_init+0xe8/0xf8 [ 28.109562] arch_call_rest_init+0x18/0x24 [ 28.109571] start_kernel+0x338/0x42c [ 28.109578] __primary_switched+0xbc/0xc4 [ 28.109588] Kernel panic - not syncing: softlockup: hung tasks Signed-off-by: Ricardo Ribalda <ribalda@chromium.org> Link: https://lore.kernel.org/r/20221122-mtk-pinctrl-v1-1-bedf5655a3d2@chromium.org Reviewed-by: AngeloGioacchino Del Regno <angelogioacchino.delregno@collabora.com> Reviewed-by: Matthias Brugger <matthias.bgg@gmail.com> Signed-off-by: Linus Walleij <linus.walleij@linaro.org> Signed-off-by: Sasha Levin <sashal@kernel.org>	2022-12-19 12:36:41 +01:00
Hou Tao	172a95026f	libbpf: Use page size as max_entries when probing ring buffer map [ Upstream commit `689eb2f1ba` ] Using page size as max_entries when probing ring buffer map, else the probe may fail on host with 64KB page size (e.g., an ARM64 host). After the fix, the output of "bpftool feature" on above host will be correct. Before : eBPF map_type ringbuf is NOT available eBPF map_type user_ringbuf is NOT available After : eBPF map_type ringbuf is available eBPF map_type user_ringbuf is available Signed-off-by: Hou Tao <houtao1@huawei.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20221116072351.1168938-2-houtao@huaweicloud.com Signed-off-by: Sasha Levin <sashal@kernel.org>	2022-12-19 12:36:41 +01:00
Mark Brown	cf611d7867	ASoC: ops: Check bounds for second channel in snd_soc_put_volsw_sx() [ Upstream commit `97eea946b9` ] The bounds checks in snd_soc_put_volsw_sx() are only being applied to the first channel, meaning it is possible to write out of bounds values to the second channel in stereo controls. Add appropriate checks. Signed-off-by: Mark Brown <broonie@kernel.org> Link: https://lore.kernel.org/r/20220511134137.169575-2-broonie@kernel.org Signed-off-by: Mark Brown <broonie@kernel.org> Signed-off-by: Sasha Levin <sashal@kernel.org>	2022-12-19 12:36:40 +01:00
Shengjiu Wang	a74b88e170	ASoC: fsl_micfil: explicitly clear CHnF flags [ Upstream commit `b776c4a461` ] There may be failure when start 1 channel recording after 8 channels recording. The reason is that the CHnF flags are not cleared successfully by software reset. This issue is triggerred by the change of clearing software reset bit. CHnF flags are write 1 clear bits. Clear them by force write. Signed-off-by: Shengjiu Wang <shengjiu.wang@nxp.com> Link: https://lore.kernel.org/r/1651925654-32060-2-git-send-email-shengjiu.wang@nxp.com Signed-off-by: Mark Brown <broonie@kernel.org> Signed-off-by: Sasha Levin <sashal@kernel.org>	2022-12-19 12:36:40 +01:00
Shengjiu Wang	afac1e7d78	ASoC: fsl_micfil: explicitly clear software reset bit [ Upstream commit `292709b9cf` ] SRES is self-cleared bit, but REG_MICFIL_CTRL1 is defined as non volatile register, it still remain in regmap cache after set, then every update of REG_MICFIL_CTRL1, software reset happens. to avoid this, clear it explicitly. Signed-off-by: Shengjiu Wang <shengjiu.wang@nxp.com> Link: https://lore.kernel.org/r/1651925654-32060-1-git-send-email-shengjiu.wang@nxp.com Signed-off-by: Mark Brown <broonie@kernel.org> Signed-off-by: Sasha Levin <sashal@kernel.org>	2022-12-19 12:36:40 +01:00
Jialiang Wang	9d933af8fe	nfp: fix use-after-free in area_cache_get() commit `02e1a114fd` upstream. area_cache_get() is used to distribute cache->area and set cache->id, and if cache->id is not 0 and cache->area->kref refcount is 0, it will release the cache->area by nfp_cpp_area_release(). area_cache_get() set cache->id before cpp->op->area_init() and nfp_cpp_area_acquire(). But if area_init() or nfp_cpp_area_acquire() fails, the cache->id is is already set but the refcount is not increased as expected. At this time, calling the nfp_cpp_area_release() will cause use-after-free. To avoid the use-after-free, set cache->id after area_init() and nfp_cpp_area_acquire() complete successfully. Note: This vulnerability is triggerable by providing emulated device equipped with specified configuration. BUG: KASAN: use-after-free in nfp6000_area_init (drivers/net/ethernet/netronome/nfp/nfpcore/nfp6000_pcie.c:760) Write of size 4 at addr ffff888005b7f4a0 by task swapper/0/1 Call Trace: <TASK> nfp6000_area_init (drivers/net/ethernet/netronome/nfp/nfpcore/nfp6000_pcie.c:760) area_cache_get.constprop.8 (drivers/net/ethernet/netronome/nfp/nfpcore/nfp_cppcore.c:884) Allocated by task 1: nfp_cpp_area_alloc_with_name (drivers/net/ethernet/netronome/nfp/nfpcore/nfp_cppcore.c:303) nfp_cpp_area_cache_add (drivers/net/ethernet/netronome/nfp/nfpcore/nfp_cppcore.c:802) nfp6000_init (drivers/net/ethernet/netronome/nfp/nfpcore/nfp6000_pcie.c:1230) nfp_cpp_from_operations (drivers/net/ethernet/netronome/nfp/nfpcore/nfp_cppcore.c:1215) nfp_pci_probe (drivers/net/ethernet/netronome/nfp/nfp_main.c:744) Freed by task 1: kfree (mm/slub.c:4562) area_cache_get.constprop.8 (drivers/net/ethernet/netronome/nfp/nfpcore/nfp_cppcore.c:873) nfp_cpp_read (drivers/net/ethernet/netronome/nfp/nfpcore/nfp_cppcore.c:924 drivers/net/ethernet/netronome/nfp/nfpcore/nfp_cppcore.c:973) nfp_cpp_readl (drivers/net/ethernet/netronome/nfp/nfpcore/nfp_cpplib.c:48) Signed-off-by: Jialiang Wang <wangjialiang0806@163.com> Reviewed-by: Yinjun Zhang <yinjun.zhang@corigine.com> Acked-by: Simon Horman <simon.horman@corigine.com> Link: https://lore.kernel.org/r/20220810073057.4032-1-wangjialiang0806@163.com Signed-off-by: Jakub Kicinski <kuba@kernel.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2022-12-19 12:36:39 +01:00
Amir Goldstein	e1a4f5880d	vfs: fix copy_file_range() averts filesystem freeze protection commit `10bc8e4af6` upstream. Commit `868f9f2f8e` ("vfs: fix copy_file_range() regression in cross-fs copies") removed fallback to generic_copy_file_range() for cross-fs cases inside vfs_copy_file_range(). To preserve behavior of nfsd and ksmbd server-side-copy, the fallback to generic_copy_file_range() was added in nfsd and ksmbd code, but that call is missing sb_start_write(), fsnotify hooks and more. Ideally, nfsd and ksmbd would pass a flag to vfs_copy_file_range() that will take care of the fallback, but that code would be subtle and we got vfs_copy_file_range() logic wrong too many times already. Instead, add a flag to explicitly request vfs_copy_file_range() to perform only generic_copy_file_range() and let nfsd and ksmbd use this flag only in the fallback path. This choise keeps the logic changes to minimum in the non-nfsd/ksmbd code paths to reduce the risk of further regressions. Fixes: `868f9f2f8e` ("vfs: fix copy_file_range() regression in cross-fs copies") Tested-by: Namjae Jeon <linkinjeon@kernel.org> Tested-by: Luis Henriques <lhenriques@suse.de> Signed-off-by: Amir Goldstein <amir73il@gmail.com> Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> [backport comments for v5.15: - sb_write_started() is missing - assert was dropped ] Signed-off-by: Amir Goldstein <amir73il@gmail.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2022-12-19 12:36:39 +01:00
Nathan Chancellor	86e28ed25b	x86/vdso: Conditionally export __vdso_sgx_enter_enclave() commit `45be2ad007` upstream. Recently, ld.lld moved from '--undefined-version' to '--no-undefined-version' as the default, which breaks building the vDSO when CONFIG_X86_SGX is not set: ld.lld: error: version script assignment of 'LINUX_2.6' to symbol '__vdso_sgx_enter_enclave' failed: symbol not defined __vdso_sgx_enter_enclave is only included in the vDSO when CONFIG_X86_SGX is set. Only export it if it will be present in the final object, which clears up the error. Fixes: `8466436952` ("x86/vdso: Implement a vDSO for Intel SGX enclave call") Signed-off-by: Nathan Chancellor <nathan@kernel.org> Signed-off-by: Thomas Gleixner <tglx@linutronix.de> Reviewed-by: Nick Desaulniers <ndesaulniers@google.com> Link: https://github.com/ClangBuiltLinux/linux/issues/1756 Link: https://lore.kernel.org/r/20221109000306.1407357-1-nathan@kernel.org Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2022-12-19 12:36:38 +01:00
Alexander Stein	aba68815d5	UPSTREAM: extcon: Deduplicate code in extcon_set_state_sync() Finding the cable index and checking for changed status is also done in extcon_set_state(). So calling extcon_set_state_sync() will do these checks twice. Remove them and use these checks from extcon_set_state(). Bug: 253534975 Bug: 260915739 Change-Id: Iaff09f32e237751c2a94fdd6a50dbf20d9c9d321 Signed-off-by: Alexander Stein <alexander.stein@ew.tq-group.com> Signed-off-by: Chanwoo Choi <cw00.choi@samsung.com> Link: https://lore.kernel.org/all/20211123145301.778629-1-alexander.stein@ew.tq-group.com/T/ (cherry picked from commit `2da3db7f49`)	2022-12-17 08:50:35 +00:00
Hans de Goede	21f5612164	UPSTREAM: usb: typec: altmodes/displayport: Make dp_altmode_notify() more generic Make dp_altmode_notify() handle the dp->data.conf == 0 case too, rather then having separate code-paths for this in various places which call it. Bug: 253534975 Bug: 260915739 Reviewed-by: Heikki Krogerus <heikki.krogerus@linux.intel.com> Tested-by: Heikki Krogerus <heikki.krogerus@linux.intel.com> Change-Id: I621216ad55cf9b0298b3124520128e4d0e67b378 Signed-off-by: Hans de Goede <hdegoede@redhat.com> Reviewed-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> Reviewed-by: Lyude Paul <lyude@redhat.com> Link: https://lore.kernel.org/r/20210817215201.795062-8-hdegoede@redhat.com (cherry picked from commit `fc27e04630`)	2022-12-17 08:50:35 +00:00
Eric Biggers	0210faf748	UPSTREAM: crypto: algboss - compile out test-related code when tests disabled When CONFIG_CRYPTO_MANAGER_DISABLE_TESTS is set, the code in algboss.c that handles CRYPTO_MSG_ALG_REGISTER is unnecessary, so make it be compiled out. Signed-off-by: Eric Biggers <ebiggers@google.com> Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au> Bug: 256875295 (cherry picked from commit `441cb1b730`) Change-Id: I11ebf60e1915ad5d13bd16a26d6c2c0944b4c401 Signed-off-by: Eric Biggers <ebiggers@google.com>	2022-12-16 18:58:21 +00:00
Eric Biggers	118fe0a09c	UPSTREAM: crypto: api - compile out crypto_boot_test_finished when tests disabled The crypto_boot_test_finished static key is unnecessary when self-tests are disabled in the kconfig, so optimize it out accordingly, along with the entirety of crypto_start_tests(). This mainly avoids the overhead of an unnecessary static_branch_enable() on every boot. Signed-off-by: Eric Biggers <ebiggers@google.com> Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au> Bug: 256875295 (cherry picked from commit `06bd9c967e`) Change-Id: I68eff9772dc219a8786bf410cb4e946052ea7811 Signed-off-by: Eric Biggers <ebiggers@google.com>	2022-12-16 18:58:21 +00:00
Eric Biggers	749d7493ad	UPSTREAM: crypto: algboss - optimize registration of internal algorithms Since algboss always skips testing of algorithms with the CRYPTO_ALG_INTERNAL flag, there is no need to go through the dance of creating the test kthread, which creates a lot of overhead. Instead, we can just directly finish the algorithm registration, like is now done when self-tests are disabled entirely. Signed-off-by: Eric Biggers <ebiggers@google.com> Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au> Bug: 256875295 (cherry picked from commit `9cadd73ade`) Change-Id: I10f814cd6903d41265f69297d8568b43ec30012e Signed-off-by: Eric Biggers <ebiggers@google.com>	2022-12-16 18:58:21 +00:00
Eric Biggers	f342a2c751	BACKPORT: crypto: api - optimize algorithm registration when self-tests disabled Currently, registering an algorithm with the crypto API always causes a notification to be posted to the "cryptomgr", which then creates a kthread to self-test the algorithm. However, if self-tests are disabled in the kconfig (as is the default option), then this kthread just notifies waiters that the algorithm has been tested, then exits. This causes a significant amount of overhead, especially in the kthread creation and destruction, which is not necessary at all. For example, in a quick test I found that booting a "minimum" x86_64 kernel with all the crypto options enabled (except for the self-tests) takes about 400ms until PID 1 can start. Of that, a full 13ms is spent just doing this pointless dance, involving a kthread being created, run, and destroyed over 200 times. That's over 3% of the entire kernel start time. Fix this by just skipping the creation of the test larval and the posting of the registration notification entirely, when self-tests are disabled. Signed-off-by: Eric Biggers <ebiggers@google.com> Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au> Bug: 256875295 (cherry picked from commit `a7008584ab`) (Resolved trivial conflict due to missing upstream commit `d6097b8d5d`) Change-Id: Ia6be068618e9286c1be01415a6766ba2fa94fc0d Signed-off-by: Eric Biggers <ebiggers@google.com>	2022-12-16 18:58:21 +00:00
Herbert Xu	21f8e3133a	UPSTREAM: crypto: api - Fix boot-up crash when crypto manager is disabled When the crypto manager is disabled, we need to explicitly set the crypto algorithms' tested status so that they can be used. Fixes: `cad439fc04` ("crypto: api - Do not create test larvals if...") Reported-by: Geert Uytterhoeven <geert@linux-m68k.org> Reported-by: Ido Schimmel <idosch@idosch.org> Reported-by: Guenter Roeck <linux@roeck-us.net> Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au> Tested-by: Ido Schimmel <idosch@nvidia.com> Tested-by: Geert Uytterhoeven <geert@linux-m68k.org> Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au> Bug: 256875295 (cherry picked from commit `beaaaa37c6`) Change-Id: I6cb42580e4774fbfd075497468b488be3447b7a9 Signed-off-by: Eric Biggers <ebiggers@google.com>	2022-12-16 18:58:21 +00:00
Herbert Xu	cd504c8ec9	UPSTREAM: crypto: api - Do not create test larvals if manager is disabled The delayed boot-time testing patch created a dependency loop between api.c and algapi.c because it added a crypto_alg_tested call to the former when the crypto manager is disabled. We could instead avoid creating the test larvals if the crypto manager is disabled. This avoids the dependency loop as well as saving some unnecessary work, albeit in a very unlikely case. Reported-by: Nathan Chancellor <nathan@kernel.org> Reported-by: Naresh Kamboju <naresh.kamboju@linaro.org> Reported-by: kernel test robot <lkp@intel.com> Fixes: `adad556efc` ("crypto: api - Fix built-in testing dependency failures") Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au> Bug: 256875295 (cherry picked from commit `cad439fc04`) Change-Id: I4e0e0b2022dc060fc1d84744e04beae411165ad0 Signed-off-by: Eric Biggers <ebiggers@google.com>	2022-12-16 18:58:21 +00:00
Herbert Xu	6be352041a	UPSTREAM: crypto: api - Export crypto_boot_test_finished We need to export crypto_boot_test_finished in case api.c is built-in while algapi.c is built as a module. Fixes: `adad556efc` ("crypto: api - Fix built-in testing dependency failures") Reported-by: Stephen Rothwell <sfr@canb.auug.org.au> Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au> Tested-by: Stephen Rothwell <sfr@canb.auug.org.au> # ppc32 build Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au> Bug: 256875295 (cherry picked from commit `e42dff467e`) Change-Id: Iefc190f29539084e7c84e23120e861de2e0b9351 Signed-off-by: Eric Biggers <ebiggers@google.com>	2022-12-16 18:58:21 +00:00
Herbert Xu	9a70f42d47	UPSTREAM: crypto: api - Fix built-in testing dependency failures When complex algorithms that depend on other algorithms are built into the kernel, the order of registration must be done such that the underlying algorithms are ready before the ones on top are registered. As otherwise they would fail during the self-test which is required during registration. In the past we have used subsystem initialisation ordering to guarantee this. The number of such precedence levels are limited and they may cause ripple effects in other subsystems. This patch solves this problem by delaying all self-tests during boot-up for built-in algorithms. They will be tested either when something else in the kernel requests for them, or when we have finished registering all built-in algorithms, whichever comes earlier. Reported-by: Vladis Dronov <vdronov@redhat.com> Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au> Bug: 256875295 (cherry picked from commit `adad556efc`) Change-Id: I9cb048ffe0ce7e471cc6e71904f1b2c462b57be4 Signed-off-by: Eric Biggers <ebiggers@google.com>	2022-12-16 18:58:21 +00:00
Ramji Jiyani	90ff743687	ANDROID: GKI: Add list of protected GKI modules android/gki_protected_modules serves as a running list of protected GKI modules. This list is being used as an input to generate list of protected GKI modules exports at android/abi_gki_protected_exports All GKI modules are protected except zram.ko & zsmalloc.ko as baseline in this list. Bug: 232430739 Test: TH Change-Id: I0c993769b9d07543755fd056199b0e4d10d27f77 Signed-off-by: Ramji Jiyani <ramjiyani@google.com>	2022-12-16 17:32:15 +00:00
Ramji Jiyani	fd1e768866	ANDROID: GKI: Protect exports of protected GKI modules Implement support for protecting the exported symbols of protected GKI modules. Only signed GKI modules are permitted to export symbols listed in the android/abi_gki_protected_exports file. Attempting to export these symbols from an unsigned module will result in the module failing to load, with a 'Permission denied' error message. Bug: 232430739 Test: TH Change-Id: I3e8b330938e116bb2e022d356ac0d55108a84a01 Signed-off-by: Ramji Jiyani <ramjiyani@google.com>	2022-12-16 16:44:54 +00:00
Will Deacon	5e28b84896	ANDROID: KVM: arm64: Add support for non-cacheable mappings Hypervisor vendor modules may need to create non-cacheable mappings in the hypervisor stage-1 for interacting with devices such as IOMMUs. Add support for this memory type to the KVM pgtable API and implement it for both stage-1 and stage-2. Bug: 244373730 Signed-off-by: Will Deacon <willdeacon@google.com> Change-Id: I2f88db7fe47e16366018e3e48f30d09b299ae6e4	2022-12-16 10:14:43 +00:00
Eric Biggers	679bf6a591	ANDROID: crypto: testmgr - add back deleted hctr2 test vectors The merge of 5.15.61 into this branch incorrectly deleted the test vectors that were added by the following commits: commit `0035442093` ("UPSTREAM: crypto: xctr - Add XCTR support") commit `e3efa8253b` ("UPSTREAM: crypto: polyval - Add POLYVAL support") commit `d672bb9c20` ("UPSTREAM: crypto: hctr2 - Add HCTR2 support") This causes a build error when CONFIG_CRYPTO_MANAGER_DISABLE_TESTS is not set. Fix this by adding back the test vectors. Bug: 233652475 Fixes: `47c7e57022` ("Merge 5.15.61 into android14-5.15") Change-Id: I7dce7570d51a97b88ae751046443df6f0a9038b2 Signed-off-by: Eric Biggers <ebiggers@google.com>	2022-12-16 08:10:27 +00:00
Daniel Rosenberg	b289d1706b	ANDROID: fsnotify: Notify lower fs of open If the filesystem being watched supports d_canonical_path, notify the lower filesystem of the open as well. Fixes: `f37e05049b` ("ANDROID: vfs: d_canonical_path for stacked FS") Test: atest CtsOsTestCases:android.os.cts.FileObserverTest Bug: 70706497 Signed-off-by: Daniel Rosenberg <drosen@google.com> Signed-off-by: Paul Lawrence <paullawrence@google.com> Signed-off-by: Alessio Balsini <balsini@google.com> Change-Id: I7c9d210e8e6ee99928ad9db0b41ffc3ac3371dc0	2022-12-16 02:20:17 +00:00
Jaegeuk Kim	085d7798c4	Merge remote-tracking branch 'aosp/upstream-f2fs-stable-linux-5.15.y' into android14-5.15 * aosp/upstream-f2fs-stable-linux-5.15.y: f2fs: reset wait_ms to default if any of the victims have been selected f2fs: fix some format WARNING in debug.c and sysfs.c f2fs: don't call f2fs_issue_discard_timeout() when discard_cmd_cnt is 0 in f2fs_put_super() f2fs: fix iostat parameter for discard f2fs: Fix spelling mistake in label: free_bio_enrty_cache -> free_bio_entry_cache f2fs: add block_age-based extent cache f2fs: allocate the extent_cache by default f2fs: refactor extent_cache to support for read and more f2fs: remove unnecessary __init_extent_tree f2fs: move internal functions into extent_cache.c f2fs: specify extent cache for read explicitly f2fs: introduce f2fs_is_readonly() for readability f2fs: remove F2FS_SET_FEATURE() and F2FS_CLEAR_FEATURE() macro f2fs: do some cleanup for f2fs module init MAINTAINERS: Add f2fs bug tracker link f2fs: remove the unused flush argument to change_curseg f2fs: open code allocate_segment_by_default f2fs: remove struct segment_allocation default_salloc_ops f2fs: introduce discard_urgent_util sysfs node f2fs: define MIN_DISCARD_GRANULARITY macro f2fs: init discard policy after thread wakeup f2fs: avoid victim selection from previous victim section f2fs: truncate blocks in batch in __complete_revoke_list() f2fs: make __queue_discard_cmd() return void f2fs: fix description about discard_granularity node f2fs: move set_file_temperature into f2fs_new_inode f2fs: fix to enable compress for newly created file if extension matches f2fs: change type for 'sbi->readdir_ra' f2fs: cleanup for 'f2fs_tuning_parameters' function f2fs: fix to alloc_mode changed after remount on a small volume device f2fs: remove submit label in __submit_discard_cmd() f2fs: fix to do sanity check on i_extra_isize in is_alive() f2fs: introduce F2FS_IOC_START_ATOMIC_REPLACE f2fs: fix to set flush_merge opt and show noflush_merge f2fs: initialize locks earlier in f2fs_fill_super() f2fs: optimize iteration over sparse directories f2fs: fix to avoid accessing uninitialized spinlock f2fs: correct i_size change for atomic writes f2fs: add proc entry to show discard_plist info f2fs: allow to read node block after shutdown f2fs: replace ternary operator with max() f2fs: replace gc_urgent_high_remaining with gc_remaining_trials f2fs: add missing bracket in doc f2fs: use sysfs_emit instead of sprintf f2fs: introduce gc_mode sysfs node f2fs: fix to destroy sbi->post_read_wq in error path of f2fs_fill_super() f2fs: fix return val in f2fs_start_ckpt_thread() f2fs: fix the msg data type f2fs: fix the assign logic of iocb f2fs: Fix typo in comments f2fs: introduce max_ordered_discard sysfs node f2fs: allow to set compression for inlined file f2fs: add barrier mount option f2fs: fix normal discard process f2fs: cleanup in f2fs_create_flush_cmd_control() f2fs: fix gc mode when gc_urgent_high_remaining is 1 f2fs: remove batched_trim_sections node f2fs: support fault injection for f2fs_is_valid_blkaddr() f2fs: fix to invalidate dcc->f2fs_issue_discard in error path f2fs: Fix the race condition of resize flag between resizefs f2fs: let's avoid to get cp_rwsem twice by f2fs_evict_inode by d_invalidate f2fs: should put a page when checking the summary info fscrypt: fix keyring memory leak on mount failure Bug: 256243893 Signed-off-by: Jaegeuk Kim <jaegeuk@google.com> Change-Id: I1755d4a31521e16602673d1327e2494cb0b84fdf	2022-12-15 11:40:35 -08:00
Will Deacon	cef0d08d97	ANDROID: KVM: arm64: Don't filter out KVM_FUNC_MMIO_GUARD_MAP hypercalls If a KVM_FUNC_MMIO_GUARD_MAP hypercall from a protected guest fails at EL2 due to running out of page-table memory, the call is forwarded to the host so that additional memory can be donated using the vCPU's memcache. Unfortunately, the host filters out these calls the hypervisor will replay the guest's HVC instruction forever, making no progress because it will fail each time. Avoid filtering out KVM_FUNC_MMIO_GUARD_MAP, in the same way as we handle the SHARE and UNSHARE hypercalls. Bug: 262700476 Cc: Keir Fraser <keirf@google.com> Signed-off-by: Will Deacon <willdeacon@google.com> Change-Id: Idd14c6bc08a4232939676e3566b79cbc7c927a3a	2022-12-15 17:06:21 +00:00
Sebastian Ene	aeac190b3b	ANDROID: KVM: arm64: Coalesce host stage2 entries on ownership reclaim This optimization allows us to re-create higher order block mappings in the host stage2 pagetables after we teardown a guest VM. When the host reclaims ownership during guest teardown, the page table walker drops the refcount of the counted entries and clears out unreferenced entries (refcount == 1). Clearing out the entry installs a zero PTE. When the host stage2 receives a data abort because there is no mapping associated, it will try to create the largest possible block mapping from the founded leaf entry. With the current patch, we increase the chances of finding a leaf entry that has level < 3 if the requested region comes from a reclaimed torned down VM memory. This has the advantage of reducing the TLB pressure at host stage2. To increase the coalescing chances, we modify the way we refcount page table descriptors for host stage2: - non-zero invalid PTEs - any of the reserved-high bits(58-55) toogled - non-default attribute mappings - page table descriptors Bug: 222044487 Test: dump the host stage2 pagetables and view the mapping Signed-off-by: Sebastian Ene <sebastianene@google.com> Change-Id: I90ff4ec2185e9a76d7ad17e77ef9bdd8ce3e8698	2022-12-15 09:17:55 +00:00
Sebastian Ene	8796cf595b	ANDROID: KVM: arm64: Move kvm_pte_table to the common header In preparation for the coalescing algorithm implementation, move the function which verifies if a page table entry is a tabel to the common header. Bug: 222044487 Change-Id: I4124b7727e91f61b8f0a7e44cd91403d09d83c3c Signed-off-by: Sebastian Ene <sebastianene@google.com>	2022-12-15 09:17:55 +00:00
Sebastian Ene	4e68fbd326	ANDROID: KVM: arm64: Have different callbacks for PTE manipulation Move the host specific code for PTE reference counting out of the pagetable code and define a new structure that wraps all the PTE manipulation callbacks. This structure will be passed during the pagetable code initialization and it allows to register different callback for [guest\|host]. Bug: 222044487 Signed-off-by: Sebastian Ene <sebastianene@google.com> Change-Id: I116e8322935762df2f2be6e8d51a3f0c140b3d36	2022-12-15 09:17:55 +00:00
Sebastian Ene	92222130c1	ANDROID: KVM: arm64: Move PTE attributes definitions to the common header Make PTE attribute definitions available from kvm_pgtable.h and take them out of the pagetable code. These attributes will be used later in mem_protect.c to construct different masks during the PTE manipulation callbacks. Bug: 222044487 Signed-off-by: Sebastian Ene <sebastianene@google.com> Change-Id: I2f7108815ef0fa536e7f3314762a412119400fe9	2022-12-15 09:17:55 +00:00
Sebastian Ene	393afc04df	ANDROID: KVM: arm64: Split stage2_put_pte function Refactor the code and add stage2_clear_pte(..) which removes the PTE without dropping the refcount for an entry. Bug: 222044487 Signed-off-by: Sebastian Ene <sebastianene@google.com> Change-Id: Ia2cb47f2ffad6faa5c6b4ec8a37bcbe61be0bc2f	2022-12-15 09:17:55 +00:00
Sebastian Ene	90048d36dc	ANDROID: KVM: arm64: Pass the pagetable struct as an argument to the freewalker Extend the scope of the stage2_freewalker by passing the pgt instead of the mm_ops callbacks. This will later be used by the stage2_pte_is_counted function. Bug: 222044487 Signed-off-by: Sebastian Ene <sebastianene@google.com> Change-Id: I390661eb106cbdb863cbb1832e39ec155c439091	2022-12-15 09:17:55 +00:00

... 5 6 7 8 9 ...

1064108 Commits