linux

mirror of https://github.com/hardkernel/linux.git synced 2026-06-06 10:58:48 +09:00

Author	SHA1	Message	Date
Liam R. Howlett	c5c507cfec	FROMGIT: maple_tree: try harder to keep active node with mas_prev() Keep a reference to the node when possible with mas_prev(). This will avoid re-walking the tree. In keeping a reference to the node, keep the last/index accurate to the range being referenced. This means the limit may be within the range, but the range may extend outside of the limit. Also fix the single entry tree to respect the range (of 0), or set the node to MAS_NONE in the case of shifting beyond 0. Link: https://lkml.kernel.org/r/20230518145544.1722059-25-Liam.Howlett@oracle.com Signed-off-by: Liam R. Howlett <Liam.Howlett@oracle.com> Cc: David Binderman <dcb314@hotmail.com> Cc: Peng Zhang <zhangpeng.00@bytedance.com> Cc: Sergey Senozhatsky <senozhatsky@chromium.org> Cc: Vernon Yang <vernon2gm@gmail.com> Cc: Wei Yang <richard.weiyang@gmail.com> Signed-off-by: Andrew Morton <akpm@linux-foundation.org> (cherry picked from commit 20e9433710317ab0278c1d76821e213fb2d11e19 git://git.kernel.org/pub/scm/linux/kernel/git/akpm/mm mm-unstable) Bug: 274059236 Change-Id: If0b40925884dac6e334474249098d03175ba6dd6 Signed-off-by: Suren Baghdasaryan <surenb@google.com>	2023-06-06 20:05:25 +00:00
Liam R. Howlett	cb6d9fa6ad	FROMGIT: maple_tree: try harder to keep active node after mas_next() Clean up the mas_next() call to try and keep a node reference when possible. This will avoid re-walking the tree in most cases. Also clean up the single entry tree handling to ensure index/last are consistent with what one would expect. (returning NULL with limit of 1-oo). Link: https://lkml.kernel.org/r/20230518145544.1722059-24-Liam.Howlett@oracle.com Signed-off-by: Liam R. Howlett <Liam.Howlett@oracle.com> Cc: David Binderman <dcb314@hotmail.com> Cc: Peng Zhang <zhangpeng.00@bytedance.com> Cc: Sergey Senozhatsky <senozhatsky@chromium.org> Cc: Vernon Yang <vernon2gm@gmail.com> Cc: Wei Yang <richard.weiyang@gmail.com> Signed-off-by: Andrew Morton <akpm@linux-foundation.org> (cherry picked from commit f7741cbb138e4cd8586e45806313561cec44f9b6 git://git.kernel.org/pub/scm/linux/kernel/git/akpm/mm mm-unstable) Bug: 274059236 Change-Id: I61c7e9e1575b5f5400f9fc2eec08ae4a1eaefa5e Signed-off-by: Suren Baghdasaryan <surenb@google.com>	2023-06-06 20:05:25 +00:00
Liam R. Howlett	5ff9438fe1	FROMGIT: BACKPORT: mm/mmap: change do_vmi_align_munmap() for maple tree iterator changes The maple tree iterator clean up is incompatible with the way do_vmi_align_munmap() expects it to behave. Update the expected behaviour to map now since the change will work currently. Link: https://lkml.kernel.org/r/20230518145544.1722059-23-Liam.Howlett@oracle.com Signed-off-by: Liam R. Howlett <Liam.Howlett@oracle.com> Cc: David Binderman <dcb314@hotmail.com> Cc: Peng Zhang <zhangpeng.00@bytedance.com> Cc: Sergey Senozhatsky <senozhatsky@chromium.org> Cc: Vernon Yang <vernon2gm@gmail.com> Cc: Wei Yang <richard.weiyang@gmail.com> Signed-off-by: Andrew Morton <akpm@linux-foundation.org> (cherry picked from commit a4d5b9fbaf42d668c1b5c7f231f79776a9419a91 git://git.kernel.org/pub/scm/linux/kernel/git/akpm/mm mm-unstable) [surenb: adjust for missing vma_iter_load] Bug: 274059236 Change-Id: Id05ab617a3539f885a32c7d3031098a8c005fff8 Signed-off-by: Suren Baghdasaryan <surenb@google.com>	2023-06-06 20:05:25 +00:00
Liam R. Howlett	133fbad5bd	FROMLIST: BACKPORT: maple_tree: Refine mas_preallocate() node calculations Calculate the number of nodes based on the pending write action instead of assuming the worst case. This addresses a performance regression introduced in platforms that have longer allocation timing. Signed-off-by: Liam R. Howlett <Liam.Howlett@oracle.com> Link: https://lore.kernel.org/lkml/20230601021605.2823123-14-Liam.Howlett@oracle.com/ [surenb: adjust node_size calculation, allow to store a slot when possible] Bug: 274059236 Change-Id: I1db402fb463ee1e391081d2d81c34619f15713ac Signed-off-by: Suren Baghdasaryan <surenb@google.com>	2023-06-06 20:05:25 +00:00
Liam R. Howlett	ce9ebd83aa	ANDROID: maple_tree: Move mas_wr_modify node size calculation to mas_wr_node_size() Create a new function to get the size of the mas_wr_node_size() since it will be used elsewhere soon. Drop the incrementing of the node size if this is the left-most node. Signed-off-by: Liam R. Howlett <Liam.Howlett@oracle.com> Link: `d781921bc8` [surenb: due to the differences with upstream kernel where the node size can be obtained using mas_wr_new_end() function, this patch is not applicable upstream. The patch was obtained from the author's tree] Bug: 274059236 Change-Id: I9f0b5238294d0842b4c2717437ed7288b17c7617 Signed-off-by: Suren Baghdasaryan <surenb@google.com>	2023-06-06 20:05:25 +00:00
Liam R. Howlett	b6734cb2ce	FROMLIST: BACKPORT: maple_tree: Move mas_wr_end_piv() below mas_wr_extend_null() Relocate it and call mas_wr_extend_null() from within mas_wr_end_piv(). Extending the NULL may affect the end pivot value so call mas_wr_endtend_null() from within mas_wr_end_piv() to keep it all together. Signed-off-by: Liam R. Howlett <Liam.Howlett@oracle.com> Link: https://lore.kernel.org/lkml/20230601021605.2823123-12-Liam.Howlett@oracle.com/ [surenb: moved additional wr_mas->end_piv assignment missing in later kernel versions] Bug: 274059236 Change-Id: I55c5843273e7a679aef918e66d4b4ed034d493da Signed-off-by: Suren Baghdasaryan <surenb@google.com>	2023-06-06 20:05:25 +00:00
Liam R. Howlett	aede79b81e	ANDROID: mm: Fix __vma_adjust() writes for the maple tree Only write when necessary to the maple tree. This should only occur when the VMA changes. In the __vma_adjust() case, it is either the vma when it is expanded, the next vma when the boundary expands into 'vma', writing the 'insert', or when vma expands/shrinks for shift_arg_pages(). The mas_preallocate() setup should track the intended write to ensure the correct number of nodes are preallocated for the pending write. Signed-off-by: Liam R. Howlett <Liam.Howlett@oracle.com> Link: `61b337f650` [surenb: __vma_adjust was removed in 6.3, therefore these fixes are not applicable upstream anymore. The patch was obtained from the author's tree] Bug: 274059236 Change-Id: I69d68a5b4ff11c40985f7b03b31eec4bb24dcbb6 Signed-off-by: Suren Baghdasaryan <surenb@google.com>	2023-06-06 20:05:25 +00:00
Liam R. Howlett	b802573f44	FROMLIST: BACKPORT: mm: Set up vma iterator for vma_iter_prealloc() calls Set the correct limits for vma_iter_prealloc() calls so that the maple tree can be smarter about how many nodes are needed. Signed-off-by: Liam R. Howlett <Liam.Howlett@oracle.com> Link: https://lore.kernel.org/lkml/20230601021605.2823123-11-Liam.Howlett@oracle.com/ [surenb: remove vma_iter-related changes not present in 6.1 kernel] Bug: 274059236 Change-Id: I05d1989e35b2e72b9346743f290da66739b3ee59 Signed-off-by: Suren Baghdasaryan <surenb@google.com>	2023-06-06 20:05:25 +00:00
Liam R. Howlett	c3118993c9	FROMGIT: maple_tree: avoid unnecessary ascending The maple tree node limits are implied by the parent. When walking up the tree, the limit may not be known until a slot that does not have implied limits are encountered. However, if the node is the left-most or right-most node, the walking up to find that limit can be skipped. This commit also fixes the debug/testing code that was not setting the limit on walking down the tree as that optimization is not compatible with this change. Link: https://lkml.kernel.org/r/20230518145544.1722059-4-Liam.Howlett@oracle.com Signed-off-by: Liam R. Howlett <Liam.Howlett@oracle.com> Reviewed-by: Peng Zhang <zhangpeng.00@bytedance.com> Cc: David Binderman <dcb314@hotmail.com> Cc: Sergey Senozhatsky <senozhatsky@chromium.org> Cc: Vernon Yang <vernon2gm@gmail.com> Cc: Wei Yang <richard.weiyang@gmail.com> Signed-off-by: Andrew Morton <akpm@linux-foundation.org> (cherry picked from commit 0f4e7f5fc2122534ae0573b37224ddfa367fa7ac git://git.kernel.org/pub/scm/linux/kernel/git/akpm/mm mm-unstable) Bug: 274059236 Change-Id: I4a5e852906692b27ea598fdf38eba8e1a69355d9 Signed-off-by: Suren Baghdasaryan <surenb@google.com>	2023-06-06 20:05:25 +00:00
Liam R. Howlett	e9fdabfc2a	FROMLIST: BACKPORT: mm: Change do_vmi_align_munmap() side tree index The majority of the calls to munmap a VMA is for a single vma. The maple tree is able to store a single entry at 0, with a size of 1 as a pointer and avoid any allocations. Change do_vmi_align_munmap() to store the VMAs being munmap()'ed into a tree indexed by the count. This will leverage the ability to store the first entry without a node allocation. Storing the entries into a tree by the count and not the vma start and end means changing the functions which iterate over the entries. Update unmap_vmas() and free_pgtables() to take a maple state and a tree end address to support this functionality. Passing through the same maple state to unmap_vmas() and free_pgtables() means the state needs to be reset between calls. This happens in the static unmap_region() and exit_mmap(). Signed-off-by: Liam R. Howlett <Liam.Howlett@oracle.com> Link: https://lore.kernel.org/lkml/20230601021605.2823123-5-Liam.Howlett@oracle.com/ [surenb: skip changes passing maple state to unmap_vmas() and free_pgtables()] Bug: 274059236 Change-Id: If38cfecd51da884bcfdbdfdfbf955a0b338d3d60 Signed-off-by: Suren Baghdasaryan <surenb@google.com>	2023-06-06 20:05:25 +00:00
Liam R. Howlett	25bed2fdbc	UPSTREAM: mm/mmap: remove preallocation from do_mas_align_munmap() In preparation of passing the vma state through split, the pre-allocation that occurs before the split has to be moved to after. Since the preallocation would then live right next to the store, just call store instead of preallocating. This effectively restores the potential error path of splitting and not munmap'ing which pre-dates the maple tree. Link: https://lkml.kernel.org/r/20230120162650.984577-12-Liam.Howlett@oracle.com Signed-off-by: Liam R. Howlett <Liam.Howlett@oracle.com> Signed-off-by: Andrew Morton <akpm@linux-foundation.org> (cherry picked from commit `0378c0a0e9`) Bug: 274059236 Change-Id: I3539fb3a08043dae1bc8aaa6c7f285711a0b5548 Signed-off-by: Suren Baghdasaryan <surenb@google.com>	2023-06-06 20:05:25 +00:00
Huang Yiwei	312dfb3b7e	ANDROID: abi_gki_aarch64_qcom: Update QCOM symbol list Update QCOM symbol list for MPAM vendor hook. Symbols added: __traceiter_android_vh_mpam_set __tracepoint_android_vh_mpam_set Bug: 285984666 Change-Id: I31e6875e95f4cc39b327ab190ef50d3bab88b57b Signed-off-by: Huang Yiwei <quic_hyiwei@quicinc.com>	2023-06-06 14:36:09 +08:00
xiaofeng	6b3daa3bba	ANDROID: GKI: Update symbol list for xiaomi 1 function symbol(s) added 'int __traceiter_android_vh_mmput(struct mm_struct *mm)' 1 variable symbol(s) added 'struct tracepoint __tracepoint_android_vh_mmput' Bug: 284059793 Change-Id: I6468b53e5c708a7e04c472c69210956e63147251 Signed-off-by: xiaofeng <xiaofeng5@xiaomi.com>	2023-06-05 23:15:22 +00:00
xiaofeng	ec196511bf	ANDROID: vendor_hooks:vendor hook for mmput add vendor hook in mmput while mm_users decreased to 0. Bug: 238821038 Change-Id: I42a717cbeeb3176bac14b4b2391fdb2366c972d3 Signed-off-by: xiaofeng <xiaofeng5@xiaomi.com>	2023-06-05 23:15:22 +00:00
Sooyong Suk	571c04e945	ANDROID: ABI: update symbol list for galaxy 5 function symbol(s) added 'int __traceiter_android_vh_madvise_pageout_swap_entry(void, swp_entry_t, int)' 'int __traceiter_android_vh_madvise_swapin_walk_pmd_entry(void, swp_entry_t)' 'int __traceiter_android_vh_process_madvise_end(void, int, ssize_t)' 'int __traceiter_android_vh_show_smap(void, struct seq_file, unsigned long, unsigned long, unsigned long)' 'int __traceiter_android_vh_smaps_pte_entry(void, swp_entry_t, unsigned long, unsigned long, unsigned long)' 5 variable symbol(s) added 'struct tracepoint __tracepoint_android_vh_madvise_pageout_swap_entry' 'struct tracepoint __tracepoint_android_vh_madvise_swapin_walk_pmd_entry' 'struct tracepoint __tracepoint_android_vh_process_madvise_end' 'struct tracepoint __tracepoint_android_vh_show_smap' 'struct tracepoint __tracepoint_android_vh_smaps_pte_entry' Bug: 284059805 Change-Id: I3ea820f19eac3b0f053bac0830625891e70c1b71 Signed-off-by: Sooyong Suk <s.suk@samsung.com>	2023-06-05 23:12:28 +00:00
Sooyong Suk	847b3f6c96	ANDROID: task_mmu: add vendor hook for swap entry Add vendor hook in smaps_pte_entry for swap entry - android_vh_smaps_pte_entry - android_vh_show_smap This vendor hook is to show more information for swap entries of a process based on the characteristics, such as written-back, same-filled or huge (uncompressed). Bug: 284059805 Change-Id: Ie4a48ae42212c056992d34a10b026b60439d0012 Signed-off-by: Sooyong Suk <s.suk@samsung.com>	2023-06-05 23:12:28 +00:00
Sooyong Suk	aee36dd530	ANDROID: mm: add vendor hooks in madvise for swap entry Add vendor hooks in madvise for swap entry - android_vh_madvise_pageout_swap_entry - android_vh_madvise_swapin_walk_pmd_entry - android_vh_process_madvise_end Bug: 284059805 Change-Id: Ic389244e343737a583286c20cadb6774efd8890c Signed-off-by: Sooyong Suk <s.suk@samsung.com>	2023-06-05 23:12:28 +00:00
Peter Collingbourne	c0cfeeaa88	BACKPORT: FROMLIST: arm64: mte: Simplify swap tag restoration logic As a result of the previous two patches, there are no circumstances in which a swapped-in page is installed in a page table without first having arch_swap_restore() called on it. Therefore, we no longer need the logic in set_pte_at() that restores the tags, so remove it. Signed-off-by: Peter Collingbourne <pcc@google.com> Link: https://linux-review.googlesource.com/id/I8ad54476f3b2d0144ccd8ce0c1d7a2963e5ff6f3 Reviewed-by: Steven Price <steven.price@arm.com> Reviewed-by: Catalin Marinas <catalin.marinas@arm.com> Link: https://lore.kernel.org/all/20230523004312.1807357-4-pcc@google.com/ Change-Id: I8ad54476f3b2d0144ccd8ce0c1d7a2963e5ff6f3 [pcc: resolved merge conflict] Bug: 274890466	2023-06-05 21:53:19 +00:00
Peter Collingbourne	131714e34b	FROMLIST: mm: Call arch_swap_restore() from unuse_pte() We would like to move away from requiring architectures to restore metadata from swap in the set_pte_at() implementation, as this is not only error-prone but adds complexity to the arch-specific code. This requires us to call arch_swap_restore() before calling swap_free() whenever pages are restored from swap. We are currently doing so everywhere except in unuse_pte(); do so there as well. Signed-off-by: Peter Collingbourne <pcc@google.com> Link: https://linux-review.googlesource.com/id/I68276653e612d64cde271ce1b5a99ae05d6bbc4f Suggested-by: David Hildenbrand <david@redhat.com> Acked-by: David Hildenbrand <david@redhat.com> Acked-by: "Huang, Ying" <ying.huang@intel.com> Reviewed-by: Steven Price <steven.price@arm.com> Acked-by: Catalin Marinas <catalin.marinas@arm.com> Link: https://lore.kernel.org/all/20230523004312.1807357-3-pcc@google.com/ Change-Id: I68276653e612d64cde271ce1b5a99ae05d6bbc4f Bug: 274890466	2023-06-05 21:53:19 +00:00
Peter Collingbourne	3805b879f5	FROMLIST: mm: Call arch_swap_restore() from do_swap_page() Commit `c145e0b47c` ("mm: streamline COW logic in do_swap_page()") moved the call to swap_free() before the call to set_pte_at(), which meant that the MTE tags could end up being freed before set_pte_at() had a chance to restore them. Fix it by adding a call to the arch_swap_restore() hook before the call to swap_free(). Signed-off-by: Peter Collingbourne <pcc@google.com> Link: https://linux-review.googlesource.com/id/I6470efa669e8bd2f841049b8c61020c510678965 Cc: <stable@vger.kernel.org> # 6.1 Fixes: `c145e0b47c` ("mm: streamline COW logic in do_swap_page()") Reported-by: Qun-wei Lin (林群崴) <Qun-wei.Lin@mediatek.com> Closes: https://lore.kernel.org/all/5050805753ac469e8d727c797c2218a9d780d434.camel@mediatek.com/ Acked-by: David Hildenbrand <david@redhat.com> Acked-by: "Huang, Ying" <ying.huang@intel.com> Reviewed-by: Steven Price <steven.price@arm.com> Acked-by: Catalin Marinas <catalin.marinas@arm.com> Link: https://lore.kernel.org/all/20230523004312.1807357-2-pcc@google.com/ Change-Id: I6470efa669e8bd2f841049b8c61020c510678965 Bug: 274890466	2023-06-05 21:53:19 +00:00
Sachin Gupta	098028adf7	ANDROID: abi_gki_aarch64_qcom: Update symbol list Symbols added: sdhci_dumpregs Bug: 285546222 Change-Id: I18fe46273b13f21e59fd4f556efbe560f581139d Signed-off-by: Sachin Gupta <quic_sachgupt@quicinc.com>	2023-06-05 20:44:15 +00:00
xiaofeng	71844b8ed9	ANDROID: GKI: Update symbol list for xiaomi 2 function symbol(s) added 'int __traceiter_android_vh_alloc_pages_reclaim_bypass(gfp_t gfp_mask, int order, int alloc_flags, int migratetype, struct page page)' 'int __traceiter_android_vh_alloc_pages_failure_bypass(gfp_t gfp_mask, int order, int alloc_flags, int migratetype, struct page page)' 2 variable symbol(s) added 'struct tracepoint __tracepoint_android_vh_alloc_pages_reclaim_bypass' 'struct tracepoint __tracepoint_android_vh_alloc_pages_failure_bypass' Bug: 284059793 Change-Id: I766d37e4f4cea8c3ce6e925e95ab920152eebbb1 Signed-off-by: xiaofeng <xiaofeng5@xiaomi.com>	2023-06-05 16:38:22 +00:00
xiaofeng	025b5a487b	ANDROID: vendor_hooks:vendor hook for __alloc_pages_slowpath. add vendor hook in __alloc_pages_slowpath ahead of __alloc_pages_direct_reclaim and warn_alloc. Bug: 243629905 Change-Id: Ieacc6cf79823c0bfacfdeec9afb55ed66f40d0b0 Signed-off-by: xiaofeng <xiaofeng5@xiaomi.com>	2023-06-05 16:38:22 +00:00
Dezhi Huang	60b0f85e24	ANDROID: ABI: Update honor symbol list 3 function symbol(s) added 'int __traceiter_android_vh_file_is_tiny_bypass(void, bool, bool)' 'int __traceiter_android_vh_modify_scan_control(void, u64, unsigned long, struct mem_cgroup, bool, bool)' 'int __traceiter_android_vh_should_continue_reclaim(u64, unsigned long, unsigned long, bool)' 3 variable symbol(s) added 'struct tracepoint __tracepoint_android_vh_file_is_tiny_bypass' 'struct tracepoint __tracepoint_android_vh_modify_scan_control' 'struct tracepoint __tracepoint_android_vh_should_continue_reclaim' Bug: 279793370 Change-Id: Ieb2a90f1317453b982341f06765bb2625daa645a Signed-off-by: Dezhi Huang <huangdezhi@hihonor.com>	2023-06-05 16:31:49 +00:00
Dezhi Huang	3e2dc32f59	ANDROID: mm: create vendor hooks for memory reclaim we try to adjust page reclaim operations based on the running task and kernel memory pressure. Thus, we want to create some vendor hooks into kernel6.1. Firstly, we add ADNRROID_VENDOR_DATA into the struct scan_control, special operations would be performed based on this special scan option. We measure the importance of the current process in the system and obtain its weight, which is recorded in ANDROID_VENDOR_DATA. The hook function: trace_android_vh_modify_scan_control is added inside of the function modify_scan_control() to adjust reclaim operations based on memory pressure. The hook function: trace_android_vh_should_continue_reclaim is added inside of the function shrink_node() to decide if page_reclaim would continue or not based on memory pressure. The hook function: trace_android_vh_file_is_tiny_bypass is added into the function prepare_scan_count() to decide if the file pages should be skipped in condition to file refualts and memory pressure. Bug: 279793370 Change-Id: I1efe9d3e866f37b0295c7cd94ec8ca0117a9bd4a Signed-off-by: Dezhi Huang <huangdezhi@hihonor.com>	2023-06-05 16:31:49 +00:00
杨辉	8e6a28c815	UPSTREAM: kcsan: Avoid READ_ONCE() in read_instrumented_memory() Haibo Li reported: \| Unable to handle kernel paging request at virtual address \| ffffff802a0d8d7171 \| Mem abort info⭕ \| ESR = 0x9600002121 \| EC = 0x25: DABT (current EL), IL = 32 bitsts \| SET = 0, FnV = 0 0 \| EA = 0, S1PTW = 0 0 \| FSC = 0x21: alignment fault \| Data abort info⭕ \| ISV = 0, ISS = 0x0000002121 \| CM = 0, WnR = 0 0 \| swapper pgtable: 4k pages, 39-bit VAs, pgdp=000000002835200000 \| [ffffff802a0d8d71] pgd=180000005fbf9003, p4d=180000005fbf9003, \| pud=180000005fbf9003, pmd=180000005fbe8003, pte=006800002a0d8707 \| Internal error: Oops: 96000021 [#1] PREEMPT SMP \| Modules linked in: \| CPU: 2 PID: 45 Comm: kworker/u8:2 Not tainted \| 5.15.78-android13-8-g63561175bbda-dirty #1 \| ... \| pc : kcsan_setup_watchpoint+0x26c/0x6bc \| lr : kcsan_setup_watchpoint+0x88/0x6bc \| sp : ffffffc00ab4b7f0 \| x29: ffffffc00ab4b800 x28: ffffff80294fe588 x27: 0000000000000001 \| x26: 0000000000000019 x25: 0000000000000001 x24: ffffff80294fdb80 \| x23: 0000000000000000 x22: ffffffc00a70fb68 x21: ffffff802a0d8d71 \| x20: 0000000000000002 x19: 0000000000000000 x18: ffffffc00a9bd060 \| x17: 0000000000000001 x16: 0000000000000000 x15: ffffffc00a59f000 \| x14: 0000000000000001 x13: 0000000000000000 x12: ffffffc00a70faa0 \| x11: 00000000aaaaaaab x10: 0000000000000054 x9 : ffffffc00839adf8 \| x8 : ffffffc009b4cf00 x7 : 0000000000000000 x6 : 0000000000000007 \| x5 : 0000000000000000 x4 : 0000000000000000 x3 : ffffffc00a70fb70 \| x2 : 0005ff802a0d8d71 x1 : 0000000000000000 x0 : 0000000000000000 \| Call trace: \| kcsan_setup_watchpoint+0x26c/0x6bc \| __tsan_read2+0x1f0/0x234 \| inflate_fast+0x498/0x750 \| zlib_inflate+0x1304/0x2384 \| __gunzip+0x3a0/0x45c \| gunzip+0x20/0x30 \| unpack_to_rootfs+0x2a8/0x3fc \| do_populate_rootfs+0xe8/0x11c \| async_run_entry_fn+0x58/0x1bc \| process_one_work+0x3ec/0x738 \| worker_thread+0x4c4/0x838 \| kthread+0x20c/0x258 \| ret_from_fork+0x10/0x20 \| Code: b8bfc2a8 2a0803f7 14000007 d503249f (78bfc2a8) ) \| ---[ end trace 613a943cb0a572b6 ]----- The reason for this is that on certain arm64 configuration since `e35123d83e` ("arm64: lto: Strengthen READ_ONCE() to acquire when CONFIG_LTO=y"), READ_ONCE() may be promoted to a full atomic acquire instruction which cannot be used on unaligned addresses. Fix it by avoiding READ_ONCE() in read_instrumented_memory(), and simply forcing the compiler to do the required access by casting to the appropriate volatile type. In terms of generated code this currently only affects architectures that do not use the default READ_ONCE() implementation. The only downside is that we are not guaranteed atomicity of the access itself, although on most architectures a plain load up to machine word size should still be atomic (a fact the default READ_ONCE() still relies on itself). BUG: 285794521 (cherry picked from commit `8dec88070d`) Reported-by: Haibo Li <haibo.li@mediatek.com> Tested-by: Haibo Li <haibo.li@mediatek.com> Cc: <stable@vger.kernel.org> # 5.17+ Change-Id: I16c9f83c3b4e28021a936249cafb1501760aa59d Signed-off-by: Marco Elver <elver@google.com> Signed-off-by: Paul E. McKenney <paulmck@kernel.org> Signed-off-by: 杨辉 <yanghui10@xiaomi.corp-partner.google.com>	2023-06-05 15:02:47 +00:00
jianzhou	675bc3a00e	ANDROID: abi_gki_aarch64_qcom: update symbol list Symbols added: page_pinner_inited __page_pinner_put_page _trace_android_vh_record_pcpu_rwsem_starttime Bug: 285243673 Change-Id: I7cf6ca8ff637f3d7de9daba597b09ca27b813e48 Signed-off-by: jianzhou <quic_jianzhou@quicinc.com>	2023-06-05 14:45:20 +00:00
Todd Kjos	7b14897460	ANDROID: Update ABI as part of crash fix Ok to commit this before KMI update since CRC change only affects the broken hooks which are only used by the partner that introduced the hooks. INFO: variable symbol 'struct tracepoint __tracepoint_android_rvh_psci_cpu_suspend' changed CRC changed from 0x4628ef5b to 0xf9b81cca variable symbol 'struct tracepoint __tracepoint_android_rvh_psci_tos_resident_on' changed CRC changed from 0x477813d5 to 0xb163a362 Fixes: `b7a7fd15ed` ("ANDROID: vendor_hooks: psci: add hook to check if cpu is allowed to power off") Bug: 285477556 Change-Id: I0539ac8ff1d26a6ba8dd0f13fc09b53f5ee0335b Signed-off-by: Todd Kjos <tkjos@google.com>	2023-06-02 20:46:44 +00:00
Todd Kjos	9e2fa0a396	ANDROID: Fix incorrect hook declaration Two hooks that need to be restricted were correctly named with "_rvh_" but were incorrectly declared as normal hooks. This resulted in crashes for at least 1 partner: ------------[ cut here ]------------ WARNING: CPU: 0 PID: 0 at include/trace/hooks/psci.h:19 psci_0_2_cpu_suspend+0x124/0x1d8 Modules linked in: CPU: 0 PID: 0 Comm: swapper/0 Tainted: G S W 6.1.25-android14-7-00072-gf10e53af33a0 #1 Hardware name: Samsung ERD9945 board based on S5E9945 (DT) pstate: 624003c5 (nZCv DAIF +PAN -UAO +TCO -DIT -SSBS BTYPE=--) pc : psci_0_2_cpu_suspend+0x124/0x1d8 lr : psci_0_2_cpu_suspend+0x88/0x1d8 sp : ffffffd00b1f7b20 x29: ffffffd00b1f7b30 [0: swapper/0: 0] x28: ffffffd00b217be4 x27: 0000000000000001 x26: 0000000000000000 x25: ffffff8915b689fc x24: 93ffff8837750100 x23: 000000008cc8e544 x22: 00000000000000c0 x21: 0000000000010000 x20: 000000008147a038 x19: efffffc000000000 x18: 0000000000000000 x17: 0000000000000000 x16: 00000000000000ff x15: 0000000000000000 x14: 0000000000000000 x13: ffffffd00b22ae00 x12: ffffffb90a98d000 x11: ffffffd00b1d9850 x10: 0000000100000001 x9 : efffffc000000000 x8 : 0000000100000001 [0: swapper/0: 0] x7 : 015001f2b5593519 [0: swapper/0: 0] x6 : 0000000000310000 x5 : 0000000000000000 x4 : 0000000000000000 x3 : 0000000000000000 [0: swapper/0: 0] x2 : 0000000000000000 x1 : ffffff8915b66850 x0 : 0000000000000000 Call trace: psci_0_2_cpu_suspend+0x124/0x1d8 psci_suspend_finisher+0x2c/0x38 cpu_suspend+0x8c/0x16c psci_cpu_suspend_enter+0x54/0x7c psci_enter_idle_state+0x64/0x94 cpuidle_enter_state+0x1dc/0x9b8 cpuidle_enter+0x58/0x7c call_cpuidle+0x30/0x58 [0: swapper/0: 0] do_idle+0x214/0x2b8 cpu_startup_entry+0x2c/0x30 kernel_init+0x0/0x180 start_kernel+0x0/0x444 start_kernel+0x368/0x444 __primary_switched+0xc0/0xc8 Kernel panic - not syncing: kernel: panic_on_warn set ... Fixes: `b7a7fd15ed` ("ANDROID: vendor_hooks: psci: add hook to check if cpu is allowed to power off") Bug: 285477556 Change-Id: I44ca332dc61dab025a0e33c94e8ad2f5eaffb6f8 Signed-off-by: Todd Kjos <tkjos@google.com>	2023-06-02 20:46:44 +00:00
Nagireddy Annem	e57fe10b5a	ANDROID: abi_gki_aarch64_qcom: Add GIC and hibernation APIs Add below fnctions and symbols to support GIC Deepsleep and Hibernation feature. 4 function symbol(s) added 'int __traceiter_android_vh_gic_v3_suspend(void, struct gic_chip_data_v3)' 'void gic_v3_cpu_init()' 'void gic_v3_dist_init()' 'void gic_v3_dist_wait_for_rwp()' 1 variable symbol(s) added 'struct tracepoint __tracepoint_android_vh_gic_v3_suspend' Bug: 279879797 Change-Id: I96e439ef537e5dfc4e16c76fe6dd91bd5f13d6dd Signed-off-by: Nagireddy Annem <quic_nannem@quicinc.com> Signed-off-by: Darshankumar Jagdishchandra Thakkar <quic_djagdish@quicinc.com> Signed-off-by: kamasali Satyanarayan <quic_kamasali@quicinc.com>	2023-06-02 17:11:39 +00:00
Maulik Shah	227d23d61d	ANDROID: gic-v3: Export gic_v3_resume() for vendor GIC enhancements syscore ops in gic-v3 takes care of invoking gic_v3_resume() when exiting from "deep" suspend. However for "s2idle" suspend syscore ops will not get invoked. Vendor modules can register for s2idle notifications and invoke gic_v3_resume() when the first cpu is waking up from s2idle. Bug: 279879797 Change-Id: Ifd48d676a5bc907eb957c2002934e18bd1c9c095 Signed-off-by: Maulik Shah <mkshah@codeaurora.org> Signed-off-by: Shreyas K K <quic_shrekk@quicinc.com>	2023-06-02 17:02:43 +00:00
Nagireddy Annem	275c8705e5	ANDROID: irqchip/irq-gic-v3: Add vendor hook for gic suspend This change adds vendor hook for gic suspend syscore ops callback. And it is invoked during deepsleep and hibernation to store gic register snapshot. Bug: 279879797 Change-Id: I4e3729afa4daf18d73e00ee9601b6da72a578b4a Signed-off-by: Nagireddy Annem <quic_nannem@quicinc.com> Signed-off-by: Shreyas K K <quic_shrekk@quicinc.com>	2023-06-02 17:02:43 +00:00
Mao Jinlong	c9539979a9	ANDROID: abi_gki_aarch64_qcom: Update abi_gki_aarch64_qcom for DMA Add dma_alloc_noncontiguous, dma_free_noncontiguous, dma_vmap_noncontiguous and dma_vunmap_noncontiguous symbols. Symbols added: dma_alloc_noncontiguous dma_free_noncontiguous dma_vmap_noncontiguous dma_vunmap_noncontiguous Bug: 284818225 Change-Id: Ifb8238071fbd15b2d27d1cfc33b856ae4c18c3f1 Signed-off-by: Chetan C R <quic_cchinnad@quicinc.com> Signed-off-by: Mao Jinlong <quic_jinlmao@quicinc.com> Signed-off-by: Zhenhua Huang <quic_zhenhuah@quicinc.com> (cherry picked from commit b3bb41cebdeb0688b508df20f0db5f55a87e46e8)	2023-06-02 16:59:18 +00:00
Weichao Guo	6da02f9101	ANDROID: GKI: Update symbols to abi_gki_aarch64_oplus for extend copy & fbarrier feature 8 function symbol(s) added 'int __blkdev_issue_discard(struct block_device, sector_t, sector_t, gfp_t, struct bio)' 'unsigned long __page_file_index(struct page)' 'void address_space_init_once(struct address_space)' 'void blk_finish_plug(struct blk_plug)' 'void blk_start_plug(struct blk_plug)' 'bool prepare_to_wait_exclusive(struct wait_queue_head, struct wait_queue_entry, int)' 'void* radix_tree_lookup_slot(const struct xarray, unsigned long)' 'void radix_tree_replace_slot(struct xarray, void*, void)' Bug: 283021230 Change-Id: Iec663ed6ed23c8c3245b609c3d8748919fa34471 Signed-off-by: Weichao Guo <guoweichao@oppo.corp-partner.google.com>	2023-06-01 20:55:23 +00:00
Sarthak Garg	87b384408e	ANDROID: abi_gki_aarch64_qcom: Update symbol list Symbols added: dev_pm_opp_remove __mmc_claim_host mmc_execute_tuning mmc_get_card mmc_get_ext_csd mmc_hs200_tuning mmc_issue_type __mmc_poll_for_busy mmc_prepare_busy_cmd mmc_put_card mmc_release_host mmc_retune_hold mmc_retune_release mmc_select_bus_width mmc_select_card mmc_select_hs400 mmc_select_hs mmc_select_hs_ddr mmc_select_timing mmc_send_status mmc_set_bus_mode mmc_set_bus_width mmc_set_clock mmc_set_initial_state mmc_set_timing mmc_wait_for_cmd __traceiter_android_rvh_mmc_resume __traceiter_android_rvh_mmc_suspend __tracepoint_android_rvh_mmc_resume __tracepoint_android_rvh_mmc_suspend Bug: 283922495 Change-Id: I9d3ff4fbdf6eb5df5798302cbe3409592b4c91c6 Signed-off-by: Sarthak Garg <quic_sartgarg@quicinc.com>	2023-06-01 18:00:57 +00:00
Sarthak Garg	a3a743e67f	ANDROID: mmc: core: Export core functions for kernel modules usage Export core functions for kernel modules usage. Bug: 283922495 Link: https://patchwork.kernel.org/project/linux-mmc/patch/20230401165723.19762-3-quic_sartgarg@quicinc.com/ Change-Id: Ia7904a5da3207e6f39590e092a7805e5260cd752 Signed-off-by: Sarthak Garg <quic_sartgarg@quicinc.com>	2023-06-01 18:00:57 +00:00
Sarthak Garg	631a2db5a3	ANDROID: vendor_hooks: Define new hooks in _mmc_suspend/resume Define new hooks in _mmc_suspend/resume to control few things in card suspend/resume paths which further allows to enable some additional steps in mmc_suspend/resume paths as per host specific requirements. Bug: 283922495 Link: https://patchwork.kernel.org/project/linux-mmc/patch/20230401165723.19762-2-quic_sartgarg@quicinc.com/ Change-Id: Ief52d1dc6b01e9866f004b46687dffa4eb1e7bc1 Signed-off-by: Sarthak Garg <quic_sartgarg@quicinc.com>	2023-06-01 18:00:57 +00:00
Cixi Geng	e82e89e170	ANDROID: update symbol for unisoc vendor_hooks Add the psci_cpu_suspend and psci_tos_resident_on. 2 function symbol(s) added 'int __traceiter_android_rvh_psci_cpu_suspend(void, u32, bool)' 'int __traceiter_android_rvh_psci_tos_resident_on(void, int, bool)' 2 variable symbol(s) added 'struct tracepoint __tracepoint_android_rvh_psci_cpu_suspend' 'struct tracepoint __tracepoint_android_rvh_psci_tos_resident_on' Bug: 284797902 Change-Id: Ie4e740757631fe6dc194bf83873a64df34769193 Signed-off-by: Cixi Geng <cixi.geng1@unisoc.com>	2023-06-01 09:20:41 +08:00
Jian Gong	b7a7fd15ed	ANDROID: vendor_hooks: psci: add hook to check if cpu is allowed to power off While TOS is running alongside with linux, cpu power off operation by linux may need be denied by TOS in some scenarios. This patch added two hooks in psci_tos_resident_on and psci_cpu_suspend to hook cpu off operation. The function psci_tos_resident_on originally is used to check if TOS is resident on a specific cpu and that cpu is dedicated for running TOS exclusively. If so, that cpu can not be power off. Actually if TOS supports SMP, TOS may need deny any other cpu to power down in some cases, i.e. there are no-expired timers in TOS. Thus the first hook for psci_tos_resident_on is used to determine if the specific cpu is allowed to power off in the cpu hotplug path. Besides cpu hotplug, a cpu also can power off by cpu_suspend. The second hook for psci_cpu_suspend determines if cpu suspend should go through or not. When the same conditions described above meets, cpu suspend will break up. The hook cherry-pick from commit 88d88955ae0b8b1f1a555d7810beb6c8ca4ca0f1 and changed vh to rvh according to commit 949edf7539b60058cf2da98f24db2b6d4d89eaa0 Bug: 284797902 Change-Id: Ib329beeff20f0cfef263f6a7813280d33f6a5eaa Signed-off-by: Jian Gong <Jian.Gong@unisoc.com> Signed-off-by: Cixi Geng <cixi.geng1@unisoc.com>	2023-06-01 09:18:28 +08:00
Xuewen Yan	3be7d118e7	ANDROID: Add vendor hook to the effective_cpu_util android_rvh_effective_cpu_util: To perform vendor-specific cpu util, it is used in EAS/schedutil/thermal. The effective_cpu_util would be called when thermal calc the dynamic power, it's non-atomic context, so set the hook be restricted. Bug: 226686099 Test: build pass Signed-off-by: Xuewen Yan <xuewen.yan@unisoc.com> Change-Id: I6fd77f44ca4328f5ef37d96989aa2e08d65e29bb	2023-06-01 00:39:32 +00:00
Chun-Hung Wu	0c2142745d	ANDROID: Update symbol list for mtk 5 function symbol(s) added 'void _trace_android_vh_record_pcpu_rwsem_starttime(struct task_struct, unsigned long)' 'struct file filp_open_block(const char, int, umode_t)' 'int iommu_dev_disable_feature(struct device, enum iommu_dev_features)' 'int of_pci_get_max_link_speed(struct device_node)' 'void sched_clock_register(u64()(), int, unsigned long)' Bug: 284836453 Change-Id: If41140f2f203664c58aeb9ce49498436a26113be Signed-off-by: Chun-Hung Wu <chun-hung.wu@mediatek.com>	2023-05-31 23:11:43 +00:00
YOUNGJIN JOO	6f7dc871a6	ANDROID: ABI: update symbol list for galaxy 5 function symbol(s) added 'void __kfree_skb(struct sk_buff)' 'int __traceiter_android_vh_ptype_head(void, const struct packet_type, struct list_head)' 'int __traceiter_kfree_skb(void, struct sk_buff, void, enum skb_drop_reason)' 'int skb_copy_ubufs(struct sk_buff, gfp_t)' 'struct usb_device* usb_alloc_dev(struct usb_device, struct usb_bus, unsigned int)' 2 variable symbol(s) added 'struct tracepoint __tracepoint_android_vh_ptype_head' 'struct tracepoint __tracepoint_kfree_skb' Bug: 284426833 Change-Id: If9dd8836500afd45ed49838f00ccca7effbdb54f Signed-off-by: YOUNGJIN JOO <youngjin79.joo@samsung.com>	2023-05-31 23:11:08 +00:00
Di Shen	b0a752c3aa	ANDROID: update symbol for unisoc vendor_hooks Add some thermal related symbols. 6 function symbol(s) added 'int __traceiter_android_vh_get_thermal_zone_device(void, struct thermal_zone_device)' 'int __traceiter_android_vh_modify_thermal_request_freq(void, struct cpufreq_policy, unsigned long)' 'int __traceiter_android_vh_modify_thermal_target_freq(void, struct cpufreq_policy, unsigned int)' 'int __traceiter_android_vh_thermal_power_cap(void, u32)' 'int __traceiter_android_vh_thermal_register(void, struct cpufreq_policy)' 'int __traceiter_android_vh_thermal_unregister(void, struct cpufreq_policy)' 6 variable symbol(s) added 'struct tracepoint __tracepoint_android_vh_get_thermal_zone_device' 'struct tracepoint __tracepoint_android_vh_modify_thermal_request_freq' 'struct tracepoint __tracepoint_android_vh_modify_thermal_target_freq' 'struct tracepoint __tracepoint_android_vh_thermal_power_cap' 'struct tracepoint __tracepoint_android_vh_thermal_register' 'struct tracepoint __tracepoint_android_vh_thermal_unregister' Bug: 285078223 Signed-off-by: Di Shen <di.shen@unisoc.com> Change-Id: I5c9e07c4754f24b70c6bb12333aec10b4db5b03f	2023-05-31 21:08:10 +00:00
Jeson Gao	ce7ceff8c8	ANDROID: thermal: Add vendor hook to check power range For SoC's skin temperature, we have to use more stringent temperature control to make IPA can monitor and mitigate temperature control earlier and faster, so add it to meet platform thermal requirement. Bug: 211564753 Signed-off-by: Jeson Gao <jeson.gao@unisoc.com> Signed-off-by: Di Shen <di.shen@unisoc.com> Change-Id: Iaef87287eef93d6fdbc3c58c93f70c1525e38296 (cherry picked from commit `6709f52325`) (cherry picked from commit `97a290b0e5`)	2023-05-31 21:08:10 +00:00
Di Shen	7191b6a759	ANDROID: thermal: Add vendor hook to get thermal zone device Need to get temperature data and config info from thermal zone device. Bug: 208946028 Signed-off-by: Di Shen <di.shen@unisoc.com> Signed-off-by: Jeson Gao <jeson.gao@unisoc.com> Change-Id: I5945df5258181b4a441b6bbe09327099491418b3 (cherry picked from commit `c53f0e3530`) (cherry picked from commit `12b8ef18b2`)	2023-05-31 21:08:10 +00:00
Jeson Gao	1fe511720a	ANDROID: thermal: Add hook for cpufreq thermal Add hook to get cpufreq policy data after registering and unregistering cpufreq thermal for platform thermal requirement. Bug: 228423762 Signed-off-by: Jeson Gao <jeson.gao@unisoc.com> Signed-off-by: Di Shen <di.shen@unisoc.com> Change-Id: I9c6bc88f348f252c428560427bd8bca91092edfa (cherry picked from commit `fbe6f8708d`)	2023-05-31 21:08:10 +00:00
Zhenhua Huang	78fe8913d1	UPSTREAM: mm,kfence: decouple kfence from page granularity mapping judgement Kfence only needs its pool to be mapped as page granularity, if it is inited early. Previous judgement was a bit over protected. From [1], Mark suggested to "just map the KFENCE region a page granularity". So I decouple it from judgement and do page granularity mapping for kfence pool only. Need to be noticed that late init of kfence pool still requires page granularity mapping. Page granularity mapping in theory cost more(2M per 1GB) memory on arm64 platform. Like what I've tested on QEMU(emulated 1GB RAM) with gki_defconfig, also turning off rodata protection: Before: [root@liebao ]# cat /proc/meminfo MemTotal: 999484 kB After: [root@liebao ]# cat /proc/meminfo MemTotal: 1001480 kB To implement this, also relocate the kfence pool allocation before the linear mapping setting up, arm64_kfence_alloc_pool is to allocate phys addr, __kfence_pool is to be set after linear mapping set up. LINK: [1] https://lore.kernel.org/linux-arm-kernel/Y+IsdrvDNILA59UN@FVFF77S0Q05N/ Suggested-by: Mark Rutland <mark.rutland@arm.com> Signed-off-by: Zhenhua Huang <quic_zhenhuah@quicinc.com> Reviewed-by: Kefeng Wang <wangkefeng.wang@huawei.com> Reviewed-by: Marco Elver <elver@google.com> Link: https://lore.kernel.org/r/1679066974-690-1-git-send-email-quic_zhenhuah@quicinc.com Signed-off-by: Will Deacon <will@kernel.org> BUG: 284812202 Change-Id: I8e7c565d3f4d6349a028a6a060259d62cf5beee7 (cherry picked from commit `bfa7965b33`) Signed-off-by: Zhenhua Huang <quic_zhenhuah@quicinc.com>	2023-05-31 17:22:42 +00:00
Tetsuo Handa	8035e57ec7	UPSTREAM: mm/page_alloc: fix potential deadlock on zonelist_update_seq seqlock commit `1007843a91` upstream. syzbot is reporting circular locking dependency which involves zonelist_update_seq seqlock [1], for this lock is checked by memory allocation requests which do not need to be retried. One deadlock scenario is kmalloc(GFP_ATOMIC) from an interrupt handler. CPU0 ---- __build_all_zonelists() { write_seqlock(&zonelist_update_seq); // makes zonelist_update_seq.seqcount odd // e.g. timer interrupt handler runs at this moment some_timer_func() { kmalloc(GFP_ATOMIC) { __alloc_pages_slowpath() { read_seqbegin(&zonelist_update_seq) { // spins forever because zonelist_update_seq.seqcount is odd } } } } // e.g. timer interrupt handler finishes write_sequnlock(&zonelist_update_seq); // makes zonelist_update_seq.seqcount even } This deadlock scenario can be easily eliminated by not calling read_seqbegin(&zonelist_update_seq) from !__GFP_DIRECT_RECLAIM allocation requests, for retry is applicable to only __GFP_DIRECT_RECLAIM allocation requests. But Michal Hocko does not know whether we should go with this approach. Another deadlock scenario which syzbot is reporting is a race between kmalloc(GFP_ATOMIC) from tty_insert_flip_string_and_push_buffer() with port->lock held and printk() from __build_all_zonelists() with zonelist_update_seq held. CPU0 CPU1 ---- ---- pty_write() { tty_insert_flip_string_and_push_buffer() { __build_all_zonelists() { write_seqlock(&zonelist_update_seq); build_zonelists() { printk() { vprintk() { vprintk_default() { vprintk_emit() { console_unlock() { console_flush_all() { console_emit_next_record() { con->write() = serial8250_console_write() { spin_lock_irqsave(&port->lock, flags); tty_insert_flip_string() { tty_insert_flip_string_fixed_flag() { __tty_buffer_request_room() { tty_buffer_alloc() { kmalloc(GFP_ATOMIC \| __GFP_NOWARN) { __alloc_pages_slowpath() { zonelist_iter_begin() { read_seqbegin(&zonelist_update_seq); // spins forever because zonelist_update_seq.seqcount is odd spin_lock_irqsave(&port->lock, flags); // spins forever because port->lock is held } } } } } } } } spin_unlock_irqrestore(&port->lock, flags); // message is printed to console spin_unlock_irqrestore(&port->lock, flags); } } } } } } } } } write_sequnlock(&zonelist_update_seq); } } } This deadlock scenario can be eliminated by preventing interrupt context from calling kmalloc(GFP_ATOMIC) and preventing printk() from calling console_flush_all() while zonelist_update_seq.seqcount is odd. Since Petr Mladek thinks that __build_all_zonelists() can become a candidate for deferring printk() [2], let's address this problem by disabling local interrupts in order to avoid kmalloc(GFP_ATOMIC) and disabling synchronous printk() in order to avoid console_flush_all() . As a side effect of minimizing duration of zonelist_update_seq.seqcount being odd by disabling synchronous printk(), latency at read_seqbegin(&zonelist_update_seq) for both !__GFP_DIRECT_RECLAIM and __GFP_DIRECT_RECLAIM allocation requests will be reduced. Although, from lockdep perspective, not calling read_seqbegin(&zonelist_update_seq) (i.e. do not record unnecessary locking dependency) from interrupt context is still preferable, even if we don't allow calling kmalloc(GFP_ATOMIC) inside write_seqlock(&zonelist_update_seq)/write_sequnlock(&zonelist_update_seq) section... Link: https://lkml.kernel.org/r/8796b95c-3da3-5885-fddd-6ef55f30e4d3@I-love.SAKURA.ne.jp Fixes: `3d36424b3b` ("mm/page_alloc: fix race condition between build_all_zonelists and page allocation") Link: https://lkml.kernel.org/r/ZCrs+1cDqPWTDFNM@alley [2] Reported-by: syzbot <syzbot+223c7461c58c58a4cb10@syzkaller.appspotmail.com> Link: https://syzkaller.appspot.com/bug?extid=223c7461c58c58a4cb10 [1] Change-Id: Ifc0c6ed9be6d36166367811ad412bedc66ed713e Signed-off-by: Tetsuo Handa <penguin-kernel@I-love.SAKURA.ne.jp> Acked-by: Michal Hocko <mhocko@suse.com> Acked-by: Mel Gorman <mgorman@techsingularity.net> Cc: Petr Mladek <pmladek@suse.com> Cc: David Hildenbrand <david@redhat.com> Cc: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com> Cc: John Ogness <john.ogness@linutronix.de> Cc: Patrick Daly <quic_pdaly@quicinc.com> Cc: Sergey Senozhatsky <senozhatsky@chromium.org> Cc: Steven Rostedt <rostedt@goodmis.org> Cc: <stable@vger.kernel.org> Signed-off-by: Andrew Morton <akpm@linux-foundation.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> (cherry picked from commit `b528537d13`) Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-05-31 16:27:26 +00:00
Mel Gorman	fa3ef799ad	UPSTREAM: mm: page_alloc: skip regions with hugetlbfs pages when allocating 1G pages commit `4d73ba5fa7` upstream. A bug was reported by Yuanxi Liu where allocating 1G pages at runtime is taking an excessive amount of time for large amounts of memory. Further testing allocating huge pages that the cost is linear i.e. if allocating 1G pages in batches of 10 then the time to allocate nr_hugepages from 10->20->30->etc increases linearly even though 10 pages are allocated at each step. Profiles indicated that much of the time is spent checking the validity within already existing huge pages and then attempting a migration that fails after isolating the range, draining pages and a whole lot of other useless work. Commit `eb14d4eefd` ("mm,page_alloc: drop unnecessary checks from pfn_range_valid_contig") removed two checks, one which ignored huge pages for contiguous allocations as huge pages can sometimes migrate. While there may be value on migrating a 2M page to satisfy a 1G allocation, it's potentially expensive if the 1G allocation fails and it's pointless to try moving a 1G page for a new 1G allocation or scan the tail pages for valid PFNs. Reintroduce the PageHuge check and assume any contiguous region with hugetlbfs pages is unsuitable for a new 1G allocation. The hpagealloc test allocates huge pages in batches and reports the average latency per page over time. This test happens just after boot when fragmentation is not an issue. Units are in milliseconds. hpagealloc 6.3.0-rc6 6.3.0-rc6 6.3.0-rc6 vanilla hugeallocrevert-v1r1 hugeallocsimple-v1r2 Min Latency 26.42 ( 0.00%) 5.07 ( 80.82%) 18.94 ( 28.30%) 1st-qrtle Latency 356.61 ( 0.00%) 5.34 ( 98.50%) 19.85 ( 94.43%) 2nd-qrtle Latency 697.26 ( 0.00%) 5.47 ( 99.22%) 20.44 ( 97.07%) 3rd-qrtle Latency 972.94 ( 0.00%) 5.50 ( 99.43%) 20.81 ( 97.86%) Max-1 Latency 26.42 ( 0.00%) 5.07 ( 80.82%) 18.94 ( 28.30%) Max-5 Latency 82.14 ( 0.00%) 5.11 ( 93.78%) 19.31 ( 76.49%) Max-10 Latency 150.54 ( 0.00%) 5.20 ( 96.55%) 19.43 ( 87.09%) Max-90 Latency 1164.45 ( 0.00%) 5.53 ( 99.52%) 20.97 ( 98.20%) Max-95 Latency 1223.06 ( 0.00%) 5.55 ( 99.55%) 21.06 ( 98.28%) Max-99 Latency 1278.67 ( 0.00%) 5.57 ( 99.56%) 22.56 ( 98.24%) Max Latency 1310.90 ( 0.00%) 8.06 ( 99.39%) 26.62 ( 97.97%) Amean Latency 678.36 ( 0.00%) 5.44 * 99.20%* 20.44 * 96.99%* 6.3.0-rc6 6.3.0-rc6 6.3.0-rc6 vanilla revert-v1 hugeallocfix-v2 Duration User 0.28 0.27 0.30 Duration System 808.66 17.77 35.99 Duration Elapsed 830.87 18.08 36.33 The vanilla kernel is poor, taking up to 1.3 second to allocate a huge page and almost 10 minutes in total to run the test. Reverting the problematic commit reduces it to 8ms at worst and the patch takes 26ms. This patch fixes the main issue with skipping huge pages but leaves the page_count() out because a page with an elevated count potentially can migrate. BugLink: https://bugzilla.kernel.org/show_bug.cgi?id=217022 Link: https://lkml.kernel.org/r/20230414141429.pwgieuwluxwez3rj@techsingularity.net Fixes: `eb14d4eefd` ("mm,page_alloc: drop unnecessary checks from pfn_range_valid_contig") Change-Id: I552f0631f15e41038219e207c994fa7702b269fa Signed-off-by: Mel Gorman <mgorman@techsingularity.net> Reported-by: Yuanxi Liu <y.liu@naruida.com> Acked-by: Vlastimil Babka <vbabka@suse.cz> Reviewed-by: David Hildenbrand <david@redhat.com> Acked-by: Michal Hocko <mhocko@suse.com> Reviewed-by: Oscar Salvador <osalvador@suse.de> Cc: Matthew Wilcox <willy@infradead.org> Cc: <stable@vger.kernel.org> Signed-off-by: Andrew Morton <akpm@linux-foundation.org> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> (cherry picked from commit `059f24aff6`) Signed-off-by: Greg Kroah-Hartman <gregkh@google.com>	2023-05-31 16:27:26 +00:00
Uttkarsh Aggarwal	c0462c4b11	UPSTREAM: usb: gadget: f_fs: Add unbind event before functionfs_unbind While exercising the unbind path, with the current implementation the functionfs_unbind would be calling which waits for the ffs->mutex to be available, however within the same time ffs_ep0_read is invoked & if no setup packets are pending, it will invoke function wait_event_interruptible_exclusive_locked_irq which by definition waits for the ev.count to be increased inside the same mutex for which functionfs_unbind is waiting. This creates deadlock situation because the functionfs_unbind won't get the lock until ev.count is increased which can only happen if the caller ffs_func_unbind can proceed further. Following is the illustration: CPU1 CPU2 ffs_func_unbind() ffs_ep0_read() mutex_lock(ffs->mutex) wait_event(ffs->ev.count) functionfs_unbind() mutex_lock(ffs->mutex) mutex_unlock(ffs->mutex) ffs_event_add() <deadlock> Fix this by moving the event unbind before functionfs_unbind to ensure the ev.count is incrased properly. Fixes: `6a19da1110` ("usb: gadget: f_fs: Prevent race during ffs_ep0_queue_wait") Cc: stable <stable@kernel.org> Signed-off-by: Uttkarsh Aggarwal <quic_uaggarwa@quicinc.com> Link: https://lore.kernel.org/r/20230525092854.7992-1-quic_uaggarwa@quicinc.com Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> Bug: 285072336 (cherry picked from commit `efb6b53520`) Change-Id: I1a001606f62f1966825d47809cd1c887e3d6fb71 Signed-off-by: Uttkarsh Aggarwal <quic_uaggarwa@quicinc.com>	2023-05-31 16:16:30 +00:00

1 2 3 4 5 ...

1149502 Commits