mirror of https://github.com/hardkernel/linux.git synced 2026-06-06 19:08:57 +09:00

Go to file

Colin Cross 301c56064d UPSTREAM: mm: add a field to store names for private anonymous memory

In many userspace applications, and especially in VM based applications
like Android uses heavily, there are multiple different allocators in
use.  At a minimum there is libc malloc and the stack, and in many cases
there are libc malloc, the stack, direct syscalls to mmap anonymous
memory, and multiple VM heaps (one for small objects, one for big
objects, etc.).  Each of these layers usually has its own tools to
inspect its usage; malloc by compiling a debug version, the VM through
heap inspection tools, and for direct syscalls there is usually no way
to track them.

On Android we heavily use a set of tools that use an extended version of
the logic covered in Documentation/vm/pagemap.txt to walk all pages
mapped in userspace and slice their usage by process, shared (COW) vs.
unique mappings, backing, etc.  This can account for real physical
memory usage even in cases like fork without exec (which Android uses
heavily to share as many private COW pages as possible between
processes), Kernel SamePage Merging, and clean zero pages.  It produces
a measurement of the pages that only exist in that process (USS, for
unique), and a measurement of the physical memory usage of that process
with the cost of shared pages being evenly split between processes that
share them (PSS).

If all anonymous memory is indistinguishable then figuring out the real
physical memory usage (PSS) of each heap requires either a pagemap
walking tool that can understand the heap debugging of every layer, or
for every layer's heap debugging tools to implement the pagemap walking
logic, in which case it is hard to get a consistent view of memory
across the whole system.

Tracking the information in userspace leads to all sorts of problems.
It either needs to be stored inside the process, which means every
process has to have an API to export its current heap information upon
request, or it has to be stored externally in a filesystem that somebody
needs to clean up on crashes.  It needs to be readable while the process
is still running, so it has to have some sort of synchronization with
every layer of userspace.  Efficiently tracking the ranges requires
reimplementing something like the kernel vma trees, and linking to it
from every layer of userspace.  It requires more memory, more syscalls,
more runtime cost, and more complexity to separately track regions that
the kernel is already tracking.

This patch adds a field to /proc/pid/maps and /proc/pid/smaps to show a
userspace-provided name for anonymous vmas.  The names of named
anonymous vmas are shown in /proc/pid/maps and /proc/pid/smaps as
[anon:<name>].

Userspace can set the name for a region of memory by calling

   prctl(PR_SET_VMA, PR_SET_VMA_ANON_NAME, start, len, (unsigned long)name)

Setting the name to NULL clears it.  The name length limit is 80 bytes
including NUL-terminator and is checked to contain only printable ascii
characters (including space), except '[',']','\','$' and '`'.

Ascii strings are being used to have a descriptive identifiers for vmas,
which can be understood by the users reading /proc/pid/maps or
/proc/pid/smaps.  Names can be standardized for a given system and they
can include some variable parts such as the name of the allocator or a
library, tid of the thread using it, etc.

The name is stored in a pointer in the shared union in vm_area_struct
that points to a null terminated string.  Anonymous vmas with the same
name (equivalent strings) and are otherwise mergeable will be merged.
The name pointers are not shared between vmas even if they contain the
same name.  The name pointer is stored in a union with fields that are
only used on file-backed mappings, so it does not increase memory usage.

CONFIG_ANON_VMA_NAME kernel configuration is introduced to enable this
feature.  It keeps the feature disabled by default to prevent any
additional memory overhead and to avoid confusing procfs parsers on
systems which are not ready to support named anonymous vmas.

The patch is based on the original patch developed by Colin Cross, more
specifically on its latest version [1] posted upstream by Sumit Semwal.
It used a userspace pointer to store vma names.  In that design, name
pointers could be shared between vmas.  However during the last
upstreaming attempt, Kees Cook raised concerns [2] about this approach
and suggested to copy the name into kernel memory space, perform
validity checks [3] and store as a string referenced from
vm_area_struct.

One big concern is about fork() performance which would need to strdup
anonymous vma names.  Dave Hansen suggested experimenting with
worst-case scenario of forking a process with 64k vmas having longest
possible names [4].  I ran this experiment on an ARM64 Android device
and recorded a worst-case regression of almost 40% when forking such a
process.

This regression is addressed in the followup patch which replaces the
pointer to a name with a refcounted structure that allows sharing the
name pointer between vmas of the same name.  Instead of duplicating the
string during fork() or when splitting a vma it increments the refcount.

[1] https://lore.kernel.org/linux-mm/20200901161459.11772-4-sumit.semwal@linaro.org/
[2] https://lore.kernel.org/linux-mm/202009031031.D32EF57ED@keescook/
[3] https://lore.kernel.org/linux-mm/202009031022.3834F692@keescook/
[4] https://lore.kernel.org/linux-mm/5d0358ab-8c47-2f5f-8e43-23b89d6a8e95@intel.com/

Changes for prctl(2) manual page (in the options section):

PR_SET_VMA
	Sets an attribute specified in arg2 for virtual memory areas
	starting from the address specified in arg3 and spanning the
	size specified	in arg4. arg5 specifies the value of the attribute
	to be set. Note that assigning an attribute to a virtual memory
	area might prevent it from being merged with adjacent virtual
	memory areas due to the difference in that attribute's value.

	Currently, arg2 must be one of:

	PR_SET_VMA_ANON_NAME
		Set a name for anonymous virtual memory areas. arg5 should
		be a pointer to a null-terminated string containing the
		name. The name length including null byte cannot exceed
		80 bytes. If arg5 is NULL, the name of the appropriate
		anonymous virtual memory areas will be reset. The name
		can contain only printable ascii characters (including
                space), except '[',']','\','$' and '`'.

                This feature is available only if the kernel is built with
                the CONFIG_ANON_VMA_NAME option enabled.

[surenb@google.com: docs: proc.rst: /proc/PID/maps: fix malformed table]
  Link: https://lkml.kernel.org/r/20211123185928.2513763-1-surenb@google.com
[surenb: rebased over v5.15-rc6, replaced userpointer with a kernel copy,
 added input sanitization and CONFIG_ANON_VMA_NAME config. The bulk of the
 work here was done by Colin Cross, therefore, with his permission, keeping
 him as the author]

Link: https://lkml.kernel.org/r/20211019215511.3771969-2-surenb@google.com
Signed-off-by: Colin Cross <ccross@google.com>
Signed-off-by: Suren Baghdasaryan <surenb@google.com>
Reviewed-by: Kees Cook <keescook@chromium.org>
Cc: Stephen Rothwell <sfr@canb.auug.org.au>
Cc: Al Viro <viro@zeniv.linux.org.uk>
Cc: Cyrill Gorcunov <gorcunov@openvz.org>
Cc: Dave Hansen <dave.hansen@intel.com>
Cc: David Rientjes <rientjes@google.com>
Cc: "Eric W. Biederman" <ebiederm@xmission.com>
Cc: Hugh Dickins <hughd@google.com>
Cc: Ingo Molnar <mingo@kernel.org>
Cc: Jan Glauber <jan.glauber@gmail.com>
Cc: Johannes Weiner <hannes@cmpxchg.org>
Cc: John Stultz <john.stultz@linaro.org>
Cc: Mel Gorman <mgorman@suse.de>
Cc: Minchan Kim <minchan@kernel.org>
Cc: Oleg Nesterov <oleg@redhat.com>
Cc: Pekka Enberg <penberg@kernel.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Rob Landley <rob@landley.net>
Cc: "Serge E. Hallyn" <serge.hallyn@ubuntu.com>
Cc: Shaohua Li <shli@fusionio.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>

(cherry picked from commit 9a10064f56)

Bug: 120441514
Signed-off-by: Suren Baghdasaryan <surenb@google.com>
Change-Id: I53d56d551a7d62f75341304751814294b447c04e

2022-01-18 15:30:27 -08:00

android

ANDROID: GKI: Enable system_dlkm build for gki

2022-01-14 20:01:23 +00:00

arch

ANDROID: GKI: Enable TRACE_MMIO_ACCESS config for gki_defconfig

2022-01-18 16:54:52 +00:00

block

Merge 5.15.11 into android13-5.15

2021-12-29 11:32:19 +01:00

certs

certs: Add support for using elliptic curve keys for signing modules

2021-08-23 19:55:42 +03:00

crypto

Merge 5.15.3 into android13-5.15

2021-11-19 15:38:07 +01:00

Documentation

UPSTREAM: mm: add a field to store names for private anonymous memory

2022-01-18 15:30:27 -08:00

drivers

ANDROID: gic: Add vendor hook to GIC

2022-01-17 08:32:50 +00:00

UPSTREAM: mm: add a field to store names for private anonymous memory

2022-01-18 15:30:27 -08:00

include

UPSTREAM: mm: add a field to store names for private anonymous memory

2022-01-18 15:30:27 -08:00

init

ANDROID: GKI: Do not force select MODULE_SIG_ALL

2022-01-14 20:00:05 +00:00

ipc

shm: extend forced shm destroy to support objects from several IPC nses

2021-11-25 09:48:42 +01:00

kernel

UPSTREAM: mm: add a field to store names for private anonymous memory

2022-01-18 15:30:27 -08:00

lib

Merge 5.15.7 into android13-5.15

2021-12-08 13:46:21 +01:00

LICENSES

LICENSES/dual/CC-BY-4.0: Git rid of "smart quotes"

2021-07-15 06:31:24 -06:00

UPSTREAM: mm: add a field to store names for private anonymous memory

2022-01-18 15:30:27 -08:00

net

Merge 5.15.14 into android13-5.15

2022-01-12 09:00:42 +01:00

samples

ftrace/samples: Add missing prototypes direct functions

2022-01-11 15:35:13 +01:00

scripts

ANDROID: GKI: Add script to generate symbol protection headers

2022-01-05 18:38:02 +00:00

security

Merge 5.15.13 into android13-5.15

2022-01-05 15:36:44 +01:00

sound

Merge 5.15.13 into android13-5.15

2022-01-05 15:36:44 +01:00

tools

FROMGIT: tools/resolve_btfids: Build with host flags

2022-01-18 21:28:16 +00:00

usr

.gitignore: prefix local generated files with a slash

2021-05-02 00:43:35 +09:00

virt

KVM: downgrade two BUG_ONs to WARN_ON_ONCE

2021-12-22 09:32:34 +01:00

.clang-format

clang-format: Update with the latest for_each macro list

2021-05-12 23:32:39 +02:00

.cocciconfig

…

.get_maintainer.ignore

Opt out of scripts/get_maintainer.pl

2019-05-16 10:53:40 -07:00

.gitattributes

.gitattributes: use 'dts' diff driver for dts files

2019-12-04 19:44:11 -08:00

.gitignore

.gitignore: ignore only top-level modules.builtin

2021-05-02 00:43:35 +09:00

.mailmap

mailmap: add Andrej Shadura

2021-10-18 20:22:03 -10:00

BUILD.bazel

ANDROID: kleaf: drop toolchain_version = CLANG_VERSION

2022-01-11 21:38:44 +00:00

build.config.aarch64

ANDROID: remove stale variables from build.config files

2022-01-05 14:33:38 +00:00

build.config.allmodconfig

ANDROID: allmodconfig: disable WERROR

2021-12-09 10:57:12 +01:00

build.config.allmodconfig.aarch64

ANDROID: drop KERNEL_DIR setting in build.config.common

2020-08-31 15:20:37 +00:00

build.config.allmodconfig.arm

ANDROID: drop KERNEL_DIR setting in build.config.common

2020-08-31 15:20:37 +00:00

build.config.allmodconfig.x86_64

ANDROID: drop KERNEL_DIR setting in build.config.common

2020-08-31 15:20:37 +00:00

build.config.amlogic

ANDROID: GKI: amlogic: add DTB overlays

2021-02-11 12:23:26 +00:00

build.config.arm

ANDROID: remove stale variables from build.config files

2022-01-05 14:33:38 +00:00

build.config.common

ANDROID: move CLANG_VERSION definition to build.config.constants

2021-12-12 20:10:27 +00:00

build.config.constants

ANDROID: clang: update to 14.0.1

2022-01-10 11:54:53 +00:00

build.config.db845c

ANDROID: db845c: Add symbol list file

2021-11-22 21:45:34 +00:00

build.config.gki

ANDROID: GKI: Add support for a GKI_BUILD_CONFIG_FRAGMENT

2021-06-24 00:26:54 +00:00

build.config.gki_kasan

ANDROID: build.configs: migrate away from CC_LD_ARG

2021-07-02 09:49:23 +00:00

build.config.gki_kasan.aarch64

ANDROID: drop KERNEL_DIR setting in build.config.common

2020-08-31 15:20:37 +00:00

build.config.gki_kasan.x86_64

ANDROID: drop KERNEL_DIR setting in build.config.common

2020-08-31 15:20:37 +00:00

build.config.gki_kprobes

ANDROID: build.configs: migrate away from CC_LD_ARG

2021-07-02 09:49:23 +00:00

build.config.gki_kprobes.aarch64

ANDROID: Adding kprobes build configs for Cuttlefish

2021-03-01 15:29:45 +00:00

build.config.gki_kprobes.x86_64

ANDROID: Adding kprobes build configs for Cuttlefish

2021-03-01 15:29:45 +00:00

build.config.gki-debug.aarch64

ANDROID: drop KERNEL_DIR setting in build.config.common

2020-08-31 15:20:37 +00:00

build.config.gki-debug.x86_64

ANDROID: drop KERNEL_DIR setting in build.config.common

2020-08-31 15:20:37 +00:00

build.config.gki.aarch64

ANDROID: GKI: Enable system_dlkm build for gki

2022-01-14 20:01:23 +00:00

build.config.gki.x86_64

ANDROID: GKI: Enable system_dlkm build for gki

2022-01-14 20:01:23 +00:00

build.config.khwasan

ANDROID: Add a build config fragment for KHWASan.

2021-10-13 19:44:44 +00:00

build.config.rockpi4

ANDROID: Build LZ4 ramdisk for rockpi4

2021-04-09 18:15:40 +00:00

build.config.x86_64

ANDROID: remove stale variables from build.config files

2022-01-05 14:33:38 +00:00

COPYING

COPYING: state that all contributions really are covered by this file

2020-02-10 13:32:20 -08:00

CREDITS

MAINTAINERS: Move Daniel Drake to credits

2021-09-21 08:34:58 +03:00

Kbuild

kbuild: rename hostprogs-y/always to hostprogs/always-y

2020-02-04 01:53:07 +09:00

Kconfig

ANDROID: kbuild: add Kconfig support for external modules

2021-12-21 09:15:51 -08:00

Kconfig.ext

ANDROID: kbuild: add Kconfig support for external modules

2021-12-21 09:15:51 -08:00

MAINTAINERS

Merge tag 'v5.15' into android-mainline

2021-11-01 07:40:54 +01:00

Makefile

Merge 5.15.14 into android13-5.15

2022-01-12 09:00:42 +01:00

OWNERS

ANDROID: Initial branch setup for android13-5.15

2021-11-02 10:19:03 +00:00

README

Drop all 00-INDEX files from Documentation/

2018-09-09 15:08:58 -06:00

README.md

ANDROID: README.md: fix checkpatch.pl path typo

2021-04-07 23:16:50 +00:00

README.md

How do I submit patches to Android Common Kernels

BEST: Make all of your changes to upstream Linux. If appropriate, backport to the stable releases. These patches will be merged automatically in the corresponding common kernels. If the patch is already in upstream Linux, post a backport of the patch that conforms to the patch requirements below.
- Do not send patches upstream that contain only symbol exports. To be considered for upstream Linux, additions of EXPORT_SYMBOL_GPL() require an in-tree modular driver that uses the symbol -- so include the new driver or changes to an existing driver in the same patchset as the export.
- When sending patches upstream, the commit message must contain a clear case for why the patch is needed and beneficial to the community. Enabling out-of-tree drivers or functionality is not not a persuasive case.
LESS GOOD: Develop your patches out-of-tree (from an upstream Linux point-of-view). Unless these are fixing an Android-specific bug, these are very unlikely to be accepted unless they have been coordinated with kernel-team@android.com. If you want to proceed, post a patch that conforms to the patch requirements below.

Common Kernel patch requirements

All patches must conform to the Linux kernel coding standards and pass scripts/checkpatch.pl
Patches shall not break gki_defconfig or allmodconfig builds for arm, arm64, x86, x86_64 architectures (see https://source.android.com/setup/build/building-kernels)
If the patch is not merged from an upstream branch, the subject must be tagged with the type of patch: UPSTREAM:, BACKPORT:, FROMGIT:, FROMLIST:, or ANDROID:.
All patches must have a Change-Id: tag (see https://gerrit-review.googlesource.com/Documentation/user-changeid.html)
If an Android bug has been assigned, there must be a Bug: tag.
All patches must have a Signed-off-by: tag by the author and the submitter

Additional requirements are listed below based on patch type

Requirements for backports from mainline Linux: `UPSTREAM:`, `BACKPORT:`

If the patch is a cherry-pick from Linux mainline with no changes at all
- tag the patch subject with UPSTREAM:.
- add upstream commit information with a (cherry picked from commit ...) line
- Example:
  - if the upstream commit message is

        important patch from upstream

        This is the detailed description of the important patch

        Signed-off-by: Fred Jones <fred.jones@foo.org>

then Joe Smith would upload the patch for the common kernel as

        UPSTREAM: important patch from upstream

        This is the detailed description of the important patch

        Signed-off-by: Fred Jones <fred.jones@foo.org>

        Bug: 135791357
        Change-Id: I4caaaa566ea080fa148c5e768bb1a0b6f7201c01
        (cherry picked from commit c31e73121f4c1ec41143423ac6ce3ce6dafdcec1)
        Signed-off-by: Joe Smith <joe.smith@foo.org>

If the patch requires any changes from the upstream version, tag the patch with BACKPORT: instead of UPSTREAM:.
- use the same tags as UPSTREAM:
- add comments about the changes under the (cherry picked from commit ...) line
- Example:

        BACKPORT: important patch from upstream

        This is the detailed description of the important patch

        Signed-off-by: Fred Jones <fred.jones@foo.org>

        Bug: 135791357
        Change-Id: I4caaaa566ea080fa148c5e768bb1a0b6f7201c01
        (cherry picked from commit c31e73121f4c1ec41143423ac6ce3ce6dafdcec1)
        [joe: Resolved minor conflict in drivers/foo/bar.c ]
        Signed-off-by: Joe Smith <joe.smith@foo.org>

Requirements for other backports: `FROMGIT:`, `FROMLIST:`,

If the patch has been merged into an upstream maintainer tree, but has not yet been merged into Linux mainline
- tag the patch subject with FROMGIT:
- add info on where the patch came from as (cherry picked from commit <sha1> <repo> <branch>). This must be a stable maintainer branch (not rebased, so don't use linux-next for example).
- if changes were required, use BACKPORT: FROMGIT:
- Example:
  - if the commit message in the maintainer tree is

        important patch from upstream

        This is the detailed description of the important patch

        Signed-off-by: Fred Jones <fred.jones@foo.org>

then Joe Smith would upload the patch for the common kernel as

        FROMGIT: important patch from upstream

        This is the detailed description of the important patch

        Signed-off-by: Fred Jones <fred.jones@foo.org>

        Bug: 135791357
        (cherry picked from commit 878a2fd9de10b03d11d2f622250285c7e63deace
         https://git.kernel.org/pub/scm/linux/kernel/git/foo/bar.git test-branch)
        Change-Id: I4caaaa566ea080fa148c5e768bb1a0b6f7201c01
        Signed-off-by: Joe Smith <joe.smith@foo.org>

If the patch has been submitted to LKML, but not accepted into any maintainer tree
- tag the patch subject with FROMLIST:
- add a Link: tag with a link to the submittal on lore.kernel.org
- add a Bug: tag with the Android bug (required for patches not accepted into a maintainer tree)
- if changes were required, use BACKPORT: FROMLIST:
- Example:

        FROMLIST: important patch from upstream

        This is the detailed description of the important patch

        Signed-off-by: Fred Jones <fred.jones@foo.org>

        Bug: 135791357
        Link: https://lore.kernel.org/lkml/20190619171517.GA17557@someone.com/
        Change-Id: I4caaaa566ea080fa148c5e768bb1a0b6f7201c01
        Signed-off-by: Joe Smith <joe.smith@foo.org>

Requirements for Android-specific patches: `ANDROID:`

If the patch is fixing a bug to Android-specific code
- tag the patch subject with ANDROID:
- add a Fixes: tag that cites the patch with the bug
- Example:

        ANDROID: fix android-specific bug in foobar.c

        This is the detailed description of the important fix

        Fixes: 1234abcd2468 ("foobar: add cool feature")
        Change-Id: I4caaaa566ea080fa148c5e768bb1a0b6f7201c01
        Signed-off-by: Joe Smith <joe.smith@foo.org>

If the patch is a new feature
- tag the patch subject with ANDROID:
- add a Bug: tag with the Android bug (required for android-specific features)

README.md

How do I submit patches to Android Common Kernels

Common Kernel patch requirements

Requirements for backports from mainline Linux: UPSTREAM:, BACKPORT:

Requirements for other backports: FROMGIT:, FROMLIST:,

Requirements for Android-specific patches: ANDROID:

Requirements for backports from mainline Linux: `UPSTREAM:`, `BACKPORT:`

Requirements for other backports: `FROMGIT:`, `FROMLIST:`,

Requirements for Android-specific patches: `ANDROID:`