linux

mirror of https://github.com/hardkernel/linux.git synced 2026-06-11 05:17:10 +09:00

Go to file

Morten Rasmussen 4f37ec6ac1 FROMLIST: sched: Add over-utilization/tipping point indicator

Energy-aware scheduling is only meant to be active while the system is
_not_ over-utilized. That is, there are spare cycles available to shift
tasks around based on their actual utilization to get a more
energy-efficient task distribution without depriving any tasks. When
above the tipping point task placement is done the traditional way based
on load_avg, spreading the tasks across as many cpus as possible based
on priority scaled load to preserve smp_nice. Below the tipping point we
want to use util_avg instead. We need to define a criteria for when we
make the switch.

The util_avg for each cpu converges towards 100% regardless of how many
additional tasks we may put on it. If we define over-utilized as:

sum_{cpus}(rq.cfs.avg.util_avg) + margin > sum_{cpus}(rq.capacity)

some individual cpus may be over-utilized running multiple tasks even
when the above condition is false. That should be okay as long as we try
to spread the tasks out to avoid per-cpu over-utilization as much as
possible and if all tasks have the _same_ priority. If the latter isn't
true, we have to consider priority to preserve smp_nice.

For example, we could have n_cpus nice=-10 util_avg=55% tasks and
n_cpus/2 nice=0 util_avg=60% tasks. Balancing based on util_avg we are
likely to end up with nice=-10 tasks sharing cpus and nice=0 tasks
getting their own as we 1.5*n_cpus tasks in total and 55%+55% is less
over-utilized than 55%+60% for those cpus that have to be shared. The
system utilization is only 85% of the system capacity, but we are
breaking smp_nice.

To be sure not to break smp_nice, we have defined over-utilization
conservatively as when any cpu in the system is fully utilized at its
highest frequency instead:

cpu_rq(any).cfs.avg.util_avg + margin > cpu_rq(any).capacity

IOW, as soon as one cpu is (nearly) 100% utilized, we switch to load_avg
to factor in priority to preserve smp_nice.

With this definition, we can skip periodic load-balance as no cpu has an
always-running task when the system is not over-utilized. All tasks will
be periodic and we can balance them at wake-up. This conservative
condition does however mean that some scenarios that could benefit from
energy-aware decisions even if one cpu is fully utilized would not get
those benefits.

For systems where some cpus might have reduced capacity on some cpus
(RT-pressure and/or big.LITTLE), we want periodic load-balance checks as
soon a just a single cpu is fully utilized as it might one of those with
reduced capacity and in that case we want to migrate it.

cc: Ingo Molnar <mingo@redhat.com>
cc: Peter Zijlstra <peterz@infradead.org>
Signed-off-by: Morten Rasmussen <morten.rasmussen@arm.com>
[ Added a comment explaining why new tasks are not accounted during
  overutilization detection ]
Signed-off-by: Quentin Perret <quentin.perret@arm.com>
Message-Id: <20181016101513.26919-13-quentin.perret@arm.com>
Signed-off-by: Quentin Perret <quentin.perret@arm.com>

Change-Id: I19f816054adfd2dfa9a69fa92c1589f62794a218

2018-10-26 11:47:12 +01:00

arch

UPSTREAM: sched/topology, arch/arm: Rebuild sched_domain hierarchy when CPU capacity changes

2018-10-26 11:44:45 +01:00

block

block: don't deal with discard limit in blkdev_issue_discard()

2018-10-18 07:23:40 -06:00

certs

export.h: remove VMLINUX_SYMBOL() and VMLINUX_SYMBOL_STR()

2018-08-22 23:21:44 +09:00

crypto

Merge tag 'dmaengine-4.19-rc1' of git://git.infradead.org/users/vkoul/slave-dma

2018-08-18 15:55:59 -07:00

Documentation

Code of Conduct: Change the contact email address

2018-10-22 07:33:36 +01:00

drivers

FROMLIST: sched/topology: Make Energy Aware Scheduling depend on schedutil

2018-10-26 11:47:11 +01:00

firmware

kbuild: remove all dummy assignments to obj-

2017-11-18 11:46:06 +09:00

fscache: Fix out of bound read in long cookie keys

2018-10-18 11:32:21 +02:00

include

FROMLIST: sched: Introduce a sysctl for Energy Aware Scheduling

2018-10-26 11:47:12 +01:00

init

Merge tag 'kbuild-v4.19-2' of git://git.kernel.org/pub/scm/linux/kernel/git/masahiroy/linux-kbuild

2018-08-25 13:40:38 -07:00

ipc

ipc/shm.c: use ERR_CAST() for shm_lock() error return

2018-10-05 16:32:04 -07:00

kernel

FROMLIST: sched: Add over-utilization/tipping point indicator

2018-10-26 11:47:12 +01:00

lib

test_ida: Fix lockdep warning

2018-10-15 16:31:29 -04:00

LICENSES

LICENSES: Remove CC-BY-SA-4.0 license text

2018-10-18 11:28:50 +02:00

mremap: properly flush TLB before releasing the page

2018-10-18 11:30:52 +02:00

net

Revert "neighbour: force neigh_invalidate when NUD_FAILED update is from admin"

2018-10-20 22:25:01 -07:00

samples

samples: disable CONFIG_SAMPLES for UML

2018-10-11 02:15:46 +09:00

scripts

Merge tag 'kbuild-fixes-v4.19-2' of git://git.kernel.org/pub/scm/linux/kernel/git/masahiroy/linux-kbuild

2018-10-11 19:23:07 +02:00

security

Revert "uapi/linux/keyctl.h: don't use C++ reserved keyword as a struct member name"

2018-09-25 13:28:58 +02:00

sound

ALSA: hda/realtek - Cannot adjust speaker's volume on Dell XPS 27 7760

2018-10-04 07:50:48 +02:00

tools

Merge branch 'perf-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

2018-10-20 15:02:51 +02:00

usr

initramfs: move gen_initramfs_list.sh from scripts/ to usr/

2018-08-22 23:21:44 +09:00

virt

KVM: Remove obsolete kvm_unmap_hva notifier backend

2018-09-07 15:06:02 +02:00

.clang-format

clang-format: Set IndentWrappedFunctionNames false

2018-08-01 18:38:51 +02:00

.cocciconfig

scripts: add Linux .cocciconfig for coccinelle

2016-07-22 12:13:39 +02:00

.get_maintainer.ignore

Add hch to .get_maintainer.ignore

2015-08-21 14:30:10 -07:00

.gitattributes

.gitattributes: set git diff driver for C source code files

2016-10-07 18:46:30 -07:00

.gitignore

Merge tag 'kbuild-v4.17-2' of git://git.kernel.org/pub/scm/linux/kernel/git/masahiroy/linux-kbuild

2018-04-15 17:21:30 -07:00

.mailmap

Merge tag 'libnvdimm-for-4.19_misc' of gitolite.kernel.org:pub/scm/linux/kernel/git/nvdimm/nvdimm

2018-08-25 18:13:10 -07:00

COPYING

COPYING: use the new text with points to the license files

2018-03-23 12:41:45 -06:00

CREDITS

9p: remove Ron Minnich from MAINTAINERS

2018-08-17 16:20:26 -07:00

Kbuild

Merge tag 'kbuild-v4.15' of git://git.kernel.org/pub/scm/linux/kernel/git/masahiroy/linux-kbuild

2017-11-17 17:45:29 -08:00

Kconfig

kconfig: move the "Executable file formats" menu to fs/Kconfig.binfmt

2018-08-02 08:06:55 +09:00

MAINTAINERS

MAINTAINERS: Add an entry for the code of conduct

2018-10-22 07:33:36 +01:00

Makefile

Linux 4.19

2018-10-22 07:37:37 +01:00

README

Docs: Added a pointer to the formatted docs to README

2018-03-21 09:02:53 -06:00

README

Linux kernel
============

There are several guides for kernel developers and users. These guides can
be rendered in a number of formats, like HTML and PDF. Please read
Documentation/admin-guide/README.rst first.

In order to build the documentation, use ``make htmldocs`` or
``make pdfdocs``.  The formatted documentation can also be read online at:

    https://www.kernel.org/doc/html/latest/

There are various text files in the Documentation/ subdirectory,
several of them using the Restructured Text markup notation.
See Documentation/00-INDEX for a list of what is contained in each file.

Please read the Documentation/process/changes.rst file, as it contains the
requirements for building and running the kernel, and information about
the problems which may result by upgrading your kernel.