alistair23-linux

redonkable

History

Josef Bacik d706751215 block: introduce blk-iolatency io controller Current IO controllers for the block layer are less than ideal for our use case. The io.max controller is great at hard limiting, but it is not work conserving. This patch introduces io.latency. You provide a latency target for your group and we monitor the io in short windows to make sure we are not exceeding those latency targets. This makes use of the rq-qos infrastructure and works much like the wbt stuff. There are a few differences from wbt - It's bio based, so the latency covers the whole block layer in addition to the actual io. - We will throttle all IO types that comes in here if we need to. - We use the mean latency over the 100ms window. This is because writes can be particularly fast, which could give us a false sense of the impact of other workloads on our protected workload. - By default there's no throttling, we set the queue_depth to INT_MAX so that we can have as many outstanding bio's as we're allowed to. Only at throttle time do we pay attention to the actual queue depth. - We backcharge cgroups for root cg issued IO and induce artificial delays in order to deal with cases like metadata only or swap heavy workloads. In testing this has worked out relatively well. Protected workloads will throttle noisy workloads down to 1 io at time if they are doing normal IO on their own, or induce up to a 1 second delay per syscall if they are doing a lot of root issued IO (metadata/swap IO). Our testing has revolved mostly around our production web servers where we have hhvm (the web server application) in a protected group and everything else in another group. We see slightly higher requests per second (RPS) on the test tier vs the control tier, and much more stable RPS across all machines in the test tier vs the control tier. Another test we run is a slow memory allocator in the unprotected group. Before this would eventually push us into swap and cause the whole box to die and not recover at all. With these patches we see slight RPS drops (usually 10-15%) before the memory consumer is properly killed and things recover within seconds. Signed-off-by: Josef Bacik <jbacik@fb.com> Acked-by: Tejun Heo <tj@kernel.org> Signed-off-by: Jens Axboe <axboe@kernel.dk>		2018-07-09 09:07:54 -06:00
..
acpi	ACPI / processor: Finish making acpi_processor_ppc_has_changed() void	2018-06-20 10:50:40 +02:00
asm-generic	locking/qspinlock: Fix build for anonymous union in older GCC compilers	2018-06-22 04:19:16 +02:00
clocksource	…
crypto	Revert changes to convert to ->poll_mask() and aio IOCB_CMD_POLL	2018-06-28 10:40:47 -07:00
drm	drm for v4.18-rc1	2018-06-06 08:16:33 -07:00
dt-bindings	dt-bindings: clock: imx6ul: Do not change the clock definition order	2018-06-29 11:40:20 -07:00
keys	docs: Fix some broken references	2018-06-15 18:10:01 -03:00
kvm	KVM: arm/arm64: Bump VGIC_V3_MAX_CPUS to 512	2018-05-25 12:29:27 +01:00
linux	block: introduce blk-iolatency io controller	2018-07-09 09:07:54 -06:00
math-emu	…
media	media: v4l2-core: push taking ioctl mutex down to ioctl handler	2018-05-28 16:31:44 -04:00
memory	…
misc	ocxl: Expose the thread_id needed for wait on POWER9	2018-06-03 20:40:32 +10:00
net	Merge git://git.kernel.org/pub/scm/linux/kernel/git/davem/net	2018-07-02 11:18:28 -07:00
pcmcia	…
ras	…
rdma	4.18-rc	2018-06-21 07:22:30 +09:00
scsi	SCSI misc on 20180610	2018-06-10 13:01:12 -07:00
soc	ARM: SoC: late updates	2018-06-11 18:19:45 -07:00
sound	sound updates for 4.18	2018-06-06 09:08:38 -07:00
target	scsi: target: transport should handle st FM/EOM/ILI reads	2018-05-18 12:22:48 -04:00
trace	NFS client updates for Linux 4.18	2018-06-12 10:09:03 -07:00
uapi	include/uapi/linux/blkzoned.h: Remove a superfluous __packed directive	2018-07-09 09:07:52 -06:00
video	fbdev changes for v4.18:	2018-06-17 05:00:24 +09:00
xen	xen: fixes for 4.18-rc2	2018-06-23 20:44:11 +08:00