Alfred Chen's Blog: VRQ 0.96d release

Sunday, July 9, 2017

VRQ 0.96d release

VRQ 0.96d is released with the following changes

1. smt sensitive scheduling improvement, which reduce some migration overhead.
2. Fix livepatch compilation issue.

This is bug fix and smt sensitive scheduling improvement release.

Enjoy VRQ 0.96d for v4.12 kernel, :)

code are available at
https://bitbucket.org/alfredchen/linux-gc/commits/branch/linux-4.12.y-vrq
and also
https://github.com/cchalpha/linux-gc/commits/linux-4.12.y-vrq

All-in-one patch is available too.

BR Alfred

59 comments:

AnonymousJuly 10, 2017 at 12:53 AM
@Alfred,

compiled fine, will test soon on i7 @ work.

Br, Eduardo
ReplyDelete
Replies
Oleksandr NatalenkoJuly 10, 2017 at 6:50 AM
Compiles and boots okay, thanks.
ReplyDelete
Replies
AnonymousJuly 11, 2017 at 2:26 AM
@Alfred
one day of use on Ryzen and i7 machines without problem. I tried some tests with memory allocation and vrq scheduler actually performs better than cfs, so my previous suspicion seems to be wrong. I will keep looking. Thanks for the updates.

BR, Dzon.
ReplyDelete
Replies
Alfred ChenJuly 11, 2017 at 8:17 PM
@all
Thanks. Till now, the feedback are positive. I am still working on reducing overhead of smt sensitive scheduling. Will announce another release when it is done.
ReplyDelete
Replies
UnknownJuly 15, 2017 at 7:21 AM
hi i have one question what is the best settings for CONFIG_HZ ?? i never interesting with this settings but today i read that MuQSS recommend tick rate 100 for his patch and i can`t find info about recommend settings for VRQ
ReplyDelete
Replies
UnknownJuly 15, 2017 at 2:18 PM
Works better than MuQSS on game server, thanks!
ReplyDelete
Replies
AnonymousJuly 16, 2017 at 11:40 AM
Sorry, for again posting some off-topic question -- now regarding the BFQ I/O scheduler:
I'm a little confused about the current in-kernel status: Is the BFQ-sq (single queue, the known former one) also included, or only the newer mq-(multiqueue-) based one? If only the latter is in, does someone of you know, whether there would be an extra patch for the first?

Thank you in advance for any info on this, best regards,
Manuel Krause
ReplyDelete
Replies
AnonymousJuly 17, 2017 at 12:36 AM
@Manuel,

Personally me, I currently gave up on extensive tests or use of "bfq-sq" or "bfq-mq", that is due to unexplained crashes. Once in a while I use bfq for my Ryzen system, but when it crashes due to bfq or other fancy stuff I do there (including VRQ), I switch back to deadline. The thing is that I do not even know which software is responsible for a crash, so I'm trying to isolate things :(
Of course, bfq matters a lot to maintain an interactive system on rotational hw like my Ryzen system, I feel that everytime I switch away from bfq.
When I prove to myself that other software on my system is rather stable, I'm back to bfq testing fun.

As far as I know, everything is in so called "algodev", You probably need to check commits there to assemble a stable patch. I haven't tried that myself for some time now. In addition, as far as I understood, there are bfq renaming thing going on which adds a bit more confusion.

If You figure everything out, please share Your knowledge ;)

Br, Eduardo

P.S. But for VRQ, it's good now, all seem to run nice! Thanks Alfred!
ReplyDelete
Replies
Oleksandr NatalenkoJuly 17, 2017 at 12:58 AM
Alfred,

I have a report regarding failed build on 32-bit machine.

Log: https://gist.githubusercontent.com/Pro-pra/6aed3990932d5b6906fc30e71a9ef8ee/raw/75d92c8e136b6e4b92b619c69d2e270a66fe8f65/err.txt

Config: https://gist.githubusercontent.com/Pro-pra/6aed3990932d5b6906fc30e71a9ef8ee/raw/75d92c8e136b6e4b92b619c69d2e270a66fe8f65/DOTconfig-4.12

I assume this happens due to name collision. There are 2 distinct declaration, "raw_spinlock_t sched_cpu_priodls_lock" and function that acquires this spinlock, sched_cpu_priodls_lock(), with the same name.

Proposed patch from me: http://ix.io/ywT

Could you please check this?

Thanks.
ReplyDelete
Replies
AnonymousJuly 17, 2017 at 3:35 AM
i know that this patch is for kernel 4.12 but i try install VRQ on kernel 4.13 rc and i have only 1 error

Hunk #1 FAILED at 15.
1 out of 1 hunk FAILED -- saving rejects to file kernel/sched/Makefile.rej

cat Makefile.rej

--- kernel/sched/Makefile
+++ kernel/sched/Makefile
@@ -15,13 +15,17 @@ ifneq ($(CONFIG_SCHED_OMIT_FRAME_POINTER),y)
CFLAGS_core.o := $(PROFILING) -fno-omit-frame-pointer
endif

-obj-y += core.o loadavg.o clock.o cputime.o
-obj-y += idle_task.o fair.o rt.o deadline.o stop_task.o
-obj-y += wait.o swait.o completion.o idle.o
-obj-$(CONFIG_SMP) += cpupri.o cpudeadline.o topology.o
+ifdef CONFIG_SCHED_BFS
+obj-y += bfs.o
+else
+obj-y += core.o idle_task.o fair.o rt.o deadline.o stop_task.o
+obj-$(CONFIG_SMP) += cpudeadline.o topology.o
obj-$(CONFIG_SCHED_AUTOGROUP) += autogroup.o
-obj-$(CONFIG_SCHEDSTATS) += stats.o
obj-$(CONFIG_SCHED_DEBUG) += debug.o
obj-$(CONFIG_CGROUP_CPUACCT) += cpuacct.o
+endif
+obj-y += cputime.o wait.o swait.o completion.o idle.o clock.o loadavg.o
+obj-$(CONFIG_SMP) += cpupri.o
+obj-$(CONFIG_SCHEDSTATS) += stats.o
obj-$(CONFIG_CPU_FREQ) += cpufreq.o
obj-$(CONFIG_CPU_FREQ_GOV_SCHEDUTIL) += cpufreq_schedutil.o
ReplyDelete
Replies
Oleksandr NatalenkoJuly 17, 2017 at 1:14 PM
Huh, just got this WARN_ON: https://gist.github.com/a4db6d8909f825d0691370eda354e2b1

From here:

===
1358 WARN_ONCE(rq != task_rq(p), "vrq: cpu[%d] take_task reside on %d.\n",
1359 cpu, task_cpu(p));
===

Something you know about?
ReplyDelete
Replies
AnonymousJuly 21, 2017 at 1:48 PM
Hi all,
chiming in just to warn you there is definitely some conflict between vrq and bfq. I tried bfq and bfq-mq and in two cases on both systems (Ryzen and i7) defrag in virtual Win10 machine hardlocked the host system and corrupted data. Cfs with bfq seems to be working and vrq with mq-deadline too (almost 2 weeks on both machines with vrq).
I randomly achieved one peculiar case (on i7) where only the guest froze and when I switched io scheduler from bfq to mq-deadline guest resumed operation. I was not able to find anything useful in logs.

BR,
Dzon
ReplyDelete
Replies
AnonymousJuly 22, 2017 at 5:57 AM
Thanks Alfred.
The usual throughput tests are here :
https://docs.google.com/spreadsheets/d/163U3H-gnVeGopMrHiJLeEY1b7XlvND2yoceKbOvQRm4/edit?usp=sharing

VRQ throughput is slightly better than CFS.
Changing the timer value seems to have a rather low impact, either with VRQ or CFS.
I've also ran interbench, for those who understand its results.

Pedro
ReplyDelete
Replies
Alfred ChenJuly 22, 2017 at 9:55 AM
@all
Thanks all for testing/feedback/benchmark in these two release. This scheduler is more stable than its previous release. There will be no new release this week, but here is the update.
The improvement of smt sensitive scheduling is on going, more testing is required to finalize a timeout value.
During the investigaion of pf's issue, I am inspired to have an idea to cut off enqueue/dequeue overhead, and now I am working on it.
ReplyDelete
Replies

Add comment