Alfred Chen's Blog: BMQ Scheduler call out for testing

Monday, March 11, 2019

BMQ Scheduler call out for testing

BMQ(BitMap Queue) Scheduler is a band new CPU scheduler development from PDS and inspired by the scheduler in zircon project(google). It has been developed for months and it's time for open testing(current version is 0.89, on linux kernel 5.0).

For more design detail of BMQ, pls reference to Documentation/scheduler/sched-BMQ.txt in the repository. The documentation is not yet completed, because the scheduler is still under development and major features are not finalized.

Here is the list of major user visible difference of BMQ with PDS.

1. *NOT* support SCHED_ISO, pls use "nice --20" instead.
2. *NO* rr_interval, but a compile time kernel config CONFIG_SCHED_TIMESLICE(default 4ms) is available for similar usage. Yet, it is *strongly NOT recommended* to change it.
3. "yield_type" is still supported, but only value 0, 1(default) are available, 2 is still accept from interface, but it's same as value 1. (Will be changed when yield implementation is finalized)
4. BATCH and IDLE tasks are treated as the same policy. They compete CPU with NORMAL policy tasks, but they just don't boost. To control the priority of NORMAL/BATCH/IDLE tasks, simply use nice level.
5. BMQ will auto adjust(boost/deboost) task priority within +/- MAX_PRIORITY_ADJ(default 4) ranges. For example, from top/htop and other cpu monitors program, task of nice level 0 may be saw running as nice in cpu time accounting.

BMQ has been running smoothly on 3 machines(NUC Desktop, NAS file server and 7*24 raspberry pi) for ~1 month. Suspend/Resume on NUC Desktop and NAS file server are tested. BMQ shows promising in Desktop activity and kernel compilation sanity comparing to PDS. More benchmark is still on going.

BMQ is simple in design compare to PDS and result in ~20KB less in patch size and ~4KB in compressed kernel binary size.

Full kernel tree repository can be found at https://gitlab.com/alfredchen/linux-bmq
And all-in-one patch can be found at gitlab.

Thanks for testing and your feedback will be welcome.

30 comments:

AnonymousMarch 12, 2019 at 12:55 AM
Sounds nice I will install it promptly and report back if I find any issues.
ReplyDelete
Replies
AnonymousMarch 12, 2019 at 2:22 AM
@Alfred, very good news, will try it out as soon as I can.
Thanks for new scheduler!
BR, Eduardo
ReplyDelete
Replies
AnonymousMarch 12, 2019 at 3:15 AM
Oh, I was trying to choose between PDS and MuQSS, now there's BMQ too.... I really need some benchmarks for daily desktop use!
ReplyDelete
Replies
AnonymousMarch 12, 2019 at 8:29 AM
What is not clear to me is what makes this like the Zircon scheduler? The Zircon kernel is so different than the Linux kernel inherently. Zircon is preemptable and also supports pre-empting other cores. Linux by default does neither.

Plus Zircon is just so different than Linux in so many other ways it seems strange that a scheduler on one would be that valuable on the other? Linux I/O executes on the same core requesting for example. Versus every I/O on Zircon involves an IPC and you use shared memory and can therefore do a type of pipelining.

I am super curious on the title and hoping it is not some clickbait thing. I would not have spent time on this if there was not the "Zircon" in the title. I am super excited about Zircon.
ReplyDelete
Replies
Oleksandr NatalenkoMarch 12, 2019 at 1:16 PM
Builds/boots fine here, thanks. I'll deliver this to a wider audience via -pf.
ReplyDelete
Replies
AnonymousMarch 12, 2019 at 1:42 PM
running on 3 hosts so far nor problems. Will try laptop when i915 problems are fixed (black screen on laptop display and no X but thats 5.0 related not bmq)
ReplyDelete
Replies
FaalagornMarch 13, 2019 at 1:38 AM
I've been running PDS for a few weeks and now I've switched to BMQ and it seems to behave not worse than PDS. I didn't do any benchmarks, but interactivity seems to be as fine as it was. I've been using yield_type of 0, but maybe I'll try to switch it to 1 to see if it changes anything.
ReplyDelete
Replies
AnonymousMarch 13, 2019 at 11:55 AM
@Alfred:
I was really curious for your new CPU scheduler approach.
For now ~24h, BMQ is running very well on my old dual-core notebook (without HT) in my daily use (webbrowser, CAD, libreoffice, video-playback). I haven't seen any drawback at 5.0.1 BMQ vs. my last PDS at kernel 4.20.12. IMHO, but not benchmarked, it lowers CPU usage by a little bit.
So your initial tuning values seem to be wisely chosen.

Many thanks for your great work!

BR, Manuel
ReplyDelete
Replies
jwh7March 13, 2019 at 7:26 PM
Compile fails for x86-UP (modified to show arrow pointing to "sched_rq_pending_mask"):
================================
kernel/sched/bmq.c: In function ‘dequeue_task’:
kernel/sched/bmq.c:609:34: error: ‘sched_rq_pending_mask’ undeclared (first use in this function); did you mean ‘sched_rq_watermark’?
cpumask_clear_cpu(cpu_of(rq), &sched_rq_pending_mask);
~~~~~~~~~~~~~~~~~~~~^
sched_rq_watermark
kernel/sched/bmq.c:609:34: note: each undeclared identifier is reported only once for each function it appears in
kernel/sched/bmq.c: In function ‘enqueue_task’:
kernel/sched/bmq.c:634:32: error: ‘sched_rq_pending_mask’ undeclared (first use in this function); did you mean ‘sched_rq_watermark’?
cpumask_set_cpu(cpu_of(rq), &sched_rq_pending_mask);
~~~~~~~~~~~~~~~~~~~^
sched_rq_watermark
make[2]: *** [scripts/Makefile.build:277: kernel/sched/bmq.o] Error 1
make[1]: *** [scripts/Makefile.build:492: kernel/sched] Error 2
ReplyDelete
Replies
AnonymousMarch 14, 2019 at 3:03 AM
Had a strange bug today. Happens when I try to emerge anything, inet hangs, emerge hangs.
[ 92.095994] BUG: workqueue lockup - pool cpus=2 node=0 flags=0x0 nice=0 stuck for 50s!
[ 92.096000] Showing busy workqueues and worker pools:
[ 92.096000] workqueue events: flags=0x0
[ 92.096001] pwq 4: cpus=2 node=0 flags=0x0 nice=0 active=2/256
[ 92.096003] pending: destroy_super_work, gen6_pm_rps_work
[ 92.096011] workqueue mm_percpu_wq: flags=0x8
[ 92.096012] pwq 4: cpus=2 node=0 flags=0x0 nice=0 active=1/256
[ 92.096013] pending: vmstat_update
[ 92.096023] workqueue netns: flags=0xe000a
[ 92.096023] pwq 16: cpus=0-7 flags=0x4 nice=0 active=1/1
[ 92.096025] in-flight: 122:cleanup_net
[ 92.096037] pool 16: cpus=0-7 flags=0x4 nice=0 hung=0s workers=9 idle: 121 123 120 60 7 107 125 124
ReplyDelete
Replies
AnonymousMarch 15, 2019 at 11:34 AM
Thanks Alfred !
I've done throughput benchmarks of PDS and BMQ.
You can find them here :
https://docs.google.com/spreadsheets/d/163U3H-gnVeGopMrHiJLeEY1b7XlvND2yoceKbOvQRm4/edit#gid=1309629120

BMQ is very promising and already on par with PDS.

BMQ and PDS are configured with NO_HZ_FULL and HZ=1000.
What is the recommended configuration for BMQ ?

Pedro
ReplyDelete
Replies
UnknownMarch 29, 2019 at 12:53 PM
Alfred, can you suggest the optimal settings for:
- Hz: periodic, no_hz, no_hz_full
- Preemption: server, desktop, preempt
- Tick frequency: 100, 250, 300, 1000
Thank you!
ReplyDelete
Replies
AnonymousSeptember 21, 2019 at 12:14 PM
I tried lots of optimized kernel ( zen, liquorix, ck, pds, mainline ) from arch aur. I'm trying over unigine valley benchmark. I get best score with linux-gc (bmq scheduler) kernel. I get best minimum fps with ck kernel. I get best maximum fps with standart mainline. But bmq get best balance between min-max fps. Good overall.
ReplyDelete
Replies
Mike PaganoMarch 18, 2021 at 4:21 PM
Thanks for your work. Running it now, and I have packaged it for our Gentoo users in 5.11.7-r1 and plan to carry it going forward.
ReplyDelete
Replies

Add comment