Alfred Chen's Blog: VRQ 0.92 release

Tuesday, January 17, 2017

VRQ 0.92 release

VRQ 0.92 is released with the following changes

1. remove printk in migrate_tasks()
2. vrq: refine normalize_rt_tasks()
3. vrq: Optimist ffb usage in skiplist_random_level()
4. vrq: introduce cputime.c
5. vrq: remove unused sched_domain_level
6. vrq: remove rq->timekeep_clock

Most are code clean up and little optimist. The major one is introducing mainline cputime.c, which help to reduce vrq scheduler main code size under 7k LOC and reduce the effect syncing up with mainline kernel scheduler code from release to release.

Enjoy VRQ 0.92, :)

code are available at
https://bitbucket.org/alfredchen/linux-gc/commits/branch/linux-4.9.y-vrq
and also
https://github.com/cchalpha/linux-gc/commits/linux-4.9.y-vrq

All-in-one patch is available too.

BR Alfred

41 comments:

AnonymousJanuary 18, 2017 at 1:17 PM
@Alfred:
VRQ 0.92 works as well on here as the predecessor. Not worse but also not better.
The compile time load balancing is still an issue (make -j2 on dualcore).

BR, Manuel Krause
ReplyDelete
Replies
jwh7January 18, 2017 at 4:17 PM
Hey Alfred; x64 built fine, i686 UP failed though:
CC kernel/sched/cputime.o
kernel/sched/cputime.c: In function ‘read_sum_exec_runtime’:
kernel/sched/cputime.c:319:18: error: storage size of ‘rf’ isn’t known
struct rq_flags rf;
^~
kernel/sched/cputime.c:322:7: error: implicit declaration of function ‘task_rq_lock’ [-Werror=implicit-function-declaration]
rq = task_rq_lock(t, &rf);
^~~~~~~~~~~~
kernel/sched/cputime.c:324:2: error: implicit declaration of function ‘task_rq_unlock’ [-Werror=implicit-function-declaration]
task_rq_unlock(rq, t, &rf);
^~~~~~~~~~~~~~
kernel/sched/cputime.c:319:18: warning: unused variable ‘rf’ [-Wunused-variable]
struct rq_flags rf;
^~
cc1: some warnings being treated as errors
make[2]: *** [scripts/Makefile.build:293: kernel/sched/cputime.o] Error 1
make[1]: *** [scripts/Makefile.build:544: kernel/sched] Error 2
make: *** [Makefile:988: kernel] Error 2
ReplyDelete
Replies
AnonymousJanuary 20, 2017 at 3:35 AM
Good job.
Less latency than MUQSS.
Almost half. :)
Not bad.
ReplyDelete
Replies
AnonymousJanuary 22, 2017 at 2:47 AM
Hi,

I have been using 092 since it was available. I wanted to have more testing time to report the results.
To me, at least on skylake laptop, kernel behaves very good, battery life is better than muqss, this is main driver (on laptop) why I'm using it.

All seems to be very good, interactivity is good, no crashes, etc. except... cpufreq on intel seems to be broken (again?) in this version on skylake. It's the same stuck frequency problem for me again, this time it was stuck at 1.2GHz, nothing, except reboot, is helping. Setting performance, or specific HZ or anything I did - no results, had to reboot.
On the other hand, to my big surprise intel_pstate started to behave, I'm using it + VRQ right now and all seems to be quite good. Will test it more, maybe finally that is fixed.

On Phenom all is fine, no complaints at all, but I'm not doing performance tests anymore.
If there will be substantial performance improvements, Alfred let me know, I'll consider testing couple of kernels with couple of games again.

Thanks for Your work and best regards
Eduardo
ReplyDelete
Replies
AnonymousJanuary 22, 2017 at 11:31 PM
@Alfred,

I have i7-6700HQ CPU. The thing is that "stuck frequency" is not visible when I boot the computer up and the problem does not show up right away. It just starts at some point after a boot, not right away, but to "unstuck" the CPU I have to reboot, that's why it's strange enough. Let's see what happens with pstate, so far it's 3 days and all is fine.

As for Phenom (running the same kernel as for skylake), I have discovered a problem, not a performance problem I believe, but total usage values observed in "top" does not correspond to individual task CPU sum. The situation I encountered was the overall system usage shows half of cores are busy (which it was, I believe), but when I check tasks, there are 20% usage at one cpu only. Might be a thing of adapting mainline task accounting. Will check it further, when I encounter it again, is there anything specific to look for?

Br, Eduardo
ReplyDelete
Replies
AnonymousJanuary 23, 2017 at 8:52 AM
Hi, I've found some time to run my usual throughput benchmarks of VRQ0.92. They are here :
https://docs.google.com/spreadsheets/d/163U3H-gnVeGopMrHiJLeEY1b7XlvND2yoceKbOvQRm4/edit?usp=sharing

I've put some colors to make the results more readable (hopefully).
The reference kernel is the one on the first column. Following the value of the realtime difference between tested kernel and reference kernel, the colors are :
- blue if difference is within 'realtime of reference kernel +/- maximum standard deviation'
- green if difference is lower than 'realtime of reference kernel - maximum standard deviation'
- red if difference is higher than 'realtime of reference kernel + maximum standard deviation'
Overall best and worst are also shown ,if not in between +/- std dev.

This time I used intel_pstate, and it seems you fixed the performance issue with this driver. The results are good except for sysbench oltp. I didn't follow the development of VRQ lately so you might be aware of that issue already.

Pedro
ReplyDelete
Replies
AnonymousJanuary 24, 2017 at 10:08 AM
@Alfred:
Atm I'm doing a disk(s) and partition(s) reorganising with gparted. I don't do it often, so this is no reference at all. But it behaves so well and fast with my the current setup, that I need to let you know.
Kernel 4.9.5, VRQ 092, BFQv8r7, WBT7(from ck/pf) and my humble TOI port.

And, besides of other posters' results, the improving 4.9.x kernels do improve reliability on here. Read it like: I don't want a kernel fail at every 2nd resume from disk because of speed issues from the bootup-speed people.

BR, Manuel Krause
ReplyDelete
Replies
AnonymousJanuary 26, 2017 at 10:43 AM
[OFFTOPIC]
I've noticed several people on here that use shared graphics, comparable with mine on a HP laptop. In my case it's a GM45 using the i915 kernel module and the i965 from the xorg intel driver.

My question:
Do you know a way to find out what amount of memory/RAM is actually really allocated?
From 'man intel', it's automatically allocated by needs, "VideoRAM" in xorg.conf thus ignored for my chipset.
Kernel says: [drm] Memory usable by graphics device = 2048M.
lspci -v says: Memory at c0000000 (64-bit, prefetchable) [size=256M]
< Note: The latter is most likely the max. possible AGP aperture mem size and not the gfx'

And that was all. The Xorg.0.log doesn't number memory sizes at all on here.

Thank you in advance for helpful hints and links,
BR, Manuel Krause
ReplyDelete
Replies
AnonymousFebruary 1, 2017 at 10:13 PM
Hi,

I was testing 092 version for good (almost 13 days uptime) measure, now switching to mux to test how pstate behaves there.
As for VRQ - stable, fast, all good except that process total CPU and separate process usage, which is not quite aligned (reported above, do anyone else are having this?), at least that's what I see.

Keep up good work and thanks,
Eduardo
ReplyDelete
Replies

Add comment