Alfred Chen's Blog: GC and VRQ branch update for v4.3.1 and latency test

Thursday, December 10, 2015

GC and VRQ branch update for v4.3.1 and latency test

Finally it comes the first stable release for 4.3, and gc and vrq branch are updated with bug fixes during these few weeks.

*A non-return error when enable SMT_NICE(though SMT_NICE is not recommended for VRQ)
*Go through threads list with tasklist_lock held when cpu hotplugs. It's for both gc and vrq branch.

*Task caching scheduling PartIII, as usual I will write another post for it.

The gc branch for v4.3.1 can be found at bitbucket and github.
The vrq branch for v4.3.1 can be found at bitbucket and github.

One more thing, I would like to add more tests/benchmark for scheduling for a long time. And I finally found one yesterday, it is Cyclictest, you can check the detail on this wiki(it's a little old but it's a good start point). Based on my research, it is scheduler independent and use no scheduler statics.

Here is my first idle workload cyclictest result for v4.3 cfs, bfs and vrq. (I'm still playing with it)

4.3 CFS
# /dev/cpu_dma_latency set to 0us
policy: fifo: loadavg: 0.05 0.04 0.05 1/219 1504

T: 0 ( 1499) P:80 I:10000 C: 10000 Min:   1831 Act:    2245 Avg:    2413 Max:   12687
T: 1 ( 1500) P:80 I:10500 C:   9524 Min:   1917 Act:    2965 Avg:    2560 Max:    7547
T: 2 ( 1501) P:80 I:11000 C:   9091 Min:   1702 Act:    2254 Avg:    2313 Max:   10650
T: 3 ( 1502) P:80 I:11500 C:   8696 Min:   1546 Act:    2297 Avg:    2274 Max:   13723

4.3 BFS
# /dev/cpu_dma_latency set to 0us
policy: fifo: loadavg: 0.15 0.10 0.04 1/234 1540

T: 0 ( 1536) P:80 I:10000 C: 10000 Min:   1437 Act:    2002 Avg:    1893 Max:   10912
T: 1 ( 1537) P:80 I:10500 C:   9524 Min:   1427 Act:    2010 Avg:    1907 Max:    7534
T: 2 ( 1538) P:80 I:11000 C:   9091 Min:   1402 Act:    1755 Avg:    1902 Max:   13059
T: 3 ( 1539) P:80 I:11500 C:   8696 Min:   1408 Act:    1878 Avg:    1866 Max:   12921

4.3 VRQ
# /dev/cpu_dma_latency set to 0us
policy: fifo: loadavg: 0.00 0.01 0.00 0/226 1607

T: 0 ( 1602) P:80 I:10000 C: 10000 Min:   1349 Act:    1785 Avg:    1647 Max:    4934
T: 1 ( 1603) P:80 I:10500 C:   9524 Min:   1355 Act:    1464 Avg:    1642 Max:   12378
T: 2 ( 1604) P:80 I:11000 C:   9091 Min:   1334 Act:    1926 Avg:    1676 Max:   12544
T: 3 ( 1605) P:80 I:11500 C:   8696 Min:   1350 Act:    1801 Avg:    1627 Max:   10989

Enjoy with gc/vrq on v4.3.1 and try cyclictest if you care about the latency and task interaction.

BR Alfred

Edit:
If you have failed s2ram/resume issue with this new gc/vrq release, you can try below 2 patches(one for gc and one for vrq) and see if it help with you.
4.3_gc3_fix.patch and 4.3_vrq1_fix.patch

45 comments:

AnonymousDecember 10, 2015 at 7:27 AM
your cyclictest numbers are very high in all testst.
How did you call cyclictest?
Here is what I'm getting on latests vrq:

sudo cyclictest --smp -p 80
# /dev/cpu_dma_latency set to 0us
policy: fifo: loadavg: 0.00 0.00 0.00 0/207 2623

T: 0 ( 2620) P:80 I:1000 C: 13685 Min: 4 Act: 7 Avg: 6 Max: 293
T: 1 ( 2621) P:80 I:1500 C: 9123 Min: 4 Act: 7 Avg: 7 Max: 309
T: 2 ( 2622) P:80 I:2000 C: 6842 Min: 4 Act: 6 Avg: 6 Max: 278
T: 3 ( 2623) P:80 I:2500 C: 5473 Min: 4 Act: 7 Avg: 7 Max: 147
ReplyDelete
Replies
AnonymousDecember 10, 2015 at 7:39 AM
:-) I just wanted to ask in the older thread, if there are changes ahead for your planned 4.3.1 branches' update...
So there are significant changes?

When I manually load the patches from bitbucket, will it be sufficient to only fetch the patches in:
https://bitbucket.org/alfredchen/linux-gc/commits/tag/v4.3.1-vrq
?

It would be very nice, if you could also provide all-in-one patches for this release, complete -gc and -vrq, for us lazy people. ;-)

The last patch for vrq works flawlessly on 4.3.0 for me.
During the last weeks I've spent some time on .config changes testing, in order to move timing relevant kernel/ module loading/ re-initialisation sections out of the process of resume-from-hibernation. You remember my problem of many random and unreproducible TuxOnIce resumes. I seem to have solved it by first (& first time) making use of your "Use prefered raid6 gen function" patch, and compiling btrfs (plus dependencies) into the kernel (what otherwise gave a "random: nonblocking pool initialized" message @ resume). But I left my GFX i915 as a module. Fine since ~1 week. As for now, I can't pin down to one particular change. I'm just glad about more than 30 failureless resumes.

Best regards,
Manuel Krause
ReplyDelete
Replies
AnonymousDecember 11, 2015 at 3:11 AM
Hi, Alfred!
Another question that arises from Con's updated BFS 466:
Can the code change (look at http://ck.kolivas.org/patches/4.0/4.3/4.3-ck2/patches/bfs465-466.patch) also be relevant for your current -vrq ?

And how would the code section look like then?

Thank you and BR, Manuel
ReplyDelete
Replies
Oleksandr NatalenkoDecember 11, 2015 at 10:22 AM
Works OK for me, thanks.
ReplyDelete
Replies
Oleksandr NatalenkoDecember 12, 2015 at 1:53 PM
Hmm, it seems that latest pf-kernel update breaks resuming from s2ram for me. I have to find out who is guilty — 4.3.1/4.3.2 update or -vrq.
ReplyDelete
Replies
Oleksandr NatalenkoDecember 12, 2015 at 3:09 PM
Also got the following same panic twice on my router:

http://i.piccy.info/i9/454c5f1d8063e0f707bdc0896d86e565/1449961790/283985/951663/panic.jpg

Weird :(.
ReplyDelete
Replies
AnonymousDecember 15, 2015 at 11:42 AM
@Alfred: Have you already found an opinion about Con's BFS 0467 aproach?
Keeping sched_interactive at 1 can improve low-latency and setting it to 0 would lead to the BFS 0466 results (possibly higher latencies but possibly higher throughput), if I understand Con's posts and the code changes correctly? I haven't found time to test his code, so far.

Best regards,
Manuel Krause
ReplyDelete
Replies

Add comment