Stockfish Testing Queue

Finished - 29418 tests

14-10-23 sni threats2 diff
LLR: -1.90 (-2.94,2.94) [0.00,4.00]
Total: 25507 W: 5147 L: 5156 D: 15204
sprt @ 15+0.05 th 1 Raise value of threats on bishops, compared to threats on knights: hopefully this might help to get the bishop pair.
14-10-23 sni threats diff
LLR: 2.97 (-2.94,2.94) [0.00,6.00]
Total: 7896 W: 1495 L: 1350 D: 5051
sprt @ 60+0.05 th 1 LTC: test with tuned values and no lsb()
14-10-23 jos tune_shelter diff
ELO: -2.62 +-2.9 (95%) LOS: 3.7%
Total: 25766 W: 5836 L: 6030 D: 13900
30000 @ 7+0.05 th 1 Test some preliminary values for ShelterWeakness and StormDanger.
14-10-23 sni threats diff
LLR: 2.95 (-2.94,2.94) [-1.50,4.50]
Total: 7598 W: 1596 L: 1468 D: 4534
sprt @ 15+0.05 th 1 Test with tuned values and no lsb()
14-10-22 hwi zobside48 diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 69710 W: 14292 L: 14204 D: 41214
sprt @ 15+0.05 th 1
14-10-22 sni threats diff
19449/20000 iterations
40000/40000 games played
40000 @ 15+0.05 th 1 SPSA tuning for Threat array
14-10-22 Roc RookOnPawns diff
9884/10000 iterations
20000/20000 games played
20000 @ 15+0.05 th 1 SPSA (fixed uci.cpp). When Rook on Rank 1 to 4, no bonus was given for direct Rook attack on pawn. Also, we expect that direct Rook attacks from behind are worth more than Rook attacks from front, especially in end game. 20000 might not be enough, we will first see how tuning evolve.
14-10-22 Mys unsupported diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 6434 W: 1272 L: 1362 D: 3800
sprt @ 15+0.05 th 1 Unsupported pawn penalty now by rank.
14-10-22 aji ProtectedRookAttacks diff
LLR: -2.94 (-2.94,2.94) [-1.50,4.50]
Total: 11106 W: 2250 L: 2326 D: 6530
sprt @ 15+0.05 th 1 Previous RookOnPawn_fix test showed that there is a value in attacking protected pawns. Extend the idea to protected pieces : STC
14-10-21 aji RookOnPawn_fix diff
ELO: -2.07 +-3.1 (95%) LOS: 9.5%
Total: 20000 W: 4060 L: 4179 D: 11761
20000 @ 15+0.05 th 1 Rewarding rook attacks on protected pawns doesn't seem correct. Measure if it is a gain or loss.
14-10-21 joa previousDepth diff
ELO: 1.37 +-2.2 (95%) LOS: 89.2%
Total: 40000 W: 8246 L: 8088 D: 23666
40000 @ 15+0.05 th 1 recreating the values of hfwittmann which passed STC and LTC for ELO measurement. (Hopefully this time without bugs)
14-10-21 uri time_mang_changes diff
ELO: -45.92 +-4.4 (95%) LOS: 0.0%
Total: 10000 W: 1432 L: 2746 D: 5822
10000 @ 15+0.05 th 1 reducing the average time by a factor of 2
14-10-21 Mys pins diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 2394 W: 415 L: 515 D: 1464
sprt @ 15+0.05 th 1 Pinned enemies
14-10-21 uri time_mang_changes diff
ELO: -137.62 +-4.8 (95%) LOS: 0.0%
Total: 10000 W: 688 L: 4454 D: 4858
10000 @ 15+0.05 th 1 last try to test elo loss and this time reduce the average time by a factor of 4.
14-10-21 uri time_mang_changes diff
ELO: -51.69 +-4.5 (95%) LOS: 0.0%
Total: 10000 W: 1458 L: 2935 D: 5607
10000 @ 15+0.05 th 1 test the loss in elo by increasing the average time by a factor of 2
14-10-21 lbr threats diff
ELO: -6.97 +-3.5 (95%) LOS: 0.0%
Total: 14697 W: 2709 L: 3004 D: 8984
40000 @ 27+0.09 th 1 triple tc to see if there's a sign of scaling. take 3 = take 2 + spsa tuned values (40k iterations)
14-10-20 joa previousDepth diff
ELO: -0.24 +-3.1 (95%) LOS: 43.9%
Total: 20000 W: 4142 L: 4156 D: 11702
20000 @ 15+0.05 th 1 reduce time if we are exceeding previous depth (and had at least 10 "stable" iterations)
14-10-21 pro 3fold_1stMove diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 89417 W: 18169 L: 18175 D: 53073
sprt @ 15+0.05 th 1 fixing draw score on first repetition without affecting the search
14-10-20 joa previousDepth diff
ELO: -68.52 +-4.6 (95%) LOS: 0.0%
Total: 10000 W: 1337 L: 3284 D: 5379
10000 @ 5+0.05 th 1 do we not measure noise this time?
14-10-20 Roc KS_Corner_3 diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 10814 W: 2135 L: 2212 D: 6467
sprt @ 15+0.05 th 1 KS_Corner_3_1: only test that showed some ELO gain. Testing if was only by luck before testing at LTC
14-10-20 joa previousDepth diff
ELO: -0.02 +-3.1 (95%) LOS: 49.6%
Total: 19895 W: 4055 L: 4056 D: 11784
20000 @ 15+0.05 th 1 even more aggressive settings...
14-10-20 joa previousDepth diff
ELO: -0.02 +-3.1 (95%) LOS: 49.5%
Total: 18995 W: 3840 L: 3841 D: 11314
20000 @ 15+0.05 th 1 previousDepth with more aggressive settings.
14-10-20 Fis TTpolicy diff
ELO: -0.78 +-3.1 (95%) LOS: 30.9%
Total: 20000 W: 4052 L: 4097 D: 11851
20000 @ 15+0.05 th 1 Make TT replacement policy more symmetric and discerning of generations always saving the new entry first. Tuning 3
14-10-20 pec tm_shorter_book diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 36462 W: 6208 L: 6244 D: 24010
sprt @ 60+0.05 th 1 STC. Now TM changes are done versus shorter book. But after change to shorter book, number of moves per game made by engine increased, and results of current tm tests may be influenced by this. So check if relevant parameter is still optimal.
14-10-20 sg backward2 diff
LLR: -0.87 (-2.94,2.94) [-1.50,4.50]
Total: 7324 W: 1475 L: 1486 D: 4363
sprt @ 15+0.05 th 1 less penalty if backward and stopper pawn far away
14-10-20 sg backward1 diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 23182 W: 4745 L: 4787 D: 13650
sprt @ 15+0.05 th 1 Double penalty for very weak backward pawns (stopped by two enemy pawns)
14-10-16 jhe sp_nodes diff
LLR: 4.93 (-2.94,2.94) [-3.00,1.00]
Total: 118661 W: 21899 L: 21842 D: 74920
sprt @ 15+0.05 th 3 Try a small simplification.
14-10-19 pec tm_shorter_book diff
LLR: 2.95 (-2.94,2.94) [-1.50,4.50]
Total: 24191 W: 5004 L: 4829 D: 14358
sprt @ 15+0.05 th 1 STC. Now TM changes are done versus shorter book. But after change to shorter book, number of moves per game made by engine increased, and results of current tm tests may be influenced by this. So check if relevant parameter is still optimal.
14-10-19 joa previousDepth diff
ELO: 2.43 +-3.1 (95%) LOS: 93.8%
Total: 20000 W: 4204 L: 4064 D: 11732
20000 @ 15+0.05 th 1 teststuff
14-10-19 fwi tm_depthbased_simplifie diff
ELO: -0.43 +-3.0 (95%) LOS: 39.0%
Total: 20000 W: 3994 L: 4019 D: 11987
20000 @ 15+0.05 th 1 Simplification (functionally equivalent to version that passed)
14-10-17 lbr master diff
ELO: 27.58 +-1.9 (95%) LOS: 100.0%
Total: 40000 W: 7782 L: 4613 D: 27605
40000 @ 60+0.05 th 1 Regression test, standard conditions. Previous one 22.80 +-1.9
14-10-19 sni threats diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 26864 W: 5468 L: 5500 D: 15896
sprt @ 15+0.05 th 1 Take 3 : use maximum threat idea for hanging pieces.
14-10-19 fwi tm_depthbased_simplifie diff
ELO: 0.10 +-3.0 (95%) LOS: 52.7%
Total: 20000 W: 4001 L: 3995 D: 12004
20000 @ 15+0.05 th 1 Simplification (slightly more aggressive than version that passed)
14-10-19 Fis TTpolicy diff
LLR: -2.95 (-2.94,2.94) [0.00,6.00]
Total: 39354 W: 6835 L: 6744 D: 25775
sprt @ 60+0.05 th 1 Make TT replacement policy more symmetric and discerning of generations. Tuning 1. Pri -1 Also got +3 in a local 10k test so let's see.
14-10-18 Fis TTpolicy diff
ELO: -0.02 +-3.1 (95%) LOS: 49.6%
Total: 20000 W: 4107 L: 4108 D: 11785
20000 @ 15+0.05 th 1 Make TT replacement policy more symmetric and discerning of generations. Tuning 2. Pri -1
14-10-19 Fis TTpolicy diff
LLR: 2.96 (-2.94,2.94) [-1.50,4.50]
Total: 36129 W: 7471 L: 7262 D: 21396
sprt @ 15+0.05 th 1 Make TT replacement policy more symmetric and discerning of generations. Tuning 1. Pri -1 Also got +3 in a local 10k test so let's see.
14-10-18 lbr threats diff
ELO: -5.90 +-2.3 (95%) LOS: 0.0%
Total: 40000 W: 8704 L: 9383 D: 21913
40000 @ 9+0.03 th 1 take 3 = take 2 + spsa tuned values (40k iterations)
14-10-18 sni threats diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 3390 W: 630 L: 728 D: 2032
sprt @ 15+0.05 th 1 Take 2 : add some multithreats ideas
14-10-18 Fis TTpolicy diff
ELO: 1.22 +-3.1 (95%) LOS: 78.1%
Total: 20000 W: 4108 L: 4038 D: 11854
20000 @ 15+0.05 th 1 Make TT replacement policy more symmetric and discerning of generations. Tuning 1. Pri -1
14-10-17 lbr threats diff
ELO: -8.71 +-2.3 (95%) LOS: 0.0%
Total: 40000 W: 8489 L: 9491 D: 22020
40000 @ 9+0.03 th 1 how far are we now?
14-10-17 jos delay_aspiration diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 46904 W: 9582 L: 9558 D: 27764
sprt @ 15+0.05 th 1 Reset aspiration window two iterations later.
14-10-17 sni threats diff
LLR: -2.96 (-2.94,2.94) [0.00,6.00]
Total: 42947 W: 7347 L: 7241 D: 28359
sprt @ 60+0.05 th 1 LTC: get rid of lsb() in threat evaluation by calculating maximum threat of each type
14-10-17 lbr threats diff
ELO: -12.04 +-2.3 (95%) LOS: 0.0%
Total: 40000 W: 8364 L: 9750 D: 21886
40000 @ 9+0.03 th 1 how far do we get with that very crude/untuned replacement ?
14-10-17 lbr threats^^ diff
ELO: -19.16 +-2.4 (95%) LOS: 0.0%
Total: 37824 W: 7654 L: 9738 D: 20432
40000 @ 9+0.03 th 1 what is the value of threats ?
14-10-17 fwi timemanagement_depthbas diff
LLR: 2.97 (-2.94,2.94) [0.00,6.00]
Total: 27110 W: 4707 L: 4472 D: 17931
sprt @ 60+0.05 th 1 using interpolated values between initial and tuned values as a conservative estimate. This makes sense as values did not converge.
14-10-17 Mys pawn_checks diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 14824 W: 2993 L: 3059 D: 8772
sprt @ 15+0.05 th 1 Safe pawn checks
14-10-17 fwi timemanagement_depthbas diff
LLR: 2.96 (-2.94,2.94) [-1.50,4.50]
Total: 11562 W: 2421 L: 2281 D: 6860
sprt @ 15+0.05 th 1 using interpolated values between initial and tuned values as a conservative estimate. This makes sense as values did not converge.
14-10-16 aji multithreats_D diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 47512 W: 9828 L: 9801 D: 27883
sprt @ 15+0.05 th 1 Multithreats, final attempt :STC
14-10-17 sni threats diff
LLR: 2.97 (-2.94,2.94) [-1.50,4.50]
Total: 10077 W: 2160 L: 2023 D: 5894
sprt @ 15+0.05 th 1 Get rid of lsb() in threat evaluation by calculating maximum threat of each type
14-10-16 fwi timemanagement_depthbas diff
LLR: 2.96 (-2.94,2.94) [-1.50,4.50]
Total: 33329 W: 6897 L: 6696 D: 19736
sprt @ 15+0.05 th 1 tuned values