Stockfish Testing Queue

Finished - 35756 tests

15-10-19 gli razor_margin diff
LLR: -0.33 (-2.94,2.94) [0.00,5.00]
Total: 396 W: 42 L: 54 D: 300
sprt @ 240+0.4 th 3 Testing timeout fix.
15-10-19 IIv time_management diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 12148 W: 2337 L: 2410 D: 7401
sprt @ 10+0.1 th 1 Time usage, sprt [0,5], Take1
15-10-19 sni material5 diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 16728 W: 3131 L: 3184 D: 10413
sprt @ 10+0.1 th 1 Keep balanced material to attack (take2, lower malus)
15-10-19 sg initiative diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 47259 W: 9174 L: 9089 D: 28996
sprt @ 10+0.1 th 1 Use pawn span for initiative. Half weight. Take 4
15-10-19 Roc MinorThreatSimplified diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 18879 W: 3725 L: 3600 D: 11554
sprt @ 10+0.1 th 1 Simpler threat handling
15-10-15 Roc MobExtended diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 59483 W: 11382 L: 11245 D: 36856
sprt @ 10+0.1 th 1 Take 3: extend only the Bishop
15-10-19 Voy HistoryStats diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 42420 W: 8177 L: 8184 D: 26059
sprt @ 10+0.1 th 1 Take 2. (More aggressive tweak)
15-10-19 sg initiative diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 16502 W: 3194 L: 3247 D: 10061
sprt @ 10+0.1 th 1 Pawn count and pawn span are highly correlated (r=0.78). So try to replace former with later one.
15-10-19 sg initiative diff
LLR: -2.94 (-2.94,2.94) [0.00,5.00]
Total: 20994 W: 4103 L: 4135 D: 12756
sprt @ 10+0.1 th 1 Use pawn span for initiative. Double up weight. Take 2
15-10-18 sni material5 diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 20309 W: 3871 L: 3908 D: 12530
sprt @ 10+0.1 th 1 Keep balanced material to attack
15-10-19 gli razor_margin diff
LLR: -0.12 (-2.94,2.94) [0.00,5.00]
Total: 1070 W: 103 L: 105 D: 862
sprt @ 240+0.4 th 3 Testing timeout fix.
15-10-18 jhe simple_time_2 diff
LLR: -3.04 (-2.94,2.94) [-3.00,1.00]
Total: 7571 W: 1065 L: 1232 D: 5274
sprt @ 40+0.4 th 1 LTC
15-10-18 sni material5 diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 12721 W: 2415 L: 2485 D: 7821
sprt @ 10+0.1 th 1 Try to keep enough material (take 2)
15-10-18 IIv time_management diff
LLR: 2.96 (-2.94,2.94) [-3.00,1.00]
Total: 15253 W: 2998 L: 2866 D: 9389
sprt @ 10+0.1 th 1 My last try to help with time_management. I know this is not an ideal solution, but maybe could serve until better solution would be found.
15-10-18 tvi lazy_NUMA2 diff
ELO: 0.23 +-6.0 (95%) LOS: 53.0%
Total: 4551 W: 801 L: 798 D: 2952
5000 @ 10+0.1 th 3 Quick test of NUMA hack
15-10-19 gli razor_margin diff
Pending...
sprt @ 240+0.4 th 3 Testing timeout fix.
15-10-18 sg initiative diff
LLR: -3.26 (-2.94,2.94) [0.00,5.00]
Total: 88856 W: 17286 L: 17028 D: 54542
sprt @ 10+0.1 th 1 Use pawn span for initiative.
15-10-17 Voy HistoryStats diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 88058 W: 16923 L: 16768 D: 54367
sprt @ 10+0.1 th 1 Improve History Stats weights...
15-10-18 SC assorted_tuning diff
LLR: 2.95 (-2.94,2.94) [0.00,4.00]
Total: 21577 W: 3507 L: 3289 D: 14781
sprt @ 40+0.4 th 1 Retest assorted tuning without time management as discussed in https://github.com/official-stockfish/Stockfish/pull/464 LTC
15-10-17 jhe simple_time_2 diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 53079 W: 10230 L: 10166 D: 32683
sprt @ 10+0.1 th 1 Further Code Cleanup.
15-10-15 sni threats6 diff
LLR: -3.04 (-2.94,2.94) [0.00,5.00]
Total: 38229 W: 7295 L: 7256 D: 23678
sprt @ 10+0.1 th 1 Change values of threats on attacked queens (take 1, change = S(10,10))
15-10-18 mco lazy_smp diff
ELO: -4.09 +-33.1 (95%) LOS: 40.4%
Total: 85 W: 8 L: 9 D: 68
10000 @ 180+0.1 th 20 lazy smp pre-merge test: high-threads scenario. With 20 threads test with fixed number of games at XXLTC to compare against same conditions at LTC.
15-10-18 sni material5 diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 26878 W: 5149 L: 5156 D: 16573
sprt @ 10+0.1 th 1 Try to keep enough material
15-10-18 jos razor_margin diff
LLR: 0.08 (-2.94,2.94) [0.00,5.00]
Total: 64 W: 4 L: 1 D: 59
sprt @ 240+0.4 th 3 Respin an old patch at very long tc to check, the timeout fix works. Test will be cancelled as soon as it's clear, the one or the other way. Now with 3 threads and 4 minutes to make sure, we hit the 5min limit.
15-10-18 sni rook_scale_factor diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 33176 W: 6229 L: 6270 D: 20677
sprt @ 10+0.1 th 1 Simplify logic in SF's KRPPKRP endgame. Simplification, tested as sprt(0..4)
15-10-18 mco lazy_smp diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 4235 W: 671 L: 528 D: 3036
sprt @ 60+0.1 th 7 lazy smp pre-merge test: middle case scenario. With 7 threads YBW is still godd but lazy should start to be effective. Test as simplification because lazy_smp saves 385(!) less lines of code. LTC version.
15-10-18 mco lazy_smp diff
ELO: 44.75 +-7.6 (95%) LOS: 100.0%
Total: 2069 W: 407 L: 142 D: 1520
10000 @ 60+0.1 th 20 lazy smp pre-merge test: high-threads scenario. With 20 threads test with fixed number of games at LTC because the advantage of lazy should be already clear after just 10K games and resources available for such a test are very few. Set at high priority because we want to allocate the few high core machines available. This is the _real_ test where lazy should prove stronger.
15-10-18 jos razor_margin diff
LLR: -0.06 (-2.94,2.94) [0.00,5.00]
Total: 29 W: 1 L: 3 D: 25
sprt @ 180+0.4 th 1 Respin an old patch at very long tc to check, the timeout fix works. Test will be cancelled as soon as it's clear, the one or the other way.
15-10-18 SC assorted_tuning diff
LLR: 3.07 (-2.94,2.94) [0.00,4.00]
Total: 15124 W: 2974 L: 2756 D: 9394
sprt @ 10+0.1 th 1 Retest assorted tuning without time management as discussed in https://github.com/official-stockfish/Stockfish/pull/464
15-10-17 sni king_march diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 33232 W: 6508 L: 6485 D: 20239
sprt @ 10+0.1 th 1 Try to use vertical king separation info
15-10-18 mco lazy_smp diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 3607 W: 674 L: 526 D: 2407
sprt @ 10+0.1 th 7 lazy smp pre-merge test: middle case scenario. With 7 threads YBW is still godd but lazy should start to be effective. Test as simplification because lazy_smp saves 385(!) less lines of code.
15-10-17 aji hybrid_history diff
ELO: 2.92 +-3.9 (95%) LOS: 93.0%
Total: 10000 W: 1669 L: 1585 D: 6746
10000 @ 10+0.1 th 7 Use a shared history/countermoves within the cluster: STC
15-10-18 sni pawn_asymmetry5 diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 12504 W: 2322 L: 2436 D: 7746
sprt @ 10+0.1 th 1 Double the asymmetry bonus
15-10-17 Roc InitiativeBlocked diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 25788 W: 5055 L: 5066 D: 15667
sprt @ 10+0.1 th 1 IB_20151017_1
15-10-17 mco lazy_smp diff
LLR: -2.95 (-2.94,2.94) [-3.00,1.00]
Total: 40232 W: 6908 L: 7130 D: 26194
sprt @ 10+0.1 th 3 lazy smp pre-merge test: worst case scenario. With 3 threads YBW scales almost in an ideal way so this is a tough test for lazy. Test as simplification becasue lazy_smp saves 385(!) less lines of code.
15-10-17 sni blocked_pawns2 diff
LLR: -2.94 (-2.94,2.94) [0.00,5.00]
Total: 24640 W: 4687 L: 4704 D: 15249
sprt @ 10+0.1 th 1 Take 2 (half malus for blocked pawns)
15-10-17 mco lazy_smp diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 28186 W: 3908 L: 3798 D: 20480
sprt @ 60+0.1 th 3 lazy smp pre-merge test: worst case scenario. With 3 threads YBW scales almost in an ideal way so this is a tough test for lazy. Test as simplification becasue lazy_smp saves 385(!) less lines of code. LTC version.
15-10-17 sni threats7 diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 33400 W: 6402 L: 6441 D: 20557
sprt @ 10+0.1 th 1 Values of threats on attacked queens, take 4 (change=S(8,8)). Rebased on current master.
15-10-17 sni rook5 diff
LLR: -2.79 (-2.94,2.94) [0.00,5.00]
Total: 19292 W: 3668 L: 3702 D: 11922
sprt @ 10+0.1 th 1 Also simplify logic in SF's KRPPKRP endgame function
15-10-17 SC initiative_pawns diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 18987 W: 3604 L: 3647 D: 11736
sprt @ 10+0.1 th 1 Try the opposite.
15-10-17 sg initiative diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 6555 W: 1213 L: 1311 D: 4031
sprt @ 10+0.1 th 1 Use stronger side passed pawns for initiative.
15-10-17 Roc InitiativeMg diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 16166 W: 3031 L: 3086 D: 10049
sprt @ 10+0.1 th 1 Using mg=eg/2 instead of 0, and max=4 on king_separation
15-10-17 sg initiative diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 10048 W: 1926 L: 2008 D: 6114
sprt @ 10+0.1 th 1 Use weaker side passed pawns for initiative.
15-10-17 sni blocked_pawns2 diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 10912 W: 2052 L: 2130 D: 6730
sprt @ 10+0.1 th 1 Blocked pawn malus for the attacking side
15-10-17 SC initiative_pawns diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 8467 W: 1596 L: 1685 D: 5186
sprt @ 10+0.1 th 1 Only use pawn count from strong side in scaling endgame.
15-10-17 lbr test diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 1159 W: 144 L: 263 D: 752
sprt @ 10+0.1 th 3 anarchic smp
15-10-16 IIv time_management diff
LLR: -1.24 (-2.94,2.94) [-3.00,1.00]
Total: 62543 W: 12016 L: 12195 D: 38332
sprt @ 10+0.1 th 1 Using 75% of time and the whole increment should cover most needs, both in making good moves, and leaving enough time on the clock. It seems like a natural limitation to me.
15-10-16 aji hybrid1 diff
ELO: -8.74 +-11.2 (95%) LOS: 6.3%
Total: 1193 W: 178 L: 208 D: 807
10000 @ 10+0.1 th 20 hybrid with 20 threads: STC (hardcoded Min Split Depth to 5)
15-10-15 sg update_stats diff
LLR: -3.76 (-2.94,2.94) [0.00,4.00]
Total: 58979 W: 11295 L: 11287 D: 36397
sprt @ 10+0.1 th 1 Test first the tuned history update. But possible further tuning is necessary (some values seems not converged).
15-10-16 sg update_stats diff
LLR: -3.10 (-2.94,2.94) [0.00,4.00]
Total: 83734 W: 16097 L: 15965 D: 51672
sprt @ 10+0.1 th 1 Test tuned values. Take 2