Stockfish Testing Queue

Finished - 51187 tests

15-02-01 Roc AttackBuildUp diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 3916 W: 745 L: 841 D: 2330
sprt @ 15+0.05 th 1 Half the weight of the first test submitted.
15-02-01 SC search_tempo diff
ELO: -0.14 +-4.4 (95%) LOS: 47.6%
Total: 10000 W: 2128 L: 2132 D: 5740
10000 @ 9+0.03 th 1 Do not add tempo during evaluation and move it completely to search. Now tempo is being added for specialized endgames. It should not change anything (apart of specialized endgames).
15-02-02 Roc LatentQAttack diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 17700 W: 3503 L: 3561 D: 10636
sprt @ 15+0.05 th 1 Fix # 2. Only scoring Minor attacks. If it does not work, next step will be to lower the weight.
15-02-02 Roc AttackBuildUp diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 22258 W: 4420 L: 4466 D: 13372
sprt @ 15+0.05 th 1 See git notes. Simplified version just looking at latent Knight move attacking the king area.
15-02-02 sg tuned_pawn_attack_threa diff
ELO: 3.64 +-2.2 (95%) LOS: 100.0%
Total: 40000 W: 8213 L: 7794 D: 23993
40000 @ 15+0.05 th 1 Measure elo of tuned vs untuned pawn attack threat
15-02-02 jos matimb diff
ELO: -0.58 +-2.1 (95%) LOS: 29.8%
Total: 40000 W: 7929 L: 7996 D: 24075
40000 @ 15+0.05 th 1 Verify tuned queen imbalance values.
15-02-02 sg spsa_pawn_attack_threat diff
38612/40000 iterations
80000/80000 games played
80000 @ 15+0.05 th 1 The last tuning was promising and at least one parameter seems not converged, so try further tuning on top.
15-02-02 vin passed_blockers2_spsa diff
14981/10000 iterations
22627/20000 games played
20000 @ 15+0.05 th 1 The idea worked well at STC but no gain at LTC. Since the initial weights were a complete guess, try a quick tuning run to see if this can be improved.
15-02-02 jhe stockfish6mf diff
ELO: -10.50 +-2.6 (95%) LOS: 0.0%
Total: 24951 W: 4124 L: 4878 D: 15949
30000 @ 30+0.10 th 1 Measure Elo impact of better mate detection.
15-02-03 sg tuned2_pawn_attack_thre diff
ELO: -0.97 +-2.2 (95%) LOS: 19.3%
Total: 38311 W: 7575 L: 7682 D: 23054
40000 @ 15+0.05 th 1 Measure elo of second tuned vs first tuned pawn attack threat
15-02-03 jos spsa_queen_imbal diff
47187/50000 iterations
91000/100000 games played
100000 @ 15+0.05 th 1 Try to tune material imbalance values of the queen. They seem to have the most impact elowise. Try a second tuning session with the values from the first one as starting point.
15-02-03 lbr kqkrps diff
LLR: 3.49 (-2.94,2.94) [-3.50,0.50]
Total: 32384 W: 5138 L: 5032 D: 22214
sprt @ 15+0.05 th 1 verify that KQKRPs is useless (8moves book to steer towards endgames)
15-02-03 sg tuned_pawn_attack_threa diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 37488 W: 6213 L: 6247 D: 25028
sprt @ 60+0.05 th 1 My first tuning seems to give the best parameters, so test them now at LTC against current master.
15-02-03 lbr kbpsk diff
LLR: 2.95 (-2.94,2.94) [-3.50,0.50]
Total: 31431 W: 5009 L: 4929 D: 21493
sprt @ 15+0.05 th 1 verify that KBPsK is useless (8moves)
15-02-03 lbr kbppkb diff
LLR: 3.43 (-2.94,2.94) [-3.50,0.50]
Total: 71912 W: 11248 L: 11235 D: 49429
sprt @ 15+0.05 th 1 verify that KBPPKB is useless (8moves)
15-02-03 Roc LatentQAttack diff
12179/12500 iterations
25000/25000 games played
25000 @ 15+0.05 th 1 SPSA run 1, starting at S(20,20) only 25M to find the trend if any
15-02-03 Roc LatentQAttack diff
12264/12500 iterations
25001/25000 games played
25000 @ 15+0.05 th 1 SPSA run 2, starting at S(30,30), only 25M to find the trend if any.
15-02-03 sni pawn_support diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 16024 W: 3233 L: 3296 D: 9495
sprt @ 15+0.05 th 1 Bonus for safe pawn pushes getting connected pawns
15-02-03 gli movecount_qsearch diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 3971 W: 744 L: 840 D: 2387
sprt @ 15+0.05 th 1 One more try on making movecount pruning more accurate (hopefully cheaply enough this time)
15-02-03 sni piece_support diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 8297 W: 1559 L: 1643 D: 5095
sprt @ 15+0.05 th 1 Bonus for safe pawn pushes supporting one of our (centralized) pieces.
15-02-04 sg pawn_attack_threat3 diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 14594 W: 2876 L: 2943 D: 8775
sprt @ 15+0.05 th 1 Recognize only attacks on minor pieces (That was the original idea of Ludmil, i extended that in my succesful patch to all pieces).
15-02-04 Roc LatentQAttack diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 30549 W: 6060 L: 6083 D: 18406
sprt @ 15+0.05 th 1 See if we can trust the first SPSA result which was started with S(20,20) and roughly gave S(21,16) in all cases.
15-02-04 sni piece_support diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 4558 W: 903 L: 998 D: 2657
sprt @ 15+0.05 th 1 Take 2
15-02-04 sni pawn_support diff
LLR: 2.95 (-2.94,2.94) [-1.50,4.50]
Total: 13462 W: 2759 L: 2615 D: 8088
sprt @ 15+0.05 th 1 Take 2
15-02-04 sni pawn_attack_threat5 diff
LLR: 4.25 (-2.94,2.94) [-1.50,4.50]
Total: 28734 W: 5846 L: 5613 D: 17275
sprt @ 15+0.05 th 1 Treat pawn pushes supported by pieces as safe pushes
15-02-04 jos matimb diff
ELO: 1.84 +-2.2 (95%) LOS: 95.3%
Total: 40000 W: 8131 L: 7919 D: 23950
40000 @ 15+0.05 th 1 Check new values after 2nd SPSA session.
15-02-04 n_p KingSafety diff
ELO: -1.08 +-2.2 (95%) LOS: 16.3%
Total: 40000 W: 7936 L: 8060 D: 24004
40000 @ 15+0.05 th 1 Checking the values from the SPSA-session on king safety.
15-02-04 SC tuned_tempo diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 9188 W: 1771 L: 1853 D: 5564
sprt @ 15+0.05 th 1 Increase tempo value in middlegame and decrease in endgame.
15-02-04 SC tuned_tempo diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 10206 W: 1974 L: 2053 D: 6179
sprt @ 15+0.05 th 1 Decrease tempo value for middlegame and increase for endgame.
15-02-04 sg pawn_attack_threat3 diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 15280 W: 3015 L: 3080 D: 9185
sprt @ 15+0.05 th 1 exclude queens as target
15-02-04 Roc LatentQAttack diff
12145/12500 iterations
25000/25000 games played
25000 @ 15+0.05 th 1 SPSA run 1 looked ok, the test at SPRT 15 +0.05 was roughly 50/50 with quite a low draw rate. Another 25M games for more tuning.
15-02-04 lbr kmpkm diff
LLR: 2.96 (-2.94,2.94) [-3.50,0.50]
Total: 21597 W: 3470 L: 3367 D: 14760
sprt @ 15+0.05 th 1 are KmPKm also useless?
15-02-04 sni pawn_support diff
LLR: -2.95 (-2.94,2.94) [0.00,6.00]
Total: 29087 W: 4864 L: 4827 D: 19396
sprt @ 60+0.05 th 1 LTC: take 2
15-02-04 sni pawn_attack_threat5 diff
LLR: -3.00 (-2.94,2.94) [0.00,4.00]
Total: 72097 W: 12026 L: 11952 D: 48119
sprt @ 60+0.05 th 1 LTC : treat pawn pushes supported by pieces as safe pushes. Tested with SPRT[0,4] as this is a tuning patch.
15-02-05 lbr kqkr diff
LLR: 3.82 (-2.94,2.94) [-3.50,0.50]
Total: 72660 W: 11287 L: 11255 D: 50118
sprt @ 15+0.05 th 1 is KQKR useless ?
15-02-05 jos spsa_queen_imbal diff
55624/50000 iterations
107000/100000 games played
100000 @ 15+0.05 th 1 Try a 3rd tuning session with the values from the 2nd one as starting point. Now with smaller ck.
15-02-05 n_p KingSafety diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 16384 W: 2594 L: 2696 D: 11094
sprt @ 60+0.05 th 1 There does not seem to be a significant gain in STC but because king safety is TC dependent and tuning done in LTC, chech if improvement in LTC.
15-02-05 sni connected_passed diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 6489 W: 1245 L: 1334 D: 3910
sprt @ 15+0.05 th 1 Bonus for connected passed pawns
15-02-05 Roc LatentQAttack diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 13045 W: 2583 L: 2654 D: 7808
sprt @ 15+0.05 th 1 My last try on this idea, with last SPSA values. S(22,22) for Bishop and S(17.17) for Knight.
15-02-05 SC tuned_tempo diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 8077 W: 1601 L: 1686 D: 4790
sprt @ 15+0.05 th 1 Bugfix for tempo evaluation. More tempo value in endgames.
15-02-05 SC tuned_tempo diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 12197 W: 2416 L: 2489 D: 7292
sprt @ 15+0.05 th 1 More tempo value in middlegames, bugfix
15-02-05 jos no_split_after_null diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 16620 W: 2935 L: 2997 D: 10688
sprt @ 15+0.05 th 3 Don't split after null move. This is most likely not a good place to split. Most of the times it is rejected anyways due to min split depth. This should reduce a bit the split/search overhead. My guess is, this may be of more benefit with a higher number of threads, but start testing with 3 threads.
15-02-06 lbr krkm diff
LLR: 4.36 (-2.94,2.94) [-3.50,0.50]
Total: 40050 W: 6382 L: 6249 D: 27419
sprt @ 15+0.05 th 1 are KRKm also useless?
15-02-06 lbr kpkp diff
LLR: 3.04 (-2.94,2.94) [-3.50,0.50]
Total: 50933 W: 8035 L: 7995 D: 34903
sprt @ 15+0.05 th 1 is KPKP useless?
15-02-06 uri not_prune_high diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 17122 W: 3333 L: 3393 D: 10396
sprt @ 15+0.05 th 1 Usually do not change the search but avoid stupid pruning when the score is a losing score. fix the problem of stupid scores in small depth at r1q1nr1k/pp1b2b1/n2p2pp/2pP1p2/2B4B/3Q1N1P/PPP1NPP1/1R3RK1 b - - 0 12
15-02-06 jos matimb diff
ELO: 0.22 +-2.1 (95%) LOS: 57.8%
Total: 40000 W: 8007 L: 7982 D: 24011
40000 @ 15+0.05 th 1 Check the new values after 3rd SPSA session.
15-02-06 n_p SPSAKingSafety3 diff
45052/50000 iterations
99879/100000 games played
100000 @ 60+0.05 th 1 Another SPSA-session on king safety.
15-02-06 jos matimb diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 90570 W: 15049 L: 14915 D: 60606
sprt @ 60+0.05 th 1 LTC: values after 2nd SPSA session. (If these pass, I will test the final values of the 3rd session against the new master.)
15-02-06 sg pawn_attack_threat3 diff
ELO: 1.34 +-3.0 (95%) LOS: 80.7%
Total: 20000 W: 3989 L: 3912 D: 12099
20000 @ 15+0.05 th 1 Allow queen as defender. The test of sn which allows all pieces as defenders passed STC, but struggles with LTC. So lets measure the effect for each piece type separatly.
15-02-06 sg pawn_attack_threat3 diff
ELO: 0.28 +-3.0 (95%) LOS: 57.1%
Total: 20000 W: 4006 L: 3990 D: 12004
20000 @ 15+0.05 th 1 Allow rook as defender