Stockfish Testing Queue

Finished - 42738 tests

15-02-01 jos spsa_queen_imbal diff
50339/50000 iterations
102000/100000 games played
100000 @ 15+0.05 th 1 Try to tune material imbalance values of the queen. They seem to have the most impact elowise.
15-02-01 Roc AttackBuildUp diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 3916 W: 745 L: 841 D: 2330
sprt @ 15+0.05 th 1 Half the weight of the first test submitted.
15-02-01 SC search_tempo diff
ELO: -0.14 +-4.4 (95%) LOS: 47.6%
Total: 10000 W: 2128 L: 2132 D: 5740
10000 @ 9+0.03 th 1 Do not add tempo during evaluation and move it completely to search. Now tempo is being added for specialized endgames. It should not change anything (apart of specialized endgames).
15-02-02 Roc LatentQAttack diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 17700 W: 3503 L: 3561 D: 10636
sprt @ 15+0.05 th 1 Fix # 2. Only scoring Minor attacks. If it does not work, next step will be to lower the weight.
15-02-02 Roc AttackBuildUp diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 22258 W: 4420 L: 4466 D: 13372
sprt @ 15+0.05 th 1 See git notes. Simplified version just looking at latent Knight move attacking the king area.
15-02-02 sg tuned_pawn_attack_threa diff
ELO: 3.64 +-2.2 (95%) LOS: 100.0%
Total: 40000 W: 8213 L: 7794 D: 23993
40000 @ 15+0.05 th 1 Measure elo of tuned vs untuned pawn attack threat
15-02-02 jos matimb diff
ELO: -0.58 +-2.1 (95%) LOS: 29.8%
Total: 40000 W: 7929 L: 7996 D: 24075
40000 @ 15+0.05 th 1 Verify tuned queen imbalance values.
15-02-02 sg spsa_pawn_attack_threat diff
38612/40000 iterations
80000/80000 games played
80000 @ 15+0.05 th 1 The last tuning was promising and at least one parameter seems not converged, so try further tuning on top.
15-02-02 vin passed_blockers2_spsa diff
14981/10000 iterations
22627/20000 games played
20000 @ 15+0.05 th 1 The idea worked well at STC but no gain at LTC. Since the initial weights were a complete guess, try a quick tuning run to see if this can be improved.
15-02-02 jhe stockfish6mf diff
ELO: -10.50 +-2.6 (95%) LOS: 0.0%
Total: 24951 W: 4124 L: 4878 D: 15949
30000 @ 30+0.10 th 1 Measure Elo impact of better mate detection.
15-02-03 sg tuned2_pawn_attack_thre diff
ELO: -0.97 +-2.2 (95%) LOS: 19.3%
Total: 38311 W: 7575 L: 7682 D: 23054
40000 @ 15+0.05 th 1 Measure elo of second tuned vs first tuned pawn attack threat
15-02-03 jos spsa_queen_imbal diff
47187/50000 iterations
91000/100000 games played
100000 @ 15+0.05 th 1 Try to tune material imbalance values of the queen. They seem to have the most impact elowise. Try a second tuning session with the values from the first one as starting point.
15-02-03 lbr kqkrps diff
LLR: 3.49 (-2.94,2.94) [-3.50,0.50]
Total: 32384 W: 5138 L: 5032 D: 22214
sprt @ 15+0.05 th 1 verify that KQKRPs is useless (8moves book to steer towards endgames)
15-02-03 sg tuned_pawn_attack_threa diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 37488 W: 6213 L: 6247 D: 25028
sprt @ 60+0.05 th 1 My first tuning seems to give the best parameters, so test them now at LTC against current master.
15-02-03 lbr kbpsk diff
LLR: 2.95 (-2.94,2.94) [-3.50,0.50]
Total: 31431 W: 5009 L: 4929 D: 21493
sprt @ 15+0.05 th 1 verify that KBPsK is useless (8moves)
15-02-03 lbr kbppkb diff
LLR: 3.43 (-2.94,2.94) [-3.50,0.50]
Total: 71912 W: 11248 L: 11235 D: 49429
sprt @ 15+0.05 th 1 verify that KBPPKB is useless (8moves)
15-02-03 Roc LatentQAttack diff
12179/12500 iterations
25000/25000 games played
25000 @ 15+0.05 th 1 SPSA run 1, starting at S(20,20) only 25M to find the trend if any
15-02-03 Roc LatentQAttack diff
12264/12500 iterations
25001/25000 games played
25000 @ 15+0.05 th 1 SPSA run 2, starting at S(30,30), only 25M to find the trend if any.
15-02-03 sni pawn_support diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 16024 W: 3233 L: 3296 D: 9495
sprt @ 15+0.05 th 1 Bonus for safe pawn pushes getting connected pawns
15-02-03 gli movecount_qsearch diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 3971 W: 744 L: 840 D: 2387
sprt @ 15+0.05 th 1 One more try on making movecount pruning more accurate (hopefully cheaply enough this time)
15-02-03 sni piece_support diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 8297 W: 1559 L: 1643 D: 5095
sprt @ 15+0.05 th 1 Bonus for safe pawn pushes supporting one of our (centralized) pieces.
15-02-04 sg pawn_attack_threat3 diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 14594 W: 2876 L: 2943 D: 8775
sprt @ 15+0.05 th 1 Recognize only attacks on minor pieces (That was the original idea of Ludmil, i extended that in my succesful patch to all pieces).
15-02-04 Roc LatentQAttack diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 30549 W: 6060 L: 6083 D: 18406
sprt @ 15+0.05 th 1 See if we can trust the first SPSA result which was started with S(20,20) and roughly gave S(21,16) in all cases.
15-02-04 sni piece_support diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 4558 W: 903 L: 998 D: 2657
sprt @ 15+0.05 th 1 Take 2
15-02-04 sni pawn_support diff
LLR: 2.95 (-2.94,2.94) [-1.50,4.50]
Total: 13462 W: 2759 L: 2615 D: 8088
sprt @ 15+0.05 th 1 Take 2
15-02-04 sni pawn_attack_threat5 diff
LLR: 4.25 (-2.94,2.94) [-1.50,4.50]
Total: 28734 W: 5846 L: 5613 D: 17275
sprt @ 15+0.05 th 1 Treat pawn pushes supported by pieces as safe pushes
15-02-04 jos matimb diff
ELO: 1.84 +-2.2 (95%) LOS: 95.3%
Total: 40000 W: 8131 L: 7919 D: 23950
40000 @ 15+0.05 th 1 Check new values after 2nd SPSA session.
15-02-04 n_p KingSafety diff
ELO: -1.08 +-2.2 (95%) LOS: 16.3%
Total: 40000 W: 7936 L: 8060 D: 24004
40000 @ 15+0.05 th 1 Checking the values from the SPSA-session on king safety.
15-02-04 SC tuned_tempo diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 9188 W: 1771 L: 1853 D: 5564
sprt @ 15+0.05 th 1 Increase tempo value in middlegame and decrease in endgame.
15-02-04 SC tuned_tempo diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 10206 W: 1974 L: 2053 D: 6179
sprt @ 15+0.05 th 1 Decrease tempo value for middlegame and increase for endgame.
15-02-04 sg pawn_attack_threat3 diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 15280 W: 3015 L: 3080 D: 9185
sprt @ 15+0.05 th 1 exclude queens as target
15-02-04 Roc LatentQAttack diff
12145/12500 iterations
25000/25000 games played
25000 @ 15+0.05 th 1 SPSA run 1 looked ok, the test at SPRT 15 +0.05 was roughly 50/50 with quite a low draw rate. Another 25M games for more tuning.
15-02-04 lbr kmpkm diff
LLR: 2.96 (-2.94,2.94) [-3.50,0.50]
Total: 21597 W: 3470 L: 3367 D: 14760
sprt @ 15+0.05 th 1 are KmPKm also useless?
15-02-04 sni pawn_support diff
LLR: -2.95 (-2.94,2.94) [0.00,6.00]
Total: 29087 W: 4864 L: 4827 D: 19396
sprt @ 60+0.05 th 1 LTC: take 2
15-02-04 sni pawn_attack_threat5 diff
LLR: -3.00 (-2.94,2.94) [0.00,4.00]
Total: 72097 W: 12026 L: 11952 D: 48119
sprt @ 60+0.05 th 1 LTC : treat pawn pushes supported by pieces as safe pushes. Tested with SPRT[0,4] as this is a tuning patch.
15-02-05 lbr kqkr diff
LLR: 3.82 (-2.94,2.94) [-3.50,0.50]
Total: 72660 W: 11287 L: 11255 D: 50118
sprt @ 15+0.05 th 1 is KQKR useless ?
15-02-05 jos spsa_queen_imbal diff
55624/50000 iterations
107000/100000 games played
100000 @ 15+0.05 th 1 Try a 3rd tuning session with the values from the 2nd one as starting point. Now with smaller ck.
15-02-05 n_p KingSafety diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 16384 W: 2594 L: 2696 D: 11094
sprt @ 60+0.05 th 1 There does not seem to be a significant gain in STC but because king safety is TC dependent and tuning done in LTC, chech if improvement in LTC.
15-02-05 sni connected_passed diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 6489 W: 1245 L: 1334 D: 3910
sprt @ 15+0.05 th 1 Bonus for connected passed pawns
15-02-05 Roc LatentQAttack diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 13045 W: 2583 L: 2654 D: 7808
sprt @ 15+0.05 th 1 My last try on this idea, with last SPSA values. S(22,22) for Bishop and S(17.17) for Knight.
15-02-05 SC tuned_tempo diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 8077 W: 1601 L: 1686 D: 4790
sprt @ 15+0.05 th 1 Bugfix for tempo evaluation. More tempo value in endgames.
15-02-05 SC tuned_tempo diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 12197 W: 2416 L: 2489 D: 7292
sprt @ 15+0.05 th 1 More tempo value in middlegames, bugfix
15-02-05 jos no_split_after_null diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 16620 W: 2935 L: 2997 D: 10688
sprt @ 15+0.05 th 3 Don't split after null move. This is most likely not a good place to split. Most of the times it is rejected anyways due to min split depth. This should reduce a bit the split/search overhead. My guess is, this may be of more benefit with a higher number of threads, but start testing with 3 threads.
15-02-06 lbr krkm diff
LLR: 4.36 (-2.94,2.94) [-3.50,0.50]
Total: 40050 W: 6382 L: 6249 D: 27419
sprt @ 15+0.05 th 1 are KRKm also useless?
15-02-06 lbr kpkp diff
LLR: 3.04 (-2.94,2.94) [-3.50,0.50]
Total: 50933 W: 8035 L: 7995 D: 34903
sprt @ 15+0.05 th 1 is KPKP useless?
15-02-06 uri not_prune_high diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 17122 W: 3333 L: 3393 D: 10396
sprt @ 15+0.05 th 1 Usually do not change the search but avoid stupid pruning when the score is a losing score. fix the problem of stupid scores in small depth at r1q1nr1k/pp1b2b1/n2p2pp/2pP1p2/2B4B/3Q1N1P/PPP1NPP1/1R3RK1 b - - 0 12
15-02-06 jos matimb diff
ELO: 0.22 +-2.1 (95%) LOS: 57.8%
Total: 40000 W: 8007 L: 7982 D: 24011
40000 @ 15+0.05 th 1 Check the new values after 3rd SPSA session.
15-02-06 n_p SPSAKingSafety3 diff
45052/50000 iterations
99879/100000 games played
100000 @ 60+0.05 th 1 Another SPSA-session on king safety.
15-02-06 jos matimb diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 90570 W: 15049 L: 14915 D: 60606
sprt @ 60+0.05 th 1 LTC: values after 2nd SPSA session. (If these pass, I will test the final values of the 3rd session against the new master.)
15-02-06 sg pawn_attack_threat3 diff
ELO: 1.34 +-3.0 (95%) LOS: 80.7%
Total: 20000 W: 3989 L: 3912 D: 12099
20000 @ 15+0.05 th 1 Allow queen as defender. The test of sn which allows all pieces as defenders passed STC, but struggles with LTC. So lets measure the effect for each piece type separatly.