Stockfish Testing Queue

Finished - 37482 tests

15-02-11 sg spsa_pawn_attack_threat diff
46633/50000 iterations
99018/100000 games played
100000 @ 15+0.05 th 1 Use different pawn attack threat bonus by piece type. Now tune this parameters, starting at value (20,20) from the current version.
15-02-12 sg pawn_attack_threat4 diff
ELO: 1.21 +-2.8 (95%) LOS: 79.8%
Total: 23278 W: 4746 L: 4665 D: 13867
30000 @ 15+0.05 th 1 Quick measure of the First tuned parameters. Successful any safe pawn push patch is now merged in.
15-02-12 Roc MoreDblAttacks diff
ELO: 0.80 +-3.3 (95%) LOS: 68.3%
Total: 16856 W: 3393 L: 3354 D: 10109
20000 @ 15+0.05 th 1 See if any value to consider different weights and also e4, d4 and file c and f for double pawn attack idea.
15-02-12 sg spsa_pawn_attack_threat diff
46974/50000 iterations
97604/100000 games played
100000 @ 15+0.05 th 1 The first tuning is done without the any_safe_pawn2 patch. The measurement (now including any_safe_pawn2 patch) gives no significant gain and this two ideas seems strongly interacting as expected. Tuning now is done based on this passed patch. Only my new parameters tuned, not the 2 from the other patch, because we add code so it have to prove first by itself.
15-02-12 Roc MoreDblAttacks diff
37570/40000 iterations
80000/80000 games played
80000 @ 15+0.05 th 1 SPSA test on all possible double pawn binds. Once we get the result, will look for a simplification.
15-02-12 SC tune_tempo diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 38725 W: 6532 L: 6561 D: 25632
sprt @ 60+0.05 th 1 Both tempo += 1 and tempo += 2 were "yellow" at STC. Test tempo += 2 at LTC and then call it a day.
15-02-12 vin en_passant_bonus diff
LLR: 2.96 (-2.94,2.94) [-1.50,4.50]
Total: 11769 W: 2443 L: 2303 D: 7023
sprt @ 15+0.05 th 1 Test the SPSA-tuned values at STC. They look implausibly high, but also spookily similar to Lyudmil's original estimate.
15-02-12 vin en_passant_bonus diff
LLR: 2.95 (-2.94,2.94) [0.00,6.00]
Total: 43724 W: 7469 L: 7156 D: 29099
sprt @ 60+0.05 th 1 Test of tuned values at LTC after STC SPRT test passed.
15-02-13 Roc CenterLever diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 34876 W: 6927 L: 6938 D: 21011
sprt @ 15+0.05 th 1 S(10,10) bonus for lever c4 against d5 and f4 against e5 (and symmetrical for Black).
15-02-13 sg pawn_attack_threat4 diff
LLR: 2.95 (-2.94,2.94) [-1.50,4.50]
Total: 9307 W: 1916 L: 1784 D: 5607
sprt @ 15+0.05 th 1 Now hopefully the correct test of the tuned parameters. It's just not my day.
15-02-13 Roc MoreDblAttacks diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 17932 W: 3530 L: 3624 D: 10778
sprt @ 15+0.05 th 1 Testing the SPSA values for the double binds on center files only. There is more weight for the endgame.
15-02-13 jos rook_filebonus diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 50560 W: 10077 L: 10045 D: 30438
sprt @ 15+0.05 th 1 Give file-dependant bonus for rook on open file. Another idea by L. Tsvetkov.
15-02-13 sg pawn_attack_threat4 diff
LLR: -2.95 (-2.94,2.94) [0.00,6.00]
Total: 10955 W: 1726 L: 1777 D: 7452
sprt @ 60+0.05 th 1 LTC: Now hopefully the correct test of the tuned parameters. It's just not my day.
15-02-13 sni any_safe_push3 diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 47204 W: 9405 L: 9382 D: 28417
sprt @ 15+0.05 th 1 Double the bonus for safe pawn pushes
15-02-13 jki ep2 diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 4776 W: 895 L: 989 D: 2892
sprt @ 15+0.05 th 1 safe pawn push + ep handling
15-02-13 sni square_control diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 7182 W: 1413 L: 1500 D: 4269
sprt @ 15+0.05 th 1 Add bonus for threatening piece domination
15-02-14 Roc MoreDblAttacks diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 14688 W: 2833 L: 2939 D: 8916
sprt @ 15+0.05 th 1 The SPSA values looked quite stable for dbl-attacked central squares on rank 5 and above, so keep only them, and see if the endgame bonus has any value.
15-02-14 sg spsa_pawn_attack_threat diff
48900/50000 iterations
99385/100000 games played
100000 @ 60+0.05 th 1 The tunings on STC gives good results on STC but bad at LTC. So a strong TC dependency seems to exist. So do a last tuning try on LTC.
15-02-14 vin en_passant_bonus_spsa diff
24973/20000 iterations
45000/40000 games played
40000 @ 15+0.05 th 1 Since there is possible overlap between this patch and safe pawn pushes, re-tune now the latter has been merged.
15-02-14 sni square_control diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 11000 W: 2218 L: 2295 D: 6487
sprt @ 15+0.05 th 1 Piece domination: take 2 bis (simpler than take 2, same bench)
15-02-14 sni square_control diff
LLR: -3.20 (-2.94,2.94) [-1.50,4.50]
Total: 12257 W: 2403 L: 2485 D: 7369
sprt @ 15+0.05 th 1 Piece domination: take 3
15-02-14 jos knight_outpost diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 6603 W: 1257 L: 1346 D: 4000
sprt @ 15+0.05 th 1 Give bonus for knight outposts close to the enemy king.
15-02-14 n_p MedKingSafety diff
ELO: -0.71 +-2.6 (95%) LOS: 29.5%
Total: 27864 W: 5568 L: 5625 D: 16671
40000 @ 15+0.05 th 1 King safety tuning using the last four SPSA-session and trying to use these results to extrapolate.
15-02-14 jki pawnmob diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 6843 W: 1387 L: 1476 D: 3980
sprt @ 15+0.05 th 1 Safe pawn push tweak try
15-02-14 sni any_safe_push3 diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 7697 W: 1532 L: 1618 D: 4547
sprt @ 15+0.05 th 1 Increase the bonus for safe pawn pushes in endgame
15-02-14 mco SpaceThreshold diff
5171/50000 iterations
10509/100000 games played
100000 @ 15+0.05 th 1 Tune space evaluation threshold. I think it has never been properly tuned before.
15-02-14 jki smp diff
ELO: 50.50 +-8.1 (95%) LOS: 100.0%
Total: 2591 W: 667 L: 293 D: 1631
5000 @ 15+0.05 th 16 smp improvement attempt (16 threads)
15-02-14 jki pmob diff
LLR: -0.36 (-2.94,2.94) [-3.00,3.00]
Total: 13242 W: 2602 L: 2615 D: 8025
sprt @ 15+0.05 th 1 Remove piece checks for safe pawn pushes. sprt [-3, 3]
15-02-14 mco SpaceThreshold diff
21048/50000 iterations
43140/100000 games played
100000 @ 15+0.05 th 1 Tune space evaluation threshold. I think it has never been properly tuned before. Take 2 (wider changes)
15-02-14 jki pmob diff
LLR: -3.85 (-2.94,2.94) [-3.00,1.00]
Total: 33801 W: 6682 L: 6954 D: 20165
sprt @ 15+0.05 th 1 Remove piece checks for safe pawn pushes. No regression test.
15-02-15 Roc Battery diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 7526 W: 1473 L: 1559 D: 4494
sprt @ 15+0.05 th 1 Another attempt at the Q-> R battery idea.
15-02-15 jki space diff
LLR: -3.92 (-2.94,2.94) [-1.50,4.50]
Total: 18337 W: 3718 L: 3809 D: 10810
sprt @ 15+0.05 th 1 space try inspired by Lyudmil
15-02-15 Roc MoreDblAttacks diff
LLR: -3.20 (-2.94,2.94) [0.00,4.00]
Total: 38342 W: 7572 L: 7605 D: 23165
sprt @ 15+0.05 th 1 Back to original S(16, 0), Verifying additional binds on c6, f6 and b7 g7 since SPSA tuning showed singularities on those squares,
15-02-15 jki smp diff
ELO: -1.27 +-2.1 (95%) LOS: 12.3%
Total: 40000 W: 7829 L: 7975 D: 24196
40000 @ 15+0.05 th 1 smp improvement attempt (check regression: 1 thread)
15-02-15 jki smp diff
ELO: 0.28 +-2.9 (95%) LOS: 57.4%
Total: 20000 W: 3650 L: 3634 D: 12716
20000 @ 15+0.05 th 2 smp improvement attempt (check regression: 2 thread)
15-02-15 jki smp diff
ELO: 0.19 +-2.8 (95%) LOS: 55.3%
Total: 20000 W: 3440 L: 3429 D: 13131
20000 @ 15+0.05 th 4 smp improvement attempt (check regression: 4 thread)
15-02-15 jki smp diff
ELO: 6.19 +-3.9 (95%) LOS: 99.9%
Total: 10325 W: 1824 L: 1640 D: 6861
10000 @ 15+0.05 th 8 smp improvement attempt (check regression: 8 thread)
15-02-15 vin en_passant_bonus diff
ELO: -0.55 +-2.4 (95%) LOS: 32.9%
Total: 31000 W: 6085 L: 6134 D: 18781
30000 @ 15+0.05 th 1 Now that the pawn push activity has subsided, measure Elo of re-tuned values at STC.
15-02-15 mco SpaceThreshold diff
48367/50000 iterations
85524/100000 games played
100000 @ 15+0.05 th 1 Tune space evaluation threshold. I think it has never been properly tuned before. Take 3 (even wider changes and smaller range)
15-02-15 mco king8 diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 55077 W: 10999 L: 10940 D: 33138
sprt @ 15+0.05 th 1 Simplify attackUnits formula
15-02-15 vin en_passant_bonus diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 62686 W: 10413 L: 10287 D: 41986
sprt @ 60+0.05 th 1 LTC test of retuned values, after STC test inconclusive. As suggested by Joona: "I suggest to make a final conclusive test at LTC. Because this test has already passed once, I think you could use less strict bounds, like [0, 5] to reduce the risk of "unlucky run"."
15-02-16 Roc Battery diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 11479 W: 2223 L: 2298 D: 6958
sprt @ 15+0.05 th 1 This time, give a bonus only for squares which are in the opponent half of the board.
15-02-16 sni connected_pawns2 diff
LLR: 2.97 (-2.94,2.94) [-1.50,4.50]
Total: 52393 W: 10912 L: 10656 D: 30825
sprt @ 15+0.05 th 1 Try to create mobile phalanxes
15-02-16 sni connected_pawns2 diff
LLR: -0.31 (-2.94,2.94) [-1.50,4.50]
Total: 38196 W: 7859 L: 7763 D: 22574
sprt @ 15+0.05 th 1 Try to create mobile phalanxes. Take 2: with increased pawn mobility value
15-02-16 mco king8 diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 14751 W: 2530 L: 2400 D: 9821
sprt @ 60+0.05 th 1 LTC: Simplify attackUnits formula
15-02-16 mco SpaceThreshold diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 32155 W: 6384 L: 6426 D: 19345
sprt @ 15+0.05 th 1 SpaceThreshold tuning verification
15-02-16 Roc Battery diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 3062 W: 552 L: 650 D: 1860
sprt @ 15+0.05 th 1 Consider only vertical batteries.
15-02-16 Roc Battery diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 8183 W: 1610 L: 1695 D: 4878
sprt @ 15+0.05 th 1 Consider only horz batteries in the opponent half of the board.
15-02-16 vin wedges diff
LLR: 2.96 (-2.94,2.94) [-1.50,4.50]
Total: 13056 W: 2681 L: 2538 D: 7837
sprt @ 15+0.05 th 1 Try bonus for an advanced cramping pawn on d5/e5/c6/d6/e6/f6 that cuts opponent's lines.
15-02-16 jos space_threshold diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 77390 W: 15402 L: 15279 D: 46709
sprt @ 15+0.05 th 1 SPSA tuning try by Marco failed, now let's try CLOP value after 38k games.