Stockfish Testing Queue

Finished - 35727 tests

15-02-12 SC tune_tempo diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 38725 W: 6532 L: 6561 D: 25632
sprt @ 60+0.05 th 1 Both tempo += 1 and tempo += 2 were "yellow" at STC. Test tempo += 2 at LTC and then call it a day.
15-02-12 vin en_passant_bonus diff
LLR: 2.96 (-2.94,2.94) [-1.50,4.50]
Total: 11769 W: 2443 L: 2303 D: 7023
sprt @ 15+0.05 th 1 Test the SPSA-tuned values at STC. They look implausibly high, but also spookily similar to Lyudmil's original estimate.
15-02-12 vin en_passant_bonus diff
LLR: 2.95 (-2.94,2.94) [0.00,6.00]
Total: 43724 W: 7469 L: 7156 D: 29099
sprt @ 60+0.05 th 1 Test of tuned values at LTC after STC SPRT test passed.
15-02-13 Roc CenterLever diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 34876 W: 6927 L: 6938 D: 21011
sprt @ 15+0.05 th 1 S(10,10) bonus for lever c4 against d5 and f4 against e5 (and symmetrical for Black).
15-02-13 sg pawn_attack_threat4 diff
LLR: 2.95 (-2.94,2.94) [-1.50,4.50]
Total: 9307 W: 1916 L: 1784 D: 5607
sprt @ 15+0.05 th 1 Now hopefully the correct test of the tuned parameters. It's just not my day.
15-02-13 Roc MoreDblAttacks diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 17932 W: 3530 L: 3624 D: 10778
sprt @ 15+0.05 th 1 Testing the SPSA values for the double binds on center files only. There is more weight for the endgame.
15-02-13 jos rook_filebonus diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 50560 W: 10077 L: 10045 D: 30438
sprt @ 15+0.05 th 1 Give file-dependant bonus for rook on open file. Another idea by L. Tsvetkov.
15-02-13 sg pawn_attack_threat4 diff
LLR: -2.95 (-2.94,2.94) [0.00,6.00]
Total: 10955 W: 1726 L: 1777 D: 7452
sprt @ 60+0.05 th 1 LTC: Now hopefully the correct test of the tuned parameters. It's just not my day.
15-02-13 sni any_safe_push3 diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 47204 W: 9405 L: 9382 D: 28417
sprt @ 15+0.05 th 1 Double the bonus for safe pawn pushes
15-02-13 jki ep2 diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 4776 W: 895 L: 989 D: 2892
sprt @ 15+0.05 th 1 safe pawn push + ep handling
15-02-13 sni square_control diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 7182 W: 1413 L: 1500 D: 4269
sprt @ 15+0.05 th 1 Add bonus for threatening piece domination
15-02-14 Roc MoreDblAttacks diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 14688 W: 2833 L: 2939 D: 8916
sprt @ 15+0.05 th 1 The SPSA values looked quite stable for dbl-attacked central squares on rank 5 and above, so keep only them, and see if the endgame bonus has any value.
15-02-14 sg spsa_pawn_attack_threat diff
48900/50000 iterations
99385/100000 games played
100000 @ 60+0.05 th 1 The tunings on STC gives good results on STC but bad at LTC. So a strong TC dependency seems to exist. So do a last tuning try on LTC.
15-02-14 vin en_passant_bonus_spsa diff
24973/20000 iterations
45000/40000 games played
40000 @ 15+0.05 th 1 Since there is possible overlap between this patch and safe pawn pushes, re-tune now the latter has been merged.
15-02-14 sni square_control diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 11000 W: 2218 L: 2295 D: 6487
sprt @ 15+0.05 th 1 Piece domination: take 2 bis (simpler than take 2, same bench)
15-02-14 sni square_control diff
LLR: -3.20 (-2.94,2.94) [-1.50,4.50]
Total: 12257 W: 2403 L: 2485 D: 7369
sprt @ 15+0.05 th 1 Piece domination: take 3
15-02-14 jos knight_outpost diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 6603 W: 1257 L: 1346 D: 4000
sprt @ 15+0.05 th 1 Give bonus for knight outposts close to the enemy king.
15-02-14 n_p MedKingSafety diff
ELO: -0.71 +-2.6 (95%) LOS: 29.5%
Total: 27864 W: 5568 L: 5625 D: 16671
40000 @ 15+0.05 th 1 King safety tuning using the last four SPSA-session and trying to use these results to extrapolate.
15-02-14 jki pawnmob diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 6843 W: 1387 L: 1476 D: 3980
sprt @ 15+0.05 th 1 Safe pawn push tweak try
15-02-14 sni any_safe_push3 diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 7697 W: 1532 L: 1618 D: 4547
sprt @ 15+0.05 th 1 Increase the bonus for safe pawn pushes in endgame
15-02-14 mco SpaceThreshold diff
5171/50000 iterations
10509/100000 games played
100000 @ 15+0.05 th 1 Tune space evaluation threshold. I think it has never been properly tuned before.
15-02-14 jki smp diff
ELO: 50.50 +-8.1 (95%) LOS: 100.0%
Total: 2591 W: 667 L: 293 D: 1631
5000 @ 15+0.05 th 16 smp improvement attempt (16 threads)
15-02-14 jki pmob diff
LLR: -0.36 (-2.94,2.94) [-3.00,3.00]
Total: 13242 W: 2602 L: 2615 D: 8025
sprt @ 15+0.05 th 1 Remove piece checks for safe pawn pushes. sprt [-3, 3]
15-02-14 mco SpaceThreshold diff
21048/50000 iterations
43140/100000 games played
100000 @ 15+0.05 th 1 Tune space evaluation threshold. I think it has never been properly tuned before. Take 2 (wider changes)
15-02-14 jki pmob diff
LLR: -3.85 (-2.94,2.94) [-3.00,1.00]
Total: 33801 W: 6682 L: 6954 D: 20165
sprt @ 15+0.05 th 1 Remove piece checks for safe pawn pushes. No regression test.
15-02-15 Roc Battery diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 7526 W: 1473 L: 1559 D: 4494
sprt @ 15+0.05 th 1 Another attempt at the Q-> R battery idea.
15-02-15 jki space diff
LLR: -3.92 (-2.94,2.94) [-1.50,4.50]
Total: 18337 W: 3718 L: 3809 D: 10810
sprt @ 15+0.05 th 1 space try inspired by Lyudmil
15-02-15 Roc MoreDblAttacks diff
LLR: -3.20 (-2.94,2.94) [0.00,4.00]
Total: 38342 W: 7572 L: 7605 D: 23165
sprt @ 15+0.05 th 1 Back to original S(16, 0), Verifying additional binds on c6, f6 and b7 g7 since SPSA tuning showed singularities on those squares,
15-02-15 jki smp diff
ELO: -1.27 +-2.1 (95%) LOS: 12.3%
Total: 40000 W: 7829 L: 7975 D: 24196
40000 @ 15+0.05 th 1 smp improvement attempt (check regression: 1 thread)
15-02-15 jki smp diff
ELO: 0.28 +-2.9 (95%) LOS: 57.4%
Total: 20000 W: 3650 L: 3634 D: 12716
20000 @ 15+0.05 th 2 smp improvement attempt (check regression: 2 thread)
15-02-15 jki smp diff
ELO: 0.19 +-2.8 (95%) LOS: 55.3%
Total: 20000 W: 3440 L: 3429 D: 13131
20000 @ 15+0.05 th 4 smp improvement attempt (check regression: 4 thread)
15-02-15 jki smp diff
ELO: 6.19 +-3.9 (95%) LOS: 99.9%
Total: 10325 W: 1824 L: 1640 D: 6861
10000 @ 15+0.05 th 8 smp improvement attempt (check regression: 8 thread)
15-02-15 vin en_passant_bonus diff
ELO: -0.55 +-2.4 (95%) LOS: 32.9%
Total: 31000 W: 6085 L: 6134 D: 18781
30000 @ 15+0.05 th 1 Now that the pawn push activity has subsided, measure Elo of re-tuned values at STC.
15-02-15 mco SpaceThreshold diff
48367/50000 iterations
85524/100000 games played
100000 @ 15+0.05 th 1 Tune space evaluation threshold. I think it has never been properly tuned before. Take 3 (even wider changes and smaller range)
15-02-15 mco king8 diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 55077 W: 10999 L: 10940 D: 33138
sprt @ 15+0.05 th 1 Simplify attackUnits formula
15-02-15 vin en_passant_bonus diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 62686 W: 10413 L: 10287 D: 41986
sprt @ 60+0.05 th 1 LTC test of retuned values, after STC test inconclusive. As suggested by Joona: "I suggest to make a final conclusive test at LTC. Because this test has already passed once, I think you could use less strict bounds, like [0, 5] to reduce the risk of "unlucky run"."
15-02-16 Roc Battery diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 11479 W: 2223 L: 2298 D: 6958
sprt @ 15+0.05 th 1 This time, give a bonus only for squares which are in the opponent half of the board.
15-02-16 sni connected_pawns2 diff
LLR: 2.97 (-2.94,2.94) [-1.50,4.50]
Total: 52393 W: 10912 L: 10656 D: 30825
sprt @ 15+0.05 th 1 Try to create mobile phalanxes
15-02-16 sni connected_pawns2 diff
LLR: -0.31 (-2.94,2.94) [-1.50,4.50]
Total: 38196 W: 7859 L: 7763 D: 22574
sprt @ 15+0.05 th 1 Try to create mobile phalanxes. Take 2: with increased pawn mobility value
15-02-16 mco king8 diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 14751 W: 2530 L: 2400 D: 9821
sprt @ 60+0.05 th 1 LTC: Simplify attackUnits formula
15-02-16 mco SpaceThreshold diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 32155 W: 6384 L: 6426 D: 19345
sprt @ 15+0.05 th 1 SpaceThreshold tuning verification
15-02-16 Roc Battery diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 3062 W: 552 L: 650 D: 1860
sprt @ 15+0.05 th 1 Consider only vertical batteries.
15-02-16 Roc Battery diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 8183 W: 1610 L: 1695 D: 4878
sprt @ 15+0.05 th 1 Consider only horz batteries in the opponent half of the board.
15-02-16 vin wedges diff
LLR: 2.96 (-2.94,2.94) [-1.50,4.50]
Total: 13056 W: 2681 L: 2538 D: 7837
sprt @ 15+0.05 th 1 Try bonus for an advanced cramping pawn on d5/e5/c6/d6/e6/f6 that cuts opponent's lines.
15-02-16 jos space_threshold diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 77390 W: 15402 L: 15279 D: 46709
sprt @ 15+0.05 th 1 SPSA tuning try by Marco failed, now let's try CLOP value after 38k games.
15-02-17 Roc Battery diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 8215 W: 1602 L: 1686 D: 4927
sprt @ 15+0.05 th 1 Lower score, only horz battery.
15-02-17 Roc Battery diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 5684 W: 1049 L: 1140 D: 3495
sprt @ 15+0.05 th 1 Lower score, for file battery only.
15-02-17 vin wedges diff
LLR: -2.96 (-2.94,2.94) [0.00,6.00]
Total: 17010 W: 2787 L: 2809 D: 11414
sprt @ 60+0.05 th 1 Test passed at STC, so proceed to test at LTC to see if it scales as-is.
15-02-17 vin wedges diff
LLR: 2.96 (-2.94,2.94) [-1.50,4.50]
Total: 14827 W: 3028 L: 2880 D: 8919
sprt @ 15+0.05 th 1 Try variant of wedges idea, making levers and wedges exclusive, in case this is even better.
15-02-17 Roc PawnDefensePush diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 12161 W: 2385 L: 2458 D: 7318
sprt @ 15+0.05 th 1 There is a S(20,20) bonus if pawn can attack piece. About about a S(10,10) if pawn can defend a piece.