Stockfish Testing Queue

Finished - 37482 tests

15-02-07 vin passed_blockers2_spsa diff
12234/12500 iterations
25000/25000 games played
25000 @ 15+0.05 th 1 Previous tuning run showed a trend in the rook, so try a second run expanded to include the minor pieces and the endgame. 25K games to find any trend (e.g. *should* the minor pieces be included or was my original suspicion that no bonus is needed correct)
15-02-07 vin pawns_both_wings diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 22251 W: 4488 L: 4533 D: 13230
sprt @ 15+0.05 th 1 Try an idea of Mindbreaker's - that the side ahead in an endgame (or approaching it) should try to preserve its pawns on both wings. The resulting eval is still symmetric. Two approaches seem possible - direct eval bonus or scaling factor adjust. Take 1 is the direct approach.
15-02-07 jos matimb diff
ELO: -0.57 +-3.0 (95%) LOS: 35.6%
Total: 20000 W: 3984 L: 4017 D: 11999
20000 @ 15+0.05 th 1 Pit the final values of 3rd tuning session against those of the 2nd one.
15-02-07 sg pawn_attack_threat3 diff
ELO: -0.81 +-2.6 (95%) LOS: 27.2%
Total: 27059 W: 5374 L: 5437 D: 16248
30000 @ 15+0.05 th 1 Allow knight, king and queen as defender. Combine the pieces which show some elo gain and measure if this adds up.
15-02-07 vin pawns_both_wings_alt diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 16207 W: 3224 L: 3286 D: 9697
sprt @ 15+0.05 th 1 Try an idea of Mindbreaker's - that the side ahead in an endgame (or approaching it) should try to preserve its pawns on both wings. The resulting eval is still symmetric. Take 2 is the scale factor approach, which is cleaner, so it would be nice if this worked rather than take 1.
15-02-07 sg pawn_attack_threat3 diff
LLR: -2.94 (-2.94,2.94) [-1.50,4.50]
Total: 28215 W: 5739 L: 5767 D: 16709
sprt @ 15+0.05 th 1 Allow knight as defender. Retest with SPRT to check for luck in first run
15-02-07 vin passed_blockers2_spsa diff
39792/40000 iterations
81000/80000 games played
80000 @ 15+0.05 th 1 The trial run showed continued movement for rook and knight in particular, so go for a longer run with the previous values as the starting point. Upwards movement for the heavy piece endgame scores hints that this is still a factor there. Also switch to a much nicer table-driven way of expressing the weighting.
15-02-07 SC search_tempo_game_phase diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 19773 W: 3911 L: 3964 D: 11898
sprt @ 15+0.05 th 1 Game phase based tempo value (faster version), take 1. More tempo value for endgames.
15-02-07 SC search_tempo_game_phase diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 17531 W: 3508 L: 3567 D: 10456
sprt @ 15+0.05 th 1 Game phase based tempo value (faster version), take 2. More tempo value for middlegame.
15-02-07 SC search_tempo_king_dista diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 23776 W: 4839 L: 4880 D: 14057
sprt @ 15+0.05 th 1 King distance based tempo value. More tempo if kings are distant.
15-02-07 sg pawn_attack_threat_see diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 23696 W: 4778 L: 4819 D: 14099
sprt @ 15+0.05 th 1 My current implemetation detects as cheap as possible safe pawn pushes, so that many cases not covered. Try now for the remaining pushes safety calculation with SEE
15-02-07 jki pdouble diff
LLR: 3.14 (-2.94,2.94) [-1.50,4.50]
Total: 15974 W: 3291 L: 3133 D: 9550
sprt @ 15+0.05 th 1 Bonus for pawn double attack controlling a central square.
15-02-07 jki pdoublesafety diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 6428 W: 1264 L: 1354 D: 3810
sprt @ 15+0.05 th 1 Pawn double attack and king safety try
15-02-07 sni pawn_attack_threat5 diff
3538/3750 iterations
7102/7500 games played
7500 @ 60+0.05 th 1 SPSA tuning of connected pawns values
15-02-07 jki pdouble diff
LLR: 3.24 (-2.94,2.94) [0.00,6.00]
Total: 10449 W: 1837 L: 1674 D: 6938
sprt @ 60+0.05 th 1 LTC: Bonus for pawn double attack controlling a central square.
15-02-08 sg backward_pawn diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 11104 W: 2202 L: 2278 D: 6624
sprt @ 15+0.05 th 1 double up penalty if backward pawn is stopped by a pawn double attack
15-02-08 jki pdouble_cf diff
LLR: 3.68 (-2.94,2.94) [-1.50,4.50]
Total: 16176 W: 3290 L: 3113 D: 9773
sprt @ 15+0.05 th 1 Pawn bind bonus also for c- and f-files
15-02-08 jki pdouble2 diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 3977 W: 741 L: 837 D: 2399
sprt @ 15+0.05 th 1 Center bind bonus x2
15-02-08 jki pdouble_end diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 25443 W: 5028 L: 5065 D: 15350
sprt @ 15+0.05 th 1 Bind bonus also for endgames.
15-02-08 sg outposts_double diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 12099 W: 2426 L: 2500 D: 7173
sprt @ 15+0.05 th 1 Add 50% more bonus if outpost is defended by two pawns.
15-02-08 sg backward_pawn diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 23324 W: 4585 L: 4628 D: 14111
sprt @ 15+0.05 th 1 Add 50% penalty if backward pawn is stopped by a pawn double attack (Take 2)
15-02-08 sg isolated_pawn diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 13095 W: 2575 L: 2646 D: 7874
sprt @ 15+0.05 th 1 Fix error: add 50% penalty for isolated pawn which is stopped by a pawn double attack
15-02-08 SC search_tempo_king_dista diff
28685/30000 iterations
60000/60000 games played
60000 @ 15+0.05 th 1 Tune king distance based tempo.
15-02-08 SC search_tempo_game_phase diff
19566/20000 iterations
40000/40000 games played
40000 @ 15+0.05 th 1 Tune game phase based tempo evaluation.
15-02-08 jki pdouble_cf diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 13471 W: 2201 L: 2312 D: 8958
sprt @ 60+0.05 th 1 LTC: Pawn bind bonus also for c- and f-files
15-02-08 Roc LooseKnight diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 56073 W: 11200 L: 11152 D: 33721
sprt @ 15+0.05 th 1 Small penalty for loose knight.
15-02-09 sni connected_pawns diff
LLR: -2.94 (-2.94,2.94) [-1.50,4.50]
Total: 19367 W: 3863 L: 3916 D: 11588
sprt @ 15+0.05 th 1 Half values for connected pawns in endgame
15-02-09 sni any_safe_push diff
LLR: 2.70 (-2.94,2.94) [-1.50,4.50]
Total: 18233 W: 3705 L: 3557 D: 10971
sprt @ 15+0.05 th 1 Add small bonus for all safe pawn pushes
15-02-09 jki master diff
ELO: 7.50 +-2.0 (95%) LOS: 100.0%
Total: 40000 W: 7286 L: 6423 D: 26291
40000 @ 60+0.05 th 1 First regression test vs. SF6, Pawn Center Bind Bonus (e1185700), using 2moves_v1.pgn
15-02-09 Roc LooseKnight diff
LLR: 2.96 (-2.94,2.94) [-1.50,4.50]
Total: 25295 W: 5077 L: 4901 D: 15317
sprt @ 15+0.05 th 1 With S(20,0)
15-02-09 SC search_tempo_game_phase diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 22535 W: 4558 L: 4602 D: 13375
sprt @ 15+0.05 th 1 Tuned value for gampe phase based tempo evaluation.
15-02-09 SC search_tempo_king_dista diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 6997 W: 1313 L: 1400 D: 4284
sprt @ 15+0.05 th 1 Tuned values for king distance based tempo.
15-02-09 vin passed_blockers2_spsa diff
19390/20000 iterations
40000/40000 games played
40000 @ 15+0.05 th 1 Some of the parameters have now converged but not all, so one final run with the King included and then we re-test. -1 priority.
15-02-09 SC tune_tempo diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 52492 W: 10509 L: 10476 D: 31507
sprt @ 15+0.05 th 1 In all the tempo tuning SPSA, tempo was increased in average. Is tempo +1 enough?
15-02-09 SC tune_tempo diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 55084 W: 10923 L: 10882 D: 33279
sprt @ 15+0.05 th 1 And tempo +2?
15-02-09 Roc LooseKnight diff
LLR: -2.97 (-2.94,2.94) [0.00,6.00]
Total: 23032 W: 3879 L: 3871 D: 15282
sprt @ 60+0.05 th 1 With S(20,0)
15-02-10 jos spsa_rook_imbal diff
52723/50000 iterations
103000/100000 games played
100000 @ 15+0.05 th 1 First tuning session, rook values.
15-02-10 sni any_safe_push2 diff
LLR: -0.18 (-2.94,2.94) [0.00,6.00]
Total: 892 W: 151 L: 153 D: 588
sprt @ 60+0.05 th 1 LTC: Add small bonus for all safe pawn pushes (with the KingDanger[] array initialization fix)
15-02-10 SC search_tempo_game_phase diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 9934 W: 1968 L: 2048 D: 5918
sprt @ 15+0.05 th 1 Game-phase based tempo evaluation. Probe material table instead of computing game phase every time. Same bench and speed-up 0.50% wrt the version which failed STC.
15-02-10 vin passed_blockers2 diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 6149 W: 1188 L: 1278 D: 3683
sprt @ 15+0.05 th 1 Test SPSA-tuned values at STC.
15-02-10 n_p KingSafety diff
ELO: -0.46 +-2.1 (95%) LOS: 33.7%
Total: 40000 W: 7933 L: 7986 D: 24081
40000 @ 15+0.05 th 1 Test the new values on king safety from the SPSA-session.
15-02-10 vin en_passant_bonus diff
LLR: 2.96 (-2.94,2.94) [-1.50,4.50]
Total: 17846 W: 3620 L: 3464 D: 10762
sprt @ 15+0.05 th 1 Test idea from Lyudmil Tsvetkov, adding small bonus to 5th rank pawns versus opponent 2nd rank potential en passant pawns.
15-02-11 lbr detempletize diff
LLR: -2.98 (-2.94,2.94) [-3.00,1.00]
Total: 22505 W: 4333 L: 4535 D: 13637
sprt @ 15+0.05 th 1 see pull request. non functional change, but bench may not be representative (depends on how frequently generate_castling is called)
15-02-11 n_p KingSafety diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 46720 W: 7768 L: 7773 D: 31179
sprt @ 60+0.05 th 1 LTC: Test the new values on king safety from the SPSA-session. Does these values scale well do to tuned in LTC.
15-02-11 Fis stats_update diff
LLR: -3.38 (-2.94,2.94) [-1.50,4.50]
Total: 12402 W: 2438 L: 2526 D: 7438
sprt @ 15+0.05 th 1 Stats::update() now clamps out of bound values to Max instead of ignoring them. See issue #251. If this passes STC, LTC will be at [0,4]. Pri -3
15-02-11 jos matimb diff
ELO: -1.56 +-2.1 (95%) LOS: 7.7%
Total: 40000 W: 7880 L: 8060 D: 24060
40000 @ 15+0.05 th 1 Check the new rook values after 1st tuning session.
15-02-11 SC search_tempo_game_phase diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 7308 W: 1387 L: 1474 D: 4447
sprt @ 15+0.05 th 1 A further try on tempo evaluation. Restore feature of ignoring tempo for specialized evaluations. (I removed it to avoid increasing the code too muchl).
15-02-11 vin en_passant_bonus_spsa diff
20899/20000 iterations
40000/40000 games played
40000 @ 15+0.05 th 1 STC passed - Try a quick tune of values at STC. Could Lyudmil be onto another winner?
15-02-11 Roc LooseKnight diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 28843 W: 5732 L: 5760 D: 17351
sprt @ 15+0.05 th 1 Adding roughly tuned penalty for each loose piece (yes... including the King !) Let's see if it more than just noise.
15-02-11 sni any_safe_push2 diff
LLR: 2.97 (-2.94,2.94) [0.00,6.00]
Total: 17684 W: 3042 L: 2854 D: 11788
sprt @ 60+0.05 th 1 LTC: Add small bonus for all safe pawn pushes (with Stefan's fix for the bug in my code: now using popcount<full> instead of popcount<max15>)