Stockfish Testing Queue

Finished - 40699 tests

15-02-05 Roc LatentQAttack diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 13045 W: 2583 L: 2654 D: 7808
sprt @ 15+0.05 th 1 My last try on this idea, with last SPSA values. S(22,22) for Bishop and S(17.17) for Knight.
15-02-05 SC tuned_tempo diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 8077 W: 1601 L: 1686 D: 4790
sprt @ 15+0.05 th 1 Bugfix for tempo evaluation. More tempo value in endgames.
15-02-05 SC tuned_tempo diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 12197 W: 2416 L: 2489 D: 7292
sprt @ 15+0.05 th 1 More tempo value in middlegames, bugfix
15-02-05 jos no_split_after_null diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 16620 W: 2935 L: 2997 D: 10688
sprt @ 15+0.05 th 3 Don't split after null move. This is most likely not a good place to split. Most of the times it is rejected anyways due to min split depth. This should reduce a bit the split/search overhead. My guess is, this may be of more benefit with a higher number of threads, but start testing with 3 threads.
15-02-06 lbr krkm diff
LLR: 4.36 (-2.94,2.94) [-3.50,0.50]
Total: 40050 W: 6382 L: 6249 D: 27419
sprt @ 15+0.05 th 1 are KRKm also useless?
15-02-06 lbr kpkp diff
LLR: 3.04 (-2.94,2.94) [-3.50,0.50]
Total: 50933 W: 8035 L: 7995 D: 34903
sprt @ 15+0.05 th 1 is KPKP useless?
15-02-06 uri not_prune_high diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 17122 W: 3333 L: 3393 D: 10396
sprt @ 15+0.05 th 1 Usually do not change the search but avoid stupid pruning when the score is a losing score. fix the problem of stupid scores in small depth at r1q1nr1k/pp1b2b1/n2p2pp/2pP1p2/2B4B/3Q1N1P/PPP1NPP1/1R3RK1 b - - 0 12
15-02-06 jos matimb diff
ELO: 0.22 +-2.1 (95%) LOS: 57.8%
Total: 40000 W: 8007 L: 7982 D: 24011
40000 @ 15+0.05 th 1 Check the new values after 3rd SPSA session.
15-02-06 n_p SPSAKingSafety3 diff
45052/50000 iterations
99879/100000 games played
100000 @ 60+0.05 th 1 Another SPSA-session on king safety.
15-02-06 jos matimb diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 90570 W: 15049 L: 14915 D: 60606
sprt @ 60+0.05 th 1 LTC: values after 2nd SPSA session. (If these pass, I will test the final values of the 3rd session against the new master.)
15-02-06 sg pawn_attack_threat3 diff
ELO: 1.34 +-3.0 (95%) LOS: 80.7%
Total: 20000 W: 3989 L: 3912 D: 12099
20000 @ 15+0.05 th 1 Allow queen as defender. The test of sn which allows all pieces as defenders passed STC, but struggles with LTC. So lets measure the effect for each piece type separatly.
15-02-06 sg pawn_attack_threat3 diff
ELO: 0.28 +-3.0 (95%) LOS: 57.1%
Total: 20000 W: 4006 L: 3990 D: 12004
20000 @ 15+0.05 th 1 Allow rook as defender
15-02-06 sg pawn_attack_threat3 diff
ELO: 0.12 +-3.0 (95%) LOS: 53.1%
Total: 20000 W: 3933 L: 3926 D: 12141
20000 @ 15+0.05 th 1 Allow bishop as defender
15-02-06 sg pawn_attack_threat3 diff
ELO: 2.69 +-3.0 (95%) LOS: 95.9%
Total: 20000 W: 4072 L: 3917 D: 12011
20000 @ 15+0.05 th 1 Allow knight as defender
15-02-06 sg pawn_attack_threat3 diff
ELO: 1.73 +-3.1 (95%) LOS: 86.2%
Total: 19246 W: 3921 L: 3825 D: 11500
20000 @ 15+0.05 th 1 Allow king as defender
15-02-07 lbr useless diff
LLR: -2.96 (-2.94,2.94) [-3.50,0.50]
Total: 54033 W: 8354 L: 8630 D: 37049
sprt @ 15+0.05 th 1 is a combo of useless endgames still useless?
15-02-07 mco a7592e69d728ac839f098f2 diff
LLR: 3.32 (-2.94,2.94) [-3.00,1.00]
Total: 146680 W: 23253 L: 23307 D: 100120
sprt @ 15+0.05 th 1 Verify that KQKRPs does not regress (8moves book to steer towards endgames)
15-02-07 vin passed_blockers2_spsa diff
12234/12500 iterations
25000/25000 games played
25000 @ 15+0.05 th 1 Previous tuning run showed a trend in the rook, so try a second run expanded to include the minor pieces and the endgame. 25K games to find any trend (e.g. *should* the minor pieces be included or was my original suspicion that no bonus is needed correct)
15-02-07 vin pawns_both_wings diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 22251 W: 4488 L: 4533 D: 13230
sprt @ 15+0.05 th 1 Try an idea of Mindbreaker's - that the side ahead in an endgame (or approaching it) should try to preserve its pawns on both wings. The resulting eval is still symmetric. Two approaches seem possible - direct eval bonus or scaling factor adjust. Take 1 is the direct approach.
15-02-07 jos matimb diff
ELO: -0.57 +-3.0 (95%) LOS: 35.6%
Total: 20000 W: 3984 L: 4017 D: 11999
20000 @ 15+0.05 th 1 Pit the final values of 3rd tuning session against those of the 2nd one.
15-02-07 sg pawn_attack_threat3 diff
ELO: -0.81 +-2.6 (95%) LOS: 27.2%
Total: 27059 W: 5374 L: 5437 D: 16248
30000 @ 15+0.05 th 1 Allow knight, king and queen as defender. Combine the pieces which show some elo gain and measure if this adds up.
15-02-07 vin pawns_both_wings_alt diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 16207 W: 3224 L: 3286 D: 9697
sprt @ 15+0.05 th 1 Try an idea of Mindbreaker's - that the side ahead in an endgame (or approaching it) should try to preserve its pawns on both wings. The resulting eval is still symmetric. Take 2 is the scale factor approach, which is cleaner, so it would be nice if this worked rather than take 1.
15-02-07 sg pawn_attack_threat3 diff
LLR: -2.94 (-2.94,2.94) [-1.50,4.50]
Total: 28215 W: 5739 L: 5767 D: 16709
sprt @ 15+0.05 th 1 Allow knight as defender. Retest with SPRT to check for luck in first run
15-02-07 vin passed_blockers2_spsa diff
39792/40000 iterations
81000/80000 games played
80000 @ 15+0.05 th 1 The trial run showed continued movement for rook and knight in particular, so go for a longer run with the previous values as the starting point. Upwards movement for the heavy piece endgame scores hints that this is still a factor there. Also switch to a much nicer table-driven way of expressing the weighting.
15-02-07 SC search_tempo_game_phase diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 19773 W: 3911 L: 3964 D: 11898
sprt @ 15+0.05 th 1 Game phase based tempo value (faster version), take 1. More tempo value for endgames.
15-02-07 SC search_tempo_game_phase diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 17531 W: 3508 L: 3567 D: 10456
sprt @ 15+0.05 th 1 Game phase based tempo value (faster version), take 2. More tempo value for middlegame.
15-02-07 SC search_tempo_king_dista diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 23776 W: 4839 L: 4880 D: 14057
sprt @ 15+0.05 th 1 King distance based tempo value. More tempo if kings are distant.
15-02-07 sg pawn_attack_threat_see diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 23696 W: 4778 L: 4819 D: 14099
sprt @ 15+0.05 th 1 My current implemetation detects as cheap as possible safe pawn pushes, so that many cases not covered. Try now for the remaining pushes safety calculation with SEE
15-02-07 jki pdouble diff
LLR: 3.14 (-2.94,2.94) [-1.50,4.50]
Total: 15974 W: 3291 L: 3133 D: 9550
sprt @ 15+0.05 th 1 Bonus for pawn double attack controlling a central square.
15-02-07 jki pdoublesafety diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 6428 W: 1264 L: 1354 D: 3810
sprt @ 15+0.05 th 1 Pawn double attack and king safety try
15-02-07 sni pawn_attack_threat5 diff
3538/3750 iterations
7102/7500 games played
7500 @ 60+0.05 th 1 SPSA tuning of connected pawns values
15-02-07 jki pdouble diff
LLR: 3.24 (-2.94,2.94) [0.00,6.00]
Total: 10449 W: 1837 L: 1674 D: 6938
sprt @ 60+0.05 th 1 LTC: Bonus for pawn double attack controlling a central square.
15-02-08 sg backward_pawn diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 11104 W: 2202 L: 2278 D: 6624
sprt @ 15+0.05 th 1 double up penalty if backward pawn is stopped by a pawn double attack
15-02-08 jki pdouble_cf diff
LLR: 3.68 (-2.94,2.94) [-1.50,4.50]
Total: 16176 W: 3290 L: 3113 D: 9773
sprt @ 15+0.05 th 1 Pawn bind bonus also for c- and f-files
15-02-08 jki pdouble2 diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 3977 W: 741 L: 837 D: 2399
sprt @ 15+0.05 th 1 Center bind bonus x2
15-02-08 jki pdouble_end diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 25443 W: 5028 L: 5065 D: 15350
sprt @ 15+0.05 th 1 Bind bonus also for endgames.
15-02-08 sg outposts_double diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 12099 W: 2426 L: 2500 D: 7173
sprt @ 15+0.05 th 1 Add 50% more bonus if outpost is defended by two pawns.
15-02-08 sg backward_pawn diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 23324 W: 4585 L: 4628 D: 14111
sprt @ 15+0.05 th 1 Add 50% penalty if backward pawn is stopped by a pawn double attack (Take 2)
15-02-08 sg isolated_pawn diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 13095 W: 2575 L: 2646 D: 7874
sprt @ 15+0.05 th 1 Fix error: add 50% penalty for isolated pawn which is stopped by a pawn double attack
15-02-08 SC search_tempo_king_dista diff
28685/30000 iterations
60000/60000 games played
60000 @ 15+0.05 th 1 Tune king distance based tempo.
15-02-08 SC search_tempo_game_phase diff
19566/20000 iterations
40000/40000 games played
40000 @ 15+0.05 th 1 Tune game phase based tempo evaluation.
15-02-08 jki pdouble_cf diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 13471 W: 2201 L: 2312 D: 8958
sprt @ 60+0.05 th 1 LTC: Pawn bind bonus also for c- and f-files
15-02-08 Roc LooseKnight diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 56073 W: 11200 L: 11152 D: 33721
sprt @ 15+0.05 th 1 Small penalty for loose knight.
15-02-09 sni connected_pawns diff
LLR: -2.94 (-2.94,2.94) [-1.50,4.50]
Total: 19367 W: 3863 L: 3916 D: 11588
sprt @ 15+0.05 th 1 Half values for connected pawns in endgame
15-02-09 sni any_safe_push diff
LLR: 2.70 (-2.94,2.94) [-1.50,4.50]
Total: 18233 W: 3705 L: 3557 D: 10971
sprt @ 15+0.05 th 1 Add small bonus for all safe pawn pushes
15-02-09 jki master diff
ELO: 7.50 +-2.0 (95%) LOS: 100.0%
Total: 40000 W: 7286 L: 6423 D: 26291
40000 @ 60+0.05 th 1 First regression test vs. SF6, Pawn Center Bind Bonus (e1185700), using 2moves_v1.pgn
15-02-09 Roc LooseKnight diff
LLR: 2.96 (-2.94,2.94) [-1.50,4.50]
Total: 25295 W: 5077 L: 4901 D: 15317
sprt @ 15+0.05 th 1 With S(20,0)
15-02-09 SC search_tempo_game_phase diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 22535 W: 4558 L: 4602 D: 13375
sprt @ 15+0.05 th 1 Tuned value for gampe phase based tempo evaluation.
15-02-09 SC search_tempo_king_dista diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 6997 W: 1313 L: 1400 D: 4284
sprt @ 15+0.05 th 1 Tuned values for king distance based tempo.
15-02-09 vin passed_blockers2_spsa diff
19390/20000 iterations
40000/40000 games played
40000 @ 15+0.05 th 1 Some of the parameters have now converged but not all, so one final run with the King included and then we re-test. -1 priority.