Stockfish Testing Queue

Pending - 0 tests 0.0 hrs

None

Active - 0 tests

Finished - 97 tests

18-04-08 lan SimpleArctan diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 11947 W: 2307 L: 2381 D: 7259
sprt @ 10+0.1 th 1 Replace the expensive atan with the cheaper x/(1+|x|)
18-03-24 lan nullmove_mobility diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 2773 W: 517 L: 633 D: 1623
sprt @ 10+0.1 th 1 Make null-move pruning a linear function of moveCount (mobility) Author: Dann Corbit, Take 2
18-03-24 lan nullmove_mobility diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 1028 W: 134 L: 255 D: 639
sprt @ 10+0.1 th 1 Make null-move pruning a linear function of moveCount (mobility) Author: Dann Corbit
18-03-13 lan dynamicContempt diff
LLR: -2.95 (-2.94,2.94) [-3.00,1.00]
Total: 19654 W: 3879 L: 4076 D: 11699
sprt @ 10+0.1 th 1 Decreasing contempt. Tested locally 197 - 190 - 613 [0.503] 1000 games Elo difference: 2.43 +/- 13.38 SPRT: llr 0.104, lbound -2.94, ubound 2.94
18-03-12 lan dynamicContempt diff
ELO: -1.15 +-2.0 (95%) LOS: 12.5%
Total: 50000 W: 10236 L: 10401 D: 29363
50000 @ 10+0.1 th 1 See how inbuilt offset fares against the one adjusted with the contempt option
18-03-10 lan dynamicContempt diff
LLR: -2.95 (-2.94,2.94) [-3.00,1.00]
Total: 3197 W: 614 L: 784 D: 1799
sprt @ 10+0.1 th 1 Tuned values for non-regression. If passes then test against SF8 with fixed number of games
18-03-10 lan tuning_branch diff
14744/25000 iterations
31039/50000 games played
50000 @ 10+0.1 th 1 Tuning the formula for dynamic contempt
18-03-08 lan dynamicContempt diff
LLR: -2.95 (-2.94,2.94) [-3.00,1.00]
Total: 13183 W: 2473 L: 2656 D: 8054
sprt @ 10+0.1 th 1 Trying a simple modification of dynamic contempt for non-regression against master
17-11-18 lan kingdanger_extend diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 5659 W: 1021 L: 1123 D: 3515
sprt @ 10+0.1 th 1 Extending some KingDanger definitions. Suggested by Mindbreaker
17-10-31 lan safecheck_tune diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 18558 W: 3277 L: 3370 D: 11911
sprt @ 10+0.1 th 1 Locally tuned safe checks
17-10-25 lan safecheck_tune diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 714 W: 86 L: 246 D: 382
sprt @ 10+0.1 th 1 Testing results of local tuning with a bespoke tuner. Local testing showed +88 Elo STC (+373 -125 =502)
16-11-05 lan numasf diff
ELO: 2.96 +-1.9 (95%) LOS: 99.9%
Total: 40000 W: 6205 L: 5864 D: 27931
40000 @ 10+0.1 th 3 Testing Thomas Zipproth's implementation of numa on a 3 threads. There was no regression on 1 thread.
16-11-04 lan numasf diff
ELO: -0.09 +-2.0 (95%) LOS: 46.6%
Total: 40000 W: 6997 L: 7007 D: 25996
40000 @ 10+0.1 th 1 Testing Thomas Zipproth's implementation of numa on a single thread. If there is no regression we may commit on testing with big machines. Previous test was erroneous. Thanks, Ajith and Joerg for noticing
16-11-02 lan up_kingdanger diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 4216 W: 716 L: 824 D: 2676
sprt @ 10+0.1 th 1 Increase King evaluation by 25% as suggested by Michael B. at Talkchess
16-10-08 lan advanced_pawn_push diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 6232 W: 1117 L: 1216 D: 3899
sprt @ 10+0.1 th 1 Tweak advanced_pawn_push
16-08-22 lan 3-fold_repetition diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 1506 W: 227 L: 347 D: 932
sprt @ 10+0.1 th 1 Bug fix for the 3-fold repetition draw. Test as a Elo-gaining patch
16-08-19 lan repetition_draw diff
LLR: -0.15 (-2.94,2.94) [0.00,5.00]
Total: 3327 W: 620 L: 612 D: 2095
sprt @ 10+0.1 th 1 Another take on repetition draw with larger evaluation (#7 of Stephane Nicolet)
16-08-18 lan Zobrist_side diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 19095 W: 3507 L: 3550 D: 12038
sprt @ 10+0.1 th 1 Deterministic Zobrist side key (suggestion #11 by Stephane Nicolet)
16-08-17 lan kt_factor diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 16015 W: 2125 L: 2230 D: 11660
sprt @ 60+0.6 th 1 LTC: Change factor in kingtropism from 7 to 6
16-08-16 lan kt_factor diff
LLR: 2.95 (-2.94,2.94) [0.00,4.00]
Total: 21308 W: 4085 L: 3853 D: 13370
sprt @ 10+0.1 th 1 Change factor in kingtropism from 7 to 6. Suggested by Stephane Nicolet
16-04-21 lan pp_tune diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 12667 W: 2233 L: 2346 D: 8088
sprt @ 10+0.1 th 1 New local tuning with bespoke tuner 2469 iter / 9876 games until obj. func. < 0.08. Tested 1000 games: +228 -207 =565. Both tuning and testing with TC 10+0.1 s.
16-04-12 lan pp_tune diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 11051 W: 1998 L: 2117 D: 6936
sprt @ 10+0.1 th 1 Locally tuned with further modified tuner and TC 10+0.1 s. Tuning stopped prematurely because of power break at iter 646 / game 2584 with objective function 0.378. Local 1000-game testing at 10+0.1: +223 -220 =557. See if such small number of iterations has any effect.
16-04-08 lan pp_tune diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 39658 W: 7357 L: 7376 D: 24925
sprt @ 10+0.1 th 1 Tuned locally with a bespoke tuner. Stop criterion: loss function < 0.08 (2442 iter / 9768 games). Tested for 1000 games (10+0.1 s): +230 -183 =587
16-03-13 lan queen_values diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 25953 W: 4627 L: 4695 D: 16631
sprt @ 10+0.1 th 1 Checking queen values (QueenValueMg and QueenValueEg) locally tuned with ASP, see Fishcooking topic Re:SPSA
16-02-23 lan nullmove_ext diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 4087 W: 694 L: 802 D: 2591
sprt @ 10+0.1 th 1 Extend nullmove search in relation to maximal search depth. Solves the mate in 15: 8/8/8/2p5/1pp5/brpp4/1pprp2P/qnkbK -- white to move
16-02-14 lan noisy_DOE diff
24578/25000 iterations
50000/50000 games played
50000 @ 2+0.02 th 1 Testing with the found by the first DOE run a=57.707 and c=231.498 with A=5000
16-02-14 lan noisy_DOE diff
24719/25000 iterations
50000/50000 games played
50000 @ 2+0.02 th 1 Testing with the found by the first DOE run a=57.707 and c=231.498
16-02-13 lan noisy_DOE diff
24443/25000 iterations
50000/50000 games played
50000 @ 2+0.02 th 1 Take 8: c=150, R=0.0044
16-02-13 lan noisy_DOE diff
24394/25000 iterations
50000/50000 games played
50000 @ 2+0.02 th 1 Take 4: c=300, R=0.00055
16-02-13 lan noisy_DOE diff
24407/25000 iterations
50000/50000 games played
50000 @ 2+0.02 th 1 Take 6: c=1, R=50
16-02-13 lan noisy_DOE diff
24299/25000 iterations
50000/50000 games played
50000 @ 2+0.02 th 1 Take 9: c=150, R=0.000044
16-02-13 lan noisy_DOE diff
24106/25000 iterations
50000/50000 games played
50000 @ 2+0.02 th 1 Take 7: c=1, R=1
16-02-13 lan noisy_DOE diff
24454/25000 iterations
50000/50000 games played
50000 @ 2+0.02 th 1 Take 5: c=150, R=0.0022
16-01-31 lan loss_measurement diff
ELO: 1.25 +-4.2 (95%) LOS: 71.9%
Total: 10000 W: 1938 L: 1902 D: 6160
10000 @ 10+0.1 th 1 Measurement of the loss function with QueenValueMg=2300
16-01-31 lan loss_measurement diff
ELO: -28.27 +-6.3 (95%) LOS: 0.0%
Total: 5000 W: 859 L: 1265 D: 2876
5000 @ 10+0.1 th 1 Measurement of the loss function with QueenValueMg=1700
16-01-31 lan loss_measurement diff
ELO: -145.05 +-7.5 (95%) LOS: 0.0%
Total: 5000 W: 484 L: 2458 D: 2058
5000 @ 10+0.1 th 1 Measurement of the loss function with QueenValueMg=5000
16-01-31 lan loss_measurement diff
ELO: -3.37 +-4.2 (95%) LOS: 5.6%
Total: 10000 W: 1818 L: 1915 D: 6267
10000 @ 10+0.1 th 1 Measurement of the loss function with QueenValueMg=2700
16-01-31 lan loss_measurement diff
ELO: -263.65 +-6.9 (95%) LOS: 0.0%
Total: 10000 W: 615 L: 7019 D: 2366
10000 @ 10+0.1 th 1 Measurement of the loss function with QueenValueMg=500
16-01-30 lan loss_measurement diff
ELO: -42.78 +-4.5 (95%) LOS: 0.0%
Total: 10000 W: 1609 L: 2834 D: 5557
10000 @ 10+0.1 th 1 Measurement of the loss function with QueenValueMg=3500
16-01-30 lan loss_measurement diff
ELO: -701.75 +-29.6 (95%) LOS: 0.0%
Total: 5000 W: 24 L: 4851 D: 125
10000 @ 10+0.1 th 1 Measurement of the loss function with QueenValueMg = -500
16-01-16 lan KRPPKRPScaleFactors diff
19788/20000 iterations
40000/40000 games played
40000 @ 20+0.2 th 1 Tune the scale factors for KRPPKRP endgames. Do ranks 7 and 8 really scale as 0 ?
15-09-12 lan OutpostCB diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 21954 W: 4095 L: 4125 D: 13734
sprt @ 15+0.05 th 1 Take 2. Fixed condition and bonus
15-09-12 lan OutpostCB diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 8795 W: 1633 L: 1721 D: 5441
sprt @ 15+0.05 th 1 Outpost on centerbinds(Bonus for outpost supported by centerbinds)
15-05-04 lan half_rook-psqt diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 11602 W: 2180 L: 2297 D: 7125
sprt @ 15+0.05 th 1 Updated hand-tuned rook psqt (half-table)
14-06-10 lan material_imbalance_Jark diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 12493 W: 2032 L: 2105 D: 8356
sprt @ 15+0.05 th 1 Material imbalance: Q vs 3 minors by Jarkko Pesonen
15-02-20 lan end_double_penalty diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 17917 W: 3492 L: 3550 D: 10875
sprt @ 15+0.05 th 1 Increase penalty for doubled pawns only on H file. An idea of Lyudmil Tsvetkov.
15-03-14 lan rook_on_7th_rank diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 69963 W: 13460 L: 13379 D: 43124
sprt @ 15+0.05 th 1 Add a bonus for our rook on rank 7 and enemy king on rank 8. Suggested by 'dragon'
15-03-16 lan bishop_colored_pawns diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 8294 W: 1523 L: 1607 D: 5164
sprt @ 15+0.05 th 1 Penalty for pawns of the same color as bishop. Suggested by 'dragon'
15-03-18 lan bishop_colored_pawns diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 7825 W: 1468 L: 1553 D: 4804
sprt @ 15+0.05 th 1 Take 2. Penalties for pawns on the same-colored square as bishop.
15-03-19 lan hanging_up diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 21816 W: 4174 L: 4255 D: 13387
sprt @ 15+0.05 th 1 Increase the mg value of hanging