Stockfish Testing Queue

Pending - 0 tests 0.0 hrs

None

Active - 0 tests

Finished - 119 tests

19-01-13 lan assorted diff
LLR: -2.95 (-2.94,2.94) [0.50,4.50]
Total: 7610 W: 1656 L: 1783 D: 4171
sprt @ 10+0.1 th 1 Tuned assorted (again) +3 elo from 1000 local games
19-01-02 lan imbalance diff
LLR: -2.94 (-2.94,2.94) [0.00,4.00]
Total: 35152 W: 7586 L: 7612 D: 19954
sprt @ 10+0.1 th 1 Tuned imbalance tables (again)
18-12-27 lan assorted diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 4770 W: 968 L: 1113 D: 2689
sprt @ 10+0.1 th 1 Tuned assorted bonuses and penalties
18-12-25 lan imbalance diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 20134 W: 4381 L: 4466 D: 11287
sprt @ 10+0.1 th 1 Tuned imbalance
18-12-25 lan imbalance diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 9088 W: 1901 L: 2029 D: 5158
sprt @ 10+0.1 th 1 Tuned imbalance, another candidate
18-12-03 lan imbalance diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 33118 W: 7186 L: 7220 D: 18712
sprt @ 10+0.1 th 1 Tuned imbalance tables, 2nd try
18-11-30 lan assorted diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 2984 W: 625 L: 780 D: 1579
sprt @ 10+0.1 th 1 Locally +11 Elo in 1000 games. Let's see if this pans out
18-11-19 lan safechecks diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 47335 W: 10307 L: 10285 D: 26743
sprt @ 10+0.1 th 1 Post hover-around-optimum tuning
18-11-16 lan safechecks diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 32570 W: 6916 L: 6953 D: 18701
sprt @ 10+0.1 th 1 Safechecks quickie
18-11-16 lan assorted diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 14037 W: 2997 L: 3106 D: 7934
sprt @ 10+0.1 th 1 Tuned assorted bonuses and penalties, 2nd tuning run
18-11-14 lan safechecks diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 8307 W: 1702 L: 1833 D: 4772
sprt @ 10+0.1 th 1 Sfechecks tuned (?)
18-11-12 lan safechecks diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 9864 W: 2055 L: 2180 D: 5629
sprt @ 10+0.1 th 1 Safechecks after a new tuning run
18-11-05 lan assorted diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 5573 W: 1177 L: 1320 D: 3076
sprt @ 10+0.1 th 1 Tuned assorted bonuses and penalties
18-11-04 lan KingAttackWeights diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 12208 W: 2578 L: 2694 D: 6936
sprt @ 10+0.1 th 1 Tuned KingAttackWeights. Rook weight increased. Forum topic https://groups.google.com/forum/?fromgroups=#!topic/fishcooking/R0ZzvPt81hA
18-11-03 lan imbalance diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 19912 W: 4267 L: 4353 D: 11292
sprt @ 10+0.1 th 1 Tuned imbalance tables (+0.5 Elo?)
18-11-01 lan safechecks diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 103171 W: 22161 L: 21925 D: 59085
sprt @ 10+0.1 th 1 Yet other tuned safechecks
18-10-31 lan safechecks diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 50320 W: 10849 L: 10817 D: 28654
sprt @ 10+0.1 th 1 Another attempt at safechecks tuning
18-10-09 lan razor_margin diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 29639 W: 4733 L: 4794 D: 20112
sprt @ 60+0.6 th 1 LTC Tuned only razor margin
18-10-09 lan razor_margin diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 1330 W: 211 L: 369 D: 750
sprt @ 10+0.1 th 1 Tuned stats and history bonus
18-10-09 lan razor_margin diff
LLR: 2.96 (-2.94,2.94) [0.00,4.00]
Total: 58753 W: 12658 L: 12269 D: 33826
sprt @ 10+0.1 th 1 Tuned only razor margin
18-10-08 lan razor_margin diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 8611 W: 1784 L: 1914 D: 4913
sprt @ 10+0.1 th 1 Tuned razor and futility margins. The second parameter in futility tended to 0 after much tuning and that's why it's omitted. SPRT limits set to tuning rather than simplification because I want a real Elo gain here
18-10-01 lan psqt diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 48784 W: 10350 L: 10325 D: 28109
sprt @ 10+0.1 th 1 Tuned King PSQT
18-04-08 lan SimpleArctan diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 11947 W: 2307 L: 2381 D: 7259
sprt @ 10+0.1 th 1 Replace the expensive atan with the cheaper x/(1+|x|)
18-03-24 lan nullmove_mobility diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 2773 W: 517 L: 633 D: 1623
sprt @ 10+0.1 th 1 Make null-move pruning a linear function of moveCount (mobility) Author: Dann Corbit, Take 2
18-03-24 lan nullmove_mobility diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 1028 W: 134 L: 255 D: 639
sprt @ 10+0.1 th 1 Make null-move pruning a linear function of moveCount (mobility) Author: Dann Corbit
18-03-13 lan dynamicContempt diff
LLR: -2.95 (-2.94,2.94) [-3.00,1.00]
Total: 19654 W: 3879 L: 4076 D: 11699
sprt @ 10+0.1 th 1 Decreasing contempt. Tested locally 197 - 190 - 613 [0.503] 1000 games Elo difference: 2.43 +/- 13.38 SPRT: llr 0.104, lbound -2.94, ubound 2.94
18-03-12 lan dynamicContempt diff
ELO: -1.15 +-2.0 (95%) LOS: 12.5%
Total: 50000 W: 10236 L: 10401 D: 29363
50000 @ 10+0.1 th 1 See how inbuilt offset fares against the one adjusted with the contempt option
18-03-10 lan dynamicContempt diff
LLR: -2.95 (-2.94,2.94) [-3.00,1.00]
Total: 3197 W: 614 L: 784 D: 1799
sprt @ 10+0.1 th 1 Tuned values for non-regression. If passes then test against SF8 with fixed number of games
18-03-10 lan tuning_branch diff
14744/25000 iterations
31039/50000 games played
50000 @ 10+0.1 th 1 Tuning the formula for dynamic contempt
18-03-08 lan dynamicContempt diff
LLR: -2.95 (-2.94,2.94) [-3.00,1.00]
Total: 13183 W: 2473 L: 2656 D: 8054
sprt @ 10+0.1 th 1 Trying a simple modification of dynamic contempt for non-regression against master
17-11-18 lan kingdanger_extend diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 5659 W: 1021 L: 1123 D: 3515
sprt @ 10+0.1 th 1 Extending some KingDanger definitions. Suggested by Mindbreaker
17-10-31 lan safecheck_tune diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 18558 W: 3277 L: 3370 D: 11911
sprt @ 10+0.1 th 1 Locally tuned safe checks
17-10-25 lan safecheck_tune diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 714 W: 86 L: 246 D: 382
sprt @ 10+0.1 th 1 Testing results of local tuning with a bespoke tuner. Local testing showed +88 Elo STC (+373 -125 =502)
16-11-05 lan numasf diff
ELO: 2.96 +-1.9 (95%) LOS: 99.9%
Total: 40000 W: 6205 L: 5864 D: 27931
40000 @ 10+0.1 th 3 Testing Thomas Zipproth's implementation of numa on a 3 threads. There was no regression on 1 thread.
16-11-04 lan numasf diff
ELO: -0.09 +-2.0 (95%) LOS: 46.6%
Total: 40000 W: 6997 L: 7007 D: 25996
40000 @ 10+0.1 th 1 Testing Thomas Zipproth's implementation of numa on a single thread. If there is no regression we may commit on testing with big machines. Previous test was erroneous. Thanks, Ajith and Joerg for noticing
16-11-02 lan up_kingdanger diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 4216 W: 716 L: 824 D: 2676
sprt @ 10+0.1 th 1 Increase King evaluation by 25% as suggested by Michael B. at Talkchess
16-10-08 lan advanced_pawn_push diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 6232 W: 1117 L: 1216 D: 3899
sprt @ 10+0.1 th 1 Tweak advanced_pawn_push
16-08-22 lan 3-fold_repetition diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 1506 W: 227 L: 347 D: 932
sprt @ 10+0.1 th 1 Bug fix for the 3-fold repetition draw. Test as a Elo-gaining patch
16-08-19 lan repetition_draw diff
LLR: -0.15 (-2.94,2.94) [0.00,5.00]
Total: 3327 W: 620 L: 612 D: 2095
sprt @ 10+0.1 th 1 Another take on repetition draw with larger evaluation (#7 of Stephane Nicolet)
16-08-18 lan Zobrist_side diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 19095 W: 3507 L: 3550 D: 12038
sprt @ 10+0.1 th 1 Deterministic Zobrist side key (suggestion #11 by Stephane Nicolet)
16-08-17 lan kt_factor diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 16015 W: 2125 L: 2230 D: 11660
sprt @ 60+0.6 th 1 LTC: Change factor in kingtropism from 7 to 6
16-08-16 lan kt_factor diff
LLR: 2.95 (-2.94,2.94) [0.00,4.00]
Total: 21308 W: 4085 L: 3853 D: 13370
sprt @ 10+0.1 th 1 Change factor in kingtropism from 7 to 6. Suggested by Stephane Nicolet
16-04-21 lan pp_tune diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 12667 W: 2233 L: 2346 D: 8088
sprt @ 10+0.1 th 1 New local tuning with bespoke tuner 2469 iter / 9876 games until obj. func. < 0.08. Tested 1000 games: +228 -207 =565. Both tuning and testing with TC 10+0.1 s.
16-04-12 lan pp_tune diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 11051 W: 1998 L: 2117 D: 6936
sprt @ 10+0.1 th 1 Locally tuned with further modified tuner and TC 10+0.1 s. Tuning stopped prematurely because of power break at iter 646 / game 2584 with objective function 0.378. Local 1000-game testing at 10+0.1: +223 -220 =557. See if such small number of iterations has any effect.
16-04-08 lan pp_tune diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 39658 W: 7357 L: 7376 D: 24925
sprt @ 10+0.1 th 1 Tuned locally with a bespoke tuner. Stop criterion: loss function < 0.08 (2442 iter / 9768 games). Tested for 1000 games (10+0.1 s): +230 -183 =587
16-03-13 lan queen_values diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 25953 W: 4627 L: 4695 D: 16631
sprt @ 10+0.1 th 1 Checking queen values (QueenValueMg and QueenValueEg) locally tuned with ASP, see Fishcooking topic Re:SPSA
16-02-23 lan nullmove_ext diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 4087 W: 694 L: 802 D: 2591
sprt @ 10+0.1 th 1 Extend nullmove search in relation to maximal search depth. Solves the mate in 15: 8/8/8/2p5/1pp5/brpp4/1pprp2P/qnkbK -- white to move
16-02-14 lan noisy_DOE diff
24578/25000 iterations
50000/50000 games played
50000 @ 2+0.02 th 1 Testing with the found by the first DOE run a=57.707 and c=231.498 with A=5000
16-02-14 lan noisy_DOE diff
24719/25000 iterations
50000/50000 games played
50000 @ 2+0.02 th 1 Testing with the found by the first DOE run a=57.707 and c=231.498
16-02-13 lan noisy_DOE diff
24443/25000 iterations
50000/50000 games played
50000 @ 2+0.02 th 1 Take 8: c=150, R=0.0044