Stockfish Testing Queue

Finished - 3322 tests

18-08-15 sg lmr_best_move2 diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 10568 W: 1690 L: 1771 D: 7107
sprt @ 60+0.6 th 1 LTC: At PV nodes do less reduction if best value < draw <= static eval.
18-08-14 fau FauMob3 diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 48620 W: 7992 L: 7992 D: 32636
sprt @ 60+0.6 th 1 LTC: mobility tuned values after all games, half the changes, with a few manual tweaks to some values
18-08-14 sni game_phase5 diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 11269 W: 1798 L: 1876 D: 7595
sprt @ 60+0.6 th 1 LTC: Use pawns in game phase: pawns/2 - 6
18-08-14 sg lmr_best_move2 diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 35317 W: 5781 L: 5765 D: 23771
sprt @ 60+0.6 th 1 LTC: Fixed version. Less reduction if value of best move < draw <= static eval.
18-08-14 sni connected11 diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 29231 W: 4694 L: 4756 D: 19781
sprt @ 60+0.6 th 1 LTC: Tweak connected pawns seed[] array : +5
18-08-14 sni game_phase5 diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 16967 W: 2745 L: 2801 D: 11421
sprt @ 60+0.6 th 1 LTC: Cap the phase to [0..128] (suggested by Stefan). Submitted with priority -1.
18-08-14 IIv king_psqt diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 14214 W: 2230 L: 2339 D: 9645
sprt @ 60+0.6 th 1 LTC: Tuned values.
18-08-14 Gua MidgameLimit2 diff
LLR: 0.13 (-2.94,2.94) [0.00,4.00]
Total: 37008 W: 6481 L: 6352 D: 24175
sprt @ 60+0.6 th 1 Speculative LTC MidgameLimit 15308
18-08-14 SC pawnTweaks diff
LLR: 0.45 (-2.94,2.94) [0.00,4.00]
Total: 1000 W: 185 L: 158 D: 657
sprt @ 60+0.6 th 1 -5, 105/100. Speculative LTC
18-08-13 Viz capture_history2~1 diff
LLR: 2.96 (-2.94,2.94) [0.00,4.00]
Total: 43251 W: 7425 L: 7131 D: 28695
sprt @ 60+0.6 th 1 LTC for 180k games yellowish run with LLR close to 0. Double weight of capture history
18-08-13 sg prune_draw_new diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 31578 W: 5350 L: 5346 D: 20882
sprt @ 60+0.6 th 1 LTC: No move count pruning if best move is draw.
18-08-12 sni master diff
ELO: 40.88 +-1.9 (95%) LOS: 100.0%
Total: 40000 W: 8754 L: 4069 D: 27177
40000 @ 60+0.6 th 1 Regression/progression test against SF9 after "Combo of several promising parameter tweaks" of August, 12th
18-08-13 sni mob_initiative5 diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 38226 W: 6572 L: 6540 D: 25114
sprt @ 60+0.6 th 1 LTC: Pawn mobility and strong pawn center (A,B)=(3,3)
18-08-13 Viz QuadTuned diff
LLR: 2.96 (-2.94,2.94) [-3.00,1.00]
Total: 34271 W: 5852 L: 5753 D: 22666
sprt @ 60+0.6 th 1 LTC for Rocky640 SPSA tuned after 78000 games
18-08-12 sni no_captures5 diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 20634 W: 3482 L: 3522 D: 13630
sprt @ 60+0.6 th 1 Speculative LTC: deprecate captures at root by 6.
18-08-12 SC simpNLpawn diff
LLR: -2.95 (-2.94,2.94) [-3.00,1.00]
Total: 25869 W: 4259 L: 4455 D: 17155
sprt @ 60+0.6 th 1 Quadratic fit 1 pawn upwards.
18-08-12 sni pr_1725~1 diff
LLR: 2.97 (-2.94,2.94) [0.00,5.00]
Total: 19683 W: 3409 L: 3206 D: 13068
sprt @ 60+0.6 th 1 LTC: Take 4, tweak bishop values like in PR #1733
18-08-12 SC PVextension diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 16859 W: 2795 L: 2851 D: 11213
sprt @ 60+0.6 th 1 Only extend: PV above beta / 40000. Speculative LTC
18-08-11 SC permatune5 diff
LLR: 2.96 (-2.94,2.94) [0.00,4.00]
Total: 23761 W: 4155 L: 3923 D: 15683
sprt @ 60+0.6 th 1 Several promising parameter tweaks at once. LTC
18-08-11 jdo simple_razor diff
ELO: 0.23 +-2.1 (95%) LOS: 58.6%
Total: 36412 W: 6168 L: 6144 D: 24100
40000 @ 60+0.6 th 1 Both versions of razoring simplification passed [-3, 1] and framework is empty, so at low prio -5 let's collect more data - which is better?
18-08-11 And razor_ether diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 16740 W: 2837 L: 2937 D: 10966
sprt @ 60+0.6 th 1 LTC: Simplify to just depth 1 razoring, maintain the PvNode condition. Test with [0, 4], despite being a simplification, to see if this is better than the [-3, 1] running now.
18-08-10 jdo simple_razor diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 15572 W: 2678 L: 2549 D: 10345
sprt @ 60+0.6 th 1 Fix of green STC simplification - possibility 1
18-08-10 jdo simple_razor diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 62670 W: 10749 L: 10697 D: 41224
sprt @ 60+0.6 th 1 Fix, possibility 2
18-08-10 jdo simple_razor diff
LLR: 1.56 (-2.94,2.94) [-3.00,1.00]
Total: 4265 W: 768 L: 693 D: 2804
sprt @ 60+0.6 th 1 Simplest razoring: depth 1 only, no distinction between PV / NonPV
18-08-10 SC permatune2 diff
LLR: -2.94 (-2.94,2.94) [0.00,4.00]
Total: 148267 W: 25685 L: 25353 D: 97229
sprt @ 60+0.6 th 1 LTC
18-08-09 Voy pawnValue diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 45444 W: 7732 L: 7739 D: 29973
sprt @ 60+0.6 th 1 Speculative LTC on this Yellow patch... low throughput
18-08-08 sni master diff
ELO: 37.78 +-1.9 (95%) LOS: 100.0%
Total: 40000 W: 8556 L: 4224 D: 27220
40000 @ 60+0.6 th 1 Regression/progression test against SF9 after "First check threshold in space evaluation" of August, 8th
18-08-09 xor doublePass diff
LLR: 2.96 (-2.94,2.94) [-3.00,1.00]
Total: 30717 W: 5256 L: 5151 D: 20310
sprt @ 60+0.6 th 1 LTC: remove condition for passed pawns.
18-08-09 sg lmr_tweak3a diff
LLR: 2.96 (-2.94,2.94) [-3.00,1.00]
Total: 29570 W: 5083 L: 4976 D: 19511
sprt @ 60+0.6 th 1 LTC: Replace capture stat score < 0 rule with last move count > 15 rule for all moves. Rebased to current master
18-08-09 sg tuned_stats_bonus2 diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 25754 W: 4371 L: 4390 D: 16993
sprt @ 60+0.6 th 1 LTC: 139K values with 50% more change.. Rebased to current master
18-08-09 Gua GameStage1 diff
LLR: 2.95 (-2.94,2.94) [0.00,5.00]
Total: 14179 W: 2507 L: 2326 D: 9346
sprt @ 60+0.6 th 1 speculative LTC (stable yellow +1.64 elo)
18-08-08 sg tuned_stats_bonus diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 74377 W: 12853 L: 12673 D: 48851
sprt @ 60+0.6 th 1 LTC: Try average of master and 139K values.
18-08-07 Voy exactPVt diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 57482 W: 9831 L: 9721 D: 37930
sprt @ 60+0.6 th 1 Since the queue is dry...try this yellow at LTC...low tp
18-08-08 Gua GameStage diff
LLR: 2.95 (-2.94,2.94) [0.00,5.00]
Total: 35324 W: 6320 L: 6049 D: 22955
sprt @ 60+0.6 th 1 LTC
18-08-07 sg tuned_stats_bonus diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 42810 W: 7308 L: 7258 D: 28244
sprt @ 60+0.6 th 1 The final values failed bad. But the stat bonus values after 139K games failed yellow at STC. So try them at LTC (low throughput)
18-08-07 sg tuned_stats_bonus diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 27105 W: 4552 L: 4566 D: 17987
sprt @ 60+0.6 th 1 STC failed fast but i want test final tuned values at LTC. (Low throughput)
18-08-07 SC risk12 diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 39688 W: 6863 L: 6824 D: 26001
sprt @ 60+0.6 th 1 -20% of original values. LTC for sni
18-08-04 sni risk12~4 diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 21229 W: 3569 L: 3607 D: 14053
sprt @ 60+0.6 th 1 LTC: Only for depth < 5
18-08-05 fau KingFau3 diff
LLR: 2.96 (-2.94,2.94) [0.00,4.00]
Total: 147079 W: 25584 L: 24947 D: 96548
sprt @ 60+0.6 th 1 Speculative LTC: Final version of King's Psqt tweak, with few manual fixes.
18-08-06 sni risk12~5 diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 23208 W: 3892 L: 3922 D: 15394
sprt @ 60+0.6 th 1 LTC: Only for depth > 0
18-08-04 31m KingFau2 diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 217624 W: 37358 L: 36804 D: 143462
sprt @ 60+0.6 th 1 For @fauzi2: Test number 2, after 370k.
18-08-05 sni risk12 diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 33943 W: 5698 L: 5685 D: 22560
sprt @ 60+0.6 th 1 LTC: Another matrix (3)
18-08-05 sni risk12 diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 25050 W: 4170 L: 4193 D: 16687
sprt @ 60+0.6 th 1 LTC: Use the new safety margin for null move pruning (suggested by Stefano Cardanobile)
18-08-05 sni risk12~1 diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 14503 W: 2430 L: 2495 D: 9578
sprt @ 60+0.6 th 1 LTC: -40% of original values (getting a little bit desperate)
18-08-05 sni risk12 diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 12802 W: 2087 L: 2159 D: 8556
sprt @ 60+0.6 th 1 LTC: +40% of original values
18-08-04 sni risk12~2 diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 61358 W: 10448 L: 10323 D: 40587
sprt @ 60+0.6 th 1 LTC: Only for depth < 3
18-08-04 sni risk12~1 diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 6010 W: 961 L: 1060 D: 3989
sprt @ 60+0.6 th 1 LTC: Only for depth < 2
18-08-04 sni risk12 diff
LLR: -2.94 (-2.94,2.94) [0.00,5.00]
Total: 21561 W: 3622 L: 3658 D: 14281
sprt @ 60+0.6 th 1 LTC for "Back to the drawing board". I expect this to fail, but it could hopefully give some insight for versions with depth restriction(s), ie closer to master.
18-08-04 sni risk12~2 diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 7044 W: 1160 L: 1255 D: 4629
sprt @ 60+0.6 th 1 LTC for "Back to the drawing board, take 3". I expect this to fail, but it could hopefully give some insight for versions with depth restriction(s), ie closer to master.
18-08-04 31m risk11 diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 6528 W: 1074 L: 1171 D: 4283
sprt @ 60+0.6 th 1 For @snicolet: Only for depths 0,2,4.