Stockfish Testing Queue

Finished - 45531 tests

15-02-28 jos fianchetto diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 20851 W: 4087 L: 4137 D: 12627
sprt @ 15+0.05 th 1 Kingside fianchetto, take 2. Additionally check for a defended pawn on g3/g6.
15-02-28 vin doubled_passed_pawns diff
LLR: 2.95 (-2.94,2.94) [-1.50,4.50]
Total: 14555 W: 2929 L: 2783 D: 8843
sprt @ 15+0.05 th 1 Uh, actually use the correct reweighted values this time instead of greatly increasing the Eg penalty. Final try.
15-02-28 sni sorting_moves4 diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 7506 W: 1440 L: 1526 D: 4540
sprt @ 15+0.05 th 1 It is often good to capture pawns with pieces rather than with pawns to avoid deteriorating the pawn structure, especially in endgame. Since the see() call will delay the bad captures till the end of the move list anyway, it makes sense to try ordering captures by MVV/MVA instead of MVV/LVA.
15-02-28 sni sorting_moves5 diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 5643 W: 1104 L: 1196 D: 3343
sprt @ 15+0.05 th 1 Sorting captures in MVV/LVA/history order
15-02-28 zar doubled_tuning diff
33052/30000 iterations
63000/60000 games played
60000 @ 15+0.05 th 1 SPSA tuning for doubled pawns
15-02-28 hxi phase_min diff
LLR: -3.14 (-2.94,2.94) [-1.50,4.50]
Total: 6257 W: 1172 L: 1268 D: 3817
sprt @ 15+0.05 th 1 game phase based on side with less material
15-02-28 jos threatened_byPawn diff
ELO: 0.46 +-2.2 (95%) LOS: 65.8%
Total: 40000 W: 8478 L: 8425 D: 23097
40000 @ 9+0.05 th 1 Measure of fine-tuned values for Threat and ThreatenedByPawn arrays. (Final try.)
15-03-01 zar order diff
ELO: -1.97 +-2.1 (95%) LOS: 3.5%
Total: 41000 W: 8088 L: 8320 D: 24592
40000 @ 15+0.05 th 1 Add psqt weight to sort quites, check value (fixed)
15-03-01 Roc OppCastleV2 diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 12544 W: 2462 L: 2534 D: 7548
sprt @ 15+0.05 th 1 2 weights have been adjusted when kings are in opposite castle 3x2 corners. Take 2
15-03-01 vin doubled_passed_pawns diff
LLR: -3.94 (-2.94,2.94) [0.00,6.00]
Total: 30360 W: 5031 L: 5023 D: 20306
sprt @ 60+0.05 th 1 Re-test at LTC after correct version of patch passed at STC.
15-03-01 jki avail diff
ELO: -13.14 +-6.9 (95%) LOS: 0.0%
Total: 3200 W: 461 L: 582 D: 2157
20000 @ 15+0.05 th 16 Examine threads in random order when splitting
15-03-01 jos fianchetto diff
LLR: -3.47 (-2.94,2.94) [-1.50,4.50]
Total: 6443 W: 1242 L: 1350 D: 3851
sprt @ 15+0.05 th 1 Check for pawns on f2, g3 and h2, slightly increased bonus. Take 3.
15-03-01 vin time_predictor diff
24048/25000 iterations
46725/50000 games played
50000 @ 30+0.05 th 1 Try using the more consistent estimated node growth rather than BestMoveChanges as a measure of PV instability. See the old thread "Comparative analysis of "big think" positions - suggests new metric?" Difficult to work out the 'right' parameters so begin with a tuning run from a can't-be-awful estimate based on bench output. As this is a time management patch use slightly longer TC.
15-03-01 jki avail2 diff
ELO: 2.40 +-2.8 (95%) LOS: 95.2%
Total: 19000 W: 3166 L: 3035 D: 12799
20000 @ 15+0.05 th 16 Orthodox helpful master concept
15-03-01 jos bishop_pawns diff
LLR: -3.37 (-2.94,2.94) [-3.00,1.00]
Total: 155176 W: 30329 L: 30794 D: 94053
sprt @ 15+0.05 th 1 Remove the little helper "pawns_on_same_color_squares" and directly compute it when needed. Small simplification and speedup as well.
15-03-01 vin blocked_centre diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 3322 W: 627 L: 725 D: 1970
sprt @ 15+0.05 th 1 Put in the only value from the SPSA session that showed some meaningful change. Adjust stormdanger when centre is open.
15-03-01 jki nolocks diff
ELO: 6.64 +-2.8 (95%) LOS: 100.0%
Total: 20000 W: 3482 L: 3100 D: 13418
20000 @ 15+0.05 th 16 Retire global thread lock
15-03-02 vin time_predictor diff
7122/25000 iterations
14350/50000 games played
50000 @ 30+0.05 th 1 Second run with corrected larger c-value as suggested by Nicklas Persson. Otherwise it'll take forever to converge..
15-03-02 zar one_lever diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 35125 W: 6914 L: 6925 D: 21286
sprt @ 15+0.05 th 1 Don't bonus lever with two opponents.
15-03-02 vin time_predictor diff
25188/25000 iterations
50000/50000 games played
50000 @ 30+0.05 th 1 Restart SPSA run as one parameter was going to hit the lower limit. (Thanks Binky) Also sync with latest master.
15-03-02 vin structural_mobility diff
LLR: -3.31 (-2.94,2.94) [-1.50,4.50]
Total: 21025 W: 4118 L: 4180 D: 12727
sprt @ 15+0.05 th 1 Try bonus for pawns that are more mobile structurally (e.g. candidate passers) as opposed to currently mobile (the current safe pawn push bonus)
15-03-02 sg long_chain diff
LLR: 2.97 (-2.94,2.94) [-1.50,4.50]
Total: 8225 W: 1708 L: 1578 D: 4939
sprt @ 15+0.05 th 1 Add bonus for inner pawns of a long chain. (even lower bonus)
15-03-02 jos matimb diff
ELO: -3.46 +-2.1 (95%) LOS: 0.1%
Total: 44000 W: 9336 L: 9774 D: 24890
40000 @ 9+0.05 th 1 One last try to improve upon existing values.
15-03-02 lbr 54f8a9cb diff
ELO: -8.27 +-58.9 (95%) LOS: 39.1%
Total: 42 W: 6 L: 7 D: 29
20000 @ 15+0.05 th 1 SF 5: 4 vs. 2 threads
15-03-03 Fis TTflipEval diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 11453 W: 2226 L: 2301 D: 6926
sprt @ 15+0.05 th 1 Use the NULL move flip trick on TT evals also. 2MB hash
15-03-03 sg long_chain diff
LLR: -2.96 (-2.94,2.94) [0.00,6.00]
Total: 15665 W: 2575 L: 2603 D: 10487
sprt @ 60+0.05 th 1 LTC: Add bonus for inner pawns of a long chain. (even lower bonus)
15-03-03 Fis TTflipEval diff
LLR: -3.09 (-2.94,2.94) [-1.50,4.50]
Total: 10331 W: 1934 L: 2017 D: 6380
sprt @ 15+0.05 th 1 One more try also refreshing the flipped TT entries. 2MB
15-03-03 vin time_predictor diff
26652/25000 iterations
50000/50000 games played
50000 @ 30+0.05 th 1 One parameter has converged it seems, so one more run to stabilise the other. Priority -1 so as not to hold up all the eval/threading work.
15-03-03 sni king2 diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 8113 W: 1654 L: 1739 D: 4720
sprt @ 15+0.05 th 1 Give less importance to the pawn structure when the kings are separated
15-03-03 sni king2 diff
LLR: -2.94 (-2.94,2.94) [-1.50,4.50]
Total: 11459 W: 2310 L: 2385 D: 6764
sprt @ 15+0.05 th 1 Take 2. Less reduction with king separation.
15-03-03 sni king2 diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 12731 W: 2514 L: 2586 D: 7631
sprt @ 15+0.05 th 1 Take 3. Quadratic reduction.
15-03-04 Fis tt_iteration2 diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 2442 W: 433 L: 533 D: 1476
sprt @ 15+0.05 th 1 A modified version of this http://tests.stockfishchess.org/tests/view/54f1b4af0ebc594fbf9a9735 patch. Full credit to Marco and http://www.talkchess.com/forum/viewtopic.php?t=55501 2MB
15-03-04 sg long_chain diff
LLR: 2.96 (-2.94,2.94) [-1.50,4.50]
Total: 32674 W: 6534 L: 6338 D: 19802
sprt @ 15+0.05 th 1 Add bonus for inner pawns of a long chain. The last value 1/16 passed STC fast but failed LTC. Try now a higher value between 1/8 and 1/16.
15-03-04 jos outpost3 diff
ELO: 0.80 +-1.8 (95%) LOS: 80.3%
Total: 46000 W: 7775 L: 7669 D: 30556
40000 @ 9+0.05 th 1 Measure tuned outpost values with 8-moves book.
15-03-04 sg long_chain diff
LLR: -2.97 (-2.94,2.94) [0.00,6.00]
Total: 25573 W: 4245 L: 4226 D: 17102
sprt @ 60+0.05 th 1 LTC: Add bonus for inner pawns of a long chain. The last value 1/16 passed STC fast but failed LTC. Try now a higher value between 1/8 and 1/16.
15-03-04 jki cb2111f0b62af diff
ELO: -0.43 +-2.0 (95%) LOS: 33.5%
Total: 40000 W: 6594 L: 6643 D: 26763
40000 @ 60+0.05 th 1 c++11 migration, regression test
15-03-04 jki cb2111f0b62af diff
ELO: -32.20 +-8.1 (95%) LOS: 0.0%
Total: 2229 W: 252 L: 458 D: 1519
20000 @ 60+0.05 th 4 c++11 migration, regression test, 4 threads
15-03-05 vin time_predictor_a diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 1493 W: 219 L: 319 D: 955
sprt @ 30+0.05 th 1 Test of first possible metric (overall node growth) from statistical fit of node growth for time management. Test at same TC as was used for tuning initially.
15-03-05 sni king3 diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 4848 W: 959 L: 1053 D: 2836
sprt @ 15+0.05 th 1 Reward semi-open and open files on opponent king
15-03-05 jos no_is_draw diff
ELO: -4.66 +-3.0 (95%) LOS: 0.1%
Total: 20000 W: 3852 L: 4120 D: 12028
20000 @ 15+0.05 th 1 Can we drop the check for a draw at the beginning of qsearch? (We shouldn't miss a 3-fold rep because we always check in main search.)
15-03-06 jki spin diff
ELO: -5.82 +-10.7 (95%) LOS: 14.4%
Total: 1314 W: 204 L: 226 D: 884
10000 @ 15+0.05 th 16 c++11 regression test, spinlocks activated
15-03-06 jos outpost3 diff
ELO: 0.39 +-1.9 (95%) LOS: 65.5%
Total: 44000 W: 7594 L: 7545 D: 28861
40000 @ 9+0.05 th 1 Now measure some asymmetrical values. Also with 8moves book to get a fair comparison to the symmetrical values.
15-03-06 lbr 27a18772 diff
ELO: 11.94 +-3.2 (95%) LOS: 100.0%
Total: 20000 W: 4883 L: 4196 D: 10921
20000 @ 9+0.03 th 1 quick test to confirm scaling regression interval. 1 thread.
15-03-06 lbr 27a18772 diff
ELO: 7.07 +-3.1 (95%) LOS: 100.0%
Total: 20000 W: 4276 L: 3869 D: 11855
20000 @ 9+0.03 th 3 quick test to confirm scaling regression interval. 3 threads.
15-03-06 lbr 27a18772 diff
ELO: 3.39 +-3.2 (95%) LOS: 98.0%
Total: 17226 W: 3422 L: 3254 D: 10550
20000 @ 9+0.03 th 7 quick test to confirm scaling regression interval. 7 threads.
15-03-07 lbr 2eec7103 diff
ELO: 0.90 +-3.1 (95%) LOS: 71.1%
Total: 19369 W: 4050 L: 4000 D: 11319
20000 @ 15+0.05 th 1 confirm scaling regression. 1 thread.
15-03-07 lbr 2eec7103 diff
ELO: 1.58 +-3.2 (95%) LOS: 83.5%
Total: 17564 W: 3409 L: 3329 D: 10826
20000 @ 15+0.05 th 3 confirm scaling regression. 3 thread.
15-03-07 lbr 27a18772 diff
ELO: 8.50 +-3.3 (95%) LOS: 100.0%
Total: 20000 W: 4961 L: 4472 D: 10567
20000 @ 9+0.03 th 1 bissect 766fb9c6..27a18772 1 thread
15-03-07 lbr 27a18772 diff
ELO: 10.48 +-3.1 (95%) LOS: 100.0%
Total: 20000 W: 4586 L: 3983 D: 11431
20000 @ 9+0.03 th 3 bissect 766fb9c6..27a18772 3 threads
15-03-07 lbr 27a18772 diff
ELO: 1.84 +-3.0 (95%) LOS: 88.2%
Total: 20000 W: 4066 L: 3960 D: 11974
20000 @ 9+0.03 th 7 bissect 766fb9c6..27a18772 7 threads