Stockfish Testing Queue

Finished - 20740 tests

21-03-14 My rBeta_tweak2 diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 14877 W: 2742 L: 2808 D: 9327
sprt @ 15+0.05 th 1 Try the same concept as aspiration delta. Split rBeta margin according to depth. Likely last take.
21-03-14 sn LMR_with_eval diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 5315 W: 967 L: 1059 D: 3289
sprt @ 15+0.05 th 1 Increase LMR reduction for positions which seem to be easy to prove/disprove (400). Low priority.
21-03-14 sn LMR_with_eval diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 2779 W: 457 L: 555 D: 1767
sprt @ 15+0.05 th 1 Increase LMR reduction for positions which seem to be easy to prove/disprove (300). Low priority.
21-03-14 sn LMR_with_eval diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 1714 W: 267 L: 367 D: 1080
sprt @ 15+0.05 th 1 Increase LMR reduction for positions which seem to be easy to prove/disprove (200). Low priority.
21-03-14 sn LMR_with_eval diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 721 W: 81 L: 183 D: 457
sprt @ 15+0.05 th 1 Increase LMR reduction for positions which seem to be easy to prove/disprove (100). Low priority.
19-03-14 sg rook_ending diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 36603 W: 6794 L: 6804 D: 23005
sprt @ 15+0.05 th 1 drawish rook ending (Take 4)
21-03-14 Fi tt_move diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 14221 W: 2602 L: 2670 D: 8949
sprt @ 15+0.05 th 1 Value tt entries w/ moves more. Test w/ 16MB to simulate more normal hash pressure.
21-03-14 My rBeta_tweak1 diff
LLR: -1.59 (-2.94,2.94) [-1.50,4.50]
Total: 3954 W: 700 L: 746 D: 2508
sprt @ 15+0.05 th 1 Introduce a factor of - depth / 8 to the singular extension margin.
20-03-14 sn LMR_with_eval diff
LLR: -2.96 (-2.94,2.94) [0.00,6.00]
Total: 16790 W: 2566 L: 2592 D: 11632
sprt @ 60+0.05 th 1 LTC : recurse to qsearch() in LMR if reduced depth is less than ONE_PLY
19-03-14 lu asp_start_retry diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 40265 W: 7322 L: 7323 D: 25620
sprt @ 15+0.05 th 1 aspiration window start size based on previous score: Ralph's patch with very different values loosely based on local testing (while the framework is empty)
20-03-14 jo iid_tweak diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 38473 W: 7161 L: 7166 D: 24146
sprt @ 15+0.05 th 1 As we arrive here with under 1% as PvNode && !ttMove, I simplified the precondition. As this noticeably affects functionality, I test in standard mode. Take 2.
20-03-14 sn LMR_with_eval diff
LLR: 2.97 (-2.94,2.94) [-1.50,4.50]
Total: 18245 W: 3461 L: 3308 D: 11476
sprt @ 15+0.05 th 1 Recurse to qsearch() in LMR if reduced depth is less than ONE_PLY
20-03-14 jo iid_tweak^ diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 21029 W: 3838 L: 3888 D: 13303
sprt @ 15+0.05 th 1 As we arrive here with under 1% as PvNode && !ttMove, I simplified the precondition. As this noticeably affects functionality, I test in standard mode. Take 1.
20-03-14 My rBeta_tweak diff
LLR: -2.96 (-2.94,2.94) [0.00,6.00]
Total: 16402 W: 2536 L: 2563 D: 11303
sprt @ 60+0.05 th 1 LTC: Introduce a factor of depth / 4 to the singular extension margin.
20-03-14 My rBeta_tweak diff
LLR: 2.97 (-2.94,2.94) [-1.50,4.50]
Total: 9889 W: 1883 L: 1752 D: 6254
sprt @ 15+0.05 th 1 Introduce a factor of depth / 4 to the singular extension margin.
20-03-14 sn probcut_tweak diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 36543 W: 6869 L: 6878 D: 22796
sprt @ 15+0.05 th 1 Harden probcut precondition, and reduction = 4 plies (current master)
20-03-14 sn probcut_tweak diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 8799 W: 1568 L: 1650 D: 5581
sprt @ 15+0.05 th 1 Harden probcut precondition, and reduction = 4.5 plies
19-03-14 sg rook_ending diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 17126 W: 3191 L: 3251 D: 10684
sprt @ 15+0.05 th 1 drawish rook ending (Take 3)
19-03-14 sg rook_ending diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 20497 W: 3759 L: 3811 D: 12927
sprt @ 15+0.05 th 1 drawish rook ending (Take 2)
19-03-14 sg rook_ending diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 12410 W: 2233 L: 2306 D: 7871
sprt @ 15+0.05 th 1 drawish rook ending (Take 1)
19-03-14 sn LMR_captures diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 7858 W: 1422 L: 1507 D: 4929
sprt @ 15+0.05 th 1 LMR for captures, with reduction 3 plies less
19-03-14 Fi asp_interpolate diff
LLR: -2.94 (-2.94,2.94) [0.00,6.00]
Total: 65148 W: 10066 L: 9875 D: 45207
sprt @ 60+0.05 th 1 Interpolate aspiration over a certain depth range. It failed STC w/ a neutral score but we know this is TC dependent. I would like to try LTC anyway w/ low priority. If don't agree don't approve.
19-03-14 Fi asp_previous diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 3249 W: 548 L: 644 D: 2057
sprt @ 15+0.05 th 1 Bias delta based on delta from previous iteration. Take 2
19-03-14 Fi asp_previous diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 6087 W: 1097 L: 1186 D: 3804
sprt @ 15+0.05 th 1 Bias delta based on delta from previous iteration. Take 1
18-03-14 mc split_moves diff
ELO: -10.83 +-7.3 (95%) LOS: 0.2%
Total: 2567 W: 344 L: 424 D: 1799
20000 @ 15+0.05 th 16 Hit the bullet and test split_moves for a gain: we need to use at least 16 threads!
18-03-14 sn LMR_captures diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 4159 W: 730 L: 824 D: 2605
sprt @ 15+0.05 th 1 LMR : do it for captures too & retire cutNode difference
18-03-14 My tune_asp_del diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 36648 W: 6806 L: 6815 D: 23027
sprt @ 15+0.05 th 1 First and last try at tuning. Increase depth & delta by 2.
18-03-14 rs lmr_cap_qsearch diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 4284 W: 758 L: 852 D: 2674
sprt @ 15+0.05 th 1 lmr captures on qsearch fail low
17-03-14 sn noise_of_sf140314 diff
ELO: 3.13 +-8.3 (95%) LOS: 77.0%
Total: 2000 W: 306 L: 288 D: 1406
2000 @ 60+0.05 th 1 measuring the noise of sf140314 against itself (2000@60'')
18-03-14 Fi asp_interpolate diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 42536 W: 7868 L: 7862 D: 26806
sprt @ 15+0.05 th 1 Interpolate aspiration over a certain depth range.
17-03-14 sn noise_of_sf140314 diff
ELO: 9.90 +-8.9 (95%) LOS: 98.5%
Total: 2000 W: 373 L: 316 D: 1311
2000 @ 15+0.05 th 1 measuring the noise of sf140314 against itself (2000@15'')
18-03-14 pe still_first diff
LLR: -2.96 (-2.94,2.94) [0.00,6.00]
Total: 13231 W: 1991 L: 2033 D: 9207
sprt @ 60+0.05 th 1 Direct LTC as looks tc dependent . Chip away more time from unchanging moves. Final attempt.
18-03-14 My attack_pin diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 8444 W: 1523 L: 1606 D: 5315
sprt @ 15+0.05 th 1 Give bonus to attacking pins.
18-03-14 sn LMR_captures diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 2304 W: 388 L: 488 D: 1428
sprt @ 15+0.05 th 1 Use LMR for captures too
18-03-14 mc split_moves diff
LLR: -2.95 (-2.94,2.94) [-3.00,1.00]
Total: 16972 W: 2757 L: 2938 D: 11277
sprt @ 15+0.05 th 3 Test "don't allocate a thread if no moves" for no-regression, this is not enough to commit, but at least check patch is not broken. (3 threads used)
17-03-14 Fi asp_delta_change diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 22725 W: 4091 L: 4137 D: 14497
sprt @ 15+0.05 th 1 Update deltas for alpha and beta independently.
17-03-14 mc bigger_asp diff
LLR: -2.95 (-2.94,2.94) [0.00,6.00]
Total: 34039 W: 5252 L: 5200 D: 23587
sprt @ 60+0.05 th 1 Increase aspiration to 18. Directly at LTC because it seems to be TC dependent (see previous asp. patches).
17-03-14 st quadratic_gamephase diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 14527 W: 2596 L: 2663 D: 9268
sprt @ 15+0.05 th 1 2nd order polynomial for game phase instead of straight line, with b=0.009. Take 2 (Final take if it fails).
15-03-14 mc master diff
LLR: 0.57 (-2.94,2.94) [-3.00,1.00]
Total: 16451 W: 2623 L: 2619 D: 11209
sprt @ 15+0.05 th 8 Test for no-regression remove of Max Threads per Split Point restriction. This is a special test that requires 8 threads at least and so is set at high priority because available machines are very few anyhow and will not affect other tests.
17-03-14 hw futility_margin_tune_2 diff
ELO: 2.74 +-2.9 (95%) LOS: 96.7%
Total: 20000 W: 3769 L: 3611 D: 12620
20000 @ 15+0.05 th 1 Another check on tuning futilty margin
17-03-14 rs asp_start diff
LLR: -2.96 (-2.94,2.94) [0.00,6.00]
Total: 17331 W: 2675 L: 2698 D: 11958
sprt @ 60+0.05 th 1 LTC: aspiration window start size based on previous score
17-03-14 sn autocorrect_CUT_ALL_sta diff
LLR: -1.31 (-2.94,2.94) [-1.50,4.50]
Total: 39521 W: 7402 L: 7345 D: 24774
sprt @ 15+0.05 th 1 take 7
17-03-14 mc asp_tuning^ diff
LLR: -2.95 (-2.94,2.94) [0.00,6.00]
Total: 4471 W: 660 L: 741 D: 3070
sprt @ 60+0.05 th 1 LTC: Aspiration test 12 vs 16
15-03-14 mc asp_tuning^ diff
LLR: 2.95 (-2.94,2.94) [-1.50,4.50]
Total: 60708 W: 11285 L: 11023 D: 38400
sprt @ 15+0.05 th 1 Aspiration test 12 vs 16
14-03-14 in asp_delta diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 40077 W: 7404 L: 7405 D: 25268
sprt @ 15+0.05 th 1 Another take on aspiration delta
14-03-14 Fi delta_change diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 51169 W: 9483 L: 9455 D: 32231
sprt @ 15+0.05 th 1 Increase delta a bit slower on fail high/low. Take 1
17-03-14 st quadratic_gamephase diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 11653 W: 2126 L: 2201 D: 7326
sprt @ 15+0.05 th 1 Use a 2nd order polynomial to interpolate through the game phases limits instead of a straight line. There's a free parameter b to tune. Here I choose b=0.013. Take 1.
16-03-14 My rBeta diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 17800 W: 3299 L: 3358 D: 11143
sprt @ 15+0.05 th 1 Use a fixed value subtraction for rBeta in singular extension search.
16-03-14 sn autocorrect_CUT_ALL_sta diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 904 W: 107 L: 208 D: 589
sprt @ 15+0.05 th 1 take 5
16-03-14 sn autocorrect_CUT_ALL_sta diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 1796 W: 266 L: 365 D: 1165
sprt @ 15+0.05 th 1 take 4