Stockfish Testing Queue

Finished - 38987 tests

14-02-04 lbr simplify_null diff
LLR: 2.95 (-2.94,2.94) [-4.00,0.00]
Total: 39483 W: 7325 L: 7305 D: 24853
sprt @ 15+0.05 th 1 simplification: remove verification search
14-02-04 dor gives_check diff
LLR: 2.97 (-2.94,2.94) [0.00,6.00]
Total: 42175 W: 6702 L: 6409 D: 29064
sprt @ 60+0.05 th 1 LTC for Th: Speed improvement, optimized common case of pos.gives_check inline to avoid fairly expensive function calls.
14-02-04 hxi scaling diff
LLR: 2.97 (-2.94,2.94) [-1.50,4.50]
Total: 44713 W: 8241 L: 8021 D: 28451
sprt @ 15+0.05 th 1 scaling with 2 and 3 pawns
14-02-04 uri lessnull2 diff
ELO: 1.26 +-2.9 (95%) LOS: 80.2%
Total: 20168 W: 3728 L: 3655 D: 12785
40000 @ 40/15 th 1 similiar try to my previous try with different parameters.
14-02-04 inf simplify_null diff
Pending...
sprt @ 60+0.05 th 1 LTC: simplification: remove verification search
14-02-04 inf simplify_null diff
LLR: 2.95 (-2.94,2.94) [-4.00,0.00]
Total: 27543 W: 4278 L: 4209 D: 19056
sprt @ 60+0.05 th 1 LTC: simplification: remove verification search
14-02-04 rst qsearch_disco diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 63171 W: 11656 L: 11597 D: 39918
sprt @ 15+0.05 th 1 don't prune discovered checks in qsearch
14-02-04 rst search_disco diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 12864 W: 2327 L: 2399 D: 8138
sprt @ 15+0.05 th 1 don't lmr discovered and double checks
14-02-04 inf razor_margin diff
LLR: 2.96 (-2.94,2.94) [-1.50,4.50]
Total: 26599 W: 5054 L: 4879 D: 16666
sprt @ 15+0.05 th 1 Final Take: return v; 50K iterations
14-02-05 luc depth_based_aspiration diff
ELO: -0.90 +-3.1 (95%) LOS: 28.6%
Total: 20000 W: 4210 L: 4262 D: 11528
20000 @ 5+0.05 th 1 evaluate simple conservative formula at very fast TC (after tuning)
14-02-05 jos razor_margin diff
LLR: -2.95 (-2.94,2.94) [0.00,6.00]
Total: 28108 W: 4361 L: 4335 D: 19412
sprt @ 60+0.05 th 1 LTC for infinity: Final Take: return v; 50K iterations
14-02-05 hxi scaling diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 68720 W: 12480 L: 12409 D: 43831
sprt @ 15+0.05 th 1 scaling with 2 and 3 pawns - take 2
14-02-06 inf scaling diff
LLR: -2.97 (-2.94,2.94) [0.00,6.00]
Total: 29621 W: 4541 L: 4510 D: 20570
sprt @ 60+0.05 th 1 LTC for hx: scaling with 2 and 3 pawns
14-02-06 hxi scaling diff
LLR: 2.96 (-2.94,2.94) [-1.50,4.50]
Total: 60928 W: 11256 L: 10994 D: 38678
sprt @ 15+0.05 th 1 scaling with 2 and 3 pawns - take 3
14-02-06 dor only_move diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 11768 W: 2641 L: 2716 D: 6411
sprt @ 15+0.05 th 1 easy move with 1 pawn adv. - low priority retest with 2moves_v1 book, previously failed after 26259 games. Take 2 as I made mistake in parameters before.
14-02-06 sg lmr_exclude diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 7564 W: 1337 L: 1422 D: 4805
sprt @ 15+0.05 th 1 LMR: less reductions for passed pawn pushes
14-02-06 sg lmr_exclude diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 7274 W: 1283 L: 1369 D: 4622
sprt @ 15+0.05 th 1 LMR: exclude passed pawn pushes
14-02-06 dor c4533e0d94 diff
ELO: 1.98 +-2.9 (95%) LOS: 90.8%
Total: 20000 W: 3754 L: 3640 D: 12606
20000 @ 15+0.05 th 1 Verify possible regression from Mindbreaker's list
14-02-06 sg scaling diff
LLR: -2.97 (-2.94,2.94) [0.00,6.00]
Total: 145541 W: 22550 L: 22000 D: 100991
sprt @ 60+0.05 th 1 LTC for hx: scaling with 2 and 3 pawns - take 3
14-02-07 uri lessnull2 diff
ELO: -15.38 +-3.8 (95%) LOS: 0.0%
Total: 12000 W: 1948 L: 2479 D: 7573
40000 @ 15+0.05 th 1 limit using null move pruning(simpler version then previous version because I use number of nodes to decide about big depth not to prune only at the beginning of the iteration.
14-02-07 uri lessnull2 diff
ELO: -3.06 +-2.2 (95%) LOS: 0.3%
Total: 35777 W: 6412 L: 6727 D: 22638
40000 @ 15+0.05 th 1 I multiply by ONE_PLY that I forgot in the previous version (the previous version was not my intention but it has smaller bench so maybe it is better so it may be interesting to test both the previous version and also my intention for comparison).
14-02-07 Tha checkinfo diff
LLR: -0.07 (-2.94,2.94) [-1.50,4.50]
Total: 238 W: 39 L: 41 D: 158
sprt @ 15+0.05 th 1 Pass CheckInfo by reference rather than const reference for speed gain
14-02-07 inf from_null diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 14452 W: 2590 L: 2658 D: 9204
sprt @ 15+0.05 th 1 Reintroduce fromNull (from Mindbreaker's potential regressions list -- edce2a8 -- -29elo?). Instead of testing old patch vs parent master, I decided to try and reintroduce it into latest master.
14-02-07 vdb measure_razor_margins diff
ELO: -4.38 +-2.1 (95%) LOS: 0.0%
Total: 38796 W: 6933 L: 7422 D: 24441
40000 @ 15+0.05 th 1 I suspect that the use of razor_margins(depth) both as a precondition for razoring and as actual margins is wrong. To measure this I have set the margins to zero but kept the precondition.
14-02-07 jos razor_margin^ diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 29474 W: 5444 L: 5472 D: 18558
sprt @ 15+0.05 th 1 Reducing at remaining depth == 1 is much less risky than at depth == 3. So I start with a much smaller margin. Take 1.
14-02-07 jos razor_margin diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 15034 W: 2732 L: 2798 D: 9504
sprt @ 15+0.05 th 1 New razor_margin, take 2.
14-02-07 vdb measure_razor_margins2 diff
ELO: 2.61 +-2.1 (95%) LOS: 99.3%
Total: 40000 W: 7565 L: 7264 D: 25171
40000 @ 15+0.05 th 1 This time I have set the razor margins to half of the precondition margins.
14-02-07 gli measure_razor_margins2 diff
LLR: -2.97 (-2.94,2.94) [0.00,6.00]
Total: 21267 W: 3272 L: 3278 D: 14717
sprt @ 60+0.05 th 1 LTC SPRT for VD: This time I have set the razor margins to half of the precondition margins.
14-02-07 luc 9e327f28c21320da9ea849a diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 32167 W: 5927 L: 5948 D: 20292
sprt @ 15+0.05 th 1 reset delta based on depth and BestMoveChanges - take 1
14-02-07 rst null_verification diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 8115 W: 1448 L: 1532 D: 5135
sprt @ 15+0.05 th 1 make null move verification search more accurate so that 8/8/8/2p5/1pp5/brpp4/1pprp2P/qnkbK3 w - - 0 1 can be solved fast
14-02-07 luc tricky_delta diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 42950 W: 7976 L: 7969 D: 27005
sprt @ 15+0.05 th 1 reset delta based on depth and BestMoveChanges - take 2
14-02-07 inf razor_futility_margin diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 15686 W: 2878 L: 2942 D: 9866
sprt @ 15+0.05 th 1 Tune razor_margin and futility_margin. This time I try using CLOP instead of SPSA.
14-02-08 vdb measure_razor_margins3 diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 17772 W: 3307 L: 3366 D: 11099
sprt @ 15+0.05 th 1 Razor margins are now roughly 2/3 of precondition margins. Last attempt in this series.
14-02-07 vdb measure_razor_margins3 diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 57336 W: 10565 L: 10521 D: 36250
sprt @ 15+0.05 th 1 Razor margins are now roughly 1/3 of precondition margins.
14-02-07 rst null_verification diff
LLR: -1.53 (-2.94,2.94) [-4.00,0.00]
Total: 53660 W: 9794 L: 10062 D: 33804
sprt @ 15+0.05 th 1 solves 8/8/8/2p5/1pp5/brpp4/1pprp2P/qnkbK3 w - - 0 1 at depth 98. Take 2. Simplification test this time.
14-02-08 pec tm diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 21844 W: 3960 L: 4009 D: 13875
sprt @ 15+0.05 th 1 Roughly proportional increase in thinking time with faster decay of pv instability time
14-02-08 rst null_verification diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 7968 W: 1436 L: 1520 D: 5012
sprt @ 15+0.05 th 1 solves 8/8/8/2p5/1pp5/brpp4/1pprp2P/qnkbK3 w - - 0 1 at depth 96. Double verification search. Take 3 (final)
14-02-08 hwi see_king_capture diff
LLR: 2.96 (-2.94,2.94) [-1.50,4.50]
Total: 81337 W: 15060 L: 14745 D: 51532
sprt @ 15+0.05 th 1 Another attempted see optimization.
14-02-08 vdb measure_razor_margins2 diff
ELO: 0.76 +-1.9 (95%) LOS: 78.3%
Total: 40000 W: 6249 L: 6162 D: 27589
40000 @ 60+0.05 th 1 This passed STC with a clear elo gain. ELO: 2.61 +-2.1 (95%) LOS: 99.3%. Yet it failed SPRT(0,6) at LTC rather quickly. Is this the mytical non-scalable patch? I would like to make a more precise measurement of the actual elo at LTC. Low priority.
14-02-08 mco 0a1092f64eb9eb232094 diff
LLR: 2.96 (-2.94,2.94) [-4.00,0.00]
Total: 38903 W: 6482 L: 6452 D: 25969
sprt @ 15+0.05 th 3 Verify racy simplification (SMP with 3 threads)
14-02-08 luc tricky_delta diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 36496 W: 6735 L: 6745 D: 23016
sprt @ 15+0.05 th 1 reset delta based on depth and BestMoveChanges - take 3 (low priority based on take 1)
14-02-08 uri fix_null_bug diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 35355 W: 6400 L: 6414 D: 22541
sprt @ 40/15 th 1 This change cause stockfish not to be blind to zugzwangs.(I guess that at 40/15 it can get the needed depth in endgames to detect some zugzwangs and play better moves.
14-02-08 inf razor_margin diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 8317 W: 1493 L: 1576 D: 5248
sprt @ 15+0.05 th 1 Try tuning with CLOP @ 10+0.05 to see if it scales better.
14-02-09 sg lmr_exclude diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 16541 W: 2993 L: 3055 D: 10493
sprt @ 15+0.05 th 1 LMR: less reduction (half ply) for double checks
14-02-09 sg lmr_exclude diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 14438 W: 2598 L: 2666 D: 9174
sprt @ 15+0.05 th 1 LMR: less reduction (half ply) for followup moves
14-02-09 inf razor_margin diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 7089 W: 1286 L: 1373 D: 4430
sprt @ 15+0.05 th 1 Try to make razor_margin more quadratic with respect to <depth> and see if it scales better. Also, use vd's tweaked pre-condition.
14-02-09 lbr attack_unit diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 20345 W: 4581 L: 4630 D: 11134
sprt @ 15+0.05 th 1 do not cap attackUnit
14-02-09 inf razor_margin diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 7107 W: 1444 L: 1532 D: 4131
sprt @ 5+0.05 th 1 Shorter TC: Try to make razor_margin more quadratic with respect to <depth> and see if it scales better. Also, use vd's tweaked pre-condition.
14-02-09 rst null_tweak diff
LLR: -2.94 (-2.94,2.94) [-1.50,4.50]
Total: 3129 W: 517 L: 613 D: 1999
sprt @ 15+0.05 th 1 attempt to smuggle Uri's max depth condition for nullmove search. Take 1
14-02-09 inf razor_margin diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 1800 W: 322 L: 425 D: 1053
sprt @ 5+0.05 th 1 Short TC: Extreme Try: Try to make razor_margin more exponential with respect to <depth> and see if it scales better. Also, use vd's tweaked pre-condition.