Stockfish Testing Queue

Finished - 21870 tests

07-02-14 ur lessnull2 diff
ELO: -3.06 +-2.2 (95%) LOS: 0.3%
Total: 35777 W: 6412 L: 6727 D: 22638
40000 @ 15+0.05 th 1 I multiply by ONE_PLY that I forgot in the previous version (the previous version was not my intention but it has smaller bench so maybe it is better so it may be interesting to test both the previous version and also my intention for comparison).
07-02-14 in razor_futility_margin diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 15686 W: 2878 L: 2942 D: 9866
sprt @ 15+0.05 th 1 Tune razor_margin and futility_margin. This time I try using CLOP instead of SPSA.
07-02-14 rs null_verification diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 8115 W: 1448 L: 1532 D: 5135
sprt @ 15+0.05 th 1 make null move verification search more accurate so that 8/8/8/2p5/1pp5/brpp4/1pprp2P/qnkbK3 w - - 0 1 can be solved fast
07-02-14 gl measure_razor_margins2 diff
LLR: -2.97 (-2.94,2.94) [0.00,6.00]
Total: 21267 W: 3272 L: 3278 D: 14717
sprt @ 60+0.05 th 1 LTC SPRT for VD: This time I have set the razor margins to half of the precondition margins.
07-02-14 ur lessnull2 diff
ELO: -15.38 +-3.8 (95%) LOS: 0.0%
Total: 12000 W: 1948 L: 2479 D: 7573
40000 @ 15+0.05 th 1 limit using null move pruning(simpler version then previous version because I use number of nodes to decide about big depth not to prune only at the beginning of the iteration.
07-02-14 vd measure_razor_margins2 diff
ELO: 2.61 +-2.1 (95%) LOS: 99.3%
Total: 40000 W: 7565 L: 7264 D: 25171
40000 @ 15+0.05 th 1 This time I have set the razor margins to half of the precondition margins.
06-02-14 do c4533e0d94 diff
ELO: 1.98 +-2.9 (95%) LOS: 90.8%
Total: 20000 W: 3754 L: 3640 D: 12606
20000 @ 15+0.05 th 1 Verify possible regression from Mindbreaker's list
07-02-14 jo razor_margin diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 15034 W: 2732 L: 2798 D: 9504
sprt @ 15+0.05 th 1 New razor_margin, take 2.
07-02-14 vd measure_razor_margins diff
ELO: -4.38 +-2.1 (95%) LOS: 0.0%
Total: 38796 W: 6933 L: 7422 D: 24441
40000 @ 15+0.05 th 1 I suspect that the use of razor_margins(depth) both as a precondition for razoring and as actual margins is wrong. To measure this I have set the margins to zero but kept the precondition.
07-02-14 Th checkinfo diff
LLR: -0.07 (-2.94,2.94) [-1.50,4.50]
Total: 238 W: 39 L: 41 D: 158
sprt @ 15+0.05 th 1 Pass CheckInfo by reference rather than const reference for speed gain
06-02-14 sg lmr_exclude diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 7274 W: 1283 L: 1369 D: 4622
sprt @ 15+0.05 th 1 LMR: exclude passed pawn pushes
06-02-14 sg lmr_exclude diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 7564 W: 1337 L: 1422 D: 4805
sprt @ 15+0.05 th 1 LMR: less reductions for passed pawn pushes
06-02-14 hx scaling diff
LLR: 2.96 (-2.94,2.94) [-1.50,4.50]
Total: 60928 W: 11256 L: 10994 D: 38678
sprt @ 15+0.05 th 1 scaling with 2 and 3 pawns - take 3
06-02-14 do only_move diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 11768 W: 2641 L: 2716 D: 6411
sprt @ 15+0.05 th 1 easy move with 1 pawn adv. - low priority retest with 2moves_v1 book, previously failed after 26259 games. Take 2 as I made mistake in parameters before.
03-02-14 do storm_blocked diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 8504 W: 1911 L: 1996 D: 4597
sprt @ 15+0.05 th 1 Don't count blocked files in kingRing - low priority retest with 2moves_v1 book, previously failed after 9826 games
03-02-14 do stsu diff
LLR: -2.94 (-2.94,2.94) [-1.50,4.50]
Total: 1915 W: 434 L: 542 D: 939
sprt @ 15+0.05 th 1 check rammed (last check, since 2 tests failed miserably and one neutral, this must be a gem) - low priority retest with 2moves_v1 book, previously failed after 1329 games
03-02-14 do only_move diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 20220 W: 3740 L: 3792 D: 12688
sprt @ 15+0.05 th 1 easy move with 1 pawn adv. - low priority retest with 2moves_v1 book, previously failed after 26259 games
03-02-14 do backward3 diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 17013 W: 3868 L: 3927 D: 9218
sprt @ 15+0.05 th 1 Retry this old idea. Slightly changed - low priority retest with 2moves_v1 book, previously failed after 22358 games.
03-02-14 do more_ks^ diff
LLR: 2.97 (-2.94,2.94) [-1.50,4.50]
Total: 57181 W: 13344 L: 13057 D: 30780
sprt @ 15+0.05 th 1 Always compute KSretire KingExposed[] and merge its values into KPSQT - low priority retest with 2moves_v1 book, previously failed after 20359 games
05-02-14 hx scaling diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 68720 W: 12480 L: 12409 D: 43831
sprt @ 15+0.05 th 1 scaling with 2 and 3 pawns - take 2
06-02-14 in scaling diff
LLR: -2.97 (-2.94,2.94) [0.00,6.00]
Total: 29621 W: 4541 L: 4510 D: 20570
sprt @ 60+0.05 th 1 LTC for hx: scaling with 2 and 3 pawns
04-02-14 ur lessnull2 diff
ELO: 1.26 +-2.9 (95%) LOS: 80.2%
Total: 20168 W: 3728 L: 3655 D: 12785
40000 @ 40/15 th 1 similiar try to my previous try with different parameters.
03-02-14 ur lessnull1 diff
ELO: -0.77 +-2.1 (95%) LOS: 24.0%
Total: 36965 W: 6671 L: 6753 D: 23541
40000 @ 40/15 th 1 another try to avoid null move pruning in the first plies(mainly in endgame). The bench is slightly smaller and it can solve 8/8/8/2p5/1pp5/brpp4/1pprp2P/qnkbK3 w - - 0 1. If I see that 0 is inside the error bounds then I am going to try more 40,000 games match in order to try to tune parameters
05-02-14 jo razor_margin diff
LLR: -2.95 (-2.94,2.94) [0.00,6.00]
Total: 28108 W: 4361 L: 4335 D: 19412
sprt @ 60+0.05 th 1 LTC for infinity: Final Take: return v; 50K iterations
04-02-14 hx scaling diff
LLR: 2.97 (-2.94,2.94) [-1.50,4.50]
Total: 44713 W: 8241 L: 8021 D: 28451
sprt @ 15+0.05 th 1 scaling with 2 and 3 pawns
04-02-14 in razor_margin diff
LLR: 2.96 (-2.94,2.94) [-1.50,4.50]
Total: 26599 W: 5054 L: 4879 D: 16666
sprt @ 15+0.05 th 1 Final Take: return v; 50K iterations
04-02-14 rs qsearch_disco diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 63171 W: 11656 L: 11597 D: 39918
sprt @ 15+0.05 th 1 don't prune discovered checks in qsearch
04-02-14 in simplify_null diff
LLR: 2.95 (-2.94,2.94) [-4.00,0.00]
Total: 27543 W: 4278 L: 4209 D: 19056
sprt @ 60+0.05 th 1 LTC: simplification: remove verification search
05-02-14 lu depth_based_aspiration diff
ELO: -0.90 +-3.1 (95%) LOS: 28.6%
Total: 20000 W: 4210 L: 4262 D: 11528
20000 @ 5+0.05 th 1 evaluate simple conservative formula at very fast TC (after tuning)
01-02-14 jk ppsqt3 diff
ELO: 1.65 +-1.7 (95%) LOS: 97.1%
Total: 50000 W: 7971 L: 7733 D: 34296
50000 @ 60+0.05 th 1 LTC: ppsqt: H-file penalty and center bonus
04-02-14 rs search_disco diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 12864 W: 2327 L: 2399 D: 8138
sprt @ 15+0.05 th 1 don't lmr discovered and double checks
04-02-14 do gives_check diff
LLR: 2.97 (-2.94,2.94) [0.00,6.00]
Total: 42175 W: 6702 L: 6409 D: 29064
sprt @ 60+0.05 th 1 LTC for Th: Speed improvement, optimized common case of pos.gives_check inline to avoid fairly expensive function calls.
04-02-14 in simplify_null diff
Pending...
sprt @ 60+0.05 th 1 LTC: simplification: remove verification search
04-02-14 lb simplify_null diff
LLR: 2.95 (-2.94,2.94) [-4.00,0.00]
Total: 39483 W: 7325 L: 7305 D: 24853
sprt @ 15+0.05 th 1 simplification: remove verification search
04-02-14 in razor_margin diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 55607 W: 10330 L: 10290 D: 34987
sprt @ 15+0.05 th 1 Take 4: return v; 25K iterations
03-02-14 Th gives_check diff
LLR: 2.97 (-2.94,2.94) [-1.50,4.50]
Total: 12441 W: 2333 L: 2196 D: 7912
sprt @ 15+0.05 th 1 Speed improvement, optimized common case of pos.gives_check inline to avoid fairly expensive function calls.
04-02-14 in pv_instability diff
LLR: -2.94 (-2.94,2.94) [-1.50,4.50]
Total: 3806 W: 627 L: 721 D: 2458
sprt @ 15+0.05 th 1 Take 3: Decay PV faster when depth is greater
03-02-14 do king_pawn_attacks diff
LLR: -2.94 (-2.94,2.94) [-1.50,4.50]
Total: 5292 W: 1196 L: 1291 D: 2805
sprt @ 15+0.05 th 1 Further push along Chris pawn on king attack idea (take 2)retire KingExposed[] and merge its values into KPSQT - low priority retest with 2moves_v1 book, previously passed after 30171 games
03-02-14 do king_safety_trigger4 diff
LLR: 2.97 (-2.94,2.94) [-1.50,4.50]
Total: 19406 W: 4611 L: 4439 D: 10356
sprt @ 15+0.05 th 1 take 1retire KingExposed[] and merge its values into KPSQT - low priority retest with 2moves_v1 book, previously passed after 17938 games
03-02-14 do pawn_dist diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 7330 W: 1688 L: 1777 D: 3865
sprt @ 15+0.05 th 1 Try to improve on 'pawns on both wing' patch. Increase bonus to 20.retire KingExposed[] and merge its values into KPSQT - low priority retest with 2moves_v1 book, previously passed after 18331 games
03-02-14 do onepawn diff
LLR: 2.96 (-2.94,2.94) [-1.50,4.50]
Total: 15115 W: 3527 L: 3370 D: 8218
sprt @ 15+0.05 th 1 Scale down evaluation when only one pawn leftretire KingExposed[] and merge its values into KPSQT - low priority retest with 2moves_v1 book, previously passed after 11921 games
03-02-14 do king_exposed diff
LLR: 2.96 (-2.94,2.94) [-1.50,4.50]
Total: 7525 W: 1821 L: 1686 D: 4018
sprt @ 15+0.05 th 1 retire KingExposed[] and merge its values into KPSQT - low priority retest with 2moves_v1 book, previously passed after 5348 games
03-02-14 in razor_margin diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 13821 W: 2496 L: 2565 D: 8760
sprt @ 15+0.05 th 1 Take 3: Try to tune around (v + razor_margin); 25K iterations
02-02-14 ur zugzwang_detect diff
ELO: -2.66 +-2.1 (95%) LOS: 0.6%
Total: 40000 W: 7255 L: 7561 D: 25184
40000 @ 15+0.05 th 1 I try to be less aggressive in null move pruning when the number of pieces of the side to move is small. I want also to measure being more aggressive when the number of pieces is big.
03-02-14 jo razoring^^ diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 7715 W: 1369 L: 1454 D: 4892
sprt @ 15+0.05 th 1 Razoring, Take 1.
03-02-14 jo razoring diff
LLR: -2.94 (-2.94,2.94) [-1.50,4.50]
Total: 5952 W: 1046 L: 1135 D: 3771
sprt @ 15+0.05 th 1 Razoring, Take 3.
03-02-14 do blocked_storm_pawn_radi diff
LLR: 2.96 (-2.94,2.94) [-1.50,4.50]
Total: 24012 W: 5643 L: 5458 D: 12911
sprt @ 15+0.05 th 1 quickly passed patch (6239 games) - retest with 2moves_v1 book (my last test of this book)
02-02-14 ur zugzwang_detect diff
ELO: -3.55 +-2.1 (95%) LOS: 0.0%
Total: 40000 W: 7161 L: 7570 D: 25269
40000 @ 15+0.05 th 1 testing the opposite direction of more aggressive null move pruning in the middle game, It is possible that both of them are positive and if I see positive results in both tests I am going to use SPRT for making both changes
03-02-14 do rook_eval diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 10814 W: 2459 L: 2537 D: 5818
sprt @ 15+0.05 th 1 quickly failed patch (4235 games) - retest with 2moves_v1 book
03-02-14 in razor_margin diff
ELO: -2.71 +-2.1 (95%) LOS: 0.5%
Total: 40000 W: 7190 L: 7502 D: 25308
40000 @ 15+0.05 th 1 v + razor_margin(depth) / original values: Verify if regression.