Stockfish Testing Queue

Finished - 16742 tests

14-03-17 pb verification_search4' diff
ELO: 1.51 +-2.1 (95%) LOS: 92.3%
Total: 19561 W: 1825 L: 1740 D: 15996
20000 @ 60+0.6 th 1 My try on make SF more aware of zugzwang based on verification search. On pos FEN: 1k3b1q/pP2p1p1/P1K1P1Pp/7P/2B5/8/8/8 w - - 0 1 it finds Bb5 in 11 secs. (lower throughput.)
15-03-17 Gu QRBKSet-3 diff
LLR: -2.94 (-2.94,2.94) [0.00,5.00]
Total: 55125 W: 9822 L: 9715 D: 35588
sprt @ 10+0.1 th 1 take 3
15-03-17 Gu QRBKSet-2 diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 42904 W: 7623 L: 7568 D: 27713
sprt @ 10+0.1 th 1 v2
15-03-17 Ro PassiveLever diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 19791 W: 3447 L: 3489 D: 12855
sprt @ 10+0.1 th 1 More precise version, do not consider a square strongly defended by a pawn if it is levered.
15-03-17 jo KnightSet2 diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 5396 W: 945 L: 1048 D: 3403
sprt @ 10+0.1 th 1 Try the same for knights. (Test against passed PawnsSet.)
15-03-17 Gu Imbalance diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 4867 W: 801 L: 905 D: 3161
sprt @ 10+0.1 th 1 take 3
15-03-17 Gu Imbalance diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 11026 W: 1927 L: 2006 D: 7093
sprt @ 10+0.1 th 1 take 2 (fixed)
14-03-17 SC dynamicContemptCrunchy diff
ELO: 0.68 +-1.0 (95%) LOS: 91.5%
Total: 242036 W: 60329 L: 59853 D: 121854
200000 @ 2+0.02 th 1 Try to debug a fishtest problem, see https://groups.google.com/forum/#!msg/fishcooking/C3KorRmYmWM/g6MJjhI3BgAJ
15-03-17 Gu Imbalance diff
LLR: -0.84 (-2.94,2.94) [0.00,5.00]
Total: 907 W: 157 L: 189 D: 561
sprt @ 10+0.1 th 1 Imbalance tweak
15-03-17 El closedB diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 24180 W: 4292 L: 4315 D: 15573
sprt @ 10+0.1 th 1 Use moveable pawns in evaluate_initiative
14-03-17 Fi skipQuiets2 diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 70513 W: 12681 L: 12508 D: 45324
sprt @ 10+0.1 th 1 Don't sort quiets that will be skipped.
15-03-17 Ro PassiveLever diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 50116 W: 8947 L: 8861 D: 32308
sprt @ 10+0.1 th 1 penalty for a levered pawn which should not take because would leave another pawn unprotected
15-03-17 Ro ActiveLever diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 21425 W: 3757 L: 3792 D: 13876
sprt @ 10+0.1 th 1 Bonus for a lever when a take by the enemy would isolate the taker.
13-03-17 SC dynamicContempt diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 14600 W: 1866 L: 1937 D: 10797
sprt @ 60+0.6 th 1 Only positive contempt. How does it scale? LTC with prior -1 in case framework goes idle.
14-03-17 Gu QRBKSet-1 diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 6783 W: 1143 L: 1239 D: 4401
sprt @ 10+0.1 th 1 set for all pieces
13-03-17 II tune_eval diff
25908/40000 iterations
52690/80000 games played
80000 @ 10+0.1 th 1 Tune evaluation #2. 20% lower ck values this time.
14-03-17 SC dynamicContempt diff
LLR: -4.71 (-2.94,2.94) [0.00,5.00]
Total: 23501 W: 4254 L: 4353 D: 14894
sprt @ 10+0.1 th 1 Try to scale with depth, so that we can properly order the moves before using dynamic contempt too much.
14-03-17 Fi skipQuiets2 diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 20130 W: 3525 L: 3566 D: 13039
sprt @ 10+0.1 th 1 Take 2
14-03-17 Gu QueenSet3 diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 38118 W: 6761 L: 6726 D: 24631
sprt @ 10+0.1 th 1 v3
09-03-17 sg master diff
ELO: 10.84 +-1.6 (95%) LOS: 100.0%
Total: 40000 W: 4817 L: 3569 D: 31614
40000 @ 60+0.6 th 1 Regression test until "Helper functions to count material for both sides"
14-03-17 sg mate_history diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 10515 W: 1830 L: 1911 D: 6774
sprt @ 10+0.1 th 1 Try the opposite and double the update weight. Take 3
14-03-17 vd pqstquiets diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 15544 W: 2703 L: 2763 D: 10078
sprt @ 10+0.1 th 1 scale up further.
14-03-17 Gu QueenSetTunes diff
9162/10000 iterations
18765/20000 games played
20000 @ 20+0.2 th 1 tunes for this yellow: http://tests.stockfishchess.org/tests/view/58c70afb0ebc59035df32d58
09-03-17 sn contempt_for_initiative diff
ELO: 113.25 +-2.4 (95%) LOS: 100.0%
Total: 40000 W: 16109 L: 3514 D: 20377
40000 @ 10+0.1 th 1 40000 STC games with contempt = 0 against SF7. Half throughput.
13-03-17 SC dynamicContempt diff
LLR: -3.87 (-2.94,2.94) [0.00,5.00]
Total: 58582 W: 10722 L: 10635 D: 37225
sprt @ 10+0.1 th 1 Dynamic contempt is somewhat messing up time management. Try to be move slower.
14-03-17 sg mate_history diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 6520 W: 1123 L: 1221 D: 4176
sprt @ 10+0.1 th 1 Half update weight. Take 2
14-03-17 pb strikeBack diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 12691 W: 2219 L: 2291 D: 8181
sprt @ 10+0.1 th 1 Take 3
14-03-17 El nmp_sc3 diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 8167 W: 1068 L: 1159 D: 5940
sprt @ 10+0.1 th 1 Take 3: include more cases. Also use 8moves book to give focus on endgames where this issue is more pressing.
14-03-17 pb strikeBack diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 28139 W: 4977 L: 4984 D: 18178
sprt @ 10+0.1 th 1 Take 2: more accurate & double hits by capture escape detection
13-03-17 jo outpost_new diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 52199 W: 9311 L: 9291 D: 33597
sprt @ 10+0.1 th 1 Check some locally tuned values.
14-03-17 vd pqstquiets diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 9051 W: 1541 L: 1628 D: 5882
sprt @ 10+0.1 th 1 non-positives only
14-03-17 vd pqstquiets diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 17100 W: 2913 L: 2967 D: 11220
sprt @ 10+0.1 th 1 more
13-03-17 Gu QueenSet diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 47945 W: 8539 L: 8463 D: 30943
sprt @ 10+0.1 th 1 take 1b
14-03-17 Fi ff5a7bd8693d0fd87d7a84a diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 20423 W: 3710 L: 3748 D: 12965
sprt @ 10+0.1 th 1 Put quiets that may be skipped at the end.
14-03-17 vd pqstquiets diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 4042 W: 665 L: 773 D: 2604
sprt @ 10+0.1 th 1 less
13-03-17 SC dynamicContempt diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 31362 W: 5729 L: 5720 D: 19913
sprt @ 10+0.1 th 1 Or move faster.
13-03-17 II knight_mobility diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 44827 W: 7872 L: 7878 D: 29077
sprt @ 10+0.1 th 1 Knight mobility bonus. Tuned values.
13-03-17 Gu ExperimentalSet diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 35261 W: 6359 L: 6334 D: 22568
sprt @ 10+0.1 th 1 ExperimentalSet (to give a large penalty, for all pieces except the queen, if after the pawn transformation their number exceeds 2)
13-03-17 Fi span4 diff
LLR: -2.94 (-2.94,2.94) [0.00,5.00]
Total: 27519 W: 4861 L: 4870 D: 17788
sprt @ 10+0.1 th 1 Final attempt
13-03-17 pb strikeBack diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 29151 W: 5168 L: 5170 D: 18813
sprt @ 10+0.1 th 1 Another exotic experiment on the idea of using an alternative strikeBack square for see.
13-03-17 Ro PawnPushTweak diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 24687 W: 4324 L: 4397 D: 15966
sprt @ 10+0.1 th 1 Using the new stronglyProtected idea for the ThreatByPawnPush
13-03-17 vd pqstquiets diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 15079 W: 2719 L: 2780 D: 9580
sprt @ 10+0.1 th 1 take 2
13-03-17 Fi partialSort diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 9227 W: 1579 L: 1665 D: 5983
sprt @ 10+0.1 th 1 Use std::partial_sort() to sort quiets.
09-03-17 sn contempt_for_initiative diff
ELO: 115.48 +-2.4 (95%) LOS: 100.0%
Total: 40000 W: 16485 L: 3659 D: 19856
40000 @ 10+0.1 th 1 40000 STC games with contempt = 30 against SF7. Half throughput.
13-03-17 Gu QueenSet2 diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 4612 W: 752 L: 857 D: 3003
sprt @ 10+0.1 th 1 take 2
13-03-17 Gu QueenSet diff
LLR: -0.31 (-2.94,2.94) [0.00,5.00]
Total: 4302 W: 756 L: 751 D: 2795
sprt @ 10+0.1 th 1 take 1
13-03-17 vd noscore diff
LLR: -2.95 (-2.94,2.94) [-3.00,1.00]
Total: 304 W: 12 L: 170 D: 122
sprt @ 10+0.1 th 1 surprisingly bench is reasonable
12-03-17 II tune_eval diff
36912/40000 iterations
79526/80000 games played
80000 @ 10+0.1 th 1 Tune evaluation #1. I'll try to retune the whole evaluation function, tuning 8-32 parameters in one session. Throughput 400.
13-03-17 Ro weakTweak diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 27978 W: 4909 L: 4917 D: 18152
sprt @ 10+0.1 th 1 Take 2
13-03-17 El nmp_sc3 diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 2343 W: 340 L: 454 D: 1549
sprt @ 10+0.1 th 1 Take 2: leave out the staticEval check