Stockfish Testing Queue

Finished - 2697 tests

16-10-17 xo movestogo3cfix diff
ELO: 8.86 +-4.6 (95%) LOS: 100.0%
Total: 10000 W: 2423 L: 2168 D: 5409
10000 @ 10/2 th 1 fix1 may have been the best, retest against master that includes the low overhead and adjudication off
15-10-17 sn mtg3' diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 176471 W: 31256 L: 31397 D: 113818
sprt @ 20+0 th 1 Use ratio=min(0.5, ratio), moveOverhead=30 and tweak for sudden death. Tested at time control 20+0 (sudden death, 20 seconds), without adjudication rules.
15-10-17 xo movestogo3cfix3 diff
ELO: 11.64 +-4.5 (95%) LOS: 100.0%
Total: 10000 W: 2400 L: 2065 D: 5535
10000 @ 10/2 th 1 try fix3: instead of adding some overhead for 5 or 10 movestogo, use whole value of movestogo/2. Test and master have overhead = 20 and adjudication disabled.
15-10-17 xo movestogo3cfix2 diff
ELO: 17.04 +-4.5 (95%) LOS: 100.0%
Total: 10000 W: 2479 L: 1989 D: 5532
10000 @ 10/2 th 1 Run best test against base that also includes the lower move overhead and no adjudication, so that test and base are comparable.
15-10-17 II sudden_death diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 65912 W: 11739 L: 11694 D: 42479
sprt @ 10+0.1 th 1 (standard STC now) Last try: from the previous tests it is obvious that aggressive time usage is unfortunately important for Elo performance, and as this has also slight impact on increment case (should be tested separately), this is my final recommendation for possible reduction of time losses in sudden death case.
15-10-17 sn mtg3 diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 15671 W: 2759 L: 2630 D: 10282
sprt @ 60/15 th 1 Use ratio=min(0.5, ratio), moveOverhead=30. Tested at time control 60/15 (60 moves in 15secs), without adjudication rules.
15-10-17 II sudden_death diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 8636 W: 1646 L: 1504 D: 5486
sprt @ 16+0 th 1 Last try: from the previous tests it is obvious that aggressive time usage is unfortunately important for Elo performance, and as this has also slight impact on increment case (should be tested separately), this is my final recommendation for possible reduction of time losses in sudden death case.
15-10-17 sn mtg3 diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 8877 W: 1713 L: 1571 D: 5593
sprt @ 10+0.1 th 1 Use ratio=min(0.5, ratio), moveOverhead=30. Tested at time control 10+0.1, without adjudication rules.
14-10-17 xo movestogo3cfix2 diff
ELO: 8.86 +-4.5 (95%) LOS: 100.0%
Total: 10000 W: 2311 L: 2056 D: 5633
10000 @ 10/2 th 1 fix2 for the fails deliberately created in 3c. First fix looks reasonable (<1% time fails instead of 3-4%), try to improve on it. Uses movetogo values up to 10 and a double so that halves can be included. overhead 20 and adjudication disabled as before to provoke the fails.
14-10-17 II mtg' diff
LLR: 2.96 (-2.94,2.94) [-3.00,1.00]
Total: 113963 W: 20008 L: 20042 D: 73913
sprt @ 60/15 th 1 Try to reduce possibility of time losses, take 2 - cleaner solution.
14-10-17 xo movestogo3cfix diff
ELO: 8.62 +-4.6 (95%) LOS: 100.0%
Total: 10000 W: 2411 L: 2163 D: 5426
10000 @ 10/2 th 1 take3cfix: last test (3c) had 2.3% time fails. Add mtg calc to try to reduce these.
14-10-17 sn mtg2 diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 18250 W: 3250 L: 3125 D: 11875
sprt @ 60/15 th 1 Use MoveOverhead=0, but also ratio=min(0.25,ratio). Tested at time control 60/15 and without adjudication rules.
14-10-17 II mtg' diff
LLR: 2.96 (-2.94,2.94) [-3.00,1.00]
Total: 32371 W: 5728 L: 5626 D: 21017
sprt @ 60/15 th 1 Test 60/x time control after report of time losses.
12-10-17 sn pawn_mobility13 diff
LLR: 2.96 (-2.94,2.94) [0.00,5.00]
Total: 20527 W: 3669 L: 3460 D: 13398
sprt @ 10+0.1 th 1 Pawn mobility
10-10-17 Ro Bishop4 diff
LLR: 2.96 (-2.94,2.94) [-3.00,1.00]
Total: 16552 W: 2136 L: 2010 D: 12406
sprt @ 60+0.6 th 1 LTC, removing 2 conditions, and increase the ThreatbyPawn to compensate. low throughput.
09-10-17 cr queenmob2 diff
LLR: 2.94 (-2.94,2.94) [0.00,5.00]
Total: 62855 W: 11280 L: 10893 D: 40682
sprt @ 10+0.1 th 1 Queen mobility with a modified rule, to not count squares that are attacked by 2 enemies. Code by Rocky
09-10-17 Ro Bishop4 diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 14236 W: 2615 L: 2483 D: 9138
sprt @ 10+0.1 th 1 Removing 2 conditions, and increase the ThreatbyPawn to compensate. low throughput.
06-10-17 SC KRKN diff
LLR: 3.65 (-2.94,2.94) [-3.00,1.00]
Total: 155890 W: 27400 L: 27468 D: 101022
sprt @ 10+0.1 th 1 We are evaluating KRKN in average 1/2 pawn better. Remove this.
08-10-17 An skipEarlyPruning2 diff
LLR: 2.97 (-2.94,2.94) [0.00,5.00]
Total: 32575 W: 5907 L: 5645 D: 21023
sprt @ 10+0.1 th 1 Only for the quiets tried by LMR, and require a very good static eval.
02-10-17 sg master diff
ELO: 32.61 +-1.6 (95%) LOS: 100.0%
Total: 40000 W: 6431 L: 2688 D: 30881
40000 @ 60+0.6 th 1 Regression test until "Good bishops on the main diagonals "
05-10-17 Gu ScaleFactor diff
LLR: 2.96 (-2.94,2.94) [-3.00,1.00]
Total: 22994 W: 2906 L: 2788 D: 17300
sprt @ 60+0.6 th 1 The test showed a 30k similar result at 0-4. The speculative LTC test for (-3, 1). Low throughput
04-10-17 Ro BlockedBishop2 diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 34432 W: 4394 L: 4291 D: 25747
sprt @ 60+0.6 th 1 LTC for the green (-3, 1), low throughput
05-10-17 sg pvExact diff
LLR: 2.94 (-2.94,2.94) [0.00,5.00]
Total: 11628 W: 2112 L: 1940 D: 7576
sprt @ 10+0.1 th 1 The version with ttValue in the window failed yellow. Currently Elberto test ttValue > alpha which currently struggles. So also try the opposite case of ttValue < beta
28-09-17 vd furtherExcept diff
LLR: 2.95 (-2.94,2.94) [-4.00,0.00]
Total: 74535 W: 11934 L: 12013 D: 50588
sprt @ 5+0.05 th 5 stc threaded
03-10-17 Ro BlockedBishop2 diff
LLR: 2.97 (-2.94,2.94) [-3.00,1.00]
Total: 31433 W: 5599 L: 5495 D: 20339
sprt @ 10+0.1 th 1 A fix on previous attempt (parked with -1), since the bonus was not given for a centered bishop...
30-09-17 sn bdiag3 diff
LLR: 2.97 (-2.94,2.94) [0.00,5.00]
Total: 83978 W: 10685 L: 10303 D: 62990
sprt @ 60+0.6 th 1 LTC: Go back to ElbertoOne's green original patch (main diagonals only), but with a pure midgame bonus
30-09-17 sn bdiag3 diff
LLR: 2.96 (-2.94,2.94) [0.00,5.00]
Total: 10801 W: 1955 L: 1786 D: 7060
sprt @ 10+0.1 th 1 Go back to ElbertoOne's green original patch (main diagonals only), but with a pure midgame bonus
30-09-17 Vo xbr diff
LLR: 2.95 (-2.94,2.94) [0.00,5.00]
Total: 25801 W: 3306 L: 3108 D: 19387
sprt @ 60+0.6 th 1 LTC: ExactBoundReduction take 2.
29-09-17 Vo xbr diff
LLR: 2.96 (-2.94,2.94) [0.00,5.00]
Total: 59004 W: 10621 L: 10249 D: 38134
sprt @ 10+0.1 th 1 ExactBoundReduction take 2.
29-09-17 El bdiag diff
LLR: 2.95 (-2.94,2.94) [0.00,5.00]
Total: 22403 W: 4052 L: 3834 D: 14517
sprt @ 10+0.1 th 1 Bonus for bishop on long diagonal when center squares not occupied by pawns
26-09-17 vd furtherExcept diff
LLR: 3.38 (-2.94,2.94) [-3.00,1.00]
Total: 213377 W: 37463 L: 37641 D: 138273
sprt @ 10+0.1 th 1 stc, take 2
28-09-17 vd furtherExcept diff
LLR: 2.96 (-2.94,2.94) [-3.00,1.00]
Total: 52063 W: 9192 L: 9123 D: 33748
sprt @ 10+0.1 th 1 stc, take 3
28-09-17 pb correctionFactor diff
LLR: 2.96 (-2.94,2.94) [0.00,5.00]
Total: 22921 W: 4134 L: 3914 D: 14873
sprt @ 10+0.1 th 1 Dynamic correction factor according to game phase (bench should be correct now)
27-09-17 sn opb diff
LLR: 2.95 (-2.94,2.94) [0.00,5.00]
Total: 4931 W: 927 L: 783 D: 3221
sprt @ 10+0.1 th 1 Opposite bishops and queens
26-09-17 pb captureKiller2 diff
LLR: 2.95 (-2.94,2.94) [0.00,5.00]
Total: 89138 W: 15869 L: 15374 D: 57895
sprt @ 10+0.1 th 1 Another shot at Stefan's Killer Captures idea...
21-09-17 Gu statScore2 diff
LLR: 2.95 (-2.94,2.94) [0.00,5.00]
Total: 19973 W: 2662 L: 2480 D: 14831
sprt @ 60+0.6 th 1 statScore tweak v2 (LTC)
21-09-17 Gu statScore2 diff
LLR: 2.95 (-2.94,2.94) [0.00,4.00]
Total: 57762 W: 10533 L: 10181 D: 37048
sprt @ 10+0.1 th 1 statScore tweak v2
20-09-17 El avoidrep diff
LLR: 2.97 (-2.94,2.94) [0.00,5.00]
Total: 24081 W: 4450 L: 4222 D: 15409
sprt @ 10+0.1 th 1 Maybe last (and most radical) attempt: no reduction for moves that avoid a repetition
18-09-17 vd timeExcept2 diff
LLR: 2.96 (-2.94,2.94) [-3.00,1.00]
Total: 138622 W: 25290 L: 25371 D: 87961
sprt @ 10+0.1 th 1 stc, take 2
16-09-17 vd timeExcept diff
LLR: 2.96 (-2.94,2.94) [-4.00,0.00]
Total: 90618 W: 18637 L: 18817 D: 53164
sprt @ 5+0.05 th 1 stc threaded take 3
14-09-17 Vo justChecks4 diff
LLR: 2.95 (-2.94,2.94) [0.00,5.00]
Total: 83813 W: 15420 L: 14937 D: 53456
sprt @ 10+0.1 th 1 One more shot at this...
13-09-17 sn weak_pawns_and_majors diff
LLR: 2.95 (-2.94,2.94) [0.00,5.00]
Total: 66673 W: 8672 L: 8341 D: 49660
sprt @ 60+0.6 th 1 Bonus=S(5,25). Speculative LTC.
14-09-17 sn factor diff
LLR: 2.96 (-2.94,2.94) [0.00,5.00]
Total: 41000 W: 7504 L: 7205 D: 26291
sprt @ 10+0.1 th 1 Pretend that SF knows the future: anticipating the endgame values by shifting the game phase.
14-09-17 vd timeExcept diff
LLR: 2.96 (-2.94,2.94) [-3.00,1.00]
Total: 25219 W: 4685 L: 4571 D: 15963
sprt @ 10+0.1 th 1 stc
11-09-17 Vo justChecks2 diff
LLR: 3.22 (-2.94,2.94) [0.00,5.00]
Total: 51192 W: 9586 L: 9228 D: 32378
sprt @ 10+0.1 th 1 stc
10-09-17 SC da063685749006310498115 diff
LLR: 2.96 (-2.94,2.94) [-3.00,1.00]
Total: 64455 W: 8384 L: 8320 D: 47751
sprt @ 60+0.6 th 1 Just add opponent's move count to statScore instead of ad-hoc logic. LTC, please see forum for further discussion.
10-09-17 II MO_normal diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 32029 W: 6639 L: 6538 D: 18852
sprt @ 4+0.1 th 1 Try more secure usage of Move Overhead, which is in the spirit of the old TM code.
08-09-17 sg root_order diff
LLR: 2.97 (-2.94,2.94) [0.00,5.00]
Total: 94816 W: 17662 L: 17127 D: 60027
sprt @ 10+0.1 th 1 Add depth in bestMoveCount update (Set other test to prio -1)
10-09-17 II MO' diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 35262 W: 7426 L: 7331 D: 20505
sprt @ 4+0.1 th 1 Regression test for PR #1248
08-09-17 SC da063685749006310498115 diff
LLR: 3.09 (-2.94,2.94) [-3.00,1.00]
Total: 64485 W: 11911 L: 11858 D: 40716
sprt @ 10+0.1 th 1 Just add opponent's move count to statScore instead of ad-hoc logic