Stockfish Testing Queue

Pending - 0 tests 0.0 hrs

None

Active - 0 tests

Finished - 843 tests

29-10-17 lb noendgame diff
LLR: -2.95 (-2.94,2.94) [-3.00,1.00]
Total: 13751 W: 2441 L: 2622 D: 8688
sprt @ 10+0.1 th 1 Remove all specific endgame knowledge. Leaving only general purpose scaling rules (and of course perfect knowledge for draw by chess rules, and syzygy endgames). Suggested by Marco in PR #1280. This test is running with adjudication disabled (see hack in UCI::value()), to make sure that the inability to convert difficult endgames will be penalized. It is, of course, running without syzygy (impossible to test in fishtest).
07-09-17 lb stopFaster diff
LLR: 1.76 (-2.94,2.94) [-3.00,1.00]
Total: 128000 W: 23237 L: 23363 D: 81400
sprt @ 10+0.1 th 1 check time 2x more often
27-08-17 lb qsdraw diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 10469 W: 1866 L: 1947 D: 6656
sprt @ 10+0.1 th 1 Don't check draw in deep QS
21-08-17 lb noeasy diff
LLR: -2.95 (-2.94,2.94) [-3.00,1.00]
Total: 15490 W: 2862 L: 3048 D: 9580
sprt @ 10+0.1 th 1 retire easy move
20-08-17 lb time diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 16371 W: 3092 L: 2963 D: 10316
sprt @ 40/10 th 1 Restore safety margin of 60ms. Tournament tc.
19-08-17 lb time diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 59664 W: 10945 L: 10891 D: 37828
sprt @ 16+0 th 1 Restore safety margin of 60ms. Sudden death tc.
19-08-17 lb time diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 58008 W: 10674 L: 10617 D: 36717
sprt @ 10+0.1 th 1 Restore safety margin of 60ms. Previous code used 60ms incompressible safety margin, plus an additional 30ms for each "move to go". This patch is also a simplification, removing a small whart.
15-08-17 lb stats16 diff
LLR: 2.96 (-2.94,2.94) [0.00,4.00]
Total: 31489 W: 4532 L: 4295 D: 22662
sprt @ 40+0.4 th 1 16bit stats, 3rd test. Verify that we really have an elo gain with strong hash pressure. This time use longer time control, and larger hash to ensure that the hash size is big w.r.t. CPU cache sizes, and it's not an artificial effect that only works with microscopic hash sizes but doesn't scale.
13-08-17 lb stats16 diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 73542 W: 13058 L: 13026 D: 47458
sprt @ 10+0.1 th 1 16bit stats, 2nd test. Now verify no regression with low hash pressure.
09-08-17 lb stats16 diff
LLR: 2.96 (-2.94,2.94) [0.00,4.00]
Total: 258430 W: 46977 L: 45943 D: 165510
sprt @ 10+0.1 th 1 16bit stats: does reducing memory footprint by 1.2MB translate into a mesurable speed-up? let's find out in 2 tests: (1) Hash=2 (high pressure) (2) Hash=8 (low pressure).
05-08-17 lb futility^ diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 61689 W: 10966 L: 10915 D: 39808
sprt @ 10+0.1 th 1 futility: 1 depth lower
05-08-17 lb futility diff
LLR: -3.50 (-2.94,2.94) [0.00,4.00]
Total: 43380 W: 7795 L: 7833 D: 27752
sprt @ 10+0.1 th 1 futility: 1 depth higher
05-08-17 lb singular3 diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 12208 W: 2085 L: 2200 D: 7923
sprt @ 10+0.1 th 1 singular tweak: take 3
05-08-17 lb singular4 diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 7098 W: 1241 L: 1374 D: 4483
sprt @ 10+0.1 th 1 singular tweak: take 4
05-08-17 lb singular diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 4806 W: 817 L: 958 D: 3031
sprt @ 10+0.1 th 1 singular tweak
05-08-17 lb singular2 diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 9144 W: 1513 L: 1638 D: 5993
sprt @ 10+0.1 th 1 singular tweak: take 2
01-07-17 lb singular diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 6954 W: 1223 L: 1319 D: 4412
sprt @ 10+0.1 th 1 don't update stats with partial search
26-06-17 lb recursive_singular diff
LLR: 2.96 (-2.94,2.94) [-3.00,1.00]
Total: 31680 W: 5701 L: 5598 D: 20381
sprt @ 10+0.1 th 1 Allow recursive SE
29-01-17 lb master diff
ELO: 8.82 +-1.5 (95%) LOS: 100.0%
Total: 40000 W: 4595 L: 3580 D: 31825
40000 @ 60+0.6 th 1 Regression test until "Simplify TT penalty stat"
29-01-17 lb hinder diff
LLR: -2.96 (-2.94,2.94) [-3.00,1.00]
Total: 7609 W: 1331 L: 1502 D: 4776
sprt @ 10+0.1 th 1 last try
28-01-17 lb hinder diff
LLR: -2.95 (-2.94,2.94) [-3.00,1.00]
Total: 35434 W: 6410 L: 6628 D: 22396
sprt @ 10+0.1 th 1 remove HinderPassedPawn, compensating in Passed[MG][]
28-01-17 lb hinder diff
LLR: -0.02 (-2.94,2.94) [-3.00,1.00]
Total: 89 W: 11 L: 12 D: 66
sprt @ 10+0.1 th 1 do we need HinderPassedPawn ?
28-01-17 lb ring diff
LLR: -0.02 (-2.94,2.94) [-3.00,1.00]
Total: 23 W: 3 L: 4 D: 16
sprt @ 10+0.1 th 1 simplify king ring
24-01-17 lb bonus diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 4768 W: 810 L: 915 D: 3043
sprt @ 10+0.1 th 1 depth^1.9
24-01-17 lb bonus^ diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 2753 W: 423 L: 536 D: 1794
sprt @ 10+0.1 th 1 depth^1.8
22-01-17 lb tune diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 921 W: 121 L: 244 D: 556
sprt @ 10+0.1 th 1 try tuned values
22-01-17 lb tune diff
36508/40000 iterations
74996/80000 games played
80000 @ 20+0.2 th 1 tune history
13-01-17 lb lazy diff
LLR: -2.96 (-2.94,2.94) [-3.00,1.00]
Total: 40683 W: 7223 L: 7449 D: 26011
sprt @ 10+0.1 th 1 simplify the lazy eval, but respecting the normal eval logic (mg/eg blending + tempo)
12-01-17 lb counterMoves diff
LLR: -2.96 (-2.94,2.94) [-3.00,1.00]
Total: 21975 W: 3177 L: 3360 D: 15438
sprt @ 30+0.3 th 1 do we need counterMoves ? Rerun STC at 30+0.3 (Throughout x1/3), because I suspect counterMoves do not scale, and CMH take over at longer tc.
11-01-17 lb counterMoves diff
LLR: -2.95 (-2.94,2.94) [-3.00,1.00]
Total: 27570 W: 4844 L: 5047 D: 17679
sprt @ 10+0.1 th 1 do we need counterMoves ?
10-01-17 lb counterMoves diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 85688 W: 11144 L: 10985 D: 63559
sprt @ 60+0.6 th 1 LTC: Use (from,to) instead of (pc,to) for MoveStats
10-01-17 lb counterMoves diff
LLR: 2.96 (-2.94,2.94) [0.00,5.00]
Total: 39202 W: 7113 L: 6823 D: 25266
sprt @ 10+0.1 th 1 Use (from,to) instead of (pc,to) for MoveStats
08-01-17 lb history diff
LLR: 2.96 (-2.94,2.94) [-3.00,1.00]
Total: 26565 W: 3519 L: 3406 D: 19640
sprt @ 60+0.6 th 1 LTC: do we still need HistoryStats ?
08-01-17 lb history diff
LLR: -2.95 (-2.94,2.94) [-3.00,1.00]
Total: 6780 W: 1122 L: 1289 D: 4369
sprt @ 10+0.1 th 1 take 2: double FromToStats to compensate for the removal of HistoryStats
08-01-17 lb history diff
LLR: 3.44 (-2.94,2.94) [-3.00,1.00]
Total: 120831 W: 21572 L: 21594 D: 77665
sprt @ 10+0.1 th 1 do we still need HistoryStats ?
07-01-17 lb tune diff
LLR: -2.94 (-2.94,2.94) [0.00,5.00]
Total: 36974 W: 4822 L: 4819 D: 27333
sprt @ 60+0.6 th 1 LTC: test tuned values. SPRT(0,5), because (0,4) is really too costly.
06-01-17 lb tune diff
LLR: 2.95 (-2.94,2.94) [0.00,4.00]
Total: 224957 W: 40981 L: 40060 D: 143916
sprt @ 10+0.1 th 1 test tuned values: rescheduling, as it was stopped by biffhero (i don't know if this is the known fishtest bug causing wrong bench from time to time...)
06-01-17 lb tune diff
LLR: -0.09 (-2.94,2.94) [0.00,4.00]
Total: 2065 W: 365 L: 363 D: 1337
sprt @ 10+0.1 th 1 test tuned values
05-01-17 lb tune diff
42643/40000 iterations
87272/90000 games played
90000 @ 10+0.1 th 1 tune checks
04-01-17 lb kingDanger diff
LLR: -2.94 (-2.94,2.94) [-3.00,1.00]
Total: 8859 W: 1545 L: 1717 D: 5597
sprt @ 10+0.1 th 1 take 2: compensate +10% shelter
03-01-17 lb kingDanger diff
LLR: -2.95 (-2.94,2.94) [-3.00,1.00]
Total: 17322 W: 3144 L: 3332 D: 10846
sprt @ 10+0.1 th 1 simplify kingDanger main formula
03-01-17 lb safe diff
LLR: -2.95 (-2.94,2.94) [-3.00,1.00]
Total: 7009 W: 1211 L: 1380 D: 4418
sprt @ 10+0.1 th 1 take 5
03-01-17 lb checks diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 14537 W: 2569 L: 2633 D: 9335
sprt @ 10+0.1 th 1 decouple checks and attacks
02-01-17 lb safe diff
LLR: -2.95 (-2.94,2.94) [-3.00,1.00]
Total: 5003 W: 850 L: 1016 D: 3137
sprt @ 10+0.1 th 1 take 4
02-01-17 lb safe diff
LLR: -2.95 (-2.94,2.94) [-3.00,1.00]
Total: 2911 W: 488 L: 652 D: 1771
sprt @ 10+0.1 th 1 take 3
02-01-17 lb safe diff
LLR: -2.98 (-2.94,2.94) [-3.00,1.00]
Total: 1786 W: 272 L: 436 D: 1078
sprt @ 10+0.1 th 1 take 2
02-01-17 lb safe diff
LLR: -2.94 (-2.94,2.94) [-3.00,1.00]
Total: 988 W: 146 L: 312 D: 530
sprt @ 10+0.1 th 1 simplify checks
01-01-17 lb otherChecks diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 3997 W: 678 L: 787 D: 2532
sprt @ 10+0.1 th 1 treat other checks consistently with safe checks. take 4, and last try on this. use a quarter bonus this time, as 1/2 failed and 3/4 failed very quickly.
31-12-16 lb otherChecks diff
LLR: -2.95 (-2.94,2.94) [-3.00,1.00]
Total: 26023 W: 4491 L: 4690 D: 16842
sprt @ 10+0.1 th 1 are other checks even useful ?
31-12-16 lb otherChecks diff
LLR: -2.94 (-2.94,2.94) [0.00,5.00]
Total: 431 W: 52 L: 185 D: 194
sprt @ 10+0.1 th 1 treat other checks consistently: take 2