Stockfish Testing Queue

Pending - 0 tests 0.0 hrs

None

Active - 0 tests

Finished - 910 tests

26-04-17 jo outpost_depth1 diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 17355 W: 3060 L: 3158 D: 11137
sprt @ 10+0.1 th 1 Now with some more tuning on pawns, each of them with not less than 1 million games.
23-04-17 jo lazy_skip2 diff
LLR: -2.94 (-2.94,2.94) [0.00,5.00]
Total: 20272 W: 3171 L: 3215 D: 13886
sprt @ 5+0.05 th 7 Also skip lower depths than the main thread already finished searching. Seems to be a non-issue at shallow depths.
22-04-17 jo lazy_random diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 7735 W: 1168 L: 1260 D: 5307
sprt @ 5+0.05 th 7 Now correctly search all shuffled root moves, but only for the first iteration. The effect is that more nodes are being searched. Whether this is good or bad ... ?
16-04-17 jo lazy_random diff
LLR: -1.72 (-2.94,2.94) [0.00,5.00]
Total: 32000 W: 5176 L: 5123 D: 21701
sprt @ 5+0.05 th 7 Does it help to shuffle the root moves of the helper threads?
12-04-17 jo outpost_depth1 diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 270128 W: 49716 L: 48945 D: 171467
sprt @ 10+0.1 th 1 Final try with some more tuned values.
13-04-17 jo lazyOddEven diff
ELO: -8.57 +-6.8 (95%) LOS: 0.7%
Total: 3000 W: 419 L: 493 D: 2088
3000 @ 5+0.05 th 11 Try a odd-even distribution of the root moves for higher threads. (Compare with tries of Stéphane.)
10-04-17 jo sf2 diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 3460 W: 565 L: 675 D: 2220
sprt @ 10+0.1 th 1 Scale more. Take 2.
10-04-17 jo sf2 diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 16177 W: 2962 L: 3018 D: 10197
sprt @ 10+0.1 th 1 Endgames with still lots of pawns tend to be drawish. Take 1.
07-04-17 jo outpost_depth1 diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 31397 W: 5653 L: 5703 D: 20041
sprt @ 10+0.1 th 1 Result of a tuning at fixed depth 1 with a much narrower range.
06-04-17 jo outpost_depth1 diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 7283 W: 1290 L: 1422 D: 4571
sprt @ 10+0.1 th 1 A last fixed depth 1 tuning experiment with values after more than 1.2 million games played. (most likely still not enough!)
05-04-17 jo threats_tuning2 diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 40621 W: 7317 L: 7335 D: 25969
sprt @ 10+0.1 th 1 Take 2 with updated values.
03-04-17 jo singular2 diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 19798 W: 3623 L: 3663 D: 12512
sprt @ 10+0.1 th 1 Tweak singular extension node.
02-04-17 jo imbalance1 diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 5160 W: 897 L: 1001 D: 3262
sprt @ 10+0.1 th 1 Check values before a possible tuning try in the framework.
01-04-17 jo issue760 diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 27071 W: 3502 L: 3390 D: 20179
sprt @ 60+0.6 th 1 Since the framework is almost idle, run a non-regression test at LTC for fixing issue #760. (STC test: http://tests.stockfishchess.org/tests/view/5846c4dc0ebc5903140c5780)
30-03-17 jo tte_depth diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 9383 W: 1645 L: 1730 D: 6008
sprt @ 10+0.1 th 1 Another TT experiment. The difference in draft should not exceed a certain limit.
30-03-17 jo threats_tuning2 diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 23518 W: 4235 L: 4311 D: 14972
sprt @ 10+0.1 th 1 Check some early ThreatByKing values.
30-03-17 jo tte_depth diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 11064 W: 1947 L: 2025 D: 7092
sprt @ 10+0.1 th 1 TT entries need to have a minimum draft. (Test against passed excludedMove patch by VoyagerOne.)
29-03-17 jo spt1 diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 6160 W: 1046 L: 1145 D: 3969
sprt @ 10+0.1 th 1 Less pruning towards endgame. Take 1.
28-03-17 jo imbalance1 diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 13028 W: 2351 L: 2421 D: 8256
sprt @ 10+0.1 th 1 Now with rook values.
24-03-17 jo simplify_threat diff
LLR: 2.96 (-2.94,2.94) [-3.00,1.00]
Total: 42557 W: 5634 L: 5542 D: 31381
sprt @ 60+0.6 th 1 LTC: Since the tuning was pretty neutral, let's see if we can simplify the scoring of ThreatBySafePawn.
24-03-17 jo imbalance1 diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 28483 W: 5183 L: 5187 D: 18113
sprt @ 10+0.1 th 1 Now with bishop values.
23-03-17 jo simplify_threat diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 18895 W: 3361 L: 3237 D: 12297
sprt @ 10+0.1 th 1 Since the tuning was pretty neutral, let's see if we can simplify the scoring of ThreatBySafePawn.
23-03-17 jo threats_tuning1 diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 42171 W: 7568 L: 7581 D: 27022
sprt @ 10+0.1 th 1 Check some values for ThreatBySafePawn.
22-03-17 jo imbalance1 diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 24955 W: 4524 L: 4543 D: 15888
sprt @ 10+0.1 th 1 Now with knight values added.
18-03-17 jo imbalance1 diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 17648 W: 3129 L: 3180 D: 11339
sprt @ 10+0.1 th 1 Re-add Linear imbalance table.
17-03-17 jo no_pruning_when_mate diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 29844 W: 5335 L: 5334 D: 19175
sprt @ 10+0.1 th 1 Final take. (beta < VALUE_MATE_IN_MAX_PLY)
17-03-17 jo no_pruning_when_mate diff
LLR: -0.27 (-2.94,2.94) [0.00,5.00]
Total: 2263 W: 405 L: 407 D: 1451
sprt @ 10+0.1 th 1 Besides bestValue, make sure alpha is greater than mated score, too. Take 2. (Although bench is the same it's a functional change.)
17-03-17 jo no_pruning_when_mate diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 4295 W: 719 L: 826 D: 2750
sprt @ 10+0.1 th 1 Helps resolving mates. Does it eventually have a positive effect, in general?
17-03-17 jo BishopSet diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 26873 W: 4871 L: 4882 D: 17120
sprt @ 10+0.1 th 1 Fixed version.
16-03-17 jo BishopSet diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 1679 W: 286 L: 407 D: 986
sprt @ 10+0.1 th 1 Try the bishop set. Last try on this. Tuning hints to re-adding the formerly LinearCoefficient might be useful.
14-03-17 jo nmp_exp diff
ELO: 0.16 +-2.1 (95%) LOS: 55.8%
Total: 20000 W: 1912 L: 1903 D: 16185
20000 @ 60+0.6 th 1 Add some preconditions and be more cautious when to allow null-move pruning. Check for elo-gain/-loss at LTC. (Test with 8-moves book and lower throughput.)
15-03-17 jo KnightSet2 diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 5396 W: 945 L: 1048 D: 3403
sprt @ 10+0.1 th 1 Try the same for knights. (Test against passed PawnsSet.)
13-03-17 jo outpost_new diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 52199 W: 9311 L: 9291 D: 33597
sprt @ 10+0.1 th 1 Check some locally tuned values.
10-03-17 jo PawnsSet diff
LLR: 2.96 (-2.94,2.94) [0.00,5.00]
Total: 100478 W: 13055 L: 12615 D: 74808
sprt @ 60+0.6 th 1 LTC: Final values and last try.
10-03-17 jo PawnsSet diff
LLR: 2.95 (-2.94,2.94) [0.00,5.00]
Total: 16155 W: 2980 L: 2787 D: 10388
sprt @ 10+0.1 th 1 Final values and last try.
08-03-17 jo PawnsSet diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 62328 W: 11045 L: 10909 D: 40374
sprt @ 10+0.1 th 1 Another try with some handmade values.
07-03-17 jo PawnsSet diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 10153 W: 1763 L: 1845 D: 6545
sprt @ 10+0.1 th 1 Now with some quickly tuned values. Take 2.
05-03-17 jo PawnsSet diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 49136 W: 8656 L: 8576 D: 31904
sprt @ 10+0.1 th 1 Try GuardianRM's idea in imbalance calculation. Start with some guessed values.
05-03-17 jo aspiration_eg diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 8549 W: 1499 L: 1588 D: 5462
sprt @ 10+0.1 th 1 Open aspiration window when material gets low.
01-03-17 jo tune_outpost2 diff
58539/60000 iterations
119999/120000 games played
120000 @ 10+0.1 th 1 Retry the fresh tuning on the framework, STC, lower throughput. Let's see, if the increased ck values work in practice.
04-03-17 jo outpost_new diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 11150 W: 1950 L: 2069 D: 7131
sprt @ 10+0.1 th 1 Check the values from fishtest.
03-03-17 jo rook_bonus_new diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 26656 W: 4705 L: 4771 D: 17180
sprt @ 10+0.1 th 1 Also check new values for RookOnFile bonus.
28-02-17 jo outpost_new diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 20534 W: 3616 L: 3703 D: 13215
sprt @ 10+0.1 th 1 Values from a fresh SPSA tuning starting from zero.
28-02-17 jo pawn_push_lmr diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 11339 W: 1934 L: 2011 D: 7394
sprt @ 10+0.1 th 1 Less LMR for pawn pushes with low material.
24-02-17 jo initiative diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 2972 W: 492 L: 605 D: 1875
sprt @ 10+0.1 th 1 Initiative tweak.
17-02-17 jo matimb_fix diff
ELO: -3.66 +-2.4 (95%) LOS: 0.2%
Total: 30000 W: 5642 L: 5958 D: 18400
30000 @ 7+0.07 th 1 Ready for going sprt?
08-02-17 jo matimb_fix diff
ELO: -1.53 +-2.4 (95%) LOS: 11.0%
Total: 30000 W: 5752 L: 5884 D: 18364
30000 @ 7+0.07 th 1 Now with updated queen values. Interestingly, some of them show no real sign of convergence and may contribute nothing at all.)
04-02-17 jo space_eg diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 2221 W: 373 L: 525 D: 1323
sprt @ 10+0.1 th 1 Try an endgame bonus in space eval.
04-02-17 jo entry_points diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 5067 W: 841 L: 945 D: 3281
sprt @ 10+0.1 th 1 Try a variation of Stéphane's patch.
02-02-17 jo udm1 diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 9071 W: 1539 L: 1626 D: 5906
sprt @ 10+0.1 th 1 My try on piece development. Take 1.