Stockfish Testing Queue

Pending - 0 tests 0.0 hrs

None

Active - 1 tests

23-06-17 jo aspiration_full diff
LLR: 0.24 (-2.94,2.94) [0.00,4.00]
Total: 57512 W: 10298 L: 10092 D: 37122
sprt @ 10+0.1 th 1 Aspiration change. (Still ok to test as parameter tweak?!)

Finished - 923 tests

19-06-17 jo statsFix diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 5410 W: 959 L: 1062 D: 3389
sprt @ 10+0.1 th 1 Reset stat scores for TT move and captures and also decrease reduction for good moves in these cases.Take 2.
19-06-17 jo qimbalance diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 15608 W: 2810 L: 2869 D: 9929
sprt @ 10+0.1 th 1 Retest queen vs 3 minors imbalance tweak.
19-06-17 jo statsFix diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 6478 W: 1129 L: 1227 D: 4122
sprt @ 10+0.1 th 1 Reset stat scores for TT move and captures and also further increase reduction for bad moves in these cases.
17-06-17 jo statsT diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 29004 W: 5297 L: 5298 D: 18409
sprt @ 10+0.1 th 1 Exclude promotions as previous quiet ttMove.
10-06-17 jo null_tweak2 diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 31989 W: 5864 L: 5852 D: 20273
sprt @ 10+0.1 th 1 Only allow consecutive null moves if the static eval of the last null move was high enough. Take 2.
10-06-17 jo null_tweak2 diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 8375 W: 1525 L: 1615 D: 5235
sprt @ 10+0.1 th 1 Don't allow immediate consecutive null moves. Take 1. Idea is that 1 null move might work but 2 are already too generous.
30-05-17 jo measure_endgames diff
ELO: -5.25 +-2.9 (95%) LOS: 0.0%
Total: 20000 W: 3522 L: 3824 D: 12654
20000 @ 10+0.1 th 1 Endgame experiment. A quick measure of most of the endgames deleted. (Half throughput)
23-05-17 jo imbalance_depth1 diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 4727 W: 885 L: 1028 D: 2814
sprt @ 10+0.1 th 1 Another tuning experiment.
20-05-17 jo lmr_threshold1 diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 13776 W: 2445 L: 2512 D: 8819
sprt @ 10+0.1 th 1 LMR move count threshold tweak.
20-05-17 jo outpost_depth1 diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 24200 W: 4360 L: 4434 D: 15406
sprt @ 10+0.1 th 1 Almost passed attempt plus changed Lever values.
12-05-17 jo lazy_skip2 diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 2416 W: 345 L: 458 D: 1613
sprt @ 5+0.05 th 7 Two ideas combined.
08-05-17 jo no_protector diff
LLR: -2.95 (-2.94,2.94) [-3.00,1.00]
Total: 46587 W: 8428 L: 8665 D: 29494
sprt @ 10+0.1 th 1 Simplify away Protector eval and try to compensate by modifying piece values. Take 1.
03-05-17 jo issue502 diff
LLR: 2.95 (-2.94,2.94) [-4.00,0.00]
Total: 41443 W: 5339 L: 5299 D: 30805
sprt @ 60+0.6 th 1 Make sure the final solution for issue #502 doesn't regress! See PR #1074
26-04-17 jo outpost_depth1 diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 17355 W: 3060 L: 3158 D: 11137
sprt @ 10+0.1 th 1 Now with some more tuning on pawns, each of them with not less than 1 million games.
23-04-17 jo lazy_skip2 diff
LLR: -2.94 (-2.94,2.94) [0.00,5.00]
Total: 20272 W: 3171 L: 3215 D: 13886
sprt @ 5+0.05 th 7 Also skip lower depths than the main thread already finished searching. Seems to be a non-issue at shallow depths.
22-04-17 jo lazy_random diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 7735 W: 1168 L: 1260 D: 5307
sprt @ 5+0.05 th 7 Now correctly search all shuffled root moves, but only for the first iteration. The effect is that more nodes are being searched. Whether this is good or bad ... ?
16-04-17 jo lazy_random diff
LLR: -1.72 (-2.94,2.94) [0.00,5.00]
Total: 32000 W: 5176 L: 5123 D: 21701
sprt @ 5+0.05 th 7 Does it help to shuffle the root moves of the helper threads?
12-04-17 jo outpost_depth1 diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 270128 W: 49716 L: 48945 D: 171467
sprt @ 10+0.1 th 1 Final try with some more tuned values.
13-04-17 jo lazyOddEven diff
ELO: -8.57 +-6.8 (95%) LOS: 0.7%
Total: 3000 W: 419 L: 493 D: 2088
3000 @ 5+0.05 th 11 Try a odd-even distribution of the root moves for higher threads. (Compare with tries of St├ęphane.)
10-04-17 jo sf2 diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 3460 W: 565 L: 675 D: 2220
sprt @ 10+0.1 th 1 Scale more. Take 2.
10-04-17 jo sf2 diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 16177 W: 2962 L: 3018 D: 10197
sprt @ 10+0.1 th 1 Endgames with still lots of pawns tend to be drawish. Take 1.
07-04-17 jo outpost_depth1 diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 31397 W: 5653 L: 5703 D: 20041
sprt @ 10+0.1 th 1 Result of a tuning at fixed depth 1 with a much narrower range.
06-04-17 jo outpost_depth1 diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 7283 W: 1290 L: 1422 D: 4571
sprt @ 10+0.1 th 1 A last fixed depth 1 tuning experiment with values after more than 1.2 million games played. (most likely still not enough!)
05-04-17 jo threats_tuning2 diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 40621 W: 7317 L: 7335 D: 25969
sprt @ 10+0.1 th 1 Take 2 with updated values.
03-04-17 jo singular2 diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 19798 W: 3623 L: 3663 D: 12512
sprt @ 10+0.1 th 1 Tweak singular extension node.
02-04-17 jo imbalance1 diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 5160 W: 897 L: 1001 D: 3262
sprt @ 10+0.1 th 1 Check values before a possible tuning try in the framework.
01-04-17 jo issue760 diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 27071 W: 3502 L: 3390 D: 20179
sprt @ 60+0.6 th 1 Since the framework is almost idle, run a non-regression test at LTC for fixing issue #760. (STC test: http://tests.stockfishchess.org/tests/view/5846c4dc0ebc5903140c5780)
30-03-17 jo tte_depth diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 9383 W: 1645 L: 1730 D: 6008
sprt @ 10+0.1 th 1 Another TT experiment. The difference in draft should not exceed a certain limit.
30-03-17 jo threats_tuning2 diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 23518 W: 4235 L: 4311 D: 14972
sprt @ 10+0.1 th 1 Check some early ThreatByKing values.
30-03-17 jo tte_depth diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 11064 W: 1947 L: 2025 D: 7092
sprt @ 10+0.1 th 1 TT entries need to have a minimum draft. (Test against passed excludedMove patch by VoyagerOne.)
29-03-17 jo spt1 diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 6160 W: 1046 L: 1145 D: 3969
sprt @ 10+0.1 th 1 Less pruning towards endgame. Take 1.
28-03-17 jo imbalance1 diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 13028 W: 2351 L: 2421 D: 8256
sprt @ 10+0.1 th 1 Now with rook values.
24-03-17 jo simplify_threat diff
LLR: 2.96 (-2.94,2.94) [-3.00,1.00]
Total: 42557 W: 5634 L: 5542 D: 31381
sprt @ 60+0.6 th 1 LTC: Since the tuning was pretty neutral, let's see if we can simplify the scoring of ThreatBySafePawn.
24-03-17 jo imbalance1 diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 28483 W: 5183 L: 5187 D: 18113
sprt @ 10+0.1 th 1 Now with bishop values.
23-03-17 jo simplify_threat diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 18895 W: 3361 L: 3237 D: 12297
sprt @ 10+0.1 th 1 Since the tuning was pretty neutral, let's see if we can simplify the scoring of ThreatBySafePawn.
23-03-17 jo threats_tuning1 diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 42171 W: 7568 L: 7581 D: 27022
sprt @ 10+0.1 th 1 Check some values for ThreatBySafePawn.
22-03-17 jo imbalance1 diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 24955 W: 4524 L: 4543 D: 15888
sprt @ 10+0.1 th 1 Now with knight values added.
18-03-17 jo imbalance1 diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 17648 W: 3129 L: 3180 D: 11339
sprt @ 10+0.1 th 1 Re-add Linear imbalance table.
17-03-17 jo no_pruning_when_mate diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 29844 W: 5335 L: 5334 D: 19175
sprt @ 10+0.1 th 1 Final take. (beta < VALUE_MATE_IN_MAX_PLY)
17-03-17 jo no_pruning_when_mate diff
LLR: -0.27 (-2.94,2.94) [0.00,5.00]
Total: 2263 W: 405 L: 407 D: 1451
sprt @ 10+0.1 th 1 Besides bestValue, make sure alpha is greater than mated score, too. Take 2. (Although bench is the same it's a functional change.)
17-03-17 jo no_pruning_when_mate diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 4295 W: 719 L: 826 D: 2750
sprt @ 10+0.1 th 1 Helps resolving mates. Does it eventually have a positive effect, in general?
17-03-17 jo BishopSet diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 26873 W: 4871 L: 4882 D: 17120
sprt @ 10+0.1 th 1 Fixed version.
16-03-17 jo BishopSet diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 1679 W: 286 L: 407 D: 986
sprt @ 10+0.1 th 1 Try the bishop set. Last try on this. Tuning hints to re-adding the formerly LinearCoefficient might be useful.
14-03-17 jo nmp_exp diff
ELO: 0.16 +-2.1 (95%) LOS: 55.8%
Total: 20000 W: 1912 L: 1903 D: 16185
20000 @ 60+0.6 th 1 Add some preconditions and be more cautious when to allow null-move pruning. Check for elo-gain/-loss at LTC. (Test with 8-moves book and lower throughput.)
15-03-17 jo KnightSet2 diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 5396 W: 945 L: 1048 D: 3403
sprt @ 10+0.1 th 1 Try the same for knights. (Test against passed PawnsSet.)
13-03-17 jo outpost_new diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 52199 W: 9311 L: 9291 D: 33597
sprt @ 10+0.1 th 1 Check some locally tuned values.
10-03-17 jo PawnsSet diff
LLR: 2.96 (-2.94,2.94) [0.00,5.00]
Total: 100478 W: 13055 L: 12615 D: 74808
sprt @ 60+0.6 th 1 LTC: Final values and last try.
10-03-17 jo PawnsSet diff
LLR: 2.95 (-2.94,2.94) [0.00,5.00]
Total: 16155 W: 2980 L: 2787 D: 10388
sprt @ 10+0.1 th 1 Final values and last try.
08-03-17 jo PawnsSet diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 62328 W: 11045 L: 10909 D: 40374
sprt @ 10+0.1 th 1 Another try with some handmade values.
07-03-17 jo PawnsSet diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 10153 W: 1763 L: 1845 D: 6545
sprt @ 10+0.1 th 1 Now with some quickly tuned values. Take 2.