Stockfish Testing Queue

Pending - 0 tests 0.0 hrs

None

Active - 0 tests

Finished - 286 tests

25-03-17 II rook_mobility diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 27386 W: 4961 L: 5024 D: 17401
sprt @ 10+0.1 th 1 RookMobility after further tuning.
23-03-17 II rook_mobility diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 77998 W: 10097 L: 10042 D: 57859
sprt @ 60+0.6 th 1 LTC. Rook mobility. Tuned values.
23-03-17 II pawn_seed diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 67689 W: 12068 L: 11996 D: 43625
sprt @ 10+0.1 th 1 Take 2. Some of tuned values (fixed).
23-03-17 II pawn_seed diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 14153 W: 2418 L: 2526 D: 9209
sprt @ 10+0.1 th 1 Pawn seed parameter tweak. Tuned values.
19-03-17 II rook_mobility diff
LLR: 2.95 (-2.94,2.94) [0.00,4.00]
Total: 211425 W: 38262 L: 37391 D: 135772
sprt @ 10+0.1 th 1 Rook mobility. Tuned values.
22-03-17 II tune_seed diff
35375/40000 iterations
72108/80000 games played
80000 @ 10+0.1 th 1 Tune pawn seed with pretty high ck values and Rk=0.0005.
20-03-17 II tune_eval diff
19773/20000 iterations
40000/40000 games played
40000 @ 10+0.1 th 1 Continue tuning RookMobility, now with ck=30, Rk=0.0005.
17-03-17 II tune_eval diff
37433/40000 iterations
78012/80000 games played
80000 @ 10+0.1 th 1 Tune evaluation #3. As the First two tunings were neutral try to use higher ck values, but compensate with lower Rk (ck=20, Rk=0.0005).
18-03-17 II delta diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 26155 W: 4531 L: 4600 D: 17024
sprt @ 10+0.1 th 1 Can we increase delta further?
15-03-17 II bishop_mobility diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 32220 W: 5606 L: 5654 D: 20960
sprt @ 10+0.1 th 1 Bishop MobilityBonus. Tuning patch was stopped by one worker, but let's check values after 52K games.
13-03-17 II tune_eval diff
25908/40000 iterations
52690/80000 games played
80000 @ 10+0.1 th 1 Tune evaluation #2. 20% lower ck values this time.
13-03-17 II knight_mobility diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 44827 W: 7872 L: 7878 D: 29077
sprt @ 10+0.1 th 1 Knight mobility bonus. Tuned values.
12-03-17 II tune_eval diff
36912/40000 iterations
79526/80000 games played
80000 @ 10+0.1 th 1 Tune evaluation #1. I'll try to retune the whole evaluation function, tuning 8-32 parameters in one session. Throughput 400.
11-03-17 II outpost diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 25668 W: 4567 L: 4636 D: 16465
sprt @ 10+0.1 th 1 Tuned outpost values.
11-03-17 II nmp diff
LLR: -3.18 (-2.94,2.94) [0.00,5.00]
Total: 9097 W: 1562 L: 1658 D: 5877
sprt @ 10+0.1 th 1 NullMove Search. Tuned values. Last try.
10-03-17 II tune_nmp diff
29150/30000 iterations
59035/60000 games played
60000 @ 10+0.1 th 1 Trying to tune NullMove search, following ElbertoOne's idea.
04-03-17 II tune_outpost diff
58622/60000 iterations
119450/120000 games played
120000 @ 10+0.1 th 1 Fine tuning outpost values, checking theory obtained from the built-in simulator.
10-03-17 II nmp diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 14915 W: 2627 L: 2689 D: 9599
sprt @ 10+0.1 th 1 NullMove tweak; testing for good starting point for tuning.
10-03-17 II tune_nmp diff
475/20000 iterations
994/40000 games played
40000 @ 10+0.1 th 1 Trying to tune NullMove search, following ElbertoOne's idea.
29-01-17 II update diff
LLR: -2.96 (-2.94,2.94) [-3.00,1.00]
Total: 4538 W: 747 L: 912 D: 2879
sprt @ 10+0.1 th 1 Stats simplification, take 2.
26-01-17 II tmm_simple diff
LLR: -2.95 (-2.94,2.94) [-3.00,1.00]
Total: 6025 W: 995 L: 1161 D: 3869
sprt @ 10+0.1 th 1 SCSPSA Test #2 - time management simplification.
26-01-17 II psqt diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 21577 W: 3891 L: 3974 D: 13712
sprt @ 10+0.1 th 1 SCSPSA Test #1 - king PSQT.
22-01-17 II update diff
LLR: -2.95 (-2.94,2.94) [-3.00,1.00]
Total: 3511 W: 548 L: 710 D: 2253
sprt @ 10+0.1 th 1 Stats simplification (joint idea by Stefan Geschwentner and Ivan Ivec). First untuned guess.
18-01-17 II update diff
LLR: -2.94 (-2.94,2.94) [0.00,5.00]
Total: 3543 W: 596 L: 706 D: 2241
sprt @ 10+0.1 th 1 SF statistics, take 1.
17-01-17 II lazy_winning diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 12259 W: 2171 L: 2244 D: 7844
sprt @ 10+0.1 th 1 Use lazy evaluation only when winning.
17-01-17 II lazy_losing diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 7574 W: 1321 L: 1414 D: 4839
sprt @ 10+0.1 th 1 Using lazy evaluation only when losing.
11-01-17 II c_lazy_eval diff
LLR: -2.96 (-2.94,2.94) [-3.00,1.00]
Total: 3844 W: 591 L: 753 D: 2500
sprt @ 10+0.1 th 1 Apply lazy eval globally. Take 2.
11-01-17 II c_lazy_eval diff
LLR: -2.95 (-2.94,2.94) [-3.00,1.00]
Total: 190 W: 2 L: 174 D: 14
sprt @ 10+0.1 th 1 Continuous and simple lazy eval. Take 1.
03-12-16 II fmc diff
LLR: -2.96 (-2.94,2.94) [-3.00,1.00]
Total: 99435 W: 17212 L: 17533 D: 64690
sprt @ 10+0.1 th 1 Move count pruning, take 2.
06-12-16 II fmc diff
LLR: -2.95 (-2.94,2.94) [-3.00,1.00]
Total: 1934 W: 281 L: 441 D: 1212
sprt @ 10+0.1 th 1 Last try on this.
05-12-16 II fmc diff
LLR: -2.96 (-2.94,2.94) [-3.00,1.00]
Total: 15637 W: 2608 L: 2789 D: 10240
sprt @ 10+0.1 th 1 Tuned values.
04-12-16 II fmc_tune diff
27857/30000 iterations
56437/60000 games played
60000 @ 10+0.1 th 1 My local test at 160+1.6 shows that linear fit is not a bad idea. So, trying to tune FutilityMoveCounts in a simple form.
04-12-16 II fmc diff
LLR: -2.95 (-2.94,2.94) [-3.00,1.00]
Total: 18125 W: 3103 L: 3289 D: 11733
sprt @ 10+0.1 th 1 Take 3. As the previous test is struggling, I lowered the priority, hoping for a better test.
18-11-16 II fmc diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 16470 W: 2894 L: 2995 D: 10581
sprt @ 10+0.1 th 1 Increasing FutilityMoveCounts by 50%.
29-10-16 II probcut_margin diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 13332 W: 2310 L: 2421 D: 8601
sprt @ 10+0.1 th 1 Take 3.
29-10-16 II probcut_margin diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 14731 W: 2582 L: 2688 D: 9461
sprt @ 10+0.1 th 1 Take 2.
29-10-16 II probcut_margin diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 14963 W: 2556 L: 2619 D: 9788
sprt @ 10+0.1 th 1 ProbCut margin. Test against passed_pawns patch.
15-10-16 II not_imp_red diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 29012 W: 5195 L: 5197 D: 18620
sprt @ 10+0.1 th 1 Reductions when not improving. This idea doesn't seem much elo sensitive, but I would take one more try anyway.
12-10-16 II not_imp_red diff
LLR: -3.40 (-2.94,2.94) [0.00,5.00]
Total: 59831 W: 10559 L: 10453 D: 38819
sprt @ 10+0.1 th 1 Reductions if not improving.
07-10-16 II piece_values diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 2928 W: 512 L: 661 D: 1755
sprt @ 10+0.1 th 1 In one scientific paper I found an information that usage of rooks and bishops is constantly increasing in chess games during last 100 years. So, trying to increase values for bishops and rooks, and decrease values for knights and queens.
07-10-16 II tmm_simple diff
LLR: -2.95 (-2.94,2.94) [-3.00,1.00]
Total: 11269 W: 1892 L: 2066 D: 7311
sprt @ 10+0.1 th 1 One more attempt with time management, after extensive local tuning.
08-09-16 II SMP diff
LLR: -0.12 (-2.94,2.94) [0.00,4.00]
Total: 3000 W: 493 L: 490 D: 2017
sprt @ 5+0.05 th 7 I would like to rewrite Half/HighDensity matrix in a compact form suitable for usage with 'infinite' number of threads, but first I would like to try one another variant.
27-08-16 II deep_search diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 6465 W: 1165 L: 1263 D: 4037
sprt @ 10+0.1 th 1 Take 3.
26-08-16 II deep_search diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 10531 W: 1877 L: 1957 D: 6697
sprt @ 10+0.1 th 1 As the tuning has its limitations, and the test is pretty neutral at STC, I'll take few more tries here.
26-08-16 II deep_search diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 28147 W: 5200 L: 5204 D: 17743
sprt @ 10+0.1 th 1 Tuned values.
25-08-16 II deep_search_tune diff
14700/15000 iterations
30000/30000 games played
30000 @ 10+0.1 th 1 Two parameters require further tuning with lower ck values.
24-08-16 II deep_search_tune diff
28697/30000 iterations
58105/60000 games played
60000 @ 10+0.1 th 1 This patch solves some difficult and mating positions several times faster than master. So, trying to tune and test in the framework.
12-08-16 II rhist diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 4277 W: 746 L: 854 D: 2677
sprt @ 10+0.1 th 1 It's fascinating to see high elo sensitivity of history reductions. Probably last try.
12-08-16 II rhist diff
LLR: -2.94 (-2.94,2.94) [0.00,5.00]
Total: 4748 W: 832 L: 937 D: 2979
sprt @ 10+0.1 th 1 Depth dependent rHist, trying the opposite.
12-08-16 II rhist diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 6694 W: 1220 L: 1317 D: 4157
sprt @ 10+0.1 th 1 Trying depth dependent history reductions.