Stockfish Testing Queue

Finished - 27341 tests

04-10-15 sg new_history diff
LLR: -3.76 (-2.94,2.94) [0.00,5.00]
Total: 51839 W: 9511 L: 9448 D: 32880
sprt @ 15+0.05 th 1 First attempt was neutral. So double up weight at move ordering.
04-10-15 SC scale_factor_tunable diff
LLR: 2.96 (-2.94,2.94) [0.00,4.00]
Total: 36089 W: 5589 L: 5331 D: 25169
sprt @ 60+0.05 th 1 Values after 183k iterations. Let us see. LTC.
04-10-15 sn 4men_probe_in_qsearch diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 37998 W: 6911 L: 6874 D: 24213
sprt @ 15+0.05 th 1 4-Syzygy vs 4-Syzygy: test the effect of probing the 4 men tables in qsearch.
04-10-15 Vo YellowCombo diff
LLR: 2.96 (-2.94,2.94) [0.00,5.00]
Total: 46036 W: 7046 L: 6756 D: 32234
sprt @ 60+0.05 th 1 LTC: http://tests.stockfishchess.org/tests/view/560c959f0ebc597e4f23e409 , http://tests.stockfishchess.org/tests/view/560a1ae60ebc597e4f23e36e
04-10-15 My AU diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 10038 W: 1733 L: 1816 D: 6489
sprt @ 15+0.05 th 1 Fixed bench
04-10-15 jo lazy_smp2 diff
LLR: -0.01 (-2.94,2.94) [-2.00,5.00]
Total: 148 W: 16 L: 16 D: 116
sprt @ 180+2 th 7 Lazy SMP. 7 Threads XLTC. This should already give a good hint about scalability. Resubmitted as sprt[-2, 5] test, so that more machines are able to participate. Test should stop if it turns out to be much weaker or stronger, otherwise we can stop after 5,000 or 10,000 games manually.
04-10-15 My QCC diff
LLR: -2.94 (-2.94,2.94) [0.00,5.00]
Total: 3160 W: 536 L: 648 D: 1976
sprt @ 15+0.05 th 1 Larger bonus for Queen contact checks where the King is on the edge of the board.
04-10-15 sn good_knight4 diff
LLR: -1.66 (-2.94,2.94) [0.00,5.00]
Total: 3823 W: 675 L: 729 D: 2419
sprt @ 15+0.05 th 1 Take 4, bonus=S(0,5)
04-10-15 Ro tune_check diff
19070/20000 iterations
40000/40000 games played
40000 @ 30+0.05 th 1 Tuning the new check bonus
04-10-15 sg new_history diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 36106 W: 6494 L: 6466 D: 23146
sprt @ 15+0.05 th 1 Introduce new history table based on from square.
04-10-15 Vo YellowCombo diff
LLR: 2.96 (-2.94,2.94) [0.00,5.00]
Total: 21802 W: 4107 L: 3887 D: 13808
sprt @ 15+0.05 th 1 http://tests.stockfishchess.org/tests/view/560c959f0ebc597e4f23e409 , http://tests.stockfishchess.org/tests/view/560a1ae60ebc597e4f23e36e
04-10-15 sn bad_knight diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 3954 W: 661 L: 770 D: 2523
sprt @ 15+0.05 th 1 Bad knight
04-10-15 Ro UnprotectedPhalanx diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 11908 W: 2152 L: 2226 D: 7530
sprt @ 15+0.05 th 1 UP_20151003_1
03-10-15 Ro SafeSentry diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 14676 W: 2609 L: 2672 D: 9395
sprt @ 15+0.05 th 1 Fixed bench
03-10-15 Vo BalanceStatFA diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 11836 W: 2106 L: 2181 D: 7549
sprt @ 15+0.05 th 1 One last shot...prior test gave a good clue what's going on. I think this version will work.
03-10-15 Ro SemiBackward2 diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 5227 W: 908 L: 1011 D: 3308
sprt @ 15+0.05 th 1 Fixed array index
03-10-15 sg checked diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 63017 W: 11448 L: 11389 D: 40180
sprt @ 15+0.05 th 1 Ok the patch have an effect on endgame. Try now the opposite and double up endgame score.
03-10-15 Ro SemiBackward2 diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 5964 W: 1089 L: 1189 D: 3686
sprt @ 15+0.05 th 1 SB_20151003_1
03-10-15 II reduction_tune diff
9855/10000 iterations
20000/20000 games played
20000 @ 30+0.05 th 1 Tuning moves 7-12, session 1. Read more under comments.
03-10-15 SC scale_factor_tunable diff
LLR: 2.96 (-2.94,2.94) [0.00,4.00]
Total: 45401 W: 8590 L: 8274 D: 28537
sprt @ 15+0.05 th 1 Values after 183k iterations. Let us see.
03-10-15 sn second_push3 diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 44242 W: 8155 L: 8159 D: 27928
sprt @ 15+0.05 th 1 Add some more weight to the proximity of our king to the squares in front of the passed pawn. Take 3, weight = 5/4
03-10-15 Vo BalanceStatFA diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 12566 W: 2234 L: 2306 D: 8026
sprt @ 15+0.05 th 1 STC: Final attempt at this.
03-10-15 mb lazy_smp2 diff
ELO: -0.00 +-54.9 (95%) LOS: 50.0%
Total: 25 W: 2 L: 2 D: 21
15000 @ 600+0.05 th 7 Lazy SMP. 7 Threads XLLTC.
03-10-15 sg checked diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 25362 W: 4527 L: 4597 D: 16238
sprt @ 15+0.05 th 1 Because safe checks already rewarded in middlegame clear the new checked bonus there
03-10-15 sg checked diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 15191 W: 2745 L: 2850 D: 9596
sprt @ 15+0.05 th 1 we discussed at the repo of the checked bonus patch how the endgame value effects the patch. Local testing seems to indicate that for the endgame the bonus is useless. So try a checked bonus of S(20,0)
03-10-15 jo lazy_smp diff
ELO: -9.17 +-5.5 (95%) LOS: 0.1%
Total: 5000 W: 756 L: 888 D: 3356
5000 @ 15+0.05 th 3 Are we now competitive with 3 Threads (-13 elo)? Further simplified the changing of the search depth of the helper threads, and also restored the old iterative deepening loop logic.
26-09-15 SC scale_factor_tuning diff
89027/100000 iterations
184197/200000 games played
200000 @ 60+0.05 th 1 As we have a lot of machines active, let me submit a very long LTC tuning session on rarely considered parameters, just in case we left some ELO lying around. Low throughput, such that it kicks in only if no Priority 0 stuff is waiting. Maybe we can get an answer before next TCEC stage.
02-10-15 II reduction_tune diff
9899/10000 iterations
20000/20000 games played
20000 @ 30+0.05 th 1 Tuning first 6 moves - session 3. Read more under comments.
02-10-15 Vo BalanceStat3b diff
LLR: -0.55 (-2.94,2.94) [0.00,5.00]
Total: 86539 W: 15900 L: 15553 D: 55086
sprt @ 15+0.05 th 1 STC: v.3b
30-09-15 sn king_separation3 diff
LLR: 2.95 (-2.94,2.94) [0.00,5.00]
Total: 115087 W: 17578 L: 17034 D: 80475
sprt @ 60+0.05 th 1 Try king separation (take 4, weight = 8). Tested against master with Stefan's and Jonathan's patches: LTC
02-10-15 Vo BalanceStat4 diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 18127 W: 2726 L: 2780 D: 12621
sprt @ 60+0.05 th 1 LTC: v.4
03-10-15 Vo BalanceStatLMR diff
LLR: -2.61 (-2.94,2.94) [0.00,5.00]
Total: 7854 W: 1399 L: 1476 D: 4979
sprt @ 15+0.05 th 1 stc
02-10-15 Vo BalanceStat2 diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 35331 W: 5336 L: 5327 D: 24668
sprt @ 60+0.05 th 1 LTC: v.2
02-10-15 Vo BalanceStat4 diff
LLR: 2.95 (-2.94,2.94) [0.00,5.00]
Total: 30755 W: 5726 L: 5469 D: 19560
sprt @ 15+0.05 th 1 STC: v.4
02-10-15 mb lazy_smp2 diff
ELO: 34.41 +-9.9 (95%) LOS: 100.0%
Total: 1094 W: 184 L: 76 D: 834
5000 @ 60+0.05 th 23 New version of Lazy SMP. 23 Threads. LTC, with more hash.
02-10-15 mb master_tlhcm diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 7577 W: 1084 L: 1177 D: 5316
sprt @ 15+0.05 th 7 Moves history and counter moves to Thread. I'd like to test with 7 threads because that's where we saw a improvement in lazy_smp vs master. I want to check if this change is what worked for lazy smp, and if it works also for YBWC.
01-10-15 Vo BalanceStat2 diff
LLR: 2.95 (-2.94,2.94) [0.00,5.00]
Total: 73354 W: 13686 L: 13244 D: 46424
sprt @ 15+0.05 th 1 STC: v.2
02-10-15 Vo BalanceStat3 diff
LLR: -3.42 (-2.94,2.94) [0.00,5.00]
Total: 47464 W: 8659 L: 8601 D: 30204
sprt @ 15+0.05 th 1 STC: v.3
01-10-15 mb lazy_smp2 diff
ELO: 41.21 +-30.0 (95%) LOS: 99.7%
Total: 144 W: 29 L: 12 D: 103
5000 @ 60+0.05 th 15 New version of Lazy SMP. 15 Threads. LTC. Lower priority.
01-10-15 Ro EndgameAttackUnits diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 40494 W: 5730 L: 5766 D: 28998
sprt @ 15+0.05 th 1 16 ??
01-10-15 Vo BalanceStat diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 47876 W: 7208 L: 7153 D: 33515
sprt @ 60+0.05 th 1 LTC
01-10-15 Ro EndgameAttackUnits diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 43135 W: 6131 L: 6159 D: 30845
sprt @ 15+0.05 th 1 25
01-10-15 aj lazy_smp2 diff
ELO: -7.30 +-5.7 (95%) LOS: 0.6%
Total: 5000 W: 817 L: 922 D: 3261
5000 @ 15+0.05 th 2 New version of Lazy SMP. Measure impact at 2 threads
01-10-15 Ro EndgameAttackUnits diff
LLR: -2.94 (-2.94,2.94) [0.00,4.00]
Total: 34823 W: 4864 L: 4916 D: 25043
sprt @ 15+0.05 th 1 What happens if we change the endgame weight to 50 ?
01-10-15 sn h_file diff
LLR: -2.68 (-2.94,2.94) [0.00,4.00]
Total: 24746 W: 4502 L: 4559 D: 15685
sprt @ 15+0.05 th 1 Try to open A or H file on the opponent king, even at the cost of a doubled B or G pawn.
01-10-15 sn second_push2 diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 13174 W: 2374 L: 2486 D: 8314
sprt @ 15+0.05 th 1 Add some more weight to the proximity of our king to the squares in front of the passed pawn. Take 2, weight = 7/4
01-10-15 Ro EndgameAttackUnits diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 5010 W: 612 L: 747 D: 3651
sprt @ 15+0.05 th 1 75
21-09-15 My RCC diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 61776 W: 9349 L: 9289 D: 43138
sprt @ 60+0.05 th 1 Retire rook contact checks, this time compensate by bumping rook checks.
30-09-15 Vo BalanceStat diff
LLR: 2.96 (-2.94,2.94) [0.00,5.00]
Total: 10989 W: 2060 L: 1888 D: 7041
sprt @ 15+0.05 th 1 STC
29-09-15 sn king_separation3 diff
LLR: 2.95 (-2.94,2.94) [0.00,5.00]
Total: 116153 W: 21616 L: 20990 D: 73547
sprt @ 15+0.05 th 1 Try king separation (take 4, weight = 8). Tested against master with Stefan's and Jonathan's patches.