Stockfish Testing Queue

Pending - 0 tests 0.0 hrs

None

Active - 0 tests

Finished - 387 tests

16-08-17 II pieceValues diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 8798 W: 1526 L: 1653 D: 5619
sprt @ 10+0.1 th 1 Take 3.
14-08-17 II pieceValues diff
LLR: -2.94 (-2.94,2.94) [0.00,4.00]
Total: 42322 W: 7623 L: 7635 D: 27064
sprt @ 10+0.1 th 1 Take 2.
14-08-17 II pieceValues diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 17242 W: 3118 L: 3216 D: 10908
sprt @ 10+0.1 th 1 I think that QueenValueMg=2400 is more than enough for VLTC. Try to pass STC.
14-08-17 II tmm_simple diff
ELO: -2.02 +-7.2 (95%) LOS: 29.2%
Total: 5000 W: 1391 L: 1420 D: 2189
5000 @ 1+0.01 th 1 tmm_simple vs tmm_simple to check the ratio of time losses at 1+0.01
14-08-17 II master diff
ELO: 1.95 +-7.7 (95%) LOS: 69.1%
Total: 5000 W: 1598 L: 1570 D: 1832
5000 @ 1+0.01 th 1 master vs master to check the ratio of time losses at 1+0.01
10-08-17 II STCfish diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 17623 W: 3123 L: 3174 D: 11326
sprt @ 10+0.1 th 1 I understand that this is not commitable, but I collected some "good" patches from July and August, and I want to test with low throughput do we left something to garbage collectors.
05-08-17 II tmm_simple'' diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 92716 W: 16496 L: 16341 D: 59879
sprt @ 40/10+0.1 th 1 A check whether we have a good constant for increment usage in x/y+z TC, take 1
08-08-17 II extra_queens diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 25966 W: 4698 L: 4713 D: 16555
sprt @ 10+0.1 th 1 Take 4: try this idea in the case of more than one queen per side.
07-08-17 II extra_queens diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 26654 W: 4786 L: 4798 D: 17070
sprt @ 10+0.1 th 1 Take 3: compensate with higher queen values
07-08-17 II extra_queens diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 17820 W: 3139 L: 3189 D: 11492
sprt @ 10+0.1 th 1 Take 2: more penalty
07-08-17 II extra_queens diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 14041 W: 2459 L: 2525 D: 9057
sprt @ 10+0.1 th 1 Penalty for extra queens based on number of pawns
06-08-17 II tmm_simple'' diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 35388 W: 6235 L: 6272 D: 22881
sprt @ 40/10+0.1 th 1 Take 1 is struggling, try also this value.
05-08-17 II tmm_simple2 diff
LLR: -2.96 (-2.94,2.94) [-3.00,1.00]
Total: 31097 W: 5813 L: 6027 D: 19257
sprt @ 40/10 th 1 Can this continuous version be even better?
05-08-17 II tmm_simple' diff
LLR: 2.96 (-2.94,2.94) [0.00,5.00]
Total: 2559 W: 502 L: 368 D: 1689
sprt @ 80/20 th 1 This should be a significant improvement in x/y time controls when x is huge. Test with x=80. It's a non-functional change for x<=50.
05-08-17 II tmm_simple2 diff
LLR: -2.96 (-2.94,2.94) [-3.00,1.00]
Total: 29367 W: 5363 L: 5572 D: 18432
sprt @ 40/10 th 1 Take 2 - do we really need that else part - test on top of PR #1192.
04-08-17 II tmm_simple2 diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 25530 W: 4748 L: 4816 D: 15966
sprt @ 40/10 th 1 Trying to improve movestogo case, take1, test on top of PR #1192.
03-08-17 II tmm_simple diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 5913 W: 1217 L: 1069 D: 3627
sprt @ 15+0 th 1 For the completeness of results, let's also see sudden death performance.
02-08-17 II tmm_simple diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 75356 W: 9690 L: 9640 D: 56026
sprt @ 60+0.6 th 1 LTC: Take 2: as some unstable machines are losing on time, try to increase Move Overhead. This should be negligible on higher time controls, but I'm not sure for STC.
02-08-17 II tmm_simple diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 25363 W: 4658 L: 4545 D: 16160
sprt @ 10+0.1 th 1 Take 2: as some unstable machines are losing on time, try to increase Move Overhead. This should be negligible on higher time controls, but I'm not sure for STC.
01-08-17 II tmm_simple diff
LLR: -0.04 (-2.94,2.94) [-3.00,1.00]
Total: 58345 W: 10571 L: 10673 D: 37101
sprt @ 10+0.1 th 1 One more try with time management, now only with linear and quadratic functions. Throughput 500.
01-08-17 II tmm_simple diff
LLR: 2.96 (-2.94,2.94) [-3.00,1.00]
Total: 19377 W: 3650 L: 3526 D: 12201
sprt @ 40/10 th 1 Test also movestogo case.
26-07-17 II knight_psqt diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 45602 W: 8180 L: 8182 D: 29240
sprt @ 10+0.1 th 1 Local tuning experiment (SCSPSA, 320K games, 2+0.02)
26-07-17 II knight_psqt diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 33809 W: 6013 L: 6055 D: 21741
sprt @ 10+0.1 th 1 Take 2, keeping old values for A1/H1
23-07-17 II assorted diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 18631 W: 3296 L: 3389 D: 11946
sprt @ 10+0.1 th 1 I'm locally tuning some values - first check.
21-07-17 II bpair diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 21689 W: 3866 L: 3949 D: 13874
sprt @ 10+0.1 th 1 Test bishop pair bonus.
16-07-17 II imbalance2 diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 30420 W: 5501 L: 5554 D: 19365
sprt @ 10+0.1 th 1 PawnSet take 2.
15-07-17 II imbalance2 diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 45357 W: 8067 L: 8070 D: 29220
sprt @ 10+0.1 th 1 Testing PawnSet part of imbalance tuning - throughput 400.
15-07-17 II imbalance3 diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 34643 W: 6151 L: 6190 D: 22302
sprt @ 10+0.1 th 1 Testing QMI part of imbalance tuning - throughput 400.
15-07-17 II imbalance1 diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 27223 W: 4823 L: 4887 D: 17513
sprt @ 10+0.1 th 1 Testing Quadratic part of imbalance tuning - throughput 400.
10-07-17 II imbalance_tune diff
57720/60000 iterations
119548/120000 games played
120000 @ 10+0.1 th 1 Tuning imbalance, taking care on the resolution (fix = reducing ck value for bishop pair)
12-07-17 II imbalance diff
LLR: -3.39 (-2.94,2.94) [0.00,4.00]
Total: 45250 W: 8100 L: 8126 D: 29024
sprt @ 10+0.1 th 1 My tuning of imbalance is of experimental type with high ck values. So, trying values after half of the tuning to see in which direction this goes.
10-07-17 II imbalance_tune diff
279/60000 iterations
676/120000 games played
120000 @ 10+0.1 th 1 Tuning imbalance, taking care on the resolution.
08-07-17 II quadratic diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 19516 W: 3516 L: 3558 D: 12442
sprt @ 10+0.1 th 1 More quadratic tables - tuned values.
07-07-17 II more_queens diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 21124 W: 2757 L: 2806 D: 15561
sprt @ 60+0.6 th 1 LTC: Dealing with more queens.
02-07-17 II quadratic_tune diff
57965/60000 iterations
119351/120000 games played
120000 @ 10+0.1 th 1 Tune Quadratic tables in one, two, more system
06-07-17 II more_queens diff
LLR: 2.95 (-2.94,2.94) [0.00,5.00]
Total: 38511 W: 7002 L: 6715 D: 24794
sprt @ 10+0.1 th 1 Dealing with more queens.
02-07-17 II queen_tune diff
39036/40000 iterations
80000/80000 games played
80000 @ 10+0.1 th 1 Less ambitious tuning - tuning only extra queen.
05-07-17 II eval diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 40535 W: 7235 L: 7254 D: 26046
sprt @ 10+0.1 th 1 Tuned values.
30-06-17 II tune_eval diff
58697/60000 iterations
119617/120000 games played
120000 @ 10+0.1 th 1 Go on with eval tuning...
30-06-17 II eval diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 119384 W: 15556 L: 15393 D: 88435
sprt @ 60+0.6 th 1 LTC: Tuned values.
29-06-17 II eval diff
LLR: 2.95 (-2.94,2.94) [0.00,4.00]
Total: 65725 W: 11915 L: 11537 D: 42273
sprt @ 10+0.1 th 1 Tuned values.
28-06-17 II tune_eval diff
54672/60000 iterations
118930/120000 games played
120000 @ 10+0.1 th 1 A new try on tuning evaluation parameters.
12-06-17 II nullmove diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 6102 W: 1066 L: 1165 D: 3871
sprt @ 10+0.1 th 1 Take 2 with new structure.
12-06-17 II nullmove diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 4246 W: 729 L: 837 D: 2680
sprt @ 10+0.1 th 1 Log null move - few more tries with new structure
09-06-17 II nullmove1 diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 11852 W: 2089 L: 2164 D: 7599
sprt @ 10+0.1 th 1 Test null move ideas separately 1/2
09-06-17 II nullmove2 diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 10466 W: 1856 L: 1977 D: 6633
sprt @ 10+0.1 th 1 Test null move ideas separately 2/2
09-06-17 II nullmove diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 5520 W: 916 L: 1018 D: 3586
sprt @ 10+0.1 th 1 Log null move take 6
09-06-17 II nullmove diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 4759 W: 779 L: 884 D: 3096
sprt @ 10+0.1 th 1 Log null move - take 5
08-06-17 II nullmove diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 11286 W: 2024 L: 2101 D: 7161
sprt @ 10+0.1 th 1 Log null move - take 4
08-06-17 II nullmove diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 8157 W: 1488 L: 1579 D: 5090
sprt @ 10+0.1 th 1 Log null move - take 3 (for me this works well on longer time controls, trying to get something on STC too).