Stockfish Testing Queue

Finished - 21870 tests

21-01-14 mc master diff
ELO: 32.49 +-2.0 (95%) LOS: 100.0%
Total: 40000 W: 8818 L: 5088 D: 26094
40000 @ 60+0.05 th 1 Regression test after SEE simplification (8moves_v3 book)
22-01-14 pe time_trouble diff
ELO: 4.33 +-3.0 (95%) LOS: 99.7%
Total: 20000 W: 4110 L: 3861 D: 12029
20000 @ 15 th 1 Handle time trouble. Take 1. Check if no increment gains hold for STC eqiuvalent TC
21-01-14 jo pp_blockSq diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 41601 W: 7672 L: 7669 D: 26260
sprt @ 15+0.05 th 1 Lesser bonus for pp eval when our king attacks the blockSq
21-01-14 pe time_trouble diff
ELO: -0.36 +-2.0 (95%) LOS: 36.7%
Total: 40000 W: 7239 L: 7280 D: 25481
40000 @ 15+0.05 th 1 Handle time trouble. Take 1. STC test for neutrality.
21-01-14 dr lesser_PV diff
LLR: -2.96 (-2.94,2.94) [0.00,6.00]
Total: 9011 W: 1329 L: 1390 D: 6292
sprt @ 60+0.05 th 1 LTC for : even lesser PV distinctions
21-01-14 in pptune2 diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 69452 W: 13078 L: 13001 D: 43373
sprt @ 15+0.05 th 1 Final Take
22-01-14 pe time_trouble diff
ELO: 18.71 +-3.4 (95%) LOS: 100.0%
Total: 20000 W: 5495 L: 4419 D: 10086
20000 @ 5 th 1 Handle time trouble. Take 1. Check that gain at no increment tc is preserved
20-01-14 pe tm_fix diff
ELO: -2.35 +-2.7 (95%) LOS: 4.3%
Total: 20000 W: 3028 L: 3163 D: 13809
20000 @ 60+0.05 th 1 Measure elo change at LTC
21-01-14 jk pvraz2 diff
ELO: 1.28 +-1.4 (95%) LOS: 96.4%
Total: 100000 W: 21215 L: 20846 D: 57939
100000 @ 5+0.05 th 1 Test value of enabling razoring in PV at very fast time control
19-01-14 jo kingAttackWeights diff
ELO: -2.88 +-2.9 (95%) LOS: 2.7%
Total: 20000 W: 3623 L: 3789 D: 12588
20000 @ 15+0.05 th 1 KS: Measure kingAttackWeights (Knight)
21-01-14 pe time_trouble diff
ELO: 0.28 +-3.1 (95%) LOS: 56.9%
Total: 20000 W: 4223 L: 4207 D: 11570
20000 @ 5+0.05 th 1 Handle time trouble. Take 1
21-01-14 pe time_trouble diff
ELO: 3.86 +-3.4 (95%) LOS: 98.8%
Total: 20000 W: 4981 L: 4759 D: 10260
20000 @ 1+0.05 th 1 Handle time trouble. Take 1. Play with disproportionately large increment
21-01-14 pe time_trouble diff
ELO: 6.53 +-3.5 (95%) LOS: 100.0%
Total: 20000 W: 5366 L: 4990 D: 9644
20000 @ 0.05+0.05 th 1 Handle time trouble. Take 1. Play on increment only.
21-01-14 in pptune2 diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 9885 W: 1808 L: 1887 D: 6190
sprt @ 15+0.05 th 1 Take 3
21-01-14 in pptune2 diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 4768 W: 867 L: 960 D: 2941
sprt @ 15+0.05 th 1 Take 2: Increasing k-factor for ebonus
19-01-14 jo simpler_pp_eval diff
ELO: -2.21 +-3.0 (95%) LOS: 7.4%
Total: 20000 W: 3779 L: 3906 D: 12315
20000 @ 15+0.05 th 1 PP: PP: Removing more stuff 4
19-01-14 jo simpler_pp_eval diff
ELO: -2.15 +-2.9 (95%) LOS: 7.4%
Total: 20000 W: 3634 L: 3758 D: 12608
20000 @ 15+0.05 th 1 PP: Removing more stuff 1
20-01-14 hw see_simp diff
ELO: -0.39 +-1.9 (95%) LOS: 34.3%
Total: 40000 W: 6190 L: 6235 D: 27575
40000 @ 60+0.05 th 1 Retest of patch to make sure it's not affecting performance.
21-01-14 rs lesser_PV diff
LLR: 2.96 (-2.94,2.94) [-1.50,4.50]
Total: 52077 W: 9694 L: 9454 D: 32929
sprt @ 15+0.05 th 1 even lesser PV distinctions
21-01-14 in pp_blockSq diff
LLR: -2.96 (-2.94,2.94) [0.00,6.00]
Total: 19524 W: 3017 L: 3030 D: 13477
sprt @ 60+0.05 th 1 LTC for joachim: Bonus for pp eval when our king attacks the blockSq
20-01-14 rs less_ext diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 17699 W: 3279 L: 3338 D: 11082
sprt @ 15+0.05 th 1 less check extensions
20-01-14 rs more_ext diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 10739 W: 1906 L: 1983 D: 6850
sprt @ 15+0.05 th 1 more check extensions
20-01-14 in pptune2 diff
LLR: -2.97 (-2.94,2.94) [0.00,6.00]
Total: 10727 W: 1654 L: 1707 D: 7366
sprt @ 60+0.05 th 1 LTC: Try a combination of 2 of Joachim's patches that almost passed.
20-01-14 jo pp_blockSq diff
LLR: 2.96 (-2.94,2.94) [-1.50,4.50]
Total: 4941 W: 945 L: 828 D: 3168
sprt @ 15+0.05 th 1 Bonus for pp eval when our king attacks the blockSq
19-01-14 jo simpler_pp_eval diff
ELO: -4.00 +-2.7 (95%) LOS: 0.2%
Total: 25000 W: 4637 L: 4925 D: 15438
25000 @ 15+0.05 th 1 PP: Removing more stuff 2
19-01-14 jo simpler_pp_eval diff
ELO: -8.72 +-3.0 (95%) LOS: 0.0%
Total: 20000 W: 3626 L: 4128 D: 12246
20000 @ 15+0.05 th 1 PP: Removing more stuff 3
19-01-14 jo simpler_pp_eval diff
ELO: -1.67 +-2.6 (95%) LOS: 10.8%
Total: 25000 W: 4633 L: 4753 D: 15614
25000 @ 15+0.05 th 1 PP: Removing more stuff
20-01-14 in pptune2 diff
LLR: 2.96 (-2.94,2.94) [-1.50,4.50]
Total: 60877 W: 11420 L: 11156 D: 38301
sprt @ 15+0.05 th 1 Try a combination of 2 of Joachim's patches that almost passed.
20-01-14 gl less_pv diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 18380 W: 3354 L: 3411 D: 11615
sprt @ 15+0.05 th 1 Don't prune the root node! Also, fix <= alpha case
20-01-14 in pptune diff
ELO: -16.92 +-3.1 (95%) LOS: 0.0%
Total: 20000 W: 3724 L: 4697 D: 11579
20000 @ 15+0.05 th 1 Further simplify calculation of k-factor. Based on elo estimate tests in the framework by Joachim and some local tests, this should not regress. (Hopefully!)
20-01-14 sg followup_moves diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 6010 W: 1067 L: 1156 D: 3787
sprt @ 15+0.05 th 1 non-capture/promotion check (take 3). Sorry, wrong bench on first attempt.
20-01-14 in less_pv diff
LLR: -0.95 (-2.94,2.94) [0.00,6.00]
Total: 43657 W: 6764 L: 6601 D: 30292
sprt @ 60+0.05 th 1 LTC for glinscott: Remove most PV distinctions
20-01-14 sg followup_moves diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 13064 W: 2316 L: 2387 D: 8361
sprt @ 15+0.05 th 1 non-capture/promotion check (take 4)
20-01-14 in material diff
LLR: -2.95 (-2.94,2.94) [0.00,6.00]
Total: 10336 W: 1594 L: 1648 D: 7094
sprt @ 60+0.05 th 1 LTC: PawnValue - 5
20-01-14 pe tm_fix diff
ELO: -2.26 +-2.9 (95%) LOS: 6.4%
Total: 20000 W: 3571 L: 3701 D: 12728
40000 @ 15+0.05 th 1 Measure elo at an normal STC
20-01-14 in material diff
LLR: -0.45 (-2.94,2.94) [-1.50,4.50]
Total: 731 W: 127 L: 141 D: 463
sprt @ 15+0.05 th 1 LTC: PawnValue - 5
19-01-14 in material diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 30442 W: 5615 L: 5641 D: 19186
sprt @ 15+0.05 th 1 PawnValue - 7
19-01-14 in material diff
LLR: 2.95 (-2.94,2.94) [-1.50,4.50]
Total: 17642 W: 3349 L: 3198 D: 11095
sprt @ 15+0.05 th 1 PawnValue - 5
19-01-14 gl less_pv diff
LLR: 2.96 (-2.94,2.94) [-1.50,4.50]
Total: 78877 W: 14706 L: 14396 D: 49775
sprt @ 15+0.05 th 1 Remove most PV distinctions
19-01-14 jo tm_fix diff
ELO: 0.10 +-3.1 (95%) LOS: 52.6%
Total: 20000 W: 4090 L: 4084 D: 11826
20000 @ 5+0.05 th 1 and with a bit more time.
20-01-14 pe tm_fix diff
ELO: 14.37 +-3.4 (95%) LOS: 100.0%
Total: 20000 W: 5445 L: 4618 D: 9937
20000 @ 5 th 1 Measure elo at 5+0
19-01-14 jo null_rep diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 14358 W: 2647 L: 2715 D: 8996
sprt @ 15+0.05 th 1 Include null moves in repetition detection.
19-01-14 sg followup_moves diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 5568 W: 1014 L: 1105 D: 3449
sprt @ 15+0.05 th 1 update followup moves: replace ttMove with non-capture/promotion
18-01-14 hw see_simp diff
ELO: -1.98 +-2.6 (95%) LOS: 7.1%
Total: 20000 W: 2971 L: 3085 D: 13944
20000 @ 60+0.05 th 1 Test simplification for regressions
19-01-14 in material diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 12530 W: 2291 L: 2363 D: 7876
sprt @ 15+0.05 th 1 PawnValue - 3 (As joergoster suggested)
19-01-14 in material diff
LLR: -2.94 (-2.94,2.94) [-1.50,4.50]
Total: 38685 W: 7272 L: 7275 D: 24138
sprt @ 15+0.05 th 1 PawnValueMg + 2, PawnValueEg + 2
19-01-14 pe move_importance diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 11128 W: 1980 L: 2056 D: 7092
sprt @ 15+0.05 th 1 move importance for pv_instability
18-01-14 jo k_mbonus diff
ELO: -0.31 +-2.6 (95%) LOS: 40.9%
Total: 25000 W: 4600 L: 4622 D: 15778
25000 @ 15+0.05 th 1 PP: Measure k-factor for mbonus
19-01-14 pe tm_fix diff
ELO: 62.85 +-3.8 (95%) LOS: 100.0%
Total: 20000 W: 7850 L: 4271 D: 7879
20000 @ 1+0.01 th 1 Measure elo at very short tc
19-01-14 pe tm_fix diff
ELO: 2.36 +-4.3 (95%) LOS: 86.0%
Total: 10000 W: 2023 L: 1955 D: 6022
10000 @ 15 th 1 See if there are any time losses at 15'' + no increment. There is possibility that recent time management fix weakened engine at TC without increment.