Stockfish Testing Queue

Finished - 31762 tests

15-10-28 IIv spare_inc diff
LLR: 1.19 (-2.94,2.94) [-3.00,1.00]
Total: 77546 W: 11348 L: 11399 D: 54799
sprt @ 60+0.4 th 1 This patch is doing well and will always leave (7500 - Tick) ms on Martin's machine.
15-10-29 lbr no_timer diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 1514 W: 499 L: 327 D: 688
sprt @ 1+0.02 th 7 Retire timer. New approach without no now(). Almost zero increment. => shortest possible tc that should work in fishtest. note that fishtest does not purge time losses anymore.
15-10-29 pec play_on_increment diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 14495 W: 2772 L: 2834 D: 8889
sprt @ 10+0.1 th 1 Allow using 6% more time for most of the game because of better increment play
15-10-29 pec play_on_increment diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 15105 W: 2917 L: 2976 D: 9212
sprt @ 10+0.1 th 1 Allow using more time for most of the game because of better increment play
15-10-29 mco no_timer diff
LLR: 2.96 (-2.94,2.94) [-3.00,1.00]
Total: 7073 W: 1761 L: 1607 D: 3705
sprt @ 2+0.05 th 1 Retire timer. New approach. Almost zero increment.
15-10-29 pec play_on_increment diff
LLR: 2.96 (-2.94,2.94) [0.00,5.00]
Total: 3245 W: 745 L: 600 D: 1900
sprt @ 0.25+0.25 th 1 Check that effect is still there if condition moved to another place
15-10-28 SC moreTuned diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 54730 W: 10623 L: 10585 D: 33522
sprt @ 10+0.1 th 1 Results from SPSA tuning session. Keep symmetrized pawn values and remove history tuned values, according to mixed suggestions by VOne and Rocky. Take 4.
15-10-29 lbr polling diff
ELO: -36.07 +-3.9 (95%) LOS: 0.0%
Total: 17265 W: 3891 L: 5677 D: 7697
30000 @ 1.2+0.02 th 7 see patch comments
15-10-28 jhe time_cleanup diff
LLR: -2.96 (-2.94,2.94) [-3.00,1.00]
Total: 37830 W: 7193 L: 7420 D: 23217
sprt @ 10+0.1 th 1 Tweak a couple of parameters.
15-10-28 jos cap_available diff
LLR: -2.94 (-2.94,2.94) [0.00,4.00]
Total: 8501 W: 1579 L: 1707 D: 5215
sprt @ 10+0.1 th 1 Don't allow available to become greater than maximum. Supposed to only affect time decision for easy moves.
15-10-28 n_p ComboTune diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 16824 W: 3247 L: 3299 D: 10278
sprt @ 10+0.1 th 1 A combo patch with all tuningpatches finishing yellow since the last … A combo patch with all tuning patches finishing yellow since the last succesful combo pach. Testing with [0,5].
15-10-28 sg passed_pawns diff
LLR: -3.61 (-2.94,2.94) [0.00,5.00]
Total: 34580 W: 6625 L: 6626 D: 21329
sprt @ 10+0.1 th 1 Rework safe squares bonus. Take 2. For the master following values would be equivalent SafeSquares[] = {18, 0, 8, 8, 8, 8, 8}. Use this time only slightly different values. Remark: the two last values are currently never used because there can only occur for pawns on rank 2 or 3 and in this case the multiplicator rr is zero.
15-10-28 sni holes diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 23746 W: 4581 L: 4602 D: 14563
sprt @ 10+0.1 th 1 Take 2 (lower malus)
15-10-28 mco no_timer diff
LLR: -2.61 (-2.94,2.94) [-3.00,1.00]
Total: 9515 W: 1854 L: 2014 D: 5647
sprt @ 2+0.05 th 7 Retire timer. Final version. Almost zero increment.
15-10-28 mco no_timer diff
LLR: -0.18 (-2.94,2.94) [-3.00,1.00]
Total: 3266 W: 550 L: 565 D: 2151
sprt @ 10+0.1 th 7 Retire timer. Final version.
15-10-28 mco no_timer diff
LLR: -1.58 (-2.94,2.94) [-3.00,1.00]
Total: 12432 W: 2384 L: 2492 D: 7556
sprt @ 10+0.1 th 1 Retire timer. Final version.
15-10-28 Voy HPtweak4 diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 11599 W: 2192 L: 2267 D: 7140
sprt @ 10+0.1 th 1 Ver. 4
15-10-28 sni holes diff
LLR: -1.08 (-2.94,2.94) [0.00,5.00]
Total: 2133 W: 398 L: 435 D: 1300
sprt @ 10+0.1 th 1 Malus for holes in the pawn structure
15-10-28 mco no_timer diff
LLR: -0.39 (-2.94,2.94) [0.00,4.00]
Total: 1303 W: 192 L: 208 D: 903
sprt @ 1+1 th 7 Retire timer. Test for improvement at very small TC (7 threads)
15-10-28 mco no_timer diff
LLR: -0.41 (-2.94,2.94) [0.00,4.00]
Total: 8138 W: 1675 L: 1667 D: 4796
sprt @ 2+0.05 th 7 Retire timer. Test for improvement at (almost) zero increment. 7 threads.
15-10-28 mco no_timer diff
LLR: 0.67 (-2.94,2.94) [0.00,4.00]
Total: 5585 W: 971 L: 918 D: 3696
sprt @ 1+1 th 1 Retire timer. Test for improvement at very small TC
15-10-26 Fis max_half_time diff
LLR: -0.74 (-2.94,2.94) [-3.00,1.00]
Total: 14283 W: 2766 L: 2832 D: 8685
sprt @ 15 th 1 Fix time losses by never using more than half of remaining time. Suggested by Vadim(author of Gull) No increment.
15-10-28 sni threats8 diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 8640 W: 1667 L: 1796 D: 5177
sprt @ 10+0.1 th 1 Also use tuned values by Rocky640 for threats on non-queens pieces
15-10-27 Voy Combo diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 56816 W: 10874 L: 10830 D: 35112
sprt @ 10+0.1 th 1 Combo: threats8 + SWt3 (Sorry wrong bench)
15-10-26 luc spare_inc diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 67730 W: 13137 L: 13100 D: 41493
sprt @ 10+0.1 th 1 Always leave at least a quarter of one increment on the clock. Low priority (as similar patches are being tested), STC
15-10-27 Voy HPtweak3 diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 12486 W: 2424 L: 2495 D: 7567
sprt @ 10+0.1 th 1 Tweak-3
15-10-27 jos max_time diff
LLR: -3.33 (-2.94,2.94) [0.00,5.00]
Total: 18681 W: 3213 L: 3276 D: 12192
sprt @ 1+1 th 1 Never use more than 75% of our time. First test with big increment. (See also commit notes)
15-10-27 IIv combo_patch diff
LLR: -2.57 (-2.94,2.94) [0.00,5.00]
Total: 13532 W: 2594 L: 2644 D: 8294
sprt @ 10+0.1 th 1 Combo patch: maximumTime part was +3.3 ELO on STC, and neutral on 60+0.05; delta values are beetwen two tests that were both +0.5 ELO on more than 50K games.
15-10-27 Voy HPtweak diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 61822 W: 11900 L: 11752 D: 38170
sprt @ 10+0.1 th 1 History Pruning Tweak Idea.
15-10-27 Voy HPtweak2 diff
LLR: -2.99 (-2.94,2.94) [0.00,5.00]
Total: 27081 W: 5249 L: 5256 D: 16576
sprt @ 10+0.1 th 1 Version 2.
15-10-27 SC moreTuned diff
LLR: -2.94 (-2.94,2.94) [0.00,4.00]
Total: 14915 W: 2827 L: 2932 D: 9156
sprt @ 10+0.1 th 1 Results from SPSA tuning session. Restore original and isolated pawn values. Take 3.
15-10-27 IIv maxTime diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 9082 W: 1659 L: 1746 D: 5677
sprt @ 10+0.1 th 1 Controlling a maximumTime, Take 4/5. I compared all previous information, and found two logical ideas.
15-10-27 jhe time_cleanup diff
LLR: -2.96 (-2.94,2.94) [-3.00,1.00]
Total: 7213 W: 1278 L: 1449 D: 4486
sprt @ 10+0.1 th 1 Restructuring and consolidating of existing TM code.
15-10-27 SC moreTuned diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 21513 W: 4160 L: 4242 D: 13111
sprt @ 10+0.1 th 1 Results from SPSA tuning session. Remove asymmetries, take 2.
15-10-27 Roc RookAttackTweak diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 11962 W: 2265 L: 2381 D: 7316
sprt @ 10+0.1 th 1 Rook contact checks have been recently removed with convincing success. I think it is because the factor for "undefended" in the attackUnits calculation, have increased from 19 (in SF6) to 27 (current master). This "hand-tuned" test increase this a little bit more to 30, and decrease the QCC accordingly.
15-10-27 SC moreTuned diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 7438 W: 1381 L: 1513 D: 4544
sprt @ 10+0.1 th 1 Results from SPSA tuning session. Asymmetries in the pawn evaluation are interesting, as well new weights for cmh vs h in move ordering.
15-10-20 SC moreTuning diff
60500/150000 iterations
123740/300000 games played
300000 @ 40+0.4 th 1 As I was twice lucky with tuning, let me give a third go with other rarely considered parameters. Framework is mostly idle and a lot of machines want to work. Low throughput, so it uses only time when fishtest is idle.
15-10-27 Voy SWt5 diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 13093 W: 2414 L: 2526 D: 8153
sprt @ 10+0.1 th 1 Try 5. Slowly but surely I am finding the optimized values.
15-10-26 Roc FollowUpThreatOnMajor diff
LLR: -2.96 (-2.94,2.94) [0.00,5.00]
Total: 17516 W: 3450 L: 3498 D: 10568
sprt @ 10+0.1 th 1 Consider follow-up threats only on higher pieces
15-10-27 lbr simple diff
LLR: -2.96 (-2.94,2.94) [-3.00,1.00]
Total: 15134 W: 2814 L: 3000 D: 9320
sprt @ 10+0.1 th 1 take 2
15-10-25 Roc ThreatTuneTest diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 82628 W: 16045 L: 15907 D: 50676
sprt @ 10+0.1 th 1 Testing locally tuned values
15-10-27 lbr simple diff
LLR: -2.95 (-2.94,2.94) [-3.00,1.00]
Total: 4433 W: 783 L: 950 D: 2700
sprt @ 10+0.1 th 1 simplify after 8fd34d77
15-10-26 sni threats8 diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 133016 W: 25836 L: 25518 D: 81662
sprt @ 10+0.1 th 1 Try delta=S(-3,-3) (after inverse parabolic interpolation of the first two runs)
15-10-27 jhe time_cleanup diff
ELO: -2.57 +-6.0 (95%) LOS: 20.2%
Total: 5000 W: 968 L: 1005 D: 3027
5000 @ 10+0.1 th 1 Quick sanity check.
15-10-26 jos time_inc_fix diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 106048 W: 20190 L: 20220 D: 65638
sprt @ 10+0.1 th 1 Never use more than 90% of increment, when playing with increment. This obviously only has effect when playing with increment like in TCEC! Test as no regression.
15-10-26 lbr master diff
ELO: 61.76 +-1.9 (95%) LOS: 100.0%
Total: 40000 W: 10233 L: 3197 D: 26570
40000 @ 60+0.05 th 1 Regression test, rescheduled correctly, to be able to compare to previous ones: 1/ 8moves book 2/ 60+0.05
15-10-27 pec play_on_increment diff
LLR: 3.33 (-2.94,2.94) [-3.00,1.00]
Total: 49343 W: 9449 L: 9358 D: 30536
sprt @ 10+0.1 th 1 No regression an STC; logically it is still should be gain albeit smaller, but proof of no regression at STC test should be enough. As SF lost again on time in TCEC test old elo gaining solution to playing on increment problem http://tests.stockfishchess.org/tests/view/52df0b250ebc59025698f83b
15-10-27 Voy SWt4 diff
LLR: -2.97 (-2.94,2.94) [0.00,5.00]
Total: 29026 W: 5654 L: 5651 D: 17721
sprt @ 10+0.1 th 1 Triple the weights.
15-10-27 aji lazy_log_formula_b diff
LLR: 2.96 (-2.94,2.94) [-3.00,1.00]
Total: 17704 W: 2489 L: 2364 D: 12851
sprt @ 60+0.4 th 3 Check for no-regression at 3 threads:LTC
15-10-27 Fis NearDraw2 diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 10708 W: 2062 L: 2141 D: 6505
sprt @ 10+0.1 th 1 See github comments. Final try.