Stockfish Testing Queue

Finished - 19396 tests

12-01-14 hw see_king diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 16261 W: 3031 L: 3094 D: 10136
sprt @ 15+0.05 th 1 Changing Position::see to return a low value for illegal captures
12-01-14 ur try2_of_fixing_a_bug diff
LLR: -2.94 (-2.94,2.94) [-1.50,4.50]
Total: 19872 W: 3622 L: 3675 D: 12575
sprt @ 15+0.05 th 1 changed the meaning of Signals.failedLowAtRoot to the logical meaning and I do not think that it is logical to consider fail low that happened at iteration 5 for decision about time to stop in iteration 20. It may be logical to consider fail low in iteration 19 for iteration 20 so it is possible that this patch is going to fail.
11-01-14 gl storm_blocked diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 23724 W: 4436 L: 4479 D: 14809
sprt @ 15+0.05 th 1 Same as before, except king safety parameters tuned with 25k games.
11-01-14 mc stsu diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 15176 W: 2765 L: 2831 D: 9580
sprt @ 15+0.05 th 1 Verify a supposed +49 ELO (!) mega patch series
11-01-14 hw king_val diff
LLR: -0.36 (-2.94,2.94) [-1.50,4.50]
Total: 1486 W: 273 L: 282 D: 931
sprt @ 15+0.05 th 1 Assign king big piece val
11-01-14 jo KBBKN_fix diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 11770 W: 2114 L: 2188 D: 7468
sprt @ 15+0.05 th 1 Fixes a bug in KBBKN. Also return lower score, as many wins may need more than 50 moves.
11-01-14 ur fixing_a_bug diff
LLR: -2.95 (-2.94,2.94) [0.00,6.00]
Total: 20272 W: 3160 L: 3169 D: 13943
sprt @ 60+0.05 th 1 another version of the same idea and now added the real intention of the code that stopped at the first move after significant time of no fail low because I believe that the intention was to failing low in the specific iteration and not to some history of failLow(maybe the history does not give much if it was many plies earlier but I prefered not to get rid of that code.because I still feel better to play faster if there was no fail low in all iterations
11-01-14 rs blocked_storm_pawn_radi diff
LLR: 2.96 (-2.94,2.94) [0.00,6.00]
Total: 38371 W: 6165 L: 5888 D: 26318
sprt @ 60+0.05 th 1 LTC: Radical solution to get king safety eval of this position r2qk2r/ppp2p2/2npbn2/2b1p3/2P1P1P1/2NB1PPp/PPNP3K/R1BQ1R2 b kq - 0 13 about right
11-01-14 st ralphsidea_kingsafety diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 31228 W: 5769 L: 5793 D: 19666
sprt @ 15+0.05 th 1 Idea taken out from Ralph's own idea. This time with a lower bonus for king safety.
11-01-14 bi pawn_psqt diff
LLR: -1.54 (-2.94,2.94) [-1.50,4.50]
Total: 34424 W: 6396 L: 6361 D: 21667
sprt @ 15+0.05 th 1 pawn psqt take 6
11-01-14 gl storm_blocked diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 14089 W: 2578 L: 2646 D: 8865
sprt @ 15+0.05 th 1 Don't remove penalty for shelter, but blocked files don't count against king ring
11-01-14 rs blocked_storm_pawn_radi diff
LLR: 2.95 (-2.94,2.94) [-1.50,4.50]
Total: 6239 W: 1249 L: 1127 D: 3863
sprt @ 15+0.05 th 1 Radical solution to get king safety eval of this position r2qk2r/ppp2p2/2npbn2/2b1p3/2P1P1P1/2NB1PPp/PPNP3K/R1BQ1R2 b kq - 0 13 about right
11-01-14 ur fixing_a_bug diff
LLR: 2.96 (-2.94,2.94) [-1.50,4.50]
Total: 19446 W: 3696 L: 3540 D: 12210
sprt @ 15+0.05 th 1 another version of the same idea and now added the real intention of the code that stopped at the first move after significant time of no fail low because I believe that the intention was to failing low in the specific iteration and not to some history of failLow(maybe the history does not give much if it was many plies earlier but I prefered not to get rid of that code.because I still feel better to play faster if there was no fail low in all iterations
10-01-14 gl pawn_psqt diff
LLR: -0.75 (-2.94,2.94) [0.00,6.00]
Total: 25943 W: 4082 L: 3990 D: 17871
sprt @ 60+0.05 th 1 Long TC for BI: pawn psqt take 4
10-01-14 gl storm_blocked diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 9826 W: 1816 L: 1896 D: 6114
sprt @ 15+0.05 th 1 Don't count blocked files in kingRing
10-01-14 rs se_at_root3 diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 5347 W: 971 L: 1063 D: 3313
sprt @ 15+0.05 th 1 take 3 (final)
10-01-14 rs se_at_root3 diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 5033 W: 896 L: 988 D: 3149
sprt @ 15+0.05 th 1 take 2
10-01-14 bi pawn_psqt diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 20580 W: 3968 L: 4019 D: 12593
sprt @ 15+0.05 th 1 pawn table 5
10-01-14 bi pawn_psqt diff
LLR: 1.54 (-2.94,2.94) [-1.50,4.50]
Total: 41006 W: 7790 L: 7627 D: 25589
sprt @ 15+0.05 th 1 pawn psqt take 3
10-01-14 gl storm_blocked diff
LLR: -2.92 (-2.94,2.94) [-1.50,4.50]
Total: 13655 W: 2596 L: 2664 D: 8395
sprt @ 15+0.05 th 1 Even less penalty when king blockading storm pawn
10-01-14 gl storm_blocked diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 17142 W: 3173 L: 3233 D: 10736
sprt @ 15+0.05 th 1 King blockading pawn storm
10-01-14 gl shelter_impeded diff
LLR: -0.35 (-2.94,2.94) [-1.50,4.50]
Total: 8004 W: 1532 L: 1523 D: 4949
sprt @ 15+0.05 th 1 No bonus for castling king safety if impeded
10-01-14 rs se_at_root3 diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 10975 W: 2011 L: 2088 D: 6876
sprt @ 15+0.05 th 1 New try for singular extensions at root nodes (with a view to using singular moves for TM decisions)
10-01-14 bi pawn_psqt diff
LLR: 2.95 (-2.94,2.94) [-1.50,4.50]
Total: 15932 W: 3055 L: 2908 D: 9969
sprt @ 15+0.05 th 1 pawn psqt take 4
10-01-14 ur fixing_a_bug diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 16732 W: 3107 L: 3168 D: 10457
sprt @ 15+0.05 th 1 I fix a bug by replacing a misleading name and play slower if the program had in the past a fail low in the root(there are more ideas like stopping in the first move at sometime in this case but I decided to add one piece of code at a time and not to do it immediately
09-01-14 jo defended_sq_2 diff
ELO: -1.46 +-2.9 (95%) LOS: 16.4%
Total: 20000 W: 3650 L: 3734 D: 12616
31000 @ 15+0.05 th 1 PP: Measure defended squares 2
10-01-14 lb 3eefb67da334da7203ec7bd diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 3736 W: 651 L: 747 D: 2338
sprt @ 15+0.05 th 1 KPP distance: take 3
10-01-14 lb 9bafb268c633c59aca892c7 diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 17578 W: 3254 L: 3313 D: 11011
sprt @ 15+0.05 th 1 KPP distance: take 2
09-01-14 jo defended_sq_3 diff
ELO: 0.98 +-3.0 (95%) LOS: 73.9%
Total: 19497 W: 3705 L: 3650 D: 12142
30000 @ 15+0.05 th 1 PP: Measure defended squares 3
08-01-14 jo unsafe_squares_3 diff
ELO: 0.65 +-2.4 (95%) LOS: 70.2%
Total: 29845 W: 5585 L: 5529 D: 18731
30000 @ 15+0.05 th 1 PP: Measure unsafe squares 3
08-01-14 jo unsafe_squares_2 diff
ELO: -1.10 +-2.6 (95%) LOS: 20.3%
Total: 26014 W: 4826 L: 4908 D: 16280
30000 @ 15+0.05 th 1 PP: Measure unsafe squares 2
08-01-14 lb psqt^^^^^^ diff
ELO: -3.35 +-2.8 (95%) LOS: 1.0%
Total: 20000 W: 3397 L: 3590 D: 13013
30000 @ 60+0.05 th 1 LTC: Measure value of pawn PSQT
09-01-14 gl master diff
ELO: 29.85 +-2.0 (95%) LOS: 100.0%
Total: 40000 W: 8593 L: 5165 D: 26242
40000 @ 60+0.05 th 1 Regression test after time management fix
10-01-14 pe tm_simple diff
ELO: -3.65 +-2.9 (95%) LOS: 0.7%
Total: 20000 W: 3566 L: 3776 D: 12658
20000 @ 15+0.05 th 1 fixed number of games at 15+0.05 .simplification: remove couple of constants from TM. This patch is also having effect of redistributing ~27% of stable PV time in favor of unstable PV
20-12-13 tr KPST4 diff
ELO: -11.29 +-3.0 (95%) LOS: 0.0%
Total: 21081 W: 3949 L: 4634 D: 12498
30001 @ 15+0.05 th 1 KPST4: MG_KPST with rank_bonus only; file_bonus removed. EG_KPST with (rounded) original values.
09-01-14 rs pv_instability diff
ELO: -22.53 +-3.0 (95%) LOS: 0.0%
Total: 20000 W: 3241 L: 4536 D: 12223
20000 @ 15+0.05 th 1 TM: measure value of pv_instability.
10-01-14 pe tm_simple diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 11396 W: 2075 L: 2150 D: 7171
sprt @ 15+0.05 th 1 use 20% more time for stable pv at the expense of unstable pv
10-01-14 ur sometimes_faster diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 1019 W: 149 L: 252 D: 618
sprt @ 15+0.05 th 1 this time I test playing twice faster if more than 95% of the time is used for the first move(meaning that stockfish refuted relatively fast the rest of the moves)
08-01-14 jo mbonus diff
ELO: -10.22 +-3.1 (95%) LOS: 0.0%
Total: 20000 W: 3839 L: 4427 D: 11734
20000 @ 15+0.05 th 1 PP: Measure base mbonus
08-01-14 jo unsafe_squares_1 diff
ELO: -21.88 +-3.0 (95%) LOS: 0.0%
Total: 20000 W: 3284 L: 4542 D: 12174
20000 @ 15+0.05 th 1 PP: Measure unsafe squares 1
10-01-14 gl pawn_psqt diff
LLR: -2.96 (-2.94,2.94) [0.00,6.00]
Total: 5400 W: 782 L: 859 D: 3759
sprt @ 60+0.05 th 1 Long TC for BI: pawn psqt EG take 2
09-01-14 sg update_stats_hist diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 51827 W: 9691 L: 9660 D: 32476
sprt @ 15+0.05 th 1 update stats for pv moves too. Additional double their bonus for history update.
10-01-14 ur sometimes_faster diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 10289 W: 1877 L: 1955 D: 6457
sprt @ 15+0.05 th 1 testing an idea to play twice faster in some moves when you spend a significant time in the first move with no fail low(I also added the simplification of marco because I do not believe that it cause a regression and the test of 20000 games did not show a significant difference)
10-01-14 lb kpp diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 2540 W: 459 L: 559 D: 1522
sprt @ 15+0.05 th 1 KPP distance: take 1
09-01-14 bi pawn_psqt diff
LLR: 2.96 (-2.94,2.94) [-1.50,4.50]
Total: 41528 W: 7912 L: 7697 D: 25919
sprt @ 15+0.05 th 1 pawn psqt EG take 2
09-01-14 mc f14cd1bb89d080f36a11df3 diff
ELO: -1.08 +-2.9 (95%) LOS: 23.7%
Total: 20000 W: 3716 L: 3778 D: 12506
20000 @ 15+0.05 th 1 Test Marco simplification of TM (I am going to apply the patch, but I want to be sure there is no regression)
10-01-14 pe tm_simple diff
ELO: -1.39 +-3.1 (95%) LOS: 19.1%
Total: 20000 W: 4140 L: 4220 D: 11640
20000 @ 5+0.05 th 1 simplification: remove couple of constants from TM. This patch is also having effect of redistributing ~27% of stable PV time in favor of unstable PV
09-01-14 rs pv_inst_tune diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 3825 W: 638 L: 733 D: 2454
sprt @ 15+0.05 th 1 Change pv_instability formula.
08-01-14 jo ebonus diff
ELO: -29.34 +-3.2 (95%) LOS: 0.0%
Total: 20000 W: 3497 L: 5182 D: 11321
20000 @ 15+0.05 th 1 PP: Measure base ebonus
09-01-14 ur tune_time diff
ELO: 1.15 +-2.2 (95%) LOS: 84.5%
Total: 40000 W: 8489 L: 8357 D: 23154
40000 @ 5+0.05 th 1 try to tune the patch and before deciding about sprt I try to save time by using faster time control to see what direction to try next