Stockfish Testing Queue

Finished - 37193 tests

15-04-29 lbr psqt diff
ELO: -18.59 +-7.3 (95%) LOS: 0.0%
Total: 3629 W: 655 L: 849 D: 2125
20000 @ 15+0.05 th 1 new psqt: take 2 (this time tuned in 3+0.03)
15-04-28 sni generalize_hanging diff
LLR: -3.32 (-2.94,2.94) [-1.50,4.50]
Total: 23195 W: 4376 L: 4433 D: 14386
sprt @ 15+0.05 th 1 The usual hanging penalty applies to weak pieces which have zero defenders. This patch generalize this for weak pawns by comparing attack and defense, even if defense is not zero.
15-04-29 Fis TTSmartSave diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 11474 W: 2141 L: 2216 D: 7117
sprt @ 15+0.05 th 1 Don't overwrite any existing TT data with blanks. 2MB
15-04-26 mco null_tune diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 55092 W: 10498 L: 10461 D: 34133
sprt @ 15+0.05 th 1 Tuned null search reduction: take 2
15-04-28 Voy Tilefish3 diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 12996 W: 2486 L: 2557 D: 7953
sprt @ 15+0.05 th 1 Use depth*depth*log(depth) only for history. This version should be much stronger...
15-04-26 jos union_psqt diff
LLR: -3.79 (-2.94,2.94) [0.00,4.00]
Total: 54258 W: 10514 L: 10523 D: 33221
sprt @ 15+0.05 th 1 Union of knight and rook psqt values.
15-04-25 lbr union diff
LLR: -2.95 (-2.94,2.94) [0.00,5.00]
Total: 72364 W: 11612 L: 11455 D: 49297
sprt @ 60+0.05 th 1 that one should be strong enough to pass. if not i give up!
15-04-27 lbr psqt diff
ELO: -14.16 +-3.1 (95%) LOS: 0.0%
Total: 20380 W: 3936 L: 4766 D: 11678
30000 @ 15+0.05 th 1 new psqt: 41 param
15-04-26 Roc RCC diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 43811 W: 8444 L: 8432 D: 26935
sprt @ 15+0.05 th 1 Full penalty gave 50/50 result for the new rook_support contact checks, so let's try about half penalty.
15-04-24 Roc LessSafeCheck diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 22417 W: 4313 L: 4359 D: 13745
sprt @ 15+0.05 th 1 Avoid to compute some safe check by sliders when the landing square is already a Queen safe check.
15-04-26 Fis BMCTime diff
LLR: -2.96 (-2.94,2.94) [0.00,6.00]
Total: 25116 W: 3989 L: 3975 D: 17152
sprt @ 60+0.05 th 1 If a move is 100% stable from the 3rd iteration on use only 2/3 of the available time. SPSA tuned. LTC
15-04-26 Voy Tilefish2 diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 4979 W: 916 L: 1009 D: 3054
sprt @ 15+0.05 th 1 fixed; forgot to remove 3* factor in movepicker also sync and match against passed Rockfish.
15-04-21 SC pieceValuesMP_simple2 diff
LLR: -1.91 (-2.94,2.94) [-3.00,1.00]
Total: 50948 W: 8027 L: 8204 D: 34717
sprt @ 60+0.05 th 1 As Joona pointed out: MVV/LVA also aims to define exactly the ordering in which captures are searched, which is left open by MVV only. Try a more compact implementation of MVV/LVA. LTC.
13-02-14 bishop_pin_clop diff
ELO: 2.57 +-4.8 (95%) LOS: 95.0%
Total: 20000 W: 4100 L: 3952 D: 11948
20000 @ 40/5+0.1 th 1
13-02-13 space diff
ELO: -10.30 +-4.8 (95%) LOS: 0.0%
Total: 20000 W: 3971 L: 4564 D: 11465
20000 @ 40/5+0.1 th 1
13-02-13 passed_pawn_support diff
ELO: -11.49 +-5.2 (95%) LOS: 0.0%
Total: 17386 W: 3360 L: 3935 D: 10091
20000 @ 40/5+0.1 th 1
13-02-13 bishop_pawns diff
ELO: -4.81 +-4.8 (95%) LOS: 0.1%
Total: 20000 W: 3828 L: 4105 D: 12067
20000 @ 40/5+0.1 th 1
13-02-13 reduce_tt_depth diff
ELO: -10.72 +-8.6 (95%) LOS: 0.0%
Total: 6290 W: 770 L: 964 D: 4556
16000 @ 40/10+0.1 th 1
13-02-13 king_safety_tweak diff
ELO: -0.43 +-7.6 (95%) LOS: 42.0%
Total: 8000 W: 1235 L: 1245 D: 5520
8000 @ 40/60+1 th 1
13-02-13 qsearch_pruning diff
ELO: -1.43 +-4.4 (95%) LOS: 16.1%
Total: 24000 W: 4931 L: 5030 D: 14039
24000 @ 40/3+0.1 th 1
13-02-13 simplify_eval diff
ELO: 2.29 +-4.8 (95%) LOS: 93.0%
Total: 20000 W: 4075 L: 3943 D: 11982
20000 @ 40/5+0.1 th 1
13-02-13 bishop_pin_clop diff
ELO: 4.73 +-4.8 (95%) LOS: 99.9%
Total: 20000 W: 4166 L: 3894 D: 11940
20000 @ 40/3+0.1 th 1
13-02-15 eval_scale diff
ELO: -1.00 +-7.6 (95%) LOS: 34.3%
Total: 8000 W: 1602 L: 1625 D: 4773
8000 @ 40/3+0.1 th 1
13-02-15 skip_null diff
ELO: -4.05 +-4.8 (95%) LOS: 0.4%
Total: 20000 W: 3630 L: 3863 D: 12507
20000 @ 40/8+0.1 th 1
13-02-15 all_cut_squash diff
ELO: -4.00 +-7.6 (95%) LOS: 3.3%
Total: 8000 W: 1203 L: 1295 D: 5502
8000 @ 40/30+0.5 th 1
13-02-16 test294 diff
ELO: -0.74 +-7.6 (95%) LOS: 38.3%
Total: 8000 W: 1632 L: 1649 D: 4719
8000 @ 15+0.05 th 1
13-02-17 singular_tweak diff
ELO: -1.46 +-4.8 (95%) LOS: 16.5%
Total: 20000 W: 3667 L: 3751 D: 12582
20000 @ 15+0.05 th 1 Simpler version of all_cut_squash
13-02-17 singular_tweak4 diff
ELO: 2.99 +-4.8 (95%) LOS: 97.7%
Total: 20000 W: 3776 L: 3604 D: 12620
20000 @ 15+0.05 th 1
13-02-17 bishop_pin_clop diff
ELO: -1.18 +-5.6 (95%) LOS: 24.4%
Total: 15011 W: 2680 L: 2731 D: 9600
16000 @ 15+0.05 th 1
13-02-18 lazy_eval diff
ELO: 2.19 +-5.6 (95%) LOS: 88.3%
Total: 14943 W: 3177 L: 3083 D: 8683
16000 @ 5+0.05 th 1
13-02-18 remove_space_eval diff
ELO: 9.06 +-5.4 (95%) LOS: 100.0%
Total: 16000 W: 3306 L: 2889 D: 9805
16000 @ 15+0.05 th 1
13-02-18 scale_with_gameplay diff
ELO: -22.82 +-6.8 (95%) LOS: 0.0%
Total: 10000 W: 1407 L: 2063 D: 6530
10000 @ 20+0.05 th 1 Scale down score with game ply
13-02-18 singular_tweak5 diff
ELO: 1.53 +-4.8 (95%) LOS: 84.4%
Total: 19998 W: 3838 L: 3750 D: 12410
20000 @ 15+0.05 th 1
13-02-19 bishop_pin_clop diff
ELO: 12.30 +-4.4 (95%) LOS: 100.0%
Total: 24000 W: 4931 L: 4082 D: 14987
24000 @ 15+0.05 th 1 Remove previous pin code, add bishop pin
13-02-19 scale_with_gameplay diff
ELO: 3.65 +-6.8 (95%) LOS: 96.2%
Total: 10000 W: 1813 L: 1708 D: 6479
26000 @ 20+0.05 th 1
13-02-20 move_ordering diff
ELO: -1.34 +-3.9 (95%) LOS: 13.3%
Total: 31013 W: 5753 L: 5873 D: 19387
32000 @ 15+0.05 th 1
13-02-20 outpost diff
ELO: -2.37 +-5.4 (95%) LOS: 7.7%
Total: 16000 W: 2870 L: 2979 D: 10151
16000 @ 15+0.05 th 1
13-02-20 pinned_null diff
ELO: -9.14 +-5.4 (95%) LOS: 0.0%
Total: 16000 W: 2689 L: 3110 D: 10201
16000 @ 15+0.05 th 1
13-02-20 master diff
ELO: 4.76 +-6.8 (95%) LOS: 99.2%
Total: 10000 W: 1715 L: 1578 D: 6707
10000 @ 60+0.05 th 1 Regression test vs sf_2.3.1 (Take 2)
13-02-20 master diff
ELO: 4.50 +-4.8 (95%) LOS: 99.9%
Total: 20000 W: 3507 L: 3248 D: 13245
20000 @ 60+0.05 th 1 Another regression test at long TC but with "bishop pin" patch applied
13-02-20 scale_with_gameplay diff
ELO: 2.43 +-6.8 (95%) LOS: 89.3%
Total: 10000 W: 1618 L: 1548 D: 6834
10000 @ 60+0.05 th 1 Retest game ply scaling at longer TC
13-02-21 remove_space_eval diff
ELO: 0.54 +-5.4 (95%) LOS: 63.2%
Total: 16000 W: 2753 L: 2728 D: 10519
16000 @ 45+0.05 th 1 Test at longer TC
13-02-21 rook_pin diff
ELO: -0.93 +-5.4 (95%) LOS: 28.8%
Total: 16000 W: 2936 L: 2979 D: 10085
16000 @ 15+0.05 th 1 Re-run due to failure
13-02-22 scale_with_gameplay diff
ELO: -1.91 +-5.4 (95%) LOS: 13.1%
Total: 16000 W: 3045 L: 3133 D: 9822
16000 @ 20+0.05 th 1 Increase game ply scaling to 2% every 10 plies
13-02-22 lucas_evasion_prunable diff
ELO: 1.57 +-3.8 (95%) LOS: 90.4%
Total: 32000 W: 6259 L: 6114 D: 19627
32000 @ 15+0.05 th 1
13-02-22 lucas_see_pv diff
ELO: 7.69 +-5.4 (95%) LOS: 100.0%
Total: 16000 W: 3219 L: 2865 D: 9916
16000 @ 15+0.05 th 1
13-02-22 rook_pin diff
ELO: 0.22 +-5.4 (95%) LOS: 55.1%
Total: 16000 W: 2988 L: 2978 D: 10034
16000 @ 15+0.05 th 1 Exclude pawn pins
13-02-22 pinned_null diff
ELO: 3.11 +-3.8 (95%) LOS: 99.5%
Total: 32000 W: 6294 L: 6008 D: 19698
32000 @ 15+0.05 th 1 Only check for pins on full null moves
13-02-23 lucas_see_pv diff
ELO: 4.31 +-6.8 (95%) LOS: 98.3%
Total: 10000 W: 1765 L: 1641 D: 6594
10000 @ 60+0.05 th 1 Re-test at longer TC
13-02-24 scale_with_gameplay diff
ELO: -6.08 +-6.8 (95%) LOS: 0.2%
Total: 10000 W: 1752 L: 1927 D: 6321
10000 @ 20+0.05 th 1 Another try at 1%, this time scaling just endgame score