Stockfish Testing Queue

Finished - 40699 tests

15-02-20 Roc CenterWedge diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 60597 W: 12047 L: 11988 D: 36562
sprt @ 15+0.05 th 1 1st one was S(5,0). Try with S(10, 0)
15-02-20 jos passed_defdef diff
19222/20000 iterations
38760/40000 games played
40000 @ 60+0.05 th 1 Idea looks promising at STC. Try to tune at LTC.
15-02-20 lan end_double_penalty diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 17917 W: 3492 L: 3550 D: 10875
sprt @ 15+0.05 th 1 Increase penalty for doubled pawns only on H file. An idea of Lyudmil Tsvetkov.
15-02-20 sni probabilistic diff
LLR: -1.04 (-2.94,2.94) [-1.50,4.50]
Total: 9075 W: 1541 L: 1555 D: 5979
sprt @ 15+0.05 th 4 Experimental run : use the probability of a cut to amend the YBWC strategy
15-02-21 ren remove_is_ok diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 55714 W: 11020 L: 10974 D: 33720
sprt @ 15+0.05 th 1 Remove the calls to the method is_ok(Move). Seems to increase elo in local tests.
15-02-21 sg queen_contact_check diff
LLR: -2.94 (-2.94,2.94) [-1.50,4.50]
Total: 19077 W: 3747 L: 3801 D: 11529
sprt @ 15+0.05 th 1 if no queen contact checks exists give bonus to queen moves which threats such a check
15-02-21 jos passed_defdef diff
LLR: 2.95 (-2.94,2.94) [-1.50,4.50]
Total: 17331 W: 3514 L: 3360 D: 10457
sprt @ 15+0.05 th 1 Take 2 with tuned values.
15-02-21 mco late_join_full diff
ELO: 3.54 +-4.4 (95%) LOS: 94.4%
Total: 8153 W: 1411 L: 1328 D: 5414
10000 @ 15+0.05 th 16 Don't stop at first failed join attempt: on the bench this pacthes reduced missed joins attempts after locking from 75% to about 10%
15-02-21 sni probabilistic2 diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 13639 W: 2243 L: 2313 D: 9083
sprt @ 15+0.05 th 7 Experimental run 2 : use the probability of a cut to amend the YBWC strategy. This version seemed reasonable in local testing with 3 threads after 400 games (score of stockfish vs base: 83 - 48 - 269 [0.544] 400).
15-02-21 jos passed_defdef diff
LLR: -2.95 (-2.94,2.94) [0.00,6.00]
Total: 15378 W: 2530 L: 2559 D: 10289
sprt @ 60+0.05 th 1 LTC: Take 2 with tuned values.
15-02-21 sni probcut_tweak2 diff
LLR: 2.95 (-2.94,2.94) [-1.50,4.50]
Total: 45966 W: 9107 L: 8876 D: 27983
sprt @ 15+0.05 th 1 Probcut tweak
15-02-21 ren remove_is_ok diff
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 12817 W: 2559 L: 2423 D: 7835
sprt @ 15+0.05 th 1 Remove the calls to the method is_ok(Move). Seems to increase elo in local tests.
15-02-21 Roc CenterWedge diff
LLR: 2.95 (-2.94,2.94) [-1.50,4.50]
Total: 50774 W: 10198 L: 9952 D: 30624
sprt @ 15+0.05 th 1 S(15, 0)
15-02-21 Roc WedgeNoBind diff
LLR: 2.95 (-2.94,2.94) [-1.50,4.50]
Total: 6358 W: 1296 L: 1173 D: 3889
sprt @ 15+0.05 th 1 Retake on Vincent idea about Wedges. Removed overlap with CenterBind cases,
15-02-21 sni probcut_tweak2 diff
LLR: -2.95 (-2.94,2.94) [0.00,6.00]
Total: 13998 W: 2276 L: 2312 D: 9410
sprt @ 60+0.05 th 1 LTC: Probcut tweak
15-02-22 Roc WedgeNoBind diff
LLR: -2.95 (-2.94,2.94) [0.00,6.00]
Total: 17424 W: 2896 L: 2915 D: 11613
sprt @ 60+0.05 th 1 Retake on Vincent idea about Wedges. Removed overlap with CenterBind cases,
15-02-22 ren remove_is_ok diff
LLR: 2.94 (-2.94,2.94) [-3.00,1.00]
Total: 23418 W: 3916 L: 3800 D: 15702
sprt @ 60+0.05 th 1 Remove the calls to the method is_ok(Move). Seems to increase elo in local tests. Passed STC as simplification. LTC as simplification.
15-02-22 Roc SemiOpenRook diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 12362 W: 2385 L: 2458 D: 7519
sprt @ 15+0.05 th 1 SemiOpenFile bonus also when Rook is in front of our pawn.
15-02-22 Roc CenterWedge diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 4879 W: 905 L: 998 D: 2976
sprt @ 15+0.05 th 1 S(20,0) (I lowered the priority of the S(15,0) test)
15-02-22 vin pawn_tweak_5th diff
LLR: -2.96 (-2.94,2.94) [0.00,4.00]
Total: 21204 W: 4130 L: 4213 D: 12861
sprt @ 15+0.05 th 1 We've had a bunch of near misses all of which reward pawns on the 5th rank. Perhaps we simply need to tweak the psq table. Parameter tweak patch.
15-02-22 vin true_wedge diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 52007 W: 10319 L: 10283 D: 31405
sprt @ 15+0.05 th 1 Try detecting only 'true' wedges involving two or more pawns. Annoyingly none of the bench positions yield these so the signature is the same; which might also mean these are too rare to matter. But let's see.
15-02-22 mco measure_level diff
ELO: -12.86 +-13.3 (95%) LOS: 2.9%
Total: 1000 W: 172 L: 209 D: 619
10000 @ 15+0.05 th 16 Measure 'level' metric alone
15-02-22 vin true_wedge diff
LLR: -3.11 (-2.94,2.94) [-1.50,4.50]
Total: 9889 W: 1917 L: 2002 D: 5970
sprt @ 15+0.05 th 1 True wedge take 2, with less generous bonus and pointless maths removed. Also merged in latest master.
15-02-22 Roc CenterWedge diff
LLR: -2.94 (-2.94,2.94) [0.00,6.00]
Total: 17925 W: 3000 L: 3016 D: 11909
sprt @ 60+0.05 th 1 Double protected center pawn on rank 5 and above. S(15, 0) @ LTC
15-02-22 mco measure_level diff
ELO: -12.95 +-3.7 (95%) LOS: 0.0%
Total: 10926 W: 1612 L: 2019 D: 7295
20000 @ 15+0.05 th 16 Measure how much value there is in late join
15-02-22 zar doubled_a diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 5295 W: 1028 L: 1121 D: 3146
sprt @ 15+0.05 th 1 Don't penalize doubled if lever or supporter
15-02-22 jos space_threshold diff
ELO: 0.05 +-2.5 (95%) LOS: 51.5%
Total: 30000 W: 5935 L: 5931 D: 18134
30000 @ 15+0.05 th 1 Probably this threshold is not elo-sensitive enough, so first do a quick measure before pushing a sprt test. Last try.
15-02-22 Roc WedgeNoBind diff
ELO: -4.48 +-3.0 (95%) LOS: 0.2%
Total: 20000 W: 3851 L: 4109 D: 12040
20000 @ 15+0.05 th 1 Final try, with S(20,20) value.
15-02-23 Roc Apex diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 61287 W: 12291 L: 12229 D: 36767
sprt @ 15+0.05 th 1 Apex: pawns which are supported twice, Give a bonus for any such pawn based on 1/3 of Connected[opposed][0][rank]
15-02-23 sni sorting_moves diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 12250 W: 2415 L: 2488 D: 7347
sprt @ 15+0.05 th 1 Use the evaluation function to help sorting quiet moves if depth >= 8. Does the better move ordering compensate for the time we spend there?
15-02-23 sni sorting_moves diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 3603 W: 661 L: 758 D: 2184
sprt @ 15+0.05 th 1 Take 2: give more importance to the eval.
15-02-23 mco parentsCount diff
ELO: -12.13 +-13.0 (95%) LOS: 3.4%
Total: 974 W: 157 L: 191 D: 626
10000 @ 15+0.05 th 16 Try harder to late join
15-02-23 Roc Apex_V2 diff
LLR: 0.08 (-2.94,2.94) [-1.50,4.50]
Total: 33477 W: 6708 L: 6613 D: 20156
sprt @ 15+0.05 th 1 Last one was 1/4. This is with 1/3, and merged with latest master (which has new Connected values)
15-02-23 Roc Apex_V2 diff
LLR: 2.96 (-2.94,2.94) [-1.50,4.50]
Total: 6549 W: 1333 L: 1209 D: 4007
sprt @ 15+0.05 th 1 1/2
15-02-23 vin blocked_centre diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 59126 W: 11710 L: 11655 D: 35761
sprt @ 15+0.05 th 1 Try increasing the weight of pawn storm against you when your centre is congested, as the blocked centre allows more time for the pawns to roll.
15-02-23 fau Extra_Penalty diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 63744 W: 12511 L: 12439 D: 38794
sprt @ 15+0.05 th 1 Extra penalty for doubled pawns in "a" and "h" columns.
15-02-23 Roc Apex_V2 diff
LLR: 2.96 (-2.94,2.94) [-1.50,4.50]
Total: 17479 W: 3503 L: 3349 D: 10627
sprt @ 15+0.05 th 1 5/8 ?
15-02-23 gli c++11 diff
ELO: -1.82 +-3.0 (95%) LOS: 11.9%
Total: 20000 W: 3916 L: 4021 D: 12063
20000 @ 15+0.05 th 1 Quick regression test, and test windows binaries for c++11
15-02-23 Fis PvInTT diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 32766 W: 6479 L: 6496 D: 19791
sprt @ 15+0.05 th 1 Optimize RootMove::insert_pv_in_tt() to make a copy of the position and skip calls to undo_move() if the PV is longer than 2. This is a non functional change but changes bench for reasons explained here https://groups.google.com/forum/?fromgroups=#!topic/fishcooking/4Phpr8VOmTU Pri -1
15-02-23 sni knight diff
LLR: -1.33 (-2.94,2.94) [-1.50,4.50]
Total: 5431 W: 1014 L: 1047 D: 3370
sprt @ 15+0.05 th 1 Raise penalty for knight attacked by pawn
15-02-23 sni sorting_moves diff
LLR: -2.97 (-2.94,2.94) [-1.50,4.50]
Total: 5855 W: 1101 L: 1192 D: 3562
sprt @ 15+0.05 th 1 sort with 4 * history + eval
15-02-23 sni sorting_moves diff
LLR: -2.96 (-2.94,2.94) [-1.50,4.50]
Total: 3755 W: 678 L: 774 D: 2303
sprt @ 15+0.05 th 1 sort with 8 * history + eval
15-02-24 Fis phaseDivorceTune diff
67746/50000 iterations
84336/100000 games played
100000 @ 15+0.05 th 1 See what happens if we divorce piece phase values from material values and tune. Pri -1
15-02-24 Roc Apex_V2 diff
LLR: 2.95 (-2.94,2.94) [0.00,6.00]
Total: 18002 W: 3037 L: 2850 D: 12115
sprt @ 60+0.05 th 1 1/2 @ LTC
15-02-24 vin blocked_centre diff
LLR: -2.95 (-2.94,2.94) [-1.50,4.50]
Total: 2896 W: 563 L: 663 D: 1670
sprt @ 15+0.05 th 1 Take 2. differentiate between a merely occupied and a blocked centre. Include shelter weakness 'worry' for open centre.
15-02-24 lbr space diff
LLR: -2.95 (-2.94,2.94) [-3.50,0.50]
Total: 6836 W: 1355 L: 1537 D: 3944
sprt @ 15+0.05 th 1 simplified space eval. take 1.
15-02-24 jos rooks diff
LLR: -2.95 (-2.94,2.94) [0.00,4.00]
Total: 71231 W: 14073 L: 13973 D: 43185
sprt @ 15+0.05 th 1 Test some new values for rooks on open or semi-open files. 50k iterations.
15-02-24 lbr space diff
LLR: -2.95 (-2.94,2.94) [-3.50,0.50]
Total: 6143 W: 1202 L: 1382 D: 3559
sprt @ 15+0.05 th 1 simplified space eval. take 2.
15-02-24 vin blocked_centre diff
LLR: -1.47 (-2.94,2.94) [-1.50,4.50]
Total: 3657 W: 690 L: 733 D: 2234
sprt @ 15+0.05 th 1 Memo to self - don't test two things at once. Disable the new shelter weakness code so as to test the impact of the take two centre block code alone.
15-02-24 vin blocked_centre_spsa diff
25322/25000 iterations
50000/50000 games played
50000 @ 15+0.05 th 1 (Re-post after incorrect branch) Since the weighting of ShelterWeakness in the open centre case had a big Elo impact, try tuning this parameter first.