Tweak fwdOut Multipliers
Description
Use different fwdOut multipliers for each subnetwork. These are SPSA tuned alongside L1 biases.
To apply the tuned parameters, I just scanned bytes in NNUE files and replaced matching patterns with them, because it was too tedious for me to calculate offsets.
Branches
Tests
SPSA #1
Test #1
Failed
Parameters obtained after 15k iterations.
Test #2
Failed
Parameters obtained after 33k iterations.
SPSA #2
The first SPSA tune session was not good, presumably due to too high values. Following linrock and Viren's suggestion, the second SPSA test is launched with much lower values (128).
Test #3
Failed
Parameters obtained after 15,445 iterations.
Test #4
Failed
Parameters obtained after 31,369 iterations.
Test #5
Passed
Failed
Parameters obtained after 150,000 iterations. In some games it double kills master, but the overall strength seems equal.
Test #6
Failed
Same L1 bias values as Test #5 but average fwdOutMultiplier
value (584) is applied.
Test #7
Passed
Failed
Same L1 bias values as Test #5 but keep the original fwdOutMultiplier
value. It also yields more double kills but not as effective as Test #5.
Last updated