I used 2 benchmarks for evaluating Egaroucid. The first one is The FFO endgame test suite. This test is for the speed of endgame complete search. The second one is the matches against old versions of Egaroucid and Edax 4.4. To test the strength of its evaluation function, I used no book, and used XOT for the starting positions.
The endgame search is evaluated by 3 features:
The most important feature for users is the search time. This feature is shown as the actual time (second) to solve The FFO endgame test suite #40 to #59. This value is good if it decreases.
To shorten the search time, we can do two things: decrease the number of nodes and increase the number of nodes visited in a unit time.
There are some graphs of results of The FFO endgame test suite on Core i9 13900K.
It is the best way to evaluate the strength of Othello AI that we have battles with some engines. The result of battles by each version of Egaroucid and Edax 4.4 is below.
To avoid same lines, I used XOT as the beginning board. Each battle is done in level 1 (lookahead depth is 1 for the midgame, 2 for the endgame).