Tweet 日本語

Egaroucid Technology

Benchmarks

I used 2 benchmarks for evaluating Egaroucid. The first one is The FFO endgame test suite. This test is for the speed of endgame complete search. The second one is the matches against old versions of Egaroucid and Edax 4.4. To test the strength of its evaluation function, I used no book, and used XOT for the starting positions.

The FFO endgame test suite

The endgame search is evaluated by 3 features:

The most important feature for users is the search time. This feature is shown as the actual time (second) to solve The FFO endgame test suite #40 to #59. This value is good if it decreases.

To shorten the search time, we can do two things: decrease the number of nodes and increase the number of nodes visited in a unit time.

There are some graphs of results of The FFO endgame test suite on Core i9 13900K.

Battles with XOT

It is the best way to evaluate the strength of Othello AI that we have battles with some engines. The result of battles by each version of Egaroucid and Edax 4.4 is below.

To avoid same lines, I used XOT as the beginning board. Each battle is done in level 1 (lookahead depth is 1 for the midgame, 2 for the endgame).

NameWinning Rate
7.0.00.5643
6.5.00.5648
6.4.00.4980
6.3.00.4598
6.1.00.5113
6.0.00.4592
Edax0.4425

The further log is available here.

Egaroucid 6.2.0 is omitted because it has the same evaluation function as 6.3.0.

Details

There are detailed benchmarks for each version including older versions.

VersionDate
7.0.02024/04/17
6.5.02023/10/25
6.4.02023/09/01
6.3.02023/07/09
6.2.02023/03/15
6.1.02022/12/23
6.0.02022/10/10
5.10.02022/06/08
5.9.02022/06/07
5.8.02022/05/13
5.7.02022/03/26
5.5.0/5.6.02022/03/16
5.4.12022/03/02

Technology Explanation

I wrote Technology Explanation only in Japanese. Please translate by yourself.

Download Transcript

Huge dataset of games played by Egaroucid is available. Please see Download Transcript page.