EUDML

Evolving small-board Go players using coevolutionary temporal difference learning with archives

Krzysztof Krawiec; Wojciech Jaśkowski; Marcin Szubert — 2011

International Journal of Applied Mathematics and Computer Science

We apply Coevolutionary Temporal Difference Learning (CTDL) to learn small-board Go strategies represented as weighted piece counters. CTDL is a randomized learning technique which interweaves two search processes that operate in the intra-game and inter-game mode. Intra-game learning is driven by gradient-descent Temporal Difference Learning (TDL), a reinforcement learning method that updates the board evaluation function according to differences observed between its values for consecutively visited...

The performance profile: A multi-criteria performance evaluation method for test-based problems

Wojciech Jaśkowski; Paweł Liskowski; Marcin Szubert; Krzysztof Krawiec — 2016

International Journal of Applied Mathematics and Computer Science

In test-based problems, solutions produced by search algorithms are typically assessed using average outcomes of interactions with multiple tests. This aggregation leads to information loss, which can render different solutions apparently indifferent and hinder comparison of search algorithms. In this paper we introduce the performance profile, a generic, domain-independent, multi-criteria performance evaluation method that mitigates this problem by characterizing the performance of a solution by...

Advanced Search

Formula preview

Currently displaying 1 – 2 of 2

Evolving small-board Go players using coevolutionary temporal difference learning with archives

The performance profile: A multi-criteria performance evaluation method for test-based problems