Library Open Repository

A Comparison of Neural Network Architectures in Reinforcement Learning in the Game of Othello

Downloads

Downloads per month over past year

Kamenetsky, D (2005) A Comparison of Neural Network Architectures in Reinforcement Learning in the Game of Othello. Honours thesis, University of Tasmania. (Unpublished)

[img]
Preview
PDF
dkThesis_Final_4.pdf | Download (1MB)
Available under University of Tasmania Standard License.

Abstract

Over the past two decades, Reinforcement Learning has emerged as a promising Machine Learning technique that is capable of solving complex dynamic problems. The benefit of this technique lies in the fact that the agent learns from its experience rather than being told directly. For problems with large state-spaces, Reinforcement Learning algorithms are combined with function approximation techniques, such as neural networks. The architecture of the neural networks plays a significant role in the agent's learning. Past research has demonstrated that networks with a constructive architecture outperform those with a fixed architecture on some benchmark problems. This study compares the performance of these two architectures in Othello - a complex deterministic board game. Three networks are used in the comparison: two with constructive architecture - Cascade and Resource Allocating Network, and one with fixed architecture - Multilayer Perceptron. Investigation is also made with respect to input representation, number of hidden nodes and other parameters used by the networks. Training is performed with both on-policy (Sarsa) and off-policy (Q-Learning) algorithms. Results show that agents were able to learn the positional strategy (novice strategy in Othello) and could beat each of the three built-in opponents. Agents trained with Multilayer Perceptron perform better, but converge slower than those trained with Cascade.

Item Type: Other
Keywords: neural networks, reinforcement learning, Othello
Publisher: Honours thesis, University of Tasmania
Date Deposited: 07 Jan 2006
Last Modified: 18 Nov 2014 03:10
URI: http://eprints.utas.edu.au/id/eprint/252
Item Statistics: View statistics for this item

Repository Staff Only (login required)

Item Control Page Item Control Page