01whole.pdf (1.84 MB)
Download file

Optimal movement through a maze by an application of Q learning

Download (1.84 MB)
thesis
posted on 28.03.2022, 11:46 by Lu Han
Machine learning has become a very important aspect of the development of artificial intelligence (AI) in the world of computing. As a part of the machine learning techniques, reinforcement learning enables an agent to explore and study the unknown environment and through trial and error, the agent learns how to perform desired tasks in an optimum way. In this project, one of the methods to solve reinforcement problems, namely Q-learning, is studied and analysed. A maze problem is set up to count the learning time of an agent to travel from a fixed starting point A to destination B using Q-learning algorithm on Matlab platform. The learning time will be studied and then compared with different sets of maze of different size and complexity. Finally, a thorough discussion on the differences in the learning time and methods to further optimise the Q-learning algorithm will be provided. This document reports the outcome of this project and highlights the problems and challenges encountered and solutions devised.

History

Table of Contents

1. Introduction -- 2. Background theory -- 3. Q-learning algorithm -- 4. Experiment set up -- 5. Results and analysis -- 6. Conclusions and future work -- 7. Abbreviations -- Appendices -- Bibliography.

Notes

Bibliography: pages 91-93 Empirical thesis.

Awarding Institution

Macquarie University

Degree Type

Thesis bachelor honours

Degree

MRes, Macquarie University, Faculty of Science and Engineering, School of Engineering

Department, Centre or School

School of Engineering

Year of Award

2016

Principal Supervisor

Sam Reisenfeld

Rights

Copyright Lu Han 2016. Copyright disclaimer: http://mq.edu.au/library/copyright

Language

English

Extent

1 online resource (xvi, 93 pages diagrams, graphs, tables)

Former Identifiers

mq:70293 http://hdl.handle.net/1959.14/1262262

Usage metrics

Keywords

Exports