posted on 2025-07-04, 04:42authored byGuowei Zhang
<p dir="ltr">In recent years, the application of big data in various fields has promoted the rapid development of artificial intelligence, and anomaly detection has always attracted much attention as a hot topic in data analysis and machine learning. There are many popular methods currently used for anomaly detection, such as the K-Means, support vector machine, isolation forest, and deep neural network. However, when faced with some large-scale and high-dimensional complex data, some traditional machine learning or clustering methods cannot detect outliers very well, and many deep neural networks also use random representation to process the data so that the interpretability of the model is reduced. In this thesis, we propose a novel model called VAEiForest, which is based on the isolation forest and adds a VAE module to extract features from the data and generate new data. When the model faces high-dimensional data, it can process low-dimensional data through VAE’s dimensional reduction, and the generation of new data can play a role in data enhancement for the model. Extensive experiments also showed that under the same data set and parameters, our model has a good improvement compared to previous similar models, and the ablation experiment was conducted to verify the effectiveness of each sub-module in our model.</p>
History
Table of Contents
1 Introduction -- 2 Background -- 3 Literature review -- 4 Research problems & aims -- 5 Methodology -- 6 Experiments -- 7 Conclusions and future research -- Bibliography
Awarding Institution
Macquarie University
Degree Type
Thesis MRes
Degree
Master of Research
Department, Centre or School
School of Computing
Year of Award
2024
Principal Supervisor
Xuyun Zhang
Additional Supervisor 1
Amin Beheshti
Rights
Copyright: The Author
Copyright disclaimer: https://www.mq.edu.au/copyright-disclaimer