Query-oriented single-document summarization using unsupervised deep learning

Yousefiazar, Mahmood

doi:10.25949/19431539.v1

01whole.pdf (1.89 MB)

Query-oriented single-document summarization using unsupervised deep learning

thesis

posted on 2022-03-28, 10:35 authored by Mahmood Yousefiazar

Over the past half a century, machine-based text summarization has been addressed from many different perspectives in a variety of application domains. Deep neural networks recently show promising results for text summarization and this thesis explores this application domain. In this research study, a deep auto-encoder is used to rank sentences based on the most salient information. More precisely, a deep neural network has been used for extractive query-oriented single-document summarization. Also, the use of an Ensemble Noisy Auto-Encoder (ENAE) for this task has been evaluated. ENAE is a stochastic version of an auto-encoder that adds noise to the input text and selects the top sentences from an ensemble of runs. Our experiments show that although a deep auto-encoder can be an effective summarizer, deep auto-encoders trained with stochastic noise in the input and run multiple times with different noise in the input can make improvements. The architecture of ENAE changes the application of the auto-encoder from a deterministic feed-forward network to a stochastic model. To cover a wide range of topics and structures, we perform experiments on two different publicly available email corpora that are specifically designed for text summarization.

History

Notes

Empirical thesis. Bibliography: pages 53-59

Awarding Institution

Macquarie University

Degree Type

Thesis MRes

Degree

MRes, Macquarie University, Faculty of Science and Engineering, Department of Computing

Department, Centre or School

Department of Computing

Year of Award

2015

Principal Supervisor

Leonard G. C. Hamey

Additional Supervisor 1

Mark Dras

Rights

Copyright Mahmood Yousefiazar 2015. Copyright disclaimer: http://www.copyright.mq.edu.au

Language

English

Extent

1 online resource (viii, 64 pages) diagrams, graphs, tables

Former Identifiers

mq:47061 http://hdl.handle.net/1959.14/1089590

Usage metrics

Keywords

auto-encoder text summarization Abstracts deep learning Neural networks (Computer science)unsupervised learning Computational linguistics

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Query-oriented single-document summarization using unsupervised deep learning

History

Table of Contents

Notes

Awarding Institution

Degree Type

Degree

Department, Centre or School

Year of Award

Principal Supervisor

Additional Supervisor 1

Rights

Language

Extent

Former Identifiers

Usage metrics

Categories

Keywords

Licence

Exports