Macquarie University
Browse
01whole.pdf (955.52 kB)

The estimation of semiparametric generalized linear models

Download (955.52 kB)
thesis
posted on 2022-03-29, 01:59 authored by Busayasachee Puang-Ngern
In this thesis, a novel method for fitting the semiparametric generalized linear model (SPGLM) is developed and tested. We demonstrate that this provides an effective model fitting algorithm to the SP-GLM, particularly, when dealing with very large data sets. We also propose another special SP-GLM and discuss how to fit this special model. This special SP-GLM assumes the canonical link function, which simplifies the algorithm to fit this model. GLMs are widely used for data analysis. However, in some applications, GLMs do not perform well in model fitting when the selected distribution for the response data is inaccurate.The SP-GLM with a nonparametric reference density extends the conventional GLMs. The SP-GLM offers flexibility in regression modelling by relaxing the requirement of a known response distribution in GLMs to only require that the response variable has a distribution from some exponential family. However, a limitation has been observed in the application of the existing SP-GLM method (Huang, 2014) on large data sets, presumably due to the significant increase in the number of constraints for the SP-GLM for large sample sizes. The proposed new SP-GLM methods in this thesis will enable to fit SP-GLM to very large data sets. In this research, the focus is on the regression coefficients estimations and inferences. An iterative algorithm is developed for estimation of the regression coefficients and the reference density simultaneously. The asymptotic properties of the estimators subject to active constraints are also provided. Performance of the proposed methods are tested through simulation studies and real data applications. The simulation results have indicated effectiveness for the methods proposed in this research, with accurate estimation of the regression coefficients, as well as inference. The conclusion reached in this research is that the proposed model fitting methods enhance the capacity of the SP-GLM to handle very large data sets with fast convergence.

History

Table of Contents

1. Introduction -- 2. Literature review -- 3. The semiparametric generalized linear model -- 4. The semiparametric generalized linear model with canonical link -- 5. Application to real data sets -- 6. Conclusions and future work -- Appendix -- References.

Notes

Bibliography: pages 137-141 Empirical thesis.

Awarding Institution

Macquarie University

Degree Type

Thesis PhD

Degree

PhD, Macquarie University, Faculty of Science and Engineering, Department of Mathematics and Statistics

Department, Centre or School

Department of Mathematics and Statistics

Year of Award

2018

Principal Supervisor

Jun Ma

Additional Supervisor 1

Ayse Bilgin

Additional Supervisor 2

Timothy Kyng

Rights

Copyright Busayasachee Puang-Ngern 2018. Copyright disclaimer: http://mq.edu.au/library/copyright

Language

English

Extent

1 online resource (xviii, 141 pages) graphs, tables

Former Identifiers

mq:71189 http://hdl.handle.net/1959.14/1271780

Usage metrics

    Macquarie University Theses

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC