Show simple item record

dc.contributor.advisorNewell, John
dc.contributor.authorAbedin, Md Jaynal
dc.date.accessioned2020-09-25T08:26:51Z
dc.date.issued2020-09-24
dc.identifier.urihttp://hdl.handle.net/10379/16188
dc.description.abstractData Science is the new and exciting interdisciplinary response that has emerged as a consequence of the staggering amounts of data generated in many new forms from digital images to audio to text. It is an interdisciplinary field involving Statistics, Computer Science and Mathematics. It involves the study of data, how they are collected, stored, accessed, visualised, modelled and ultimately used to inform decision making by turning data into intelligence. Despite this ’data revolution’ and the development of Data Science as a consequence, the aim of any data analysis is still the same, to make inference about unknown population parameters using sample statistics. One fundamental challenge in inference is the identification of outliers. Such oddities, or atypical observations, could be indicative of poor data management or biased sampling. In this situation the presence of such outliers are considered a negative aspect and efforts are needed to account for them (e.g. correct data entry errors) accordingly to avoid introducing bias in parameter estimation. On the other hand, finding an outlier may be the key focus of the exercise as an outlier may represent something new and novel. Many statistical methods have been developed to identify outlying data points and robust methods developed to account for outliers in statistical models. A central property of all such methods is that an observation is classified as an outlier or not (i.e. a binary decision); being able to quantify an observations ’outlyingness’ is clearly an attractive alternative. In this thesis, a novel method is presented for outlier detection in multivariate data based on the idea of a statistical depth function. The proposed approach enables outlier detection in multivariate data while taking into consideration the local geometry of the underlying probability distribution.en_IE
dc.publisherNUI Galway
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 Ireland
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/3.0/ie/
dc.subjectData-depthen_IE
dc.subjectoutlier detectionen_IE
dc.subjecttopic modellingen_IE
dc.subjectEngineeringen_IE
dc.subjectData scienceen_IE
dc.subjectMathematics, Statistics and Applied Mathematicsen_IE
dc.subjectComputer Scienceen_IE
dc.subjectStatistiscsen_IE
dc.subjectMathematicsen_IE
dc.titleA modified depth function for outlier detection in multivariate data with applicationsen_IE
dc.typeThesisen
dc.local.noteThis PhD in Data Science presents a novel approach for uncovering evolving research themes in literature reviews in any domain using topic modelling and a new method to identify and visualise outliers in multivariate data using a modified statistical depth functionen_IE
dc.description.embargo2021-09-14
dc.local.finalYesen_IE
nui.item.downloads106


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivs 3.0 Ireland
Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 Ireland