Show simple item record

dc.contributor.advisorNewell, John
dc.contributor.authorWall, Deirdre
dc.date.accessioned2014-01-22T11:54:22Z
dc.date.available2014-01-22T11:54:22Z
dc.date.issued2014-01-08
dc.identifier.urihttp://hdl.handle.net/10379/3989
dc.description.abstractThe main aim of my PhD is to create a prognostic model for invasive breast cancer patients for disease recurrence and death. The data were collected retrospectively and are comprised of 647 invasive breast cancer patients with patient characteristics and genetic markers measured. An additional complexity exists due to the presence of missing data. A complete case analysis with both clinical and pathological biomarkers reduces the number of cases to 103 patients. A major challenge is how best to build a prognostic model for breast cancer in the presence of missing data. The Kaplan Meier estimate of the survival function is the most commonly used method for the representation of the distribution of survival times. Extensions to graphical comparisons of these survival estimates were developed. Classical approaches to modelling survival data using complete case analysis are examined and then an empirical simulation study is used to examine the effect of missing data on variable selection and to compare the performance of variable selection techniques in imputed data. The final model identified Bilateral, Lymph Node status, Mitotic Count, Metastasis and UICC staging as being good predictors of Disease Free Survival and a subset of these for Overall Survival (Mitotic Count, Metastasis and UICC staging). These models have good concordance and were calibrated both internally and externally. Classification and Regression Trees (CART) are a non-parametric approach to regression modelling. The main feature of CART is the data are recursively partitioned into groups and a simple prediction model fitted to each partition. A novel approach using surrogate splits to create alternative competing trees with comparable prediction power are introduced. This helps identify underlying structure in the data.en_US
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 Ireland
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/3.0/ie/
dc.subjectClassification and regression treesen_US
dc.subjectSurvival analysisen_US
dc.subjectStatisticsen_US
dc.subjectSurrogate splitsen_US
dc.subjectBreast canceren_US
dc.subjectMathematicsen_US
dc.subjectPrognostic modelen_US
dc.subjectMathematics, Statistics and Applied Mathematicsen_US
dc.titleIntegration of Genetic Biomarkers in Prognostic Models for Breast Cancer Survivalen_US
dc.typeThesisen_US
dc.contributor.funderNational Breast Cancer Research Instituteen_US
dc.local.noteIn this thesis, a prognostic model for breast cancer patients for disease recurrence and death is created. An on-line calculator was created to make the prognostic model easier for clinicians and patients to interpret. A novel approach to identifying structure in Classification and Regression Trees is also introduced.en_US
dc.local.finalYesen_US
nui.item.downloads1979


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivs 3.0 Ireland
Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 Ireland