dc.contributor.advisor | Newell, John | |
dc.contributor.author | Wall, Deirdre | |
dc.date.accessioned | 2014-01-22T11:54:22Z | |
dc.date.available | 2014-01-22T11:54:22Z | |
dc.date.issued | 2014-01-08 | |
dc.identifier.uri | http://hdl.handle.net/10379/3989 | |
dc.description.abstract | The main aim of my PhD is to create a prognostic model for invasive breast cancer patients for disease recurrence and death. The data were collected retrospectively
and are comprised of 647 invasive breast cancer patients with patient characteristics and genetic markers measured. An additional complexity exists due to the presence of missing data. A complete case analysis with both clinical and pathological biomarkers reduces the number of cases to 103 patients. A major challenge is how best to build a prognostic model for breast cancer in the presence of missing data.
The Kaplan Meier estimate of the survival function is the most commonly used method for the representation of the distribution of survival times. Extensions to graphical comparisons of these survival estimates were developed. Classical approaches to modelling survival data using complete case analysis are examined and then an empirical simulation study is used to examine the effect of missing data on variable selection and to compare the performance of variable selection techniques in imputed data.
The final model identified Bilateral, Lymph Node status, Mitotic Count, Metastasis and UICC staging as being good predictors of Disease Free Survival and a subset of these for Overall Survival (Mitotic Count, Metastasis and UICC staging). These models have good concordance and were calibrated both internally and externally.
Classification and Regression Trees (CART) are a non-parametric approach to regression modelling. The main feature of CART is the data are recursively partitioned into groups and a simple prediction model fitted to each partition. A novel approach using surrogate splits to create alternative competing trees with comparable prediction power are introduced. This helps identify underlying structure in the data. | en_US |
dc.rights | Attribution-NonCommercial-NoDerivs 3.0 Ireland | |
dc.rights.uri | https://creativecommons.org/licenses/by-nc-nd/3.0/ie/ | |
dc.subject | Classification and regression trees | en_US |
dc.subject | Survival analysis | en_US |
dc.subject | Statistics | en_US |
dc.subject | Surrogate splits | en_US |
dc.subject | Breast cancer | en_US |
dc.subject | Mathematics | en_US |
dc.subject | Prognostic model | en_US |
dc.subject | Mathematics, Statistics and Applied Mathematics | en_US |
dc.title | Integration of Genetic Biomarkers in Prognostic Models for Breast Cancer Survival | en_US |
dc.type | Thesis | en_US |
dc.contributor.funder | National Breast Cancer Research Institute | en_US |
dc.local.note | In this thesis, a prognostic model for breast cancer patients for disease recurrence and death is created. An on-line calculator was created to make the prognostic model easier for clinicians and patients to interpret. A novel approach to identifying structure in Classification and Regression Trees is also introduced. | en_US |
dc.local.final | Yes | en_US |
nui.item.downloads | 1979 | |