ARAN - Access to Research at NUI Galway

The Effect of Principal Component Analysis on Machine Learning Accuracy with High Dimensional Spectral Data

ARAN - Access to Research at NUI Galway

Show simple item record

dc.contributor.author Ryder, Alan G. en
dc.contributor.author O Connell, Marie-Louise en
dc.contributor.author Madden, Michael G. en
dc.contributor.author Howley, Tom en
dc.date.accessioned 2009-05-15T10:47:16Z en
dc.date.available 2009-05-15T10:47:16Z en
dc.date.issued 2005 en
dc.identifier.citation "The Effect of Principal Component Analysis on Machine Learning Accuracy with High Dimensional Spectral Data" , Tom Howley, Michael G. Madden, Marie-Louise O Connell and Alan G Ryder. Proceedings of AI-2005, 25th International Conference on Innovative Techniques and Applications of Artificial Intelligence, Cambridge, Dec 2005. en
dc.identifier.uri http://hdl.handle.net/10379/194 en
dc.description.abstract The classi¿cation of high dimensional data, such as images, gene-expression data and spectral data, poses an interesting challenge to machine learning, as the presence of high numbers of redundant or highly correlated attributes can seriously degrade classification accuracy. This paper investigates the use of Principal Component Analysis (PCA) to reduce high dimensional data and to improve the predictive performance of some well known machine learning methods. Experiments are carried out on a high dimensional spectral dataset, in which the task is to identify a target material within a mixture. These experiments employ the NIPALS (Non-Linear Iterative Partial Least Squares) PCA method, a method that has been used in the field of chemometrics for spectral classification, and is a more efficient alternative than the widely used eigenvector decomposition approach. The experiments show that the use of this PCA method can improve the performance of machine learning in the classification of high dimensionsal data. en
dc.language.iso en en
dc.subject Image processing en
dc.subject Chemometrics en
dc.subject Machine learning en
dc.subject High dimensional spectral data en
dc.subject.lcsh Image processing en
dc.subject.lcsh Chemometrics en
dc.subject.lcsh Machine learning en
dc.title The Effect of Principal Component Analysis on Machine Learning Accuracy with High Dimensional Spectral Data en
dc.type Conference Paper en

Files in this item

This item appears in the following Collection(s)

Show simple item record