Explorative data analysis techniques and unsupervised clustering methods to support clinical assessment of Chronic Obstructive Pulmonary Disease (COPD) phenotypes |
| |
Authors: | Matteo Paoletti Gianna Camiciottoli Eleonora Meoni Francesca Bigazzi Lucia Cestelli Massimo Pistolesi Carlo Marchesi |
| |
Affiliation: | aDepartment of Systems and Computer Science, University of Florence, Florence, Italy;bDepartment of Internal Medicine, Section of Respiratory Medicine, University of Florence, Florence, Italy |
| |
Abstract: | Chronic Obstructive Pulmonary Disease (COPD) is the fourth leading cause of death worldwide and represents one of the major causes of chronic morbidity. Cigarette smoking is the most important risk factor for COPD. In these patients, the airflow limitation is caused by a mixture of small airways disease and parenchyma destruction, the relative contribution of which varies from person to person. The twofold nature of the pathology has been studied in the past and according to some authors each patient should be classified as presenting a predominantly bronchial or emphysematous phenotype. In this study we applied various explorative analysis techniques (PCA, MCA, MDS) and recent unsupervised clustering methods (KHM) to study a large dataset, acquired from 415 COPD patients, to assess the presence of hidden structures in data corresponding to the different COPD phenotypes observed in clinical practice. In order to validate our methods, we compared the results obtained from a training set of 415 patients with lung density data acquired in a test set of 93 patients who underwent HRCT (High Resolution Computerized Tomography). |
| |
Keywords: | Explorative biomedical data analysis Homogeneity analysis Biomedical data clustering Biomedical clustering Multiple Correspondence Analysis Principal Component Analysis K-Harmonic Means COPD Chronic Obstructive Pulmonary Disease Biomedical data mining |
本文献已被 ScienceDirect 等数据库收录! |
|