首页 | 本学科首页   官方微博 | 高级检索  
检索        


Epigenetic age estimation in saliva and in buccal cells
Institution:1. Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, Spain;2. CITMAga (Center for Mathematical Research and Technology of Galicia), University of Santiago de Compostela, Spain;3. Faculty of Mathematics, University of Santiago de Compostela, Spain
Abstract:Age estimation based on epigenetic markers is a DNA intelligence tool with the potential to provide relevant information for criminal investigations, as well as to improve the inference of age-dependent physical characteristics such as male pattern baldness or hair color. Age prediction models have been developed based on different tissues, including saliva and buccal cells, which show different methylation patterns as they are composed of different cell populations. On many occasions in a criminal investigation, the origin of a sample or the proportion of tissues is not known with certainty, for example the provenance of cigarette butts, so use of combined models can provide lower prediction errors.In the present study, two tissue-specific and seven age-correlated CpG sites were selected from publicly available data from the Illumina HumanMethylation 450 BeadChip and bibliographic searches, to help build a tissue-dependent, and an age-prediction model, respectively. For the development of both models, a total of 184 samples (N = 91 saliva and N = 93 buccal cells) ranging from 21 to 86 years old were used. Validation of the models was performed using either k-fold cross-validation and an additional set of 184 samples (N = 93 saliva and N = 91 buccal cells, 21–86 years old).The tissue prediction model was developed using two CpG sites (HUNK and RUNX1) based on logistic regression that produced a correct classification rate for saliva and buccal swab samples of 88.59 % for the training set, and 83.69 % for the testing set. Despite these high success rates, a combined age prediction model was developed covering both saliva and buccal cells, using seven CpG sites (cg10501210, LHFPL4, ELOVL2, PDE4C, HOXC4, OTUD7A and EDARADD) based on multivariate quantile regression giving a median absolute error (MAE): ± 3.54 years and a correct classification rate ( %CP±PI) of 76.08 % for the training set, and an MAE of ± 3.66 years and a %CP±PI of 71.19 % for the testing set. The addition of tissue-of origin as a co-variate to the model was assessed, but no improvement was detected in age predictions. Finally, considering the limitations usually faced by forensic DNA analyses, the robustness of the model and the minimum recommended amount of input DNA for bisulfite conversion were evaluated, considering up to 10 ng of genomic DNA for reproducible results. The final multivariate quantile regression age predictor based on the models we developed has been placed in the open-access Snipper forensic classification website.
Keywords:DNA methylation  Forensic age estimation  Logistic regression  Quantile regression  SNaPshot  Saliva  Buccal swab  Buccal cells
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号