首页 | 本学科首页   官方微博 | 高级检索  
检索        


Optimized variant calling for estimating kinship
Institution:1. Center for Human Identification, University of North Texas Health Science Center, Fort Worth, TX, USA;2. Department of Microbiology, Immunology and Genetics, University of North Texas Health Science Center, Fort Worth, TX, USA;3. Department of Forensic Medicine, University of Helsinki;1. University Ulm, Institute of Legal Medicine, Albert-Einstein-Allee 23, Ulm 89081, Germany;2. Ludwig-Maximilians University Munich, Geschwister-Scholl-Platz 1, München 80539, Germany;1. Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Sciences, Ministry of Justice, PR China, Shanghai 200063, China;2. Department of Forensic Medicine, Inner Mongolia Medical University, Hohhot 010110, China;1. Department of Legal Medicine, Shinshu University School of Medicine, Matsumoto, Nagano, Japan;2. Department of Genetic Identification, Erasmus MC, University Medical Center Rotterdam, Rotterdam, the Netherlands;1. Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, Spain;2. Institute of Anthropology and Ethnology, Adam Mickiewicz University in Poznań, Poland
Abstract:One of the fundamental goals of forensic genetics is sample attribution, i.e., whether an item of evidence can be associated with some person or persons. The most common scenario involves a direct comparison, e.g., between DNA profiles from an evidentiary item and a sample collected from a person of interest. Less common is an indirect comparison in which kinship is used to potentially identify the source of the evidence. Because of the sheer amount of information lost in the hereditary process for comparison purposes, sampling a limited set of loci may not provide enough resolution to accurately resolve a relationship. Instead, whole genome techniques can sample the entirety of the genome or a sufficiently large portion of the genome and as such they may effect better relationship determinations. While relatively common in other areas of study, whole genome techniques have only begun to be explored in the forensic sciences. As such, bioinformatic pipelines are introduced for estimating kinship by massively parallel sequencing of whole genomes using approaches adapted from the medical and population genomic literature. The pipelines are designed to characterize a person’s entire genome, not just some set of targeted markers. Two different variant callers are considered, contrasting a classical variant calling algorithm (BCFtools) to a more modern deep convolution neural network (DeepVariant). Two different bioinformatic pipelines specific to each variant caller are introduced and evaluated in a titration series. Filters and thresholds are then optimized specifically for the purposes of estimating kinship as determined by the KING-robust algorithm. With the appropriate filtering and thresholds in place both tools perform similarly, with DeepVariant tending to produce more accurate genotypes, though the resultant types of inaccuracies tended to produce slightly less accurate overall estimates of relatedness
Keywords:Kinship  Genetic genealogy  Genomics  Massively parallel sequencing  Whole genome sequencing
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号