首页 | 本学科首页   官方微博 | 高级检索  
     

“去重”在肺癌NGS数据分析中的重要作用
引用本文:陈宇,张绪超,郭伟浜,颜文青,谢至,吕志异,卢丹霞,黄迎. “去重”在肺癌NGS数据分析中的重要作用[J]. 肿瘤防治研究, 2018, 45(1): 1-4. DOI: 10.3971/j.issn.1000-8578.2018.17.0954
作者姓名:陈宇  张绪超  郭伟浜  颜文青  谢至  吕志异  卢丹霞  黄迎
作者单位:510080 广州,广东省人民医院,广东省医学科学院,广东省肺癌研究所
基金项目:广州市科技计划科学研究专项一般项目(201607010391);广东省科技计划公益研究与能力建设专项(2014A020212225);广东省科技计划应用型科技研发专项资金项目(2016B020237006);广东省人民医院院内临床研究专项(2014zh006)
摘    要:目的 探讨“不去重”和“去重”两种方法分析后各NGS相关指标间的差异,研究“去重”在靶向捕获NGS数据分析中的重要作用。方法 通过对58例肺癌患者的DNA样本进行靶向286基因探针杂交方法建库并进行NGS检测,每例NGS检测数据分别进行“去重”和“不去重”两种方法的生物信息学分析,比较两种方法分析得到的NGS相关指标Mapped Reads(可比对上reads比例)、On Target(靶向区域reads比例)、Mean Depth(平均测序深度)以及Uniformity(均一性)等数据之间的差异。结果 “不去重”和“去重”两种方法分析得到平均Mapped Reads、On Target、Mean Depth和Uniformity值,各组指标间差异均有统计学意义(P<0.001)。“去重”方法分析时,Mapped Reads、On Target和MeanDepth 3个指标均呈现出血浆样本与其他3种类型样本间(FFPE、穿刺和手术)差异有统计学意义,在Uniformity指标上则呈现出4种类型样本间差异均无统计学意义。而“不去重”方法分析后结果正好相反。结论 去除PCR扩增所致重复序列的“去重”步骤在NGS数据分析中具有重要的作用,能够改善结果均一性,真实反映所测样本的DNA模板数量及等位基因频率(AF值,allele frequency)。“去重”后结果更有助于临床决策。

关 键 词:NGS  数据分析  去重  肺癌  
收稿时间:2017-08-07

Important Role of Deduplication Method in Next Generation Sequencing Data Analysis of Non-small Cell Lung Cancer
CHEN Yu,ZHANG Xuchao,GUO Weibang,YAN Wenqing,XIE Zhi,LYU Zhiyi,LU Danxia,HUANG Ying. Important Role of Deduplication Method in Next Generation Sequencing Data Analysis of Non-small Cell Lung Cancer[J]. Cancer Research on Prevention and Treatment, 2018, 45(1): 1-4. DOI: 10.3971/j.issn.1000-8578.2018.17.0954
Authors:CHEN Yu  ZHANG Xuchao  GUO Weibang  YAN Wenqing  XIE Zhi  LYU Zhiyi  LU Danxia  HUANG Ying
Affiliation:Guangdong Lung Cancer Institute, Guangdong General Hospital, Guangdong Academy of Medical Sciences, Guangzhou 510080, China
Abstract:Objective To investigate the important role of deduplication method in next generation sequencing(NGS) data analysis by comparing the difference of indicators obtained from deduplication and duplication methods. Methods Tumor samples of a cohort of 58 NSCLC patients were collected. A panel of 286 genes was tested by NGS. The NGS data were analyzed by de-duplication method and common duplication method, respectively. Indicators of “Mapped Reads, On Target, Mean Depth and Uniformity” were compared. Results The differences of Mapped Reads, On Target, Mean Depth and Uniformity were statistically significant between two methods, respectively(P<0.001). Mapped Reads and On Target and Mean Depth analyzed by de-duplication method were found significantly different between plasma sample and other three types of samples, ie., formalin fixed and paraffin embedded(FFPE) sample and puncture biopsy and surgical tissue sample; while Uniformity was generated without significant difference between the four types of samples. The results by duplication analysis were opposite. Conclusion Deduplication step plays an important role in NGS data analysis, which could improve the Uniformity and reflect the real DNA template amount and allele frequency of genomic alterations. Deduplication result is helpful for clinical decision.
Keywords:Next generation sequencing(NGS)  Data analysis  De-duplication  Lung cancer  
点击此处可从《肿瘤防治研究》浏览原始摘要信息
点击此处可从《肿瘤防治研究》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号