首页 | 本学科首页   官方微博 | 高级检索  
     

基因位点预测的一种特征选择优化算法
引用本文:李骜,冯焕清,王涛,王明会. 基因位点预测的一种特征选择优化算法[J]. 北京生物医学工程, 2005, 24(2): 84-88
作者姓名:李骜  冯焕清  王涛  王明会
作者单位:中国科学技术大学电子科技系,合肥,230026
基金项目:中国科技大学校科研和教改项目
摘    要:目的剪接位点是真核细胞生物基因序列中外显子和内含子的相邻区域,如果能准确预测基因序列中的剪接位点,就能将基因中的表达区域和非表达区域分开.方法从机器学习的角度出发,提出了一种有效的特征选择算法用于剪接位点的建模和预测.该算法首先将初始链模型中每一对父子节点作为特征量提取,然后通过遗传算法和最大后验分类器进行特征选择.结果及结论对剪接位点数据的预测结果显示,这种新算法能够有效地优化链模型的结构,提高对剪接位点的预测能力.同时,经过优化的模型也有助于了解真核细胞中基因转录和表达的过程.

关 键 词:剪接位点  特征选择  遗传算法  生物信息学
文章编号:1002-3208(2005)02-0084-05
修稿时间:2003-11-28

An Effective Feature Selection Optimization Algorithm For Gene Splice Junction Sites Prediction
Li Ao,Feng Huanqing,WANG Tao,Wang Minghui. An Effective Feature Selection Optimization Algorithm For Gene Splice Junction Sites Prediction[J]. Beijing Biomedical Engineering, 2005, 24(2): 84-88
Authors:Li Ao  Feng Huanqing  WANG Tao  Wang Minghui
Abstract:Objective Splice junction sites are the boundaries between exons and introns in eukaryotic gene sequences. If the splice junction sites in a gene sequence could be predict correctly, the coding regions in a gene would be separated from the non-coding regions. Method This paper proposes a machine learning approach for the modeling and prediction of splice junction sites by an effective feature selection algorithm. This algorithm selects every pair of parent and child node in original chain models for splice junction sites as features, and then uses genetic algorithm and a MAP(Maximum A Posteriori) classifier to select the features. Results and Conclusion The experiment result shows that the new algorithm can optimize the chain models and improve the prediction accuracy of splice junction sites. Besides the architecture of the optimized chain models can also help to understand the procedures of gene translation and expression in eukaryotic cells.
Keywords:splice junction sites feature selection genetic algorithm bioinformatics
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号