首页 | 本学科首页   官方微博 | 高级检索  
     


A constrained-syntax genetic programming system for discovering classification rules: application to medical data sets
Authors:Bojarczuk Celia C  Lopes Heitor S  Freitas Alex A  Michalkiewicz Edson L
Affiliation:Laboratório de Bioinformática/CPGEI, Centro Federal de Educa??o Tecnológica do Paraná, CEFET-PR, Av. 7 de Setembro 3165, 80230-901 (PR), Curitiba, Brazil. celiacri@cefetpr.br
Abstract:This paper proposes a new constrained-syntax genetic programming (GP) algorithm for discovering classification rules in medical data sets. The proposed GP contains several syntactic constraints to be enforced by the system using a disjunctive normal form representation, so that individuals represent valid rule sets that are easy to interpret. The GP is compared with C4.5, a well-known decision-tree-building algorithm, and with another GP that uses Boolean inputs (BGP), in five medical data sets: chest pain, Ljubljana breast cancer, dermatology, Wisconsin breast cancer, and pediatric adrenocortical tumor. For this last data set a new preprocessing step was devised for survival prediction. Computational experiments show that, overall, the GP algorithm obtained good results with respect to predictive accuracy and rule comprehensibility, by comparison with C4.5 and BGP.
Keywords:
本文献已被 PubMed 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号