Sequential patterns mining and gene sequence visualization to discover novelty from microarray data |
| |
Authors: | Sallaberry A Pecheur N Bringay S Roche M Teisseire M |
| |
Institution: | aLaBRI, INRIA Bordeaux Sud-Ouest, Pikko, 351, cours de la Libération, 33405 Talence Cedex, France;bLIRMM, Univ. Montpellier 2 - CNRS, 161 rue Ada 34095 Montpellier Cedex 5 France;cMIAp Department, Univ. Montpellier 3, route de Mende 34199 Montpellier cedex 5, France;dCemagref, UMR TETIS, Maison de la teledetection, 500 rue Jean-François Breton, 34093 Montpellier, France |
| |
Abstract: | Data mining allow users to discover novelty in huge amounts of data. Frequent pattern methods have proved to be efficient, but the extracted patterns are often too numerous and thus difficult to analyze by end users. In this paper, we focus on sequential pattern mining and propose a new visualization system to help end users analyze the extracted knowledge and to highlight novelty according to databases of referenced biological documents. Our system is based on three visualization techniques: clouds, solar systems, and treemaps. We show that these techniques are very helpful for identifying associations and hierarchical relationships between patterns among related documents. Sequential patterns extracted from gene data using our system were successfully evaluated by two biology laboratories working on Alzheimer’s disease and cancer. |
| |
Keywords: | Visualization Data mining Bioinformatics Sequential patterns Microarray data Gene data |
本文献已被 ScienceDirect PubMed 等数据库收录! |
|