A technique for identifying three diagnostic findings using association analysis |
| |
Authors: | Tomoaki Imamura Shinya Matsumoto Yoshiyuki Kanagawa Bunichi Tajima Shiro Matsuya Masutaka Furue Hiroshi Oyama |
| |
Affiliation: | (1) Department of Planning Information and Management, The University of Tokyo Hospital, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-8655, Japan;(2) Teradata Division, NCR JAPAN Ltd, Tokyo, Japan;(3) Division of Clinical Bioinformatics, Engineering Department of Clinical Bioinformatics, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan;(4) Department of Dermatology, Graduate School of Medical Sciences, Kyushu University, Kyushu, Japan |
| |
Abstract: | In diagnosing diseases in clinical practice, a combination of three clinical findings is often used to represent each disease. This is largely because it is often difficult or impractical to assess for all possible combinations of symptoms and abnormal exam findings that occur in any particular disease. For most diseases, diagnostic triads are based on empirical observations. In this study, we determined diagnostic triads for chronic diseases using data mining procedures. We also verified the combinations’ validity as well as our procedure for determining them. We used symptoms and examination findings from 477 patients with chronic diseases, collected as part of a 35-year longitudinal study begun in 1968. For each patient there were 295 items from examinations in internal medicine, dermatology, ophthalmology, dentistry and blood tests. We judged each item to be either normal or abnormal, and restricted the analysis to the abnormal findings. To analyze such an exhaustive assortment, we used the data mining technique of association analysis. The analysis generated three clinical findings for each disease. Diseases were defined based on blood tests. Searching through all 295 items to find the three most useful clinical findings would be impractical on a commodity PC. However, by excluding normal items, we were able to sufficiently reduce the total number of combinations so as to make combinatorial analysis on a PC feasible. In addition to more accurate diagnoses, we believe our technique can identify those diagnostic data that are more cost effective in terms of time and other resources required for their collection. |
| |
Keywords: | Data mining Association analysis Medical data Three clinical findings Diagnostic triad |
本文献已被 PubMed SpringerLink 等数据库收录! |
|