a Department of Computer Science, Hong Kong University of Science and Technology, Hong Kong, PR China
b Department of Computer Science, Aalborg University, Aalborg, Denmark
Abstract:
The naive Bayes model makes the often unrealistic assumption that the feature variables are mutually independent given the class variable. We interpret a violation of this assumption as an indication of the presence of latent variables, and we show how latent variables can be detected. Latent variable discovery is interesting, especially for medical applications, because it can lead to a better understanding of application domains. It can also improve classification accuracy and boost user confidence in classification models.