Learning Bayesian networks from datasets joining continuous and discrete variables |
| |
Affiliation: | Institute of Informatics, University of Warsaw, Warsaw, Poland |
| |
Abstract: | The current paper addresses two problems observed in structure learning applications to computational biology.The first one is dealing with mixed data. Most optimization criteria for learning algorithms are applicable to either discrete or continuous data. Mixed datasets are usually handled by discretization of continuous data, which often leads to the loss of information. In order to address this problem, we adapted discrete scoring functions to continuous data. Consequently, the same score is used to both types of variables, and the network structure may be learned from mixed data directly.The second problem is the control of the type I error level. Usually, learning algorithms output a network that is the best according to some optimization criteria, but the reliability of particular relationships represented by this network is unknown. We address this problem by allowing the user to specify the expected error level and adjusting the parameters of the scoring criteria to this level. |
| |
Keywords: | Bayesian networks Learning from data |
本文献已被 ScienceDirect 等数据库收录! |
|