International Journal of Computer Science Issues

Interestingness Measures for Classification Rule Mining: Model Selection Ability

Pannapa Changpetch and Dennis K. J. Lin

This study analyzes the ability of interestingness measures to capture the correct predictive model through a new simulation design. The simulation is designed to be general enough to allow fair conclusions to be drawn without depending on subjective opinions. We found that the relative success of interestingness measures in capturing the correct model depends on two major factors: (1) the characteristics of the model and (2) the weight of each of three components—confidence, support, and the probability of result or class—in the measures formula. No measure was found to perform best in all scenarios. However, we found that two groups of measures work well with different scenarios and that these groups of measures complement each other. That is, in any given scenario, the measures in one or the other of these two groups would perform the best. Therefore, in actual practice, when the characteristics of data are unknown, we propose using representatives of these two groups—confidence and lift—to capture the predictive models as either of these will capture the predictive model that best fits the given scenario.

Keywords: Classification rule mining, Interestingness measures, Predictive model

Download Full-Text

ABOUT THE AUTHORS

Pannapa Changpetch
Bentley University

Dennis K. J. Lin
Penn State University

International Journal of Computer Science Issues More than a traditional journal...

Interestingness Measures for Classification Rule Mining: Model Selection Ability

International Journal of Computer Science Issues

More than a traditional journal...