Saher Esmeir and Shaul Markovitch. Anytime Learning of Anycost Classifiers. Machine Learning, 82:445-473 2011.
The classification of new cases using a predictive model incurs two types of costs--testing costs and misclassification costs. Recent research efforts have resulted in several novel algorithms that attempt to produce learners that simultaneously minimize both types. In many real life scenarios, however, we cannot afford to conduct all the tests required by the predictive model. For example, a medical center might have a fixed predetermined budget for diagnosing each patient. For cost bounded classification, decision trees are considered attractive as they measure only the tests along a single path. In this work we present an anytime framework for producing decision-tree based classifiers that can make accurate decisions within a strict bound on testing costs. These bounds can be known to the learner, known to the classifier but not to the learner, or not predetermined. Extensive experiments with a variety of datasets show that our proposed framework produces trees with lower misclassification costs along a wide range of testing cost bounds.
@article{Esmeir:2011:ALA,
Author = {Saher Esmeir and Shaul Markovitch},
Title = {Anytime Learning of Anycost Classifiers},
Year = {2011},
Journal = {Machine Learning},
Volume = {82},
Number = {3},
Month = {},
Pages = {445--473},
Url = {http://www.cs.technion.ac.il/~shaulm/papers/pdf/Esmeir-Markovitch-MLJ2011.pdf},
Keywords = {Anytime Algorithms, Anytime Learning, Decision Tree Induction, Cost-Sensitive Learning},
Abstract = {
The classification of new cases using a predictive model incurs
two types of costs--testing costs and misclassification costs.
Recent research efforts have resulted in several novel algorithms
that attempt to produce learners that simultaneously minimize both
types. In many real life scenarios, however, we cannot afford to
conduct all the tests required by the predictive model. For
example, a medical center might have a fixed predetermined budget
for diagnosing each patient. For cost bounded classification,
decision trees are considered attractive as they measure only the
tests along a single path. In this work we present an anytime
framework for producing decision-tree based classifiers that can
make accurate decisions within a strict bound on testing costs.
These bounds can be known to the learner, known to the classifier
but not to the learner, or not predetermined. Extensive
experiments with a variety of datasets show that our proposed
framework produces trees with lower misclassification costs along
a wide range of testing cost bounds.
}
}