Home | Publications | CS Home

Information Filtering: Selection Mechanisms in Learning Systems


Shaul Markovitch. Information Filtering: Selection Mechanisms in Learning Systems. PhD Thesis, EECS Department, University of Michigan, 1989.


Abstract

Traditionally knowledge was considered as beneficial to the performance of problem solvers. Recent studies indicate that knowledge acquisition is not necessarily a monotonic process, and that sometimes additional knowledge leads to deterioration in system performance. This thesis studies the problem of harmful knowledge in learning systems, defines a unifying framework to deal with the problem, and tests the framework in the context of a useful learning system. Knowledge is harmful if the problem solver's performance would be improved by removing it. The information filtering model is proposed as a unifying framework for reducing or eliminating harmfulness of knowledge. The framework identifies five logical types of selection processes (filters) that can be incorporated into learning systems in order to reduce harmful knowledge: selective experience, attention, acquisition, retention and utilization. The framework is implemented and tested within the context of LASSY, a learning system that improves the performance of a PROLOG interpreter by utilizing acquired domain specific knowledge. LASSY incorporates two primary learning mechanisms: one deductive and the other inductive. The deductive mechanism is a lemma learner which uses theorems proved in the past as shortcuts for future proving. The inductive mechanism is a module that acquires average costs and number of solutions for various calling patterns that are used for reordering subgoals in rule bodies. LASSY incorporates all five types of selection mechanisms. The two novel ones are an experience filter which uses a model of the task domain to generate training problems that are likely to generate relevant knowledge and utilization filter which is used for filtering out the usage of lemmas in branches of the search tree which have high probability of failing. The system's performance was improved by a factor of two when using the lemma learner and its filters. The subgoal reordering improved the system performance by a factor of 23. When combined, the system's performance was improved by a factor of 36. It is concluded that it is not beneficial to be greedy, even for knowledge, and that being selective by inserting information filters can reduce the harmfulness of knowledge in learning systems.


Keywords: Utility Problem, Selective Learning, Active Learning, Lemma Learning
Secondary Keywords:
Online version:
Bibtex entry:
 @phdthesis{Markovitch:1989:IFS,
  Author = {Shaul Markovitch},
  Title = {Information Filtering: Selection Mechanisms in Learning Systems},
  Year = {1989},
  School = {EECS Department, University of Michigan},
  Url = {http://www.cs.technion.ac.il/~shaulm/papers/pdf/Markovitch-thesis.pdf},
  Keywords = {Utility Problem, Selective Learning, Active Learning, Lemma Learning},
  Secondary-keywords = {Learning to Order, Information Filtering, Forgetting, Selective Utilization, Deductive Learning, Explanation-Based Learning},
  Abstract = {
    Traditionally knowledge was considered as beneficial to the
    performance of problem solvers. Recent studies indicate that
    knowledge acquisition is not necessarily a monotonic process, and
    that sometimes additional knowledge leads to deterioration in
    system performance. This thesis studies the problem of harmful
    knowledge in learning systems, defines a unifying framework to
    deal with the problem, and tests the framework in the context of a
    useful learning system. Knowledge is harmful if the problem
    solver's performance would be improved by removing it. The
    information filtering model is proposed as a unifying framework
    for reducing or eliminating harmfulness of knowledge. The
    framework identifies five logical types of selection processes
    (filters) that can be incorporated into learning systems in order
    to reduce harmful knowledge: selective experience, attention,
    acquisition, retention and utilization. The framework is
    implemented and tested within the context of LASSY, a learning
    system that improves the performance of a PROLOG interpreter by
    utilizing acquired domain specific knowledge. LASSY incorporates
    two primary learning mechanisms: one deductive and the other
    inductive. The deductive mechanism is a lemma learner which uses
    theorems proved in the past as shortcuts for future proving. The
    inductive mechanism is a module that acquires average costs and
    number of solutions for various calling patterns that are used for
    reordering subgoals in rule bodies. LASSY incorporates all five
    types of selection mechanisms. The two novel ones are an
    experience filter which uses a model of the task domain to
    generate training problems that are likely to generate relevant
    knowledge and utilization filter which is used for filtering out
    the usage of lemmas in branches of the search tree which have high
    probability of failing. The system's performance was improved by a
    factor of two when using the lemma learner and its filters. The
    subgoal reordering improved the system performance by a factor of
    23. When combined, the system's performance was improved by a
    factor of 36. It is concluded that it is not beneficial to be
    greedy, even for knowledge, and that being selective by inserting
    information filters can reduce the harmfulness of knowledge in
    learning systems.
  }

  }