|
The
curly braces are used in this figure to enclose the elements
of a set. The 5 horizontal levels along which the sets are organized
reflects the partial order structure of this set of sets. The
top set includes all four of the possible concept instances.
The next level consists of four sets each of which contain a
different combination of three of the four possible concept instances.
Similarly, the next level consists of six sets each of which
contain a different combination of two of the four possible concept
instances. The fourth level consists of 4 sets each containing
one of the four possible concept instances. And, finally, the
bottom level consists of the empty set which contains none of
the possible concept instances.
The
lines in the figure connect a set at level n - 1 with
a set at level n if the set at level n - 1 is
a subset of the set at level n. Now in order to see
how the lattice property proves useful to hypothesis revision,
let us choose one of the sets to represent our current hypothesis
about the concept. Choose the set at the middle level at the
far left. According to this hypothesis the concept is any instance
that is black. And, we assume that this hypothesis is consistent
with the training instances that have been seen to this point.
Now
assume that we encounter an instance that consists of a large
white square. Further, suppose that we are told that this instance
is a positive instance of the concept. Consequently, our hypothesis
must be generalized. Note that there are 5 hypotheses in this
space that are more general than the current hypothesis. How
should we choose among this candidate set of revisions?
Note
first that the current hypothesis is a subset of only 3 of the
5. Thus, 2 of 5, the two at the far right of the figure, can
be eliminated since adopting either of these would result in
an hypothesis that was inconsistent with the training sequence
that has been observed to this point.
Consider
next the generalizations of the current instance. There are seven
generalizations that include this instance. However, only two
of these seven generalizations include both the current instance
and the current hypotheses. And one of these two is the most
general hypothesis at the top level of the lattice. One other
generalization of the hypothesis remains. That, is the set at
the far left of the lattice at level 2. This hypothesis is preferred
since it is the minimal generalization required to bring the
hypothesis in lines with the training sequence.
A
similar line of argument could be illustrated if a negative instance
were presented, such as a small black square. In this case the
specialization of the hypothesis would result in the identification
of the set at the far left of the lattice at level 4.
|