A method is presented that causes A* to return high quality solutions while solving a set of problems using a non-admissible heuristic. The heuristic guiding the search changes as new information is learned during the search, and it converges to an admissible heuristic which 'contains the insight' of the original nonadmissible one. After a finite number of problems, A* returns only optimal solutions.
Experiments on sliding tile problems suggest that learning occurs very fast. Beginning with hundreds of randomly generated problems and an overestimating heuristic, the system learned sufficiently fast that only the first problem was solved non-optimally. As an application we show how one may construct heuristics for finding high quality solutions at lower cost than those returned by A* using available admissible heuristics.
[1]
Manfred K. Warmuth,et al.
Finding a Shortest Solution for the N × N Extension of the 15-PUZZLE Is Intractable
,
1986,
AAAI.
[2]
Henry W. Davis,et al.
Towards Finding Optimal Solutions with Non-Admissible Heuristics: A New Technique
,
1989,
IJCAI.
[3]
George Politowski.
On the construction of heuristic functions
,
1986
.
[4]
TWO-WEEK Loan COpy,et al.
University of California
,
1886,
The American journal of dental science.
[5]
Nils J. Nilsson,et al.
Principles of Artificial Intelligence
,
1980,
IEEE Transactions on Pattern Analysis and Machine Intelligence.
[6]
Denis Johnson.
In Palo Alto
,
1982
.