When dealing with high dimensional problems, very often it is a good idea to solve a lower dimensional approximation of the problem and use it as an heuristic.
The design of heuristics is critical in heuristic search-based planning.
PRM "discretizes" a continuous space by sampling points in the space.
RRT searches the space by sampling towards the goal and to random points building a connected tree as found needed.
Replanning is needed when execution fails.
Given a plan that failed on a step, the two alternatives for replanning
are: to plan from scratch to the goal from the new state; and to plan
from the new state to try to reach one of the steps of the past plan. Both
options are not ideal.
ERRT with a search bias to the past plan is the perfect solution
for replanning:
ERRT expands the search tree to the goal with probability p, to a past found waypoint for reuse, with probability q, and a random target with probability 1-p-q.
Realtime Heuristic Search: plan a few steps towards the goal and re-plan later.
May lead to smaller total planning time, but the result may also be highly sub-optimal.
Freespace assumption: assume the path is clear unless known otherwise.
Generating "correct" explanations is hard, as it may require additional knowledge about the domain for the generalization part.
Control learning "explains" a search episode and outputs an explanation that can be used by the planner "to speed up" its search for solutions to problems.
Explanation Based Generalization: Assumes the world provides all relevant features. Has the purpose
HAMLET: learns control rules without proving that they are correct by generalizing from a single example, but it inductively revises its set of rules from multiple incremental examples.
Human-Level Control Through Deep Reinforcement Learning, V. Mnih, K. Kavukcuoglu, D. Silver, A. Rusu, J. Veness, M. Bellemare, A. Graves,
M. Riedmiller, A. Fidjeland, G. Ostrovski, S. Petersen, C. Beattie, A. Sadik, I. Antonoglou,
H. King, D. Kumaran, D. Wierstra, S. Legg and D. Hassabis
In Nature 518, 529-533, 2015
Wednesday, November 7: (L) Learning in Planning: learning a cost function