Professional Documents
Culture Documents
HPLAN-P
Heuristic planning with TEGs, SPs and TEPs
Incremental search algorithm
Outline
PDDL3 problem/domain
Planning problem with TEGs and TEPs
Preprocessing Phase Adapting existing heuristic search techniques to achieve SPs and solve the compiled problem HPLAN-P algorithm
exploiting the adapted heuristics to incrementally find better plans
PDDL3 Overview
TEGs/TEPs Simple Preferences (SPs) Precondition Preferences (PPs) Metric Function (M)
Metric Function
Defines the plan (numeric) quality over:
Preference violation weigth/count
Preference internal/external quantification
Preprocessing PDDL3
Simpler planning problem containing only SPs
augmented planning domain/problem
Preprocessing PPs
(preference p )
is-violated-p counter
Initiliazed to 0
(TEG or TEP) ,
a new domain predicate P P is TRUE is satisfied by the plan
f Automaton A
No BA Transitions labeled with FO (PDDL) predicates A states monitor the satisfaction of
A Planning Domain
Only valid/preferred plans simulate automata
acceptance predicate acceptance state
Sometime
Secondo livello
Terzo livello
Always
Secondo livello
Terzo livello
Preprocessing Automaton
A two new predicates (eventually parameterized) (state-A ?s ?x) and (accepting-A ?x) Automaton state updates with CE (eventually quantified)
one step behind Augmenting each original action + finish action
Adding start/finish actions initialization/goal specification Mutex and exhaustive Multiple parallel updates of different automata
PNFA
Secondo livello
Terzo livello
PNFA
All different paths to the goal
PNFA
Pseudo-action updates
No augmenting action domain
Non-Compilable TEGs/TEPs
Constraints that require infinite plans
Temporal Domain
CE added at both start/end points of each action TIL (exogenous events)
within, hold-after, hold-during
(always-within t )
Timed Automaton reset action
Heuristics Design
Active search Priority to achieving HG Desirability VS Ease of Achieving preferences
If M is non-increasing in the number of achieved preferences, O is a lower bound (for M) on the best plan extending s
HPLAN-P
Forward search
Best First Search
Heuristic
Different from TLPLAN
Incremental (episodic)
Each episode ends as soon as a better plan is found
Optimal
HPLAN-P Algorithm
Secondo livello
Terzo livello
Sound Pruning
If MetricBoundFN(s) is a lower bound on M of any plan extending s pruning is sound With sound pruning optimal plans are never pruned
1. MetricBoundFN(s) bestMetric
2. s is pruned
3. MetricBoundFN(s) M(ss)
4. ss never reached
5. M(ss) bestMetric
6. sound pruning
Optimality
If HPLAN-P stops and sound pruning is used the last plan return is optimal Proof
Each planning episode has returned a better plan It stops only when final episode has rejected all possible plans Sound pruning never prunes optimal plans No better plan than the last one returned exists
Termination
HPLAN-P termination conditions:
bestMetricintial finite MetricBoundFN(s) bestMetricintial finite
M cannot improve as the number of violated PPs increases
References
A Heuristic Search Approach to Planning with Temporally Extended Prefere ,Baier, J.andBacchus, F.andMcIlraith, S., 2007 Proceedings of the Twentieth International Joint Conference on Artificial Intelligence (IJCAI-07), pp. 1808-1815, January , Hyderabad, India
Planning with First-Order Temporally Extended Goals Using Heuristic Searc ,Jorge A. BaierandSheila McIlraith, Proceedings of the 21st National Conference on Artificial Intelligence (AAAI-06), pp. 788-795, July 2006, Boston, MA.
Alfonso E. Gerevini, Derek Long, Patrik Haslum, Alessandro Saetti, Yannis Dimopoulos, " Deterministic Planning in the Fifth International Planning Competition: PDD ,Artificial Intelligence, vol 173 (2009), pp. 619-668.