On-line learning of macro planning operators using probabilistic estimations of cause-effects

Agostini, Alejandro Gabriel; Wörgötter, Florentin; Celaya Llover, Enric|||0000-0001-8480-7706; Torras, Carme|||0000-0002-2933-398X

On-line learning of macro planning operators using probabilistic estimations of cause-effects

In this work we propose an on-line learning method for learning action rules for planning. The system uses a probabilistic approach of a constructive induction method that combines a beam search with an example-based search over candidate rules to find those that more concisely describe the world dy...

Descripción completa

Detalles Bibliográficos
Autores:	Agostini, Alejandro Gabriel, Wörgötter, Florentin, Celaya Llover, Enric\|\|\|0000-0001-8480-7706, Torras, Carme\|\|\|0000-0002-2933-398X
Tipo de recurso:	informe técnico
Fecha de publicación:	2008
País:	España
Institución:	Universitat Politècnica de Catalunya (UPC)
Repositorio:	UPCommons. Portal del coneixement obert de la UPC
Idioma:	inglés
OAI Identifier:	oai:upcommons.upc.edu:2117/2694
Acceso en línea:	https://hdl.handle.net/2117/2694
Access Level:	acceso abierto
Palabra clave:	Intelligent robots and autonomous agents Machine learning Planning / Artificial intelligence Robots, Industrial online learning macro planning operator constructive learning Robots Sistemes de control intel·ligents Aprenentatge automàtic Planificació / Intel·ligència artificial Robots industrials Classificació INSPEC::Automation::Robots::Intelligent robots Classificació INSPEC::Cybernetics::Artificial intelligence::Learning (artificial intelligence) Classificació INSPEC::Cybernetics::Artificial intelligence::Planning (artificial intelligence) Classificació INSPEC::Automation::Robots::Service robots Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Representació del coneixement Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Aprenentatge automàtic Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial artificial Àrees temàtiques de la UPC::Informàtica::Robòtica

Descripción
Sumario:	In this work we propose an on-line learning method for learning action rules for planning. The system uses a probabilistic approach of a constructive induction method that combines a beam search with an example-based search over candidate rules to find those that more concisely describe the world dynamics. The approach permits a rapid integration of the knowledge acquired from experience. Exploration of the world dynamics is guided by the planner, and – if the planner fails because of incomplete knowledge – by a teacher through action instructions.

On-line learning of macro planning operators using probabilistic estimations of cause-effects

Similares en LA Referencia