Etude de differentes combinations de comportments adaptatives

Description

This article focussei on the automated synthesis of agents In an uncertain environment, working In the setting of Reinforcement Learning and more precisely of Partially Observable Markov Decision Processes. The agents (with no model of their environment and no short-term memory) are facing multiple motivations/goals simultaneously, a problem related to thefield of Action Selection. We propose and evaluate various Action Selection architectures. They all combine already known basic behaviors in...[Show more] an adaptive manner, by learning the tuning of the combination, so as to maximize the agent's payoff. The logical continuation of this work is to automate the selection and design of the basic behaviors themselves.