Multiobjective Optimization Model in Multiagent Systems with Reinforcement Learning
Multi-agent systems, reinforcement learning, multi-objective optimization, decision process.
This work proposes a multi-agent multi-objective reinforcement learning algorithm for scenarios where multiple autonomous agents operate in a shared environment and must simultaneously optimize possibly conflicting objectives. The research aims to overcome the limitations of current approaches by providing adaptive, scalable and effective solutions to complex problems such as logistics planning and energy distribution. The objective is to explore approaches for the development of multi-agent reinforcement learning algorithms, proposing methodologies for learning in different interaction scenarios between agents, such as the use of multiple agents to accelerate learning, joint action learning, individual learning in environments with shared observation and individual learning in environments with individual observations. The relevance of the topic is due to the ability of reinforcement learning algorithms to adapt to changes in the environment, making them suitable for multi-objective optimization problems in real distributed decision situations. The methodology includes the development of a multi-agent reinforcement learning algorithm, the evaluation of the impact of different reward modeling techniques, such as Difference Reward and PBRS (Potential-Based Reward Shaping), and the comparison of the proposed algorithm with other solutions from the literature, considering aspects such as definition of a priori preferences between objectives, restrictions regarding the format of the Pareto Frontier, characteristics of communication between agents and the agents' level of knowledge about the environment. This research is expected to expand the state of the art in multi-agent reinforcement learning and multi-objective optimization, contributing to more effective and adaptive solutions to complex problems.