Summarization: In this article, we explore the computation of joint policies for autonomous agents to resolve congestions problems in the air traffic management (ATM) domain. Agents, representing flights, have limited information about others’ payoffs and preferences, and need to coordinate to achieve their tasks while adhering to operational constraints. We formalize the problem as a multiagent Markov decision process (MDP) towards deciding flight delays to resolve demand and capacity balance (DCB) problems in ATM. To this end, we present multiagent reinforcement learning methods that allow agents to interact and form own policies in coordination with others. Experimental study on real-world cases, confirms the effectiveness of our approach in resolving the demand-capacity balance problem. Παρουσιάστηκε στο: 10th Hellenic Conference on Artificial Intelligence
The different versions of the original document can be found in:
Published on 01/01/2018
Volume 2018, 2018
DOI: 10.1145/3200947.3201010
Licence: Other
Are you one of the authors of this document?