Scipediacontent: Scipediacontent moved page Draft Content 514190107 to Liang et al 2019a

2021-01-28T20:27:03Z

Scipediacontent moved page Draft Content 514190107 to Liang et al 2019a

Scipediacontent: Created page with " == Abstract == Traffic Management (ATM) is a complex decision-making process. Air traffic controllers’ decision on aircraft trajectory control actions directly leads to t..."

2021-01-28T20:26:57Z

Created page with " == Abstract == Traffic Management (ATM) is a complex decision-making process. Air traffic controllers’ decision on aircraft trajectory control actions directly leads to t..."

New page

== Abstract ==

Traffic Management (ATM) is a complex decision-making process. Air traffic controllers’ decision on aircraft trajectory control actions directly leads to the efficiency of traffic flow management. In the Automated Point Merge Trajectory Planning (APMTP) problem, it aims to realize an automated routine trajectory management in Terminal Manoeuvring Area (TMA) with an intelligent decision-making agent.An Artificial Intelligence-based approach, mainly Reinforcement Learning (RL) algorithm, is applied to adaptively and smartly integrate four types of de-conflict actions for solving conflicts with fewer delays on the environment. In this paper, we will mainly discuss the policy optimization in APMTP, focus on improving the agent’s learning quality and exploration efficiency. Firstly,application of RL in adaptive trajectory planning is presented.APMTP problem is adaptively divided into several sub-problems.For each sub-problem, an online policy π is applied to guide the simulation and optimization modules to find out the conflict free and less-delay solution. The online policy π is a scale of weight distribution for choosing desirable actions. It follows the rule of Roulette-wheel selection with weighted probability. The highest desirable decision variable has the largest share of the roulette wheel, while the lowest desirable decision variable has the smallest share of the roulette wheel. The RL direct policy optimization algorithm is designed to update the online policy π.Finally, experiments are built up for validation of the proposed policy optimization algorithm for the intelligent decision-making in APMTP. The results in the test environment show that learning agent with different exploration and exploitation ability will result in different system performance in conflict resolution and delay Refereed/Peer-reviewed

== Original document ==

The different versions of the original document can be found in:

* [http://dx.doi.org/10.13140/rg.2.2.30535.85926 http://dx.doi.org/10.13140/rg.2.2.30535.85926]

* [https://hal-enac.archives-ouvertes.fr/hal-02267452/file/conference_041818.pdf https://hal-enac.archives-ouvertes.fr/hal-02267452/file/conference_041818.pdf]

* [https://hdl.handle.net/11541.2/138969 https://hdl.handle.net/11541.2/138969]

* [http://xplorestaging.ieee.org/ielx7/9069808/9081608/09081789.pdf?arnumber=9081789 http://xplorestaging.ieee.org/ielx7/9069808/9081608/09081789.pdf?arnumber=9081789],
: [http://dx.doi.org/10.1109/dasc43569.2019.9081789 http://dx.doi.org/10.1109/dasc43569.2019.9081789]

* [https://hal-enac.archives-ouvertes.fr/hal-02267452 https://hal-enac.archives-ouvertes.fr/hal-02267452],
: [https://hal-enac.archives-ouvertes.fr/hal-02267452/document https://hal-enac.archives-ouvertes.fr/hal-02267452/document],
: [https://hal-enac.archives-ouvertes.fr/hal-02267452/file/conference_041818.pdf https://hal-enac.archives-ouvertes.fr/hal-02267452/file/conference_041818.pdf]

* [https://hal-enac.archives-ouvertes.fr/hal-02267452/document https://hal-enac.archives-ouvertes.fr/hal-02267452/document],
: [https://hal-enac.archives-ouvertes.fr/hal-02267452 https://hal-enac.archives-ouvertes.fr/hal-02267452],
: [https://academic.microsoft.com/#/detail/2979872972 https://academic.microsoft.com/#/detail/2979872972]

* [ ]

DOIS: 10.1109/dasc43569.2019.9081789 10.13140/rg.2.2.30535.85926 10.1109/dasc43569.2019.9081789.

← Older revision	Revision as of 20:27, 28 January 2021
(No difference)

Liang et al 2019a - Revision history

Scipediacontent: Scipediacontent moved page Draft Content 514190107 to Liang et al 2019a

Scipediacontent: Created page with " == Abstract == Traffic Management (ATM) is a complex decision-making process. Air traffic controllers’ decision on aircraft trajectory control actions directly leads to t..."