Deep Reinforcement Learning Aided Packet-Routing for Aeronautical Ad-Hoc Networks Formed by Passenger Planes
Authors: Liu, D., Cui, J., Zhang, J., Yang, C. and Hanzo, L.
Journal: IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY
Volume: 70
Issue: 5
Pages: 5166-5171
eISSN: 1939-9359
ISSN: 0018-9545
DOI: 10.1109/TVT.2021.3074015
https://eprints.bournemouth.ac.uk/35262/
Source: Web of Science (Lite)
Deep Reinforcement Learning Aided Packet-Routing For Aeronautical Ad-Hoc Networks Formed by Passenger Planes
Authors: Liu, D., Cui, J., Zhang, J., Yang, C. and Hanzo, L.
Journal: IEEE Transactions on Vehicular Technology
https://eprints.bournemouth.ac.uk/35262/
Source: Manual
Deep Reinforcement Learning Aided Packet-Routing For Aeronautical Ad-Hoc Networks Formed by Passenger Planes
Authors: Liu, D., Cui, J., Zhang, J., Yang, C. and Hanzo, L.
Journal: IEEE Transactions on Vehicular Technology
Volume: 70
Issue: 5
Pages: 5166-5171
ISSN: 0018-9545
Abstract:Data packet routing in aeronautical ad-hoc networks (AANETs) is challenging due to their high-dynamic topology. In this paper, we invoke deep reinforcement learning for routing in AANETs aiming at minimizing the end-to-end (E2E) delay. Specifically, a deep Q-network (DQN) is conceived for capturing the relationship between the optimal routing decision and the local geographic information observed by the forwarding node. The DQN is trained in an offline manner based on historical flight data and then stored by each airplane for assisting their routing decisions during flight. To boost the learning efficiency and the online adaptability of the proposed DQN-routing, we further exploit the knowledge concerning the system’s dynamics by using a deep value network (DVN) conceived with a feedback mechanism. Our simulation results show that both DQN-routing and DVN-routing achieve lower E2E delay than the benchmark protocol, and DVN-routing performs similarly to the optimal routing that relies on perfect global information.
https://eprints.bournemouth.ac.uk/35262/
Source: BURO EPrints