LSTM-characterized Deep Reinforcement Learning for Continuous Flight Control and Resource Allocation in UAV-assisted Sensor Network

Li, Kai; Ni, Wei; Dressler, Falko

Publication

LSTM-characterized Deep Reinforcement Learning for Continuous Flight Control and Resource Allocation in UAV-assisted Sensor Network

2021-08-05Journal article

dc.contributor.author	Li, Kai
dc.contributor.author	Ni, Wei
dc.contributor.author	Dressler, Falko
dc.date.accessioned	2021-09-10T12:32:29Z
dc.date.embargo	2100
dc.date.issued	2021-08-05
dc.description.abstract	Unmanned aerial vehicles (UAVs) can be employed to collect sensory data in remote wireless sensor networks (WSN). Due to UAV's maneuvering, scheduling a sensor device to transmit data can overflow data buffers of the unscheduled ground devices. Moreover, lossy airborne channels can result in packet reception errors at the scheduled sensor. This paper proposes a new deep reinforcement learning based flight resource allocation framework (DeFRA) to minimize the overall data packet loss in a continuous action space. DeFRA is based on Deep Deterministic Policy Gradient (DDPG), optimally controls instantaneous headings and speeds of the UAV, and selects the ground device for data collection. Furthermore, a state characterization layer, leveraging long short-term memory (LSTM), is developed to predict network dynamics, resulting from time-varying airborne channels and energy arrivals at the ground devices. To validate the effectiveness of DeFRA, experimental data collected from a real-world UAV testbed and energy harvesting WSN are utilized to train the actions of the UAV. Numerical results demonstrate that the proposed DeFRA achieves a fast convergence while reducing the packet loss by over 15%, as compared to existing deep reinforcement learning solutions.	pt_PT
dc.description.sponsorship	This work was partially supported by National Funds through FCT/MCTES (Portuguese Foundation for Science and Technology), within the CISTER Research Unit (UIDP/UIDB/04234/2020); also by national funds through the FCT, under CMU Portugal partnership, within project CMU/TIC/0022/2019 (CRUAV).	pt_PT
dc.description.version	info:eu-repo/semantics/publishedVersion	pt_PT
dc.identifier.doi	10.1109/JIOT.2021.3102831	pt_PT
dc.identifier.uri	http://hdl.handle.net/10400.22/18346
dc.language.iso	eng	pt_PT
dc.publisher	IEEE	pt_PT
dc.relation	UIDP/UIDB/04234/2020	pt_PT
dc.relation.publisherversion	https://ieeexplore.ieee.org/document/9507550	pt_PT
dc.subject	Unmanned aerial vehicles	pt_PT
dc.subject	Flight trajectory	pt_PT
dc.subject	Resource allocation	pt_PT
dc.subject	Deep deterministic policy gradient	pt_PT
dc.subject	Long short-term memory	pt_PT
dc.subject	Experimental datasets	pt_PT
dc.title	LSTM-characterized Deep Reinforcement Learning for Continuous Flight Control and Resource Allocation in UAV-assisted Sensor Network	pt_PT
dc.title.alternative	210802	pt_PT
dc.type	journal article
dspace.entity.type	Publication
oaire.awardURI	info:eu-repo/grantAgreement/FCT/3599-PPCDT/156761/PT
oaire.citation.endPage	11	pt_PT
oaire.citation.startPage	1	pt_PT
oaire.citation.title	IEEE Internet of Things Journal	pt_PT
oaire.fundingStream	3599-PPCDT
person.familyName	Li
person.givenName	Kai
person.identifier.ciencia-id	EE10-B822-16ED
person.identifier.orcid	0000-0002-0517-2392
project.funder.identifier	http://doi.org/10.13039/501100001871
project.funder.name	Fundação para a Ciência e a Tecnologia
rcaap.rights	closedAccess	pt_PT
rcaap.type	article	pt_PT
relation.isAuthorOfPublication	21f3fb85-19c2-4c89-afcd-3acb27cedc5e
relation.isAuthorOfPublication.latestForDiscovery	21f3fb85-19c2-4c89-afcd-3acb27cedc5e
relation.isProjectOfPublication	35de90fc-8621-4acb-a2b4-0ced71747cd3
relation.isProjectOfPublication.latestForDiscovery	35de90fc-8621-4acb-a2b4-0ced71747cd3

Collections

ISEP – CISTER – Artigos

LSTM-characterized Deep Reinforcement Learning for Continuous Flight Control and Resource Allocation in UAV-assisted Sensor Network

Files

Collections