基于深度强化学习的空中无人机基站资源分配与公平性研究 Deep reinforcement learning-based resource allocation and fairness of aerial UAV base stations期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

基于深度强化学习的空中无人机基站资源分配与公平性研究

引用本文：	郭少雄,宋志群,李勇.基于深度强化学习的空中无人机基站资源分配与公平性研究[J].河北科技大学学报,2024,45(1):44-51.

作者姓名：	郭少雄宋志群李勇

作者单位：	通信网信息传输与分发技术重点实验室;通信网信息传输与分发技术重点实验室；中国电子科技集团公司第五十四研究所

基金项目：	国家自然科学基金（FFX23641X003）

摘要：	为了提高无人机基站(unmanned aerial vehicle base stations, UAV-BS)为地面多用户服务时的数据速率，提出一种基于决斗深度神经网络(dueling deep Q-network, Dueling-DQN)的深度强化学习(deep reinforcement learning, DRL)算法。采用决斗网络(dueling network, DN)结构以克服动态环境的部分可观测问题，联合优化了UAV-BS的位置和下行链路功率分配，在更符合实际的空地概率信道模型中检验了Dueling-DQN算法的性能。结果表明，相较于对比算法，所提出的Dueling-DQN算法可以提供更高的数据速率和服务公平性，且随着地面用户数量的增大，算法的优势更加明显。Dueling-DQN算法可有效解决复杂非凸性问题，为UAV-BS的资源分配问题提供理论参考。
关键词：	无线通信技术 UAV 空中基站深度强化学习资源分配公平性
收稿时间：	2023/9/23 0:00:00
修稿时间：	2023/12/15 0:00:00
Deep reinforcement learning-based resource allocation and fairness of aerial UAV base stations

GUO Shaoxiong,SONG Zhiqun,LI Yong.Deep reinforcement learning-based resource allocation and fairness of aerial UAV base stations[J].Journal of Hebei University of Science and Technology,2024,45(1):44-51.

Authors:	GUO Shaoxiong SONG Zhiqun LI Yong

Abstract:	In order to improve the data rate of unmanned aerial vehicle base stations (UAV-BS) when serving multiple users on the ground, a deep reinforcement learning (DRL) algorithm was proposed based on dueling deep Q-network (Dueling-DQN). A dueling network (DN) structure was employed to overcome the partially observable problem of the dynamic environment, and the position of the UAV-BS and the power allocation of the downlink were jointly optimized to satisfy the quality of service (QoS) of the ground users. The performance of the algorithm was examined in a more realistic air-ground probabilistic channel model. The results show that compared with the baseline algorithm, the proposed Dueling-DQN algorithm can provide higher data rate and service fairness, and the advantages are more obvious with the increase in the number of ground users. The Dueling-DQN algorithm is effective to solve the complex non-convexity problem, which provides some theoretical reference for the resource allocation problem of UAV-BS.

Keywords:	wireless communication technology UAV aerial base stations deep reinforcement learning resource allocation fairness

	点击此处可从《河北科技大学学报》浏览原始摘要信息
	点击此处可从《河北科技大学学报》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏