TY - JOUR
T1 - Throughput Maximization of HARQ-IR for ISAC
AU - Zou, Jianpeng
AU - Shi, Zheng
AU - Zhou, Binggui
AU - Fu, Yaru
AU - Wang, Hong
AU - Tan, Weiqiang
N1 - Publisher Copyright:
© 2025 IEEE. All rights reserved.
PY - 2025
Y1 - 2025
N2 - In this letter, hybrid automatic retransmission request with incremental redundancy (HARQ-IR) is applied to assist integrated sensing and communication (ISAC). The long term average throughput (LTAT) of HARQ-IR-assisted ISAC is maximized via power allocation while ensuring both communication and sensing reliability as well as total average power budget. Since the LTAT maximization is a non-convex problem, the asymptotic outage approximation is leveraged for problem relaxation. Subsequently, successive convex approximation (SCA) is exploited to convert it into a successive geometric programming (GP) problem. However, the GP-based solution underestimates the LTAT due to the large approximation error of the asymptotic outage probability at a low signal-to-noise ratio (SNR). To address this issue, the original problem is transformed to a Markov decision process, which can be solved with deep reinforcement learning (DRL), e.g., deep deterministic policy gradient. Numerical results show that the DRL-based method delivers comparable performance to the GP-based one at high SNR while performing much better than the GP-based method at low SNR, albeit at the cost of higher complexity.
AB - In this letter, hybrid automatic retransmission request with incremental redundancy (HARQ-IR) is applied to assist integrated sensing and communication (ISAC). The long term average throughput (LTAT) of HARQ-IR-assisted ISAC is maximized via power allocation while ensuring both communication and sensing reliability as well as total average power budget. Since the LTAT maximization is a non-convex problem, the asymptotic outage approximation is leveraged for problem relaxation. Subsequently, successive convex approximation (SCA) is exploited to convert it into a successive geometric programming (GP) problem. However, the GP-based solution underestimates the LTAT due to the large approximation error of the asymptotic outage probability at a low signal-to-noise ratio (SNR). To address this issue, the original problem is transformed to a Markov decision process, which can be solved with deep reinforcement learning (DRL), e.g., deep deterministic policy gradient. Numerical results show that the DRL-based method delivers comparable performance to the GP-based one at high SNR while performing much better than the GP-based method at low SNR, albeit at the cost of higher complexity.
KW - Deep reinforcement learning (DRL)
KW - hybrid automatic retransmission request with incremental redundancy (HARQ-IR)
KW - integrated sensing and communication (ISAC)
UR - http://www.scopus.com/inward/record.url?scp=86000724248&partnerID=8YFLogxK
U2 - 10.1109/LCOMM.2025.3526865
DO - 10.1109/LCOMM.2025.3526865
M3 - Article
AN - SCOPUS:86000724248
SN - 1089-7798
VL - 29
SP - 492
EP - 496
JO - IEEE Communications Letters
JF - IEEE Communications Letters
IS - 3
ER -