TY - JOUR
T1 - Intelligent resource allocation in mobile blockchain for privacy and security transactions
T2 - a deep reinforcement learning based approach
AU - Ning, Zhaolong
AU - Sun, Shouming
AU - Wang, Xiaojie
AU - Guo, Lei
AU - Wang, Guoyin
AU - Gao, Xinbo
AU - Kwok, Ricky Y.K.
N1 - Publisher Copyright:
© 2021, Science China Press and Springer-Verlag GmbH Germany, part of Springer Nature.
PY - 2021/6
Y1 - 2021/6
N2 - In order to protect the privacy and data security of mobile devices during the transactions in the industrial Internet of Things (IIoT), we propose a mobile edge computing (MEC)-based mobile blockchain framework by considering the limited bandwidth and computing power of small base stations (SBSs). First, we formulate a joint bandwidth and computing resource allocation problem to maximize the long-term utility of all mobile devices, and take into account the mobility of devices as well as the blockchain throughput. We decompose the formulated problem into two subproblems to decrease the dimension of action space. Then, we propose a deep reinforcement learning additional particle swarm optimization (DRPO) algorithm to solve the two subproblems, in which a particle swarm optimization algorithm is leveraged to avoid the unnecessary search of a deep deterministic policy gradient approach. Simulation results demonstrate the effectiveness of our method from various aspects.
AB - In order to protect the privacy and data security of mobile devices during the transactions in the industrial Internet of Things (IIoT), we propose a mobile edge computing (MEC)-based mobile blockchain framework by considering the limited bandwidth and computing power of small base stations (SBSs). First, we formulate a joint bandwidth and computing resource allocation problem to maximize the long-term utility of all mobile devices, and take into account the mobility of devices as well as the blockchain throughput. We decompose the formulated problem into two subproblems to decrease the dimension of action space. Then, we propose a deep reinforcement learning additional particle swarm optimization (DRPO) algorithm to solve the two subproblems, in which a particle swarm optimization algorithm is leveraged to avoid the unnecessary search of a deep deterministic policy gradient approach. Simulation results demonstrate the effectiveness of our method from various aspects.
KW - bandwidth allocation
KW - deep reinforcement learning
KW - mobile blockchain
KW - mobile edge computing
KW - power allocation
UR - http://www.scopus.com/inward/record.url?scp=85105016273&partnerID=8YFLogxK
U2 - 10.1007/s11432-020-3125-y
DO - 10.1007/s11432-020-3125-y
M3 - Article
AN - SCOPUS:85105016273
SN - 1674-733X
VL - 64
JO - Science China Information Sciences
JF - Science China Information Sciences
IS - 6
M1 - 162303
ER -