Avoiding jammers: A reinforcement learning approach

Ak, S.; Brüggenwirth, S.

doi:10.1109/RADAR42522.2020.9114797

2020

Conference Paper

Abstract

This paper investigates the anti-jamming performance of a cognitive radar under a partially observable Markov decision process (POMDP) model. First, we obtain an explicit expression for the uncertainty of jammer dynamics, which enables us to discover new insights into the performance metric of the probability of being jammed for the radar beyond a conventional signal-to-noise ratio (SNR) based analysis. Considering two frequency hopping strategies developed in the framework of reinforcement learning (RL), this performance metric is analyzed with deep Q-network (DQN) and long short term memory (LSTM) networks under various uncertainty values. Finally, the requirement of the target network in the RL algorithm for both network architectures is replaced with a softmax operator. Simulation results show that this operator improves upon the performance of the traditional target network.

Author(s)

Ak, S.

Brüggenwirth, S.

Mainwork

IEEE International Radar Conference, RADAR 2020

Conference

International Conference on Radar 2020

Options

Avoiding jammers: A reinforcement learning approach