Phased-array Radar Beam Position Partitioning for Intercepted Pulse Sequences: An Expert Knowledge-based Hybrid Reinforcement Learning Framework

YU Zhuang; LING Qing; YAN Wenjun; DONG Shipeng; ZHANG Limin; SHANG Yunli

doi:10.12000/JR25283

Article Contents

Article Navigation > Journal of Radars > 2026 > Online First

YU Zhuang, LING Qing, YAN Wenjun, et al. Phased-array radar beam position partitioning for intercepted pulse sequences: An expert knowledge-based hybrid reinforcement learning framework[J]. Journal of Radars, in press. doi: 10.12000/JR25283

Citation:

YU Zhuang, LING Qing, YAN Wenjun, et al. Phased-array radar beam position partitioning for intercepted pulse sequences: An expert knowledge-based hybrid reinforcement learning framework[J]. Journal of Radars, in press. doi: 10.12000/JR25283

Citation:

PDF( 3896 KB)

Phased-array Radar Beam Position Partitioning for Intercepted Pulse Sequences: An Expert Knowledge-based Hybrid Reinforcement Learning Framework

DOI: 10.12000/JR25283 CSTR: 32380.14.JR25283

YU Zhuang^{1, 2},
LING Qing^{1, 2
,
,},
YAN Wenjun^{1, 2
,
,},
DONG Shipeng³,
ZHANG Limin^{1, 2},
SHANG Yunli⁴

1.
Institute of Information Fusion, Naval Aviation University, Yantai 264001, China
2.
Key Laboratory of Sea-Air information Perception and Processing Technology of Shandong Province, Yantai 264001, China
3.
National Key Laboratory of Radar Signal Processing, Xidian University, Xi’an 710126, China
4.
Unit 92038, People’s Liberation Army, Qingdao 266071, China

Funds: The National Natural Science Foundation of China (62371465), The Taishan Scholars Project Special Fund (ts201511020), Youth Innovation Teams in Shandong Province Fund (2022KJ084)

More Information

Corresponding author: LING Qing, linqing19870522@163.com; YAN Wenjun, wj_yan@foxmail.com
Received Date: 2025-12-31
Rev Recd Date: 2026-03-23

Available Online: 2026-03-28

Abstract

Abstract

The characteristics of phased-array radars—including flexible beam scanning, rapid multimode switching, and parameter agility—pose challenges to traditional radar signal analysis methods based on parameter clustering, causing feature parameter instability and parameter space overlap. To address these issues, this paper analyzes phased-array radar signals from the perspective of beam position partitioning. In particular, we reconstruct pulse subsequences corresponding to distinct beam positions from mixed pulse streams and an innovative expert-knowledge and hybrid-reinforcement-learning framework is proposed. This framework first performs preliminary partitioning using dynamic pulse amplitude thresholds. It subsequently feeds the preliminary results into a human-in-the-loop reinforcement-learning environment by integrating expert knowledge guidance with confidence assessment to ultimately achieve fine-grained beam position partitioning. Experimental results obtained using simulated datasets demonstrate that the proposed framework achieves a partitioning precision of 92.7%, indicating excellent calibration of the confidence assessment model. This work provides an effective technical pathway for human–machine collaboration in solving complex electromagnetic signal processing problems.
- Phased array radar,
- Beam position partitioning,
- Expert knowledge,
- Reinforcement learning,
- Confidence

FullText(HTML)

References(36)

References

[1]	王雪松, 王占领, 庞晨, 等. 极化相控阵雷达技术研究综述[J]. 雷达科学与技术, 2021, 19(4): 349–370. doi: 10.3969/j.issn.1672-2337.2021.04.001. WANG Xuesong, WANG Zhanling, PANG Chen, et al. Review on polarimetric phased array radar technologies[J]. Radar Science and Technology, 2021, 19(4): 349–370. doi: 10.3969/j.issn.1672-2337.2021.04.001.
[2]	GURBUZ S Z, GRIFFITHS H D, CHARLISH A, et al. An overview of cognitive radar: Past, present, and future[J]. IEEE Aerospace and Electronic Systems Magazine, 2019, 34(12): 6–18. doi: 10.1109/MAES.2019.2953762.
[3]	GOK G, ALP Y K, and ARIKAN O. A new method for specific emitter identification with results on real radar measurements[J]. IEEE Transactions on Information Forensics and Security, 2020, 15: 3335–3346. doi: 10.1109/TIFS.2020.2988558.
[4]	王颖, 郭睿, 梁毅. 实测双极化雷达压制干扰的特性分析与抑制[J]. 海军航空大学学报, 2025, 40(1): 163–170,196. doi: 10.7682/j.issn.2097-1427.2025.01.007. WANG Ying, GUO Rui, and LIANG Yi. Characteristics analysis and suppression of measured dual-polarization radar blanketing jamming[J]. Journal of Naval Aviation University, 2025, 40(1): 163–170,196. doi: 10.7682/j.issn.2097-1427.2025.01.007.
[5]	CHENG Wenhai, ZHANG Qunying, DONG Jiaming, et al. An enhanced algorithm for deinterleaving mixed radar signals[J]. IEEE Transactions on Aerospace and Electronic Systems, 2021, 57(6): 3927–3940. doi: 10.1109/TAES.2021.3087832.
[6]	ZHANG Peng, YAN Junkun, PU Wenqiang, et al. Multi-dimensional resource management scheme for multiple target tracking under dynamic electromagnetic environment[J]. IEEE Transactions on Signal Processing, 2024, 72: 2377–2393. doi: 10.1109/TSP.2024.3390119.
[7]	CHEN Baoxin, CHEN Xiaolong, HUANG Yong, et al. Transmit beampattern synthesis for the FDA radar[J]. IEEE Antennas and Wireless Propagation Letters, 2018, 17(1): 98–101. doi: 10.1109/LAWP.2017.2776957.
[8]	闫文君, 刘康晟, 凌青, 等. 跨场景辐射源个体识别技术综述[J]. 雷达学报(中英文), 待出版. doi: 10.12000/JR25166. YAN Wenjun, LIU Kangsheng, LING Qing, et al. Survey of cross-scenario specific emitter identification technology[J]. Journal of Radars, in press. doi: 10.12000/JR25166.
[9]	KRISHNAMURTHY V, PATTANAYAK K, GOGINENI S, et al. Adversarial radar inference: Inverse tracking, identifying cognition, and designing smart interference[J]. IEEE Transactions on Aerospace and Electronic Systems, 2021, 57(4): 2067–2081. doi: 10.1109/TAES.2021.3090901.
[10]	刘松涛, 赵帅, 汪慧阳. 雷达辐射源识别技术新进展[J]. 中国电子科学研究院学报, 2022, 17(6): 523–533. doi: 10.3969/j.issn.1673-5692.2022.06.002. LIU Songtao, ZHAO Shuai, and WANG Huiyang. New development on the technology of radar emitter identification[J]. Journal of China Academy of Electronics and Information Technology, 2022, 17(6): 523–533. doi: 10.3969/j.issn.1673-5692.2022.06.002.
[11]	RECHT B. A tour of reinforcement learning: The view from continuous control[J]. Annual Review of Control, Robotics, and Autonomous Systems, 2019, 2: 253–279. doi: 10.1146/annurev-control-053018-023825.
[12]	AL KASSIR H, ZAHARIS Z D, LAZARIDIS P I, et al. A review of the state of the art and future challenges of deep learning-based beamforming[J]. IEEE Access, 2022, 10: 80869–80882. doi: 10.1109/ACCESS.2022.3195299.
[13]	石荣, 吴聪. 基于PRI信息的雷达脉冲信号分选技术研究综述[J]. 电讯技术, 2020, 60(1): 112–120. doi: 10.3969/j.issn.1001-893x.2020.01.019. SHI Rong and WU Cong. Review on technology research about radar pulse signal deinterleaving based on PRI information[J]. Telecommunication Engineering, 2020, 60(1): 112–120. doi: 10.3969/j.issn.1001-893x.2020.01.019.
[14]	YAO Yu, LIU Haitao, MIAO Pu, et al. MIMO radar design for extended target detection in a spectrally crowded environment[J]. IEEE Transactions on Intelligent Transportation Systems, 2022, 23(9): 14389–14398. doi: 10.1109/TITS.2021.3127727.
[15]	张嘉翔, 张凯翔, 梁振楠, 等. 一种基于深度强化学习的频率捷变雷达智能频点决策方法[J]. 雷达学报(中英文), 2024, 13(1): 227–239. doi: 10.12000/JR23197. ZHANG Jiaxiang, ZHANG Kaixiang, LIANG Zhennan, et al. An intelligent frequency decision method for a frequency agile radar based on deep reinforcement learning[J]. Journal of Radars, 2024, 13(1): 227–239. doi: 10.12000/JR23197.
[16]	FAWAZ H I, FORESTIER G, WEBER J, et al. Deep learning for time series classification: A review[J]. Data Mining and Knowledge Discovery, 2019, 33(4): 917–963. doi: 10.1007/s10618-019-00619-1.
[17]	TAN Kaiwen, YAN Wenjun, ZHANG Limin, et al. Semi-supervised specific emitter identification based on bispectrum feature extraction CGAN in multiple communication scenarios[J]. IEEE Transactions on Aerospace and Electronic Systems, 2023, 59(1): 292–310. doi: 10.1109/TAES.2022.3184619.
[18]	PAPA L, RUSSO P, AMERINI I, et al. A survey on efficient vision transformers: Algorithms, techniques, and performance benchmarking[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 46(12): 7682–7700. doi: 10.1109/TPAMI.2024.3392941.
[19]	TU Ya, LIN Yun, ZHA Haoran, et al. Large-scale real-world radio signal recognition with deep learning[J]. Chinese Journal of Aeronautics, 2022, 35(9): 35–48. doi: 10.1016/j.cja.2021.08.016.
[20]	YAN Wenjun, LING Qing, YU Keyuan, et al. A pseudolabel method with semantic drift for specific emitter identification[J]. IEEE Transactions on Aerospace and Electronic Systems, 2025, 61(3): 6217–6235. doi: 10.1109/TAES.2025.3527960.
[21]	WANG Yu, GUI Guan, LIN Yun, et al. Few-shot specific emitter identification via deep metric ensemble learning[J]. IEEE Internet of Things Journal, 2022, 9(24): 24980–24994. doi: 10.1109/JIOT.2022.3194967.
[22]	MOSQUEIRA-REY E, HERNÁNDEZ-PEREIRA E, ALONSO-RÍOS D, et al. Human-in-the-loop machine learning: A state of the art[J]. Artificial Intelligence Review, 2023, 56(4): 3005–3054. doi: 10.1007/s10462-022-10246-w.
[23]	TANG Chen, ABBATEMATTEO B, HU Jiaheng, et al. Deep reinforcement learning for robotics: A survey of real-world successes[J]. Annual Review of Control, Robotics, and Autonomous Systems, 2025, 8: 153–188. doi: 10.1146/annurev-control-030323-022510.
[24]	WU Jingda, HUANG Zhiyu, HU Zhongxu, et al. Toward human-in-the-loop AI: Enhancing deep reinforcement learning via real-time human guidance for autonomous driving[J]. Engineering, 2023, 21: 75–91. doi: 10.1016/j.eng.2022.05.017.
[25]	WURMAN P R, BARRETT S, KAWAMOTO K, et al. Outracing champion Gran Turismo drivers with deep reinforcement learning[J]. Nature, 2022, 602(7896): 223–228. doi: 10.1038/s41586-021-04357-7.
[26]	宋新超, 吴连慧, 王星宇. 基于侦察幅度信息的雷达行为及特征分析[J]. 舰船电子对抗, 2019, 42(3): 48–51. doi: 10.16426/j.cnki.jcdzdk.2019.03.011. SONG Xinchao, WU Lianhui, and WANG Xingyu. Radar behavior and feature analysis based on reconnaissance amplitude information[J]. Shipboard Electronic Countermeasure, 2019, 42(3): 48–51. doi: 10.16426/j.cnki.jcdzdk.2019.03.011.
[27]	RAO Jinjun, XU Xiaoqiang, BIAN Haoran, et al. A modified random network distillation algorithm and its application in USVs naval battle simulation[J]. Ocean Engineering, 2022, 261: 112147. doi: 10.1016/j.oceaneng.2022.112147.
[28]	ARULKUMARAN K, DEISENROTH M P, BRUNDAGE M, et al. Deep reinforcement learning: A brief survey[J]. IEEE Signal Processing Magazine, 2017, 34(6): 26–38. doi: 10.1109/MSP.2017.2743240.
[29]	TAO Jin and ZHANG Xindong. Radar emitter signal recognition method based on improved collaborative semi-supervised learning[J]. Journal of Systems Engineering and Electronics, 2023, 34(5): 1182–1190. doi: 10.23919/JSEE.2023.000126.
[30]	KARNIADAKIS G E, KEVREKIDIS I G, LU Lu, et al. Physics-informed machine learning[J]. Nature Reviews Physics, 2021, 3(6): 422–440. doi: 10.1038/s42254-021-00314-5.
[31]	MANNION P, DEVLIN S, MASON K, et al. Policy invariance under reward transformations for multi-objective reinforcement learning[J]. Neurocomputing, 2017, 263: 60–73. doi: 10.1016/j.neucom.2017.05.090.
[32]	BOUDT K, TODOROV V, and WANG Wenjing. Robust distribution-based winsorization in composite indicators construction[J]. Social Indicators Research, 2020, 149(2): 375–397. doi: 10.1007/s11205-019-02259-w.
[33]	LOMBARDI O, HOLIK F, and VANNI L. What is Shannon information?[J]. Synthese, 2016, 193(7): 1983–2012. doi: 10.1007/s11229-015-0824-z.
[34]	ZHANG Donglin, WU Xiaojun, XU Tianyang, et al. DAH: Discrete asymmetric hashing for efficient cross-media retrieval[J]. IEEE Transactions on Knowledge and Data Engineering, 2023, 35(2): 1365–1378. doi: 10.1109/TKDE.2021.3099125.
[35]	SCHULMAN J, WOLSKI F, DHARIWAL P, et al. Proximal policy optimization algorithms[EB/OL]. https://arxiv.org/abs/1707.06347, 2017.
[36]	CHRISTODOULOU P. Soft actor-critic for discrete action settings[EB/OL]. https://arxiv.org/abs/1910.07207, 2019.

Relative Articles

Supplements(0)

Cited By

Proportional views

Proportional views

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Get Citation

PDF

XML

Article views(480) PDF downloads(181)

Phased-array Radar Beam Position Partitioning for Intercepted Pulse Sequences: An Expert Knowledge-based Hybrid Reinforcement Learning Framework

DOI: 10.12000/JR25283 CSTR: 32380.14.JR25283

Abstract

References

Proportional views

Catalog

通讯作者: 陈斌, bchen63@163.com

Proportional views

Related

About Journal

Contacts Us

Phased-array Radar Beam Position Partitioning for Intercepted Pulse Sequences: An Expert Knowledge-based Hybrid Reinforcement Learning Framework

DOI: 10.12000/JR25283 CSTR: 32380.14.JR25283

Abstract

References

Proportional views

Catalog

通讯作者: 陈斌, bchen63@163.com

Proportional views

Related

About Journal

Contacts Us

Export File

Citation

Format

Content