Most Downloaded

1
Deep learning is primarily used for target detection in Synthetic Aperture Radar (SAR) images; however, its performance heavily relies on large-scale labeled datasets. The detection performance of deep learning models degrades when applied to SAR data with varying distributions, hindering their real-world applicability. In addition, manual labeling of SAR data is costly. Hence, cross-domain learning strategies based on multisource information are being explored to address these challenges. These strategies can assist detection models in realizing cross-domain knowledge migration by integrating prior information from optical remote sensing images or heterogeneous SAR images acquired from different sensors. This paper focuses on cross-domain learning technologies within the deep learning framework. In addition, it provides a systematic overview of the latest research progress in this field and analyzes the core issues, advantages, and applicable scenarios of existing technologies from a methodological perspective. It outlines future research directions based on the law of technological evolution, aiming to offer theoretical support and methodological references to enhance the generalizability of target detection in SAR images. Deep learning is primarily used for target detection in Synthetic Aperture Radar (SAR) images; however, its performance heavily relies on large-scale labeled datasets. The detection performance of deep learning models degrades when applied to SAR data with varying distributions, hindering their real-world applicability. In addition, manual labeling of SAR data is costly. Hence, cross-domain learning strategies based on multisource information are being explored to address these challenges. These strategies can assist detection models in realizing cross-domain knowledge migration by integrating prior information from optical remote sensing images or heterogeneous SAR images acquired from different sensors. This paper focuses on cross-domain learning technologies within the deep learning framework. In addition, it provides a systematic overview of the latest research progress in this field and analyzes the core issues, advantages, and applicable scenarios of existing technologies from a methodological perspective. It outlines future research directions based on the law of technological evolution, aiming to offer theoretical support and methodological references to enhance the generalizability of target detection in SAR images.
2
Marine target detection and recognition depend on the characteristics of marine targets and sea clutter. Therefore, understanding the essential features of marine targets based on the measured data is crucial for advancing target detection and recognition technology. To address the issue of insufficient data on the scattering characteristics of marine targets, the Sea-Detecting Radar Data-Sharing Program (SDRDSP) was upgraded to obtain data on marine targets and their environment under different polarizations and sea states. This upgrade expanded the physical dimension of radar target observation and improved radar and auxiliary data acquisition capabilities. Furthermore, a dual-polarized multistate scattering characteristic dataset of marine targets was constructed, and the statistical distribution characteristics, time and space correlation, and Doppler spectrum were analyzed, supporting the data usage. In the future, the types and quantities of maritime targets will continue to accumulate, providing data support for improving marine target detection and recognition performance and intelligence. Marine target detection and recognition depend on the characteristics of marine targets and sea clutter. Therefore, understanding the essential features of marine targets based on the measured data is crucial for advancing target detection and recognition technology. To address the issue of insufficient data on the scattering characteristics of marine targets, the Sea-Detecting Radar Data-Sharing Program (SDRDSP) was upgraded to obtain data on marine targets and their environment under different polarizations and sea states. This upgrade expanded the physical dimension of radar target observation and improved radar and auxiliary data acquisition capabilities. Furthermore, a dual-polarized multistate scattering characteristic dataset of marine targets was constructed, and the statistical distribution characteristics, time and space correlation, and Doppler spectrum were analyzed, supporting the data usage. In the future, the types and quantities of maritime targets will continue to accumulate, providing data support for improving marine target detection and recognition performance and intelligence.
3
This study addresses the issue of fine-grained feature extraction and classification for Low-Slow-Small (LSS) targets, such as birds and drones, by proposing a multi-band multi-angle feature fusion classification method. First, data from five types of rotorcraft drones and bird models were collected at multiple angles using K-band and L-band frequency-modulated continuous-wave radars, forming a dataset for LSS target detection. Second, to capture the periodic vibration characteristics of the L-band target signals, empirical mode decomposition was applied to extract high-frequency features and reduce noise interference. For the K-band echo signals, short-time Fourier transform was applied to obtain high-resolution micro-Doppler features from various angles. Based on these features, a Multi-band Multi-angle Feature Fusion Network (MMFFNet) was designed, incorporating an improved convolutional long short-term memory network for temporal feature extraction, along with an attention fusion module and a multiscale feature fusion module. The proposed architecture improves target classification accuracy by integrating features from both bands and angles. Validation using a real-world dataset showed that compared with methods relying on single radar features, the proposed approach improved the classification accuracy for seven types of LSS targets by 3.1% under a high Signal-to-Noise Ratio (SNR) of 5 dB and by 12.3% under a low SNR of −3 dB. This study addresses the issue of fine-grained feature extraction and classification for Low-Slow-Small (LSS) targets, such as birds and drones, by proposing a multi-band multi-angle feature fusion classification method. First, data from five types of rotorcraft drones and bird models were collected at multiple angles using K-band and L-band frequency-modulated continuous-wave radars, forming a dataset for LSS target detection. Second, to capture the periodic vibration characteristics of the L-band target signals, empirical mode decomposition was applied to extract high-frequency features and reduce noise interference. For the K-band echo signals, short-time Fourier transform was applied to obtain high-resolution micro-Doppler features from various angles. Based on these features, a Multi-band Multi-angle Feature Fusion Network (MMFFNet) was designed, incorporating an improved convolutional long short-term memory network for temporal feature extraction, along with an attention fusion module and a multiscale feature fusion module. The proposed architecture improves target classification accuracy by integrating features from both bands and angles. Validation using a real-world dataset showed that compared with methods relying on single radar features, the proposed approach improved the classification accuracy for seven types of LSS targets by 3.1% under a high Signal-to-Noise Ratio (SNR) of 5 dB and by 12.3% under a low SNR of −3 dB.
4
With the increasing demands on imaging accuracy, efficiency, and robustness in modern three-Dimensional (3D) Synthetic Aperture Radar (SAR) imaging systems, the performance of traditional 3D imaging methods, such as matched filtering and compressed sensing, has become limited in these aspects. In recent years, the rapid development of Deep Learning (DL) technology has provided new theoretical solutions for SAR 3D imaging by enabling the integration of neural networks with physical radar imaging models, leading to the emergence of a learning-based imaging paradigm that combines data-driven and model-driven approaches. This paper systematically reviews recent research progress in DL-based SAR 3D imaging. Focusing on two core issues, namely super-resolution imaging and enhanced imaging, this paper discusses current research advances and hotspots in SAR 3D imaging. These include super-resolution 3D imaging methods based on feedforward neural networks and deep unfolding networks, as well as 3D enhancement techniques such as multichannel data preprocessing and point cloud post-processing. This paper also summarizes publicly available datasets for SAR 3D imaging. In addition, this paper explores current research challenges in DL SAR 3D imaging, including high-generalization and high-precision DL SAR super-resolution 3D imaging technology, DL SAR elevation dimension disambiguation technology, integrated study of DL SAR 3D imaging and image enhancement, and the construction of DL SAR 3D imaging datasets. This paper provides an outlook on future development trends, aiming to offer research references and technical guidance for scholars in related fields. With the increasing demands on imaging accuracy, efficiency, and robustness in modern three-Dimensional (3D) Synthetic Aperture Radar (SAR) imaging systems, the performance of traditional 3D imaging methods, such as matched filtering and compressed sensing, has become limited in these aspects. In recent years, the rapid development of Deep Learning (DL) technology has provided new theoretical solutions for SAR 3D imaging by enabling the integration of neural networks with physical radar imaging models, leading to the emergence of a learning-based imaging paradigm that combines data-driven and model-driven approaches. This paper systematically reviews recent research progress in DL-based SAR 3D imaging. Focusing on two core issues, namely super-resolution imaging and enhanced imaging, this paper discusses current research advances and hotspots in SAR 3D imaging. These include super-resolution 3D imaging methods based on feedforward neural networks and deep unfolding networks, as well as 3D enhancement techniques such as multichannel data preprocessing and point cloud post-processing. This paper also summarizes publicly available datasets for SAR 3D imaging. In addition, this paper explores current research challenges in DL SAR 3D imaging, including high-generalization and high-precision DL SAR super-resolution 3D imaging technology, DL SAR elevation dimension disambiguation technology, integrated study of DL SAR 3D imaging and image enhancement, and the construction of DL SAR 3D imaging datasets. This paper provides an outlook on future development trends, aiming to offer research references and technical guidance for scholars in related fields.
5
This study proposes a Synthetic Aperture Radar (SAR) aircraft detection and recognition method combined with scattering perception to address the problem of target discreteness and false alarms caused by strong background interference in SAR images. The global information is enhanced through a context-guided feature pyramid module, which suppresses strong disturbances in complex images and improves the accuracy of detection and recognition. Additionally, scatter key points are used to locate targets, and a scatter-aware detection module is designed to realize the fine correction of the regression boxes to improve target localization accuracy. This study generates and presents a high-resolution SAR-AIRcraft-1.0 dataset to verify the effectiveness of the proposed method and promote the research on SAR aircraft detection and recognition. The images in this dataset are obtained from the satellite Gaofen-3, which contains 4,368 images and 16,463 aircraft instances, covering seven aircraft categories, namely A220, A320/321, A330, ARJ21, Boeing737, Boeing787, and other. We apply the proposed method and common deep learning algorithms to the constructed dataset. The experimental results demonstrate the excellent effectiveness of our method combined with scattering perception. Furthermore, we establish benchmarks for the performance indicators of the dataset in different tasks such as SAR aircraft detection, recognition, and integrated detection and recognition. This study proposes a Synthetic Aperture Radar (SAR) aircraft detection and recognition method combined with scattering perception to address the problem of target discreteness and false alarms caused by strong background interference in SAR images. The global information is enhanced through a context-guided feature pyramid module, which suppresses strong disturbances in complex images and improves the accuracy of detection and recognition. Additionally, scatter key points are used to locate targets, and a scatter-aware detection module is designed to realize the fine correction of the regression boxes to improve target localization accuracy. This study generates and presents a high-resolution SAR-AIRcraft-1.0 dataset to verify the effectiveness of the proposed method and promote the research on SAR aircraft detection and recognition. The images in this dataset are obtained from the satellite Gaofen-3, which contains 4,368 images and 16,463 aircraft instances, covering seven aircraft categories, namely A220, A320/321, A330, ARJ21, Boeing737, Boeing787, and other. We apply the proposed method and common deep learning algorithms to the constructed dataset. The experimental results demonstrate the excellent effectiveness of our method combined with scattering perception. Furthermore, we establish benchmarks for the performance indicators of the dataset in different tasks such as SAR aircraft detection, recognition, and integrated detection and recognition.
6
Passive radar plays an important role in early warning detection and Low Slow Small (LSS) target detection. Due to the uncontrollable source of passive radar signal radiations, target characteristics are more complex, which makes target detection and identification extremely difficult. In this paper, a passive radar LSS detection dataset (LSS-PR-1.0) is constructed, which contains the radar echo signals of four typical sea and air targets, namely helicopters, unmanned aerial vehicles, speedboats, and passenger ships, as well as sea clutter data at low and high sea states. It provides data support for radar research. In terms of target feature extraction and analysis, the singular-value-decomposition sea-clutter-suppression method is first adopted to remove the influence of the strong Bragg peak of sea clutter on target echo. On this basis, four categories of ten multi-domain feature extraction and analysis methods are proposed, including time-domain features (relative average amplitude), frequency-domain features (spectral features, Doppler waterfall plot, and range Doppler features), time-frequency-domain features, and motion features (heading difference, trajectory parameters, speed variation interval, speed variation coefficient, and acceleration). Based on the actual measurement data, a comparative analysis is conducted on the characteristics of four types of sea and air targets, summarizing the patterns of various target characteristics and laying the foundation for subsequent target recognition. Passive radar plays an important role in early warning detection and Low Slow Small (LSS) target detection. Due to the uncontrollable source of passive radar signal radiations, target characteristics are more complex, which makes target detection and identification extremely difficult. In this paper, a passive radar LSS detection dataset (LSS-PR-1.0) is constructed, which contains the radar echo signals of four typical sea and air targets, namely helicopters, unmanned aerial vehicles, speedboats, and passenger ships, as well as sea clutter data at low and high sea states. It provides data support for radar research. In terms of target feature extraction and analysis, the singular-value-decomposition sea-clutter-suppression method is first adopted to remove the influence of the strong Bragg peak of sea clutter on target echo. On this basis, four categories of ten multi-domain feature extraction and analysis methods are proposed, including time-domain features (relative average amplitude), frequency-domain features (spectral features, Doppler waterfall plot, and range Doppler features), time-frequency-domain features, and motion features (heading difference, trajectory parameters, speed variation interval, speed variation coefficient, and acceleration). Based on the actual measurement data, a comparative analysis is conducted on the characteristics of four types of sea and air targets, summarizing the patterns of various target characteristics and laying the foundation for subsequent target recognition.
7
Detection of small, slow-moving targets, such as drones using Unmanned Aerial Vehicles (UAVs) poses considerable challenges to radar target detection and recognition technology. There is an urgent need to establish relevant datasets to support the development and application of techniques for detecting small, slow-moving targets. This paper presents a dataset for detecting low-speed and small-size targets using a multiband Frequency Modulated Continuous Wave (FMCW) radar. The dataset utilizes Ku-band and L-band FMCW radar to collect echo data from six UAV types and exhibits diverse temporal and frequency domain resolutions and measurement capabilities by modulating radar cycles and bandwidth, generating an LSS-FMCWR-1.0 dataset (Low Slow Small, LSS). To further enhance the capability for extracting micro-Doppler features from UAVs, this paper proposes a method for UAV micro-Doppler extraction and parameter estimation based on the local maximum synchroextracting transform. Based on the Short Time Fourier Transform (STFT), this method extracts values at the maximum energy point in the time-frequency domain to retain useful signals and refine the time-frequency energy representation. Validation and analysis using the LSS-FMCWR-1.0 dataset demonstrate that this approach reduces entropy on an average by 5.3 dB and decreases estimation errors in rotor blade length by 27.7% compared with traditional time-frequency methods. Moreover, the proposed method provides the foundation for subsequent target recognition efforts because it balances high time-frequency resolution and parameter estimation capabilities. Detection of small, slow-moving targets, such as drones using Unmanned Aerial Vehicles (UAVs) poses considerable challenges to radar target detection and recognition technology. There is an urgent need to establish relevant datasets to support the development and application of techniques for detecting small, slow-moving targets. This paper presents a dataset for detecting low-speed and small-size targets using a multiband Frequency Modulated Continuous Wave (FMCW) radar. The dataset utilizes Ku-band and L-band FMCW radar to collect echo data from six UAV types and exhibits diverse temporal and frequency domain resolutions and measurement capabilities by modulating radar cycles and bandwidth, generating an LSS-FMCWR-1.0 dataset (Low Slow Small, LSS). To further enhance the capability for extracting micro-Doppler features from UAVs, this paper proposes a method for UAV micro-Doppler extraction and parameter estimation based on the local maximum synchroextracting transform. Based on the Short Time Fourier Transform (STFT), this method extracts values at the maximum energy point in the time-frequency domain to retain useful signals and refine the time-frequency energy representation. Validation and analysis using the LSS-FMCWR-1.0 dataset demonstrate that this approach reduces entropy on an average by 5.3 dB and decreases estimation errors in rotor blade length by 27.7% compared with traditional time-frequency methods. Moreover, the proposed method provides the foundation for subsequent target recognition efforts because it balances high time-frequency resolution and parameter estimation capabilities.
8
China has one of the longest land borders in the world and features a diverse range of terrain types and a dense electromagnetic environment. Therefore, in practical applications, airborne radar faces complex environments. The efficacy of detecting airborne radar is severely deteriorated in regions with complex terrains and electromagnetic environments, limiting the ability to meet military operational requirements. Cognitive Space-Time Adaptive Processing (STAP) is an effective technical approach for addressing this problem. In this study, a cognitive STAP architecture is proposed, and based on this architecture, the database, algorithm library, cognitive STAP technology, and feedback control are introduced. Analysis of the simulated data reveals that compared to traditional STAP technology, cognitive space-time adaptive processing technology can significantly enhance the efficacy of detecting moving targets using airborne radar in complex environments. China has one of the longest land borders in the world and features a diverse range of terrain types and a dense electromagnetic environment. Therefore, in practical applications, airborne radar faces complex environments. The efficacy of detecting airborne radar is severely deteriorated in regions with complex terrains and electromagnetic environments, limiting the ability to meet military operational requirements. Cognitive Space-Time Adaptive Processing (STAP) is an effective technical approach for addressing this problem. In this study, a cognitive STAP architecture is proposed, and based on this architecture, the database, algorithm library, cognitive STAP technology, and feedback control are introduced. Analysis of the simulated data reveals that compared to traditional STAP technology, cognitive space-time adaptive processing technology can significantly enhance the efficacy of detecting moving targets using airborne radar in complex environments.
9
In recent years, the rapid development of Multimodal Large Language Models (MLLMs) and their applications in earth observation have garnered significant attention. Earth observation MLLMs achieve deep integration of multimodal information, including optical imagery, Synthetic Aperture Radar (SAR) imagery, and textual data, through the design of bridging mechanisms between large language models and vision models, combined with joint training strategies. This integration facilitates a paradigm shift in intelligent earth observation interpretation—from shallow semantic matching to higher-level understanding based on world knowledge. In this study, we systematically review the research progress in the applications of MLLMs in earth observation, specifically examining the development of Earth Observation MLLMs (EO-MLLMs), which provides a foundation for future research directions. Initially, we discuss the concept of EO-MLLMs and review their development in chronological order. Subsequently, we provide a detailed analysis and statistical summary of the proposed architectures, training methods, applications, and corresponding benchmark datasets, along with an introduction to Earth Observation Agents (EO-Agent). Finally, we summarize the research status of EO-MLLMs and discuss future research directions. In recent years, the rapid development of Multimodal Large Language Models (MLLMs) and their applications in earth observation have garnered significant attention. Earth observation MLLMs achieve deep integration of multimodal information, including optical imagery, Synthetic Aperture Radar (SAR) imagery, and textual data, through the design of bridging mechanisms between large language models and vision models, combined with joint training strategies. This integration facilitates a paradigm shift in intelligent earth observation interpretation—from shallow semantic matching to higher-level understanding based on world knowledge. In this study, we systematically review the research progress in the applications of MLLMs in earth observation, specifically examining the development of Earth Observation MLLMs (EO-MLLMs), which provides a foundation for future research directions. Initially, we discuss the concept of EO-MLLMs and review their development in chronological order. Subsequently, we provide a detailed analysis and statistical summary of the proposed architectures, training methods, applications, and corresponding benchmark datasets, along with an introduction to Earth Observation Agents (EO-Agent). Finally, we summarize the research status of EO-MLLMs and discuss future research directions.
10
The bistatic Synthetic Aperture Radar (SAR) system, which employs spatially separated transmitting and receiving platforms, provides high-resolution imaging of terrestrial and maritime scenes and targets in complex environments. Its advantages include flexible configuration, strong concealment capabilities, high interference resistance, and comprehensive target information acquisition, making it valuable in high-precision remote sensing mapping, covert imaging, and precision strikes. Image processing is critical for obtaining high-resolution Bistatic SAR (BiSAR) images. However, the echo model and characteristics of BiSAR substantially differ from those of traditional monostatic SAR, necessitating specialized image processing methods tailored to various operational modes and configurations. This study examines key challenges and solutions for several BiSAR configurations, including airborne BiSAR, BiSAR with high-speed and highly maneuverable platforms, spaceborne heterogeneous BiSAR, and spaceborne homogeneous BiSAR. This study also addresses motion compensation approaches and moving target imaging in BiSAR systems, reviews relevant domestic and international research advancements, and provides an outlook on future trends in BiSAR image processing. The bistatic Synthetic Aperture Radar (SAR) system, which employs spatially separated transmitting and receiving platforms, provides high-resolution imaging of terrestrial and maritime scenes and targets in complex environments. Its advantages include flexible configuration, strong concealment capabilities, high interference resistance, and comprehensive target information acquisition, making it valuable in high-precision remote sensing mapping, covert imaging, and precision strikes. Image processing is critical for obtaining high-resolution Bistatic SAR (BiSAR) images. However, the echo model and characteristics of BiSAR substantially differ from those of traditional monostatic SAR, necessitating specialized image processing methods tailored to various operational modes and configurations. This study examines key challenges and solutions for several BiSAR configurations, including airborne BiSAR, BiSAR with high-speed and highly maneuverable platforms, spaceborne heterogeneous BiSAR, and spaceborne homogeneous BiSAR. This study also addresses motion compensation approaches and moving target imaging in BiSAR systems, reviews relevant domestic and international research advancements, and provides an outlook on future trends in BiSAR image processing.
11
Millimeter-wave radar is increasingly being adopted for smart home systems, elder care, and surveillance monitoring, owing to its adaptability to environmental conditions, high resolution, and privacy-preserving capabilities. A key factor in effectively utilizing millimeter-wave radar is the analysis of point clouds, which are essential for recognizing human postures. However, the sparse nature of these point clouds poses significant challenges for accurate and efficient human action recognition. To overcome these issues, we present a 3D point cloud dataset tailored for human actions captured using millimeter-wave radar (mmWave-3DPCHM-1.0). This dataset is enhanced with advanced data processing techniques and cutting-edge human action recognition models. Data collection is conducted using Texas Instruments (TI)’s IWR1443-ISK and Vayyar’s vBlu radio imaging module, covering 12 common human actions, including walking, waving, standing, and falling. At the core of our approach is the Point EdgeConv and Transformer (PETer) network, which integrates edge convolution with transformer models. For each 3D point cloud frame, PETer constructs a locally directed neighborhood graph through edge convolution to extract spatial geometric features effectively. The network then leverages a series of Transformer encoding models to uncover temporal relationships across multiple point cloud frames. Extensive experiments reveal that the PETer network achieves exceptional recognition rates of 98.77% on the TI dataset and 99.51% on the Vayyar dataset, outperforming the traditional optimal baseline model by approximately 5%. With a compact model size of only 1.09 MB, PETer is well-suited for deployment on edge devices, providing an efficient solution for real-time human action recognition in resource-constrained environments. Millimeter-wave radar is increasingly being adopted for smart home systems, elder care, and surveillance monitoring, owing to its adaptability to environmental conditions, high resolution, and privacy-preserving capabilities. A key factor in effectively utilizing millimeter-wave radar is the analysis of point clouds, which are essential for recognizing human postures. However, the sparse nature of these point clouds poses significant challenges for accurate and efficient human action recognition. To overcome these issues, we present a 3D point cloud dataset tailored for human actions captured using millimeter-wave radar (mmWave-3DPCHM-1.0). This dataset is enhanced with advanced data processing techniques and cutting-edge human action recognition models. Data collection is conducted using Texas Instruments (TI)’s IWR1443-ISK and Vayyar’s vBlu radio imaging module, covering 12 common human actions, including walking, waving, standing, and falling. At the core of our approach is the Point EdgeConv and Transformer (PETer) network, which integrates edge convolution with transformer models. For each 3D point cloud frame, PETer constructs a locally directed neighborhood graph through edge convolution to extract spatial geometric features effectively. The network then leverages a series of Transformer encoding models to uncover temporal relationships across multiple point cloud frames. Extensive experiments reveal that the PETer network achieves exceptional recognition rates of 98.77% on the TI dataset and 99.51% on the Vayyar dataset, outperforming the traditional optimal baseline model by approximately 5%. With a compact model size of only 1.09 MB, PETer is well-suited for deployment on edge devices, providing an efficient solution for real-time human action recognition in resource-constrained environments.
12
Synthetic Aperture Radar (SAR) is widely used in military and civilian applications, with intelligent target interpretation of SAR images being a crucial component of SAR applications. Vision-Language Models (VLMs) play an important role in SAR target interpretation. By incorporating natural language understanding, VLMs effectively address the challenges posed by large intraclass variability in target characteristics and the scarcity of high-quality labeled samples, thereby advancing the field from purely visual interpretation toward semantic understanding of targets. Drawing upon our team’s extensive research experience in SAR target interpretation theory, algorithms, and applications, this paper provides a comprehensive review of intelligent SAR target interpretation based on VLMs. We provide an in-depth analysis of existing challenges and tasks, summarize the current state of research, and compile available open-source datasets. Furthermore, we systematically outline the evolution, ranging from task-specific VLMs to contrastive-, conversational-, and generative-based VLMs and foundational models. Finally, we discuss the latest challenges and future outlooks in SAR target interpretation by VLMs. Synthetic Aperture Radar (SAR) is widely used in military and civilian applications, with intelligent target interpretation of SAR images being a crucial component of SAR applications. Vision-Language Models (VLMs) play an important role in SAR target interpretation. By incorporating natural language understanding, VLMs effectively address the challenges posed by large intraclass variability in target characteristics and the scarcity of high-quality labeled samples, thereby advancing the field from purely visual interpretation toward semantic understanding of targets. Drawing upon our team’s extensive research experience in SAR target interpretation theory, algorithms, and applications, this paper provides a comprehensive review of intelligent SAR target interpretation based on VLMs. We provide an in-depth analysis of existing challenges and tasks, summarize the current state of research, and compile available open-source datasets. Furthermore, we systematically outline the evolution, ranging from task-specific VLMs to contrastive-, conversational-, and generative-based VLMs and foundational models. Finally, we discuss the latest challenges and future outlooks in SAR target interpretation by VLMs.
13
Maritime target detection and identification technology are developed using large-scale, high-quality multi-sensor measurement data. Therefore, the Sea Detection Radar Data Sharing Program (SDRDSP) was upgraded to the Maritime Target Data Sharing Program (MTDSP), integrating multiple observation modalities, such as HH-polarized radar, VV-polarized radar, electro-optical devices, and Automatic Identification System (AIS) equipment to conduct multisource observation experiments on maritime vessel targets. The program collects various data types, including radar intermediate frequency/video echo slice data, visible and infrared imagery, AIS static and dynamic messages, and meteorological and hydrological data, covering representative sea conditions and multiple vessel types. A comprehensive multisource observation dataset was constructed, enabling the matching and annotation of multimodal data for the same target. Moreover, an automated data management system was implemented to support data storage, conditional retrieval, and batch export, providing a solid foundation for the automated acquisition, long-term accumulation, and efficient use of maritime target characteristic data. Based on this system and measured data, the time/frequency domain features of the same and different vessel targets under different sea states, attitudes, polarization conditions are compared and analyzed, and the statistical conclusion of the change in target features is obtained. Maritime target detection and identification technology are developed using large-scale, high-quality multi-sensor measurement data. Therefore, the Sea Detection Radar Data Sharing Program (SDRDSP) was upgraded to the Maritime Target Data Sharing Program (MTDSP), integrating multiple observation modalities, such as HH-polarized radar, VV-polarized radar, electro-optical devices, and Automatic Identification System (AIS) equipment to conduct multisource observation experiments on maritime vessel targets. The program collects various data types, including radar intermediate frequency/video echo slice data, visible and infrared imagery, AIS static and dynamic messages, and meteorological and hydrological data, covering representative sea conditions and multiple vessel types. A comprehensive multisource observation dataset was constructed, enabling the matching and annotation of multimodal data for the same target. Moreover, an automated data management system was implemented to support data storage, conditional retrieval, and batch export, providing a solid foundation for the automated acquisition, long-term accumulation, and efficient use of maritime target characteristic data. Based on this system and measured data, the time/frequency domain features of the same and different vessel targets under different sea states, attitudes, polarization conditions are compared and analyzed, and the statistical conclusion of the change in target features is obtained.
14

Flying birds and Unmanned Aerial Vehicles (UAVs) are typical “low, slow, and small” targets with low observability. The need for effective monitoring and identification of these two targets has become urgent and must be solved to ensure the safety of air routes and urban areas. There are many types of flying birds and UAVs that are characterized by low flying heights, strong maneuverability, small radar cross-sectional areas, and complicated detection environments, which are posing great challenges in target detection worldwide. “Visible (high detection ability) and clear-cut (high recognition probability)” methods and technologies must be developed that can finely describe and recognize UAVs, flying birds, and “low-slow-small” targets. This paper reviews the recent progress in research on detection and recognition technologies for rotor UAVs and flying birds in complex scenes and discusses effective detection and recognition methods for the detection of birds and drones, including echo modeling and recognition of fretting characteristics, the enhancement and extraction of maneuvering features in ubiquitous observation mode, distributed multi-view features fusion, differences in motion trajectories, and intelligent classification via deep learning. Lastly, the problems of existing research approaches are summarized, and we consider the future development prospects of target detection and recognition technologies for flying birds and UAVs in complex scenarios.

Flying birds and Unmanned Aerial Vehicles (UAVs) are typical “low, slow, and small” targets with low observability. The need for effective monitoring and identification of these two targets has become urgent and must be solved to ensure the safety of air routes and urban areas. There are many types of flying birds and UAVs that are characterized by low flying heights, strong maneuverability, small radar cross-sectional areas, and complicated detection environments, which are posing great challenges in target detection worldwide. “Visible (high detection ability) and clear-cut (high recognition probability)” methods and technologies must be developed that can finely describe and recognize UAVs, flying birds, and “low-slow-small” targets. This paper reviews the recent progress in research on detection and recognition technologies for rotor UAVs and flying birds in complex scenes and discusses effective detection and recognition methods for the detection of birds and drones, including echo modeling and recognition of fretting characteristics, the enhancement and extraction of maneuvering features in ubiquitous observation mode, distributed multi-view features fusion, differences in motion trajectories, and intelligent classification via deep learning. Lastly, the problems of existing research approaches are summarized, and we consider the future development prospects of target detection and recognition technologies for flying birds and UAVs in complex scenarios.

15

As one of the core components of Advanced Driver Assistance Systems (ADAS), automotive millimeter-wave radar has become the focus of scholars and manufacturers at home and abroad because it has the advantages of all-day and all-weather operation, miniaturization, high integration, and key sensing capabilities. The core performance indicators of the automotive millimeter-wave radar are distance, speed, angular resolution, and field of view. Accuracy, cost, real-time and detection performance, and volume are the key issues to be considered. The increasing performance requirements pose several challenges for the signal processing of millimeter-wave radar systems. Radar signal processing technology is crucial for improving radar performance to meet more stringent requirements. Obtaining dense radar point clouds, generating accurate radar imaging results, and mitigating mutual interference among multiple radar systems are the key points and the foundation for subsequent tracking, recognition, and other applications. Therefore, this paper discusses the practical application of the automotive millimeter-wave radar system based on the key technologies of signal processing, summarizes relevant research results, and mainly discusses the topics of point cloud imaging processing, synthetic aperture radar imaging processing, and interference suppression. Finally, herein, we summarize the research status at home and abroad. Moreover, future development trends for automotive millimeter-wave radar systems are forecast with the hope of enlightening readers in related fields.

As one of the core components of Advanced Driver Assistance Systems (ADAS), automotive millimeter-wave radar has become the focus of scholars and manufacturers at home and abroad because it has the advantages of all-day and all-weather operation, miniaturization, high integration, and key sensing capabilities. The core performance indicators of the automotive millimeter-wave radar are distance, speed, angular resolution, and field of view. Accuracy, cost, real-time and detection performance, and volume are the key issues to be considered. The increasing performance requirements pose several challenges for the signal processing of millimeter-wave radar systems. Radar signal processing technology is crucial for improving radar performance to meet more stringent requirements. Obtaining dense radar point clouds, generating accurate radar imaging results, and mitigating mutual interference among multiple radar systems are the key points and the foundation for subsequent tracking, recognition, and other applications. Therefore, this paper discusses the practical application of the automotive millimeter-wave radar system based on the key technologies of signal processing, summarizes relevant research results, and mainly discusses the topics of point cloud imaging processing, synthetic aperture radar imaging processing, and interference suppression. Finally, herein, we summarize the research status at home and abroad. Moreover, future development trends for automotive millimeter-wave radar systems are forecast with the hope of enlightening readers in related fields.

16
Integrated Sensing And Communications (ISAC), a key technology for 6G networks, has attracted extensive attention from both academia and industry. Leveraging the widespread deployment of communication infrastructures, the integration of sensing functions into communication systems to achieve ISAC networks has emerged as a research focus. To this end, the signal design for communication-centric ISAC systems should be addressed first. Two main technical routes are considered for communication-centric signal design: (1) pilot-based sensing signal design and (2) data-based ISAC signal design. This paper provides an in-depth and systematic overview of signal design for the aforementioned technical routes. First, a comprehensive review of the existing literature on pilot-based signal design for sensing is presented. Then, the data-based ISAC signal design is analyzed. Finally, future research topics on the ISAC signal design are proposed. Integrated Sensing And Communications (ISAC), a key technology for 6G networks, has attracted extensive attention from both academia and industry. Leveraging the widespread deployment of communication infrastructures, the integration of sensing functions into communication systems to achieve ISAC networks has emerged as a research focus. To this end, the signal design for communication-centric ISAC systems should be addressed first. Two main technical routes are considered for communication-centric signal design: (1) pilot-based sensing signal design and (2) data-based ISAC signal design. This paper provides an in-depth and systematic overview of signal design for the aforementioned technical routes. First, a comprehensive review of the existing literature on pilot-based signal design for sensing is presented. Then, the data-based ISAC signal design is analyzed. Finally, future research topics on the ISAC signal design are proposed.
17
Spaceborne Interferometric Synthetic Aperture Radar (InSAR) enables surface elevation measurement and deformation monitoring by measuring phase differences along the radar line of sight. However, meeting the future demand for higher-precision measurements remains challenging: analytical models linking InSAR system design parameters to measurement accuracy are still limited by incomplete key parameters and insufficient or unclear physical constraints. These limitations restrict the development of next-generation InSAR technology. This study examines the complex multifactor coupling between system design parameters and measurement accuracy. It provides a detailed analysis of the imaging mechanism and theoretical constraints of spaceborne InSAR with spatial and temporal baselines and presents a spatiotemporal error model integrating multisource decorrelation. The nonlinear relationship between baseline parameters and measurement accuracy is quantitatively characterized, and a comprehensive evaluation framework is established based on key indicators such as coherence, elevation accuracy, and coherent temporal baseline-based deformation sensitivity. Built on top of these analysis, the concept and system architecture of very large baseline spaceborne InSAR are proposed, and its performance is analyzed in detail. The associated technical challenges—including orbit configuration, system design, synchronization, error correction, and phase unwrapping—are systematically discussed. Potential applications of this type of InSAR system architecture in high-precision elevation, deformation measurements, and distributed SAR systems are introduced. The proposed framework provides theoretical support for the design of next-generation high-precision, multidimensional InSAR systems and is expected to play a key role in the frontier of Earth science exploration and the safety assurance of major national engineering projects. Spaceborne Interferometric Synthetic Aperture Radar (InSAR) enables surface elevation measurement and deformation monitoring by measuring phase differences along the radar line of sight. However, meeting the future demand for higher-precision measurements remains challenging: analytical models linking InSAR system design parameters to measurement accuracy are still limited by incomplete key parameters and insufficient or unclear physical constraints. These limitations restrict the development of next-generation InSAR technology. This study examines the complex multifactor coupling between system design parameters and measurement accuracy. It provides a detailed analysis of the imaging mechanism and theoretical constraints of spaceborne InSAR with spatial and temporal baselines and presents a spatiotemporal error model integrating multisource decorrelation. The nonlinear relationship between baseline parameters and measurement accuracy is quantitatively characterized, and a comprehensive evaluation framework is established based on key indicators such as coherence, elevation accuracy, and coherent temporal baseline-based deformation sensitivity. Built on top of these analysis, the concept and system architecture of very large baseline spaceborne InSAR are proposed, and its performance is analyzed in detail. The associated technical challenges—including orbit configuration, system design, synchronization, error correction, and phase unwrapping—are systematically discussed. Potential applications of this type of InSAR system architecture in high-precision elevation, deformation measurements, and distributed SAR systems are introduced. The proposed framework provides theoretical support for the design of next-generation high-precision, multidimensional InSAR systems and is expected to play a key role in the frontier of Earth science exploration and the safety assurance of major national engineering projects.
18
Non-Line-Of-Sight (NLOS) millimeter wave radar 3D imaging leverages electromagnetic wave propagation characteristics such as reflection, diffraction, scattering, and penetration to detect, locate, and image hidden targets in occluded environments. It holds significant potential for applications in autonomous driving, disaster rescue, and urban warfare. However, uncertainties introduced by reflection surfaces and occluding objects in practical NLOS scenarios, such as phase errors, aperture shadowing, and multipath effect, lead to issues like blurred imaging and increased artifacts in radar imaging. To address these challenges, this study proposes a 3D imaging method for NLOS millimeter wave radar based on Range Migration (RM) operator learning, leveraging the adaptive optimization properties of deep unfolding networks and prior environmental perception. First, a 3D imaging model for NLOS millimeter wave radar in Looking Around Corner (LAC) scenarios is established. An RM kernel operator is introduced to enhance imaging efficiency and reduce computational complexity. Second, a high-precision NLOS 3D imaging network is constructed based on the Fast Iterative Shrinkage/Thresholding Algorithm (FISTA) framework. Utilizing features specific to NLOS scenes and designing algorithm parameters as functions of network weights, the method achieves high-precision, high-efficiency 3D reconstruction of NLOS targets. Finally, a near-field NLOS millimeter wave radar imaging platform is developed. Experimental validations are performed on targets, including metal letters “O” and “S”, an Eiffel Tower model, and an artificial satellite model, under both ideal and non-ideal reflection surface conditions. The results demonstrate that the proposed method significantly improves 3D imaging precision, achieving a two-orders-of-magnitude increase in computational speed over traditional sparse imaging algorithms. Non-Line-Of-Sight (NLOS) millimeter wave radar 3D imaging leverages electromagnetic wave propagation characteristics such as reflection, diffraction, scattering, and penetration to detect, locate, and image hidden targets in occluded environments. It holds significant potential for applications in autonomous driving, disaster rescue, and urban warfare. However, uncertainties introduced by reflection surfaces and occluding objects in practical NLOS scenarios, such as phase errors, aperture shadowing, and multipath effect, lead to issues like blurred imaging and increased artifacts in radar imaging. To address these challenges, this study proposes a 3D imaging method for NLOS millimeter wave radar based on Range Migration (RM) operator learning, leveraging the adaptive optimization properties of deep unfolding networks and prior environmental perception. First, a 3D imaging model for NLOS millimeter wave radar in Looking Around Corner (LAC) scenarios is established. An RM kernel operator is introduced to enhance imaging efficiency and reduce computational complexity. Second, a high-precision NLOS 3D imaging network is constructed based on the Fast Iterative Shrinkage/Thresholding Algorithm (FISTA) framework. Utilizing features specific to NLOS scenes and designing algorithm parameters as functions of network weights, the method achieves high-precision, high-efficiency 3D reconstruction of NLOS targets. Finally, a near-field NLOS millimeter wave radar imaging platform is developed. Experimental validations are performed on targets, including metal letters “O” and “S”, an Eiffel Tower model, and an artificial satellite model, under both ideal and non-ideal reflection surface conditions. The results demonstrate that the proposed method significantly improves 3D imaging precision, achieving a two-orders-of-magnitude increase in computational speed over traditional sparse imaging algorithms.
19
To address issues such as insufficient feature extraction, limited spatiotemporal correlation modeling, and poor classification performance in radar classification of Low, Slow, and Small (LSS) targets, this paper investigates on graph network-based feature extraction and classification methods. First, focusing on digital array ubiquitous radar, a radar detection dataset for LSS targets, named LSS-DAUR-1.0, is constructed; it contains Doppler and track data for six types of targets: Passenger ships, speedboats, helicopters, rotor drones, birds, and fixed-wing drones. Second, based on this dataset, the multidomain and multidimensional characteristics of the targets are analyzed, and the complementarity between Doppler and physical motion features is verified through correlation and cosine similarity analyses. On this basis, a Graph Convolutional Network with Dynamic Graph Construction (DG-GCN) classification method fusing dual features is proposed. An adaptive window adjustment, a hybrid attenuation function, and a dynamic threshold mechanism are designed to construct an adaptive dynamic graph based on spatiotemporal correlation. Combined with graph convolution-based feature learning and classification modules, this approach achieves refined classification of low, slow, and small targets. Validation on the LSS-DAUR-1.0 dataset shows that the DG-GCN achieves 99.66% classification accuracy, which is 6.78% and 17.97% higher than that of ResNet and Transformer models, respectively. The total processing time is only 4.98 ms, which is more than 80% lower than that of the aforementioned comparison models. Hence, the DG-GCN achieves both high accuracy and efficiency. In addition, noise environment tests show good robustness. Ablation experiments verify that the dynamic edge weight mechanism compensates for the lack of spatial feature correlation in purely temporal connections and improves the model’s generalizability. To address issues such as insufficient feature extraction, limited spatiotemporal correlation modeling, and poor classification performance in radar classification of Low, Slow, and Small (LSS) targets, this paper investigates on graph network-based feature extraction and classification methods. First, focusing on digital array ubiquitous radar, a radar detection dataset for LSS targets, named LSS-DAUR-1.0, is constructed; it contains Doppler and track data for six types of targets: Passenger ships, speedboats, helicopters, rotor drones, birds, and fixed-wing drones. Second, based on this dataset, the multidomain and multidimensional characteristics of the targets are analyzed, and the complementarity between Doppler and physical motion features is verified through correlation and cosine similarity analyses. On this basis, a Graph Convolutional Network with Dynamic Graph Construction (DG-GCN) classification method fusing dual features is proposed. An adaptive window adjustment, a hybrid attenuation function, and a dynamic threshold mechanism are designed to construct an adaptive dynamic graph based on spatiotemporal correlation. Combined with graph convolution-based feature learning and classification modules, this approach achieves refined classification of low, slow, and small targets. Validation on the LSS-DAUR-1.0 dataset shows that the DG-GCN achieves 99.66% classification accuracy, which is 6.78% and 17.97% higher than that of ResNet and Transformer models, respectively. The total processing time is only 4.98 ms, which is more than 80% lower than that of the aforementioned comparison models. Hence, the DG-GCN achieves both high accuracy and efficiency. In addition, noise environment tests show good robustness. Ablation experiments verify that the dynamic edge weight mechanism compensates for the lack of spatial feature correlation in purely temporal connections and improves the model’s generalizability.
20
As the electromagnetic spectrum becomes a key operational domain in modern warfare, radars will face a more complex, dexterous, and smarter electromagnetic interference environment in future military operations. Cognitive Intelligent Radar (CIR) has become one of the key development directions in the field of radar technology because it has the capabilities of active environmental perception, arbitrary transmit and receive design, intelligent signal processing, and resource scheduling, therefore, can adapt to the complex and changeable battlefield electromagnetic confrontation environment. In this study, the CIR is decomposed into four functional modules: cognitive transmitting, cognitive receiving, intelligent signal processing, and intelligent resource scheduling. Then, the antijamming principle of each link (i.e., interference perception, transmit design, receive design, signal processing, and resource scheduling) of CIR is elucidated. Finally, we summarize the representative literature in recent years and analyze the technological development trend in this field to provide the necessary reference and basis for future technological research. As the electromagnetic spectrum becomes a key operational domain in modern warfare, radars will face a more complex, dexterous, and smarter electromagnetic interference environment in future military operations. Cognitive Intelligent Radar (CIR) has become one of the key development directions in the field of radar technology because it has the capabilities of active environmental perception, arbitrary transmit and receive design, intelligent signal processing, and resource scheduling, therefore, can adapt to the complex and changeable battlefield electromagnetic confrontation environment. In this study, the CIR is decomposed into four functional modules: cognitive transmitting, cognitive receiving, intelligent signal processing, and intelligent resource scheduling. Then, the antijamming principle of each link (i.e., interference perception, transmit design, receive design, signal processing, and resource scheduling) of CIR is elucidated. Finally, we summarize the representative literature in recent years and analyze the technological development trend in this field to provide the necessary reference and basis for future technological research.
  • First
  • Prev
  • 1
  • 2
  • 3
  • 4
  • 5
  • Last
  • Total:5
  • To
  • Go