Most Downloaded

1
With the increasing demands on imaging accuracy, efficiency, and robustness in modern three-Dimensional (3D) Synthetic Aperture Radar (SAR) imaging systems, the performance of traditional 3D imaging methods, such as matched filtering and compressed sensing, has become limited in these aspects. In recent years, the rapid development of Deep Learning (DL) technology has provided new theoretical solutions for SAR 3D imaging by enabling the integration of neural networks with physical radar imaging models, leading to the emergence of a learning-based imaging paradigm that combines data-driven and model-driven approaches. This paper systematically reviews recent research progress in DL-based SAR 3D imaging. Focusing on two core issues, namely super-resolution imaging and enhanced imaging, this paper discusses current research advances and hotspots in SAR 3D imaging. These include super-resolution 3D imaging methods based on feedforward neural networks and deep unfolding networks, as well as 3D enhancement techniques such as multichannel data preprocessing and point cloud post-processing. This paper also summarizes publicly available datasets for SAR 3D imaging. In addition, this paper explores current research challenges in DL SAR 3D imaging, including high-generalization and high-precision DL SAR super-resolution 3D imaging technology, DL SAR elevation dimension disambiguation technology, integrated study of DL SAR 3D imaging and image enhancement, and the construction of DL SAR 3D imaging datasets. This paper provides an outlook on future development trends, aiming to offer research references and technical guidance for scholars in related fields. With the increasing demands on imaging accuracy, efficiency, and robustness in modern three-Dimensional (3D) Synthetic Aperture Radar (SAR) imaging systems, the performance of traditional 3D imaging methods, such as matched filtering and compressed sensing, has become limited in these aspects. In recent years, the rapid development of Deep Learning (DL) technology has provided new theoretical solutions for SAR 3D imaging by enabling the integration of neural networks with physical radar imaging models, leading to the emergence of a learning-based imaging paradigm that combines data-driven and model-driven approaches. This paper systematically reviews recent research progress in DL-based SAR 3D imaging. Focusing on two core issues, namely super-resolution imaging and enhanced imaging, this paper discusses current research advances and hotspots in SAR 3D imaging. These include super-resolution 3D imaging methods based on feedforward neural networks and deep unfolding networks, as well as 3D enhancement techniques such as multichannel data preprocessing and point cloud post-processing. This paper also summarizes publicly available datasets for SAR 3D imaging. In addition, this paper explores current research challenges in DL SAR 3D imaging, including high-generalization and high-precision DL SAR super-resolution 3D imaging technology, DL SAR elevation dimension disambiguation technology, integrated study of DL SAR 3D imaging and image enhancement, and the construction of DL SAR 3D imaging datasets. This paper provides an outlook on future development trends, aiming to offer research references and technical guidance for scholars in related fields.
2
This study addresses the issue of fine-grained feature extraction and classification for Low-Slow-Small (LSS) targets, such as birds and drones, by proposing a multi-band multi-angle feature fusion classification method. First, data from five types of rotorcraft drones and bird models were collected at multiple angles using K-band and L-band frequency-modulated continuous-wave radars, forming a dataset for LSS target detection. Second, to capture the periodic vibration characteristics of the L-band target signals, empirical mode decomposition was applied to extract high-frequency features and reduce noise interference. For the K-band echo signals, short-time Fourier transform was applied to obtain high-resolution micro-Doppler features from various angles. Based on these features, a Multi-band Multi-angle Feature Fusion Network (MMFFNet) was designed, incorporating an improved convolutional long short-term memory network for temporal feature extraction, along with an attention fusion module and a multiscale feature fusion module. The proposed architecture improves target classification accuracy by integrating features from both bands and angles. Validation using a real-world dataset showed that compared with methods relying on single radar features, the proposed approach improved the classification accuracy for seven types of LSS targets by 3.1% under a high Signal-to-Noise Ratio (SNR) of 5 dB and by 12.3% under a low SNR of −3 dB. This study addresses the issue of fine-grained feature extraction and classification for Low-Slow-Small (LSS) targets, such as birds and drones, by proposing a multi-band multi-angle feature fusion classification method. First, data from five types of rotorcraft drones and bird models were collected at multiple angles using K-band and L-band frequency-modulated continuous-wave radars, forming a dataset for LSS target detection. Second, to capture the periodic vibration characteristics of the L-band target signals, empirical mode decomposition was applied to extract high-frequency features and reduce noise interference. For the K-band echo signals, short-time Fourier transform was applied to obtain high-resolution micro-Doppler features from various angles. Based on these features, a Multi-band Multi-angle Feature Fusion Network (MMFFNet) was designed, incorporating an improved convolutional long short-term memory network for temporal feature extraction, along with an attention fusion module and a multiscale feature fusion module. The proposed architecture improves target classification accuracy by integrating features from both bands and angles. Validation using a real-world dataset showed that compared with methods relying on single radar features, the proposed approach improved the classification accuracy for seven types of LSS targets by 3.1% under a high Signal-to-Noise Ratio (SNR) of 5 dB and by 12.3% under a low SNR of −3 dB.
3
Deep learning is primarily used for target detection in Synthetic Aperture Radar (SAR) images; however, its performance heavily relies on large-scale labeled datasets. The detection performance of deep learning models degrades when applied to SAR data with varying distributions, hindering their real-world applicability. In addition, manual labeling of SAR data is costly. Hence, cross-domain learning strategies based on multisource information are being explored to address these challenges. These strategies can assist detection models in realizing cross-domain knowledge migration by integrating prior information from optical remote sensing images or heterogeneous SAR images acquired from different sensors. This paper focuses on cross-domain learning technologies within the deep learning framework. In addition, it provides a systematic overview of the latest research progress in this field and analyzes the core issues, advantages, and applicable scenarios of existing technologies from a methodological perspective. It outlines future research directions based on the law of technological evolution, aiming to offer theoretical support and methodological references to enhance the generalizability of target detection in SAR images. Deep learning is primarily used for target detection in Synthetic Aperture Radar (SAR) images; however, its performance heavily relies on large-scale labeled datasets. The detection performance of deep learning models degrades when applied to SAR data with varying distributions, hindering their real-world applicability. In addition, manual labeling of SAR data is costly. Hence, cross-domain learning strategies based on multisource information are being explored to address these challenges. These strategies can assist detection models in realizing cross-domain knowledge migration by integrating prior information from optical remote sensing images or heterogeneous SAR images acquired from different sensors. This paper focuses on cross-domain learning technologies within the deep learning framework. In addition, it provides a systematic overview of the latest research progress in this field and analyzes the core issues, advantages, and applicable scenarios of existing technologies from a methodological perspective. It outlines future research directions based on the law of technological evolution, aiming to offer theoretical support and methodological references to enhance the generalizability of target detection in SAR images.
4
Marine target detection and recognition depend on the characteristics of marine targets and sea clutter. Therefore, understanding the essential features of marine targets based on the measured data is crucial for advancing target detection and recognition technology. To address the issue of insufficient data on the scattering characteristics of marine targets, the Sea-Detecting Radar Data-Sharing Program (SDRDSP) was upgraded to obtain data on marine targets and their environment under different polarizations and sea states. This upgrade expanded the physical dimension of radar target observation and improved radar and auxiliary data acquisition capabilities. Furthermore, a dual-polarized multistate scattering characteristic dataset of marine targets was constructed, and the statistical distribution characteristics, time and space correlation, and Doppler spectrum were analyzed, supporting the data usage. In the future, the types and quantities of maritime targets will continue to accumulate, providing data support for improving marine target detection and recognition performance and intelligence. Marine target detection and recognition depend on the characteristics of marine targets and sea clutter. Therefore, understanding the essential features of marine targets based on the measured data is crucial for advancing target detection and recognition technology. To address the issue of insufficient data on the scattering characteristics of marine targets, the Sea-Detecting Radar Data-Sharing Program (SDRDSP) was upgraded to obtain data on marine targets and their environment under different polarizations and sea states. This upgrade expanded the physical dimension of radar target observation and improved radar and auxiliary data acquisition capabilities. Furthermore, a dual-polarized multistate scattering characteristic dataset of marine targets was constructed, and the statistical distribution characteristics, time and space correlation, and Doppler spectrum were analyzed, supporting the data usage. In the future, the types and quantities of maritime targets will continue to accumulate, providing data support for improving marine target detection and recognition performance and intelligence.
5
This study proposes a Synthetic Aperture Radar (SAR) aircraft detection and recognition method combined with scattering perception to address the problem of target discreteness and false alarms caused by strong background interference in SAR images. The global information is enhanced through a context-guided feature pyramid module, which suppresses strong disturbances in complex images and improves the accuracy of detection and recognition. Additionally, scatter key points are used to locate targets, and a scatter-aware detection module is designed to realize the fine correction of the regression boxes to improve target localization accuracy. This study generates and presents a high-resolution SAR-AIRcraft-1.0 dataset to verify the effectiveness of the proposed method and promote the research on SAR aircraft detection and recognition. The images in this dataset are obtained from the satellite Gaofen-3, which contains 4,368 images and 16,463 aircraft instances, covering seven aircraft categories, namely A220, A320/321, A330, ARJ21, Boeing737, Boeing787, and other. We apply the proposed method and common deep learning algorithms to the constructed dataset. The experimental results demonstrate the excellent effectiveness of our method combined with scattering perception. Furthermore, we establish benchmarks for the performance indicators of the dataset in different tasks such as SAR aircraft detection, recognition, and integrated detection and recognition. This study proposes a Synthetic Aperture Radar (SAR) aircraft detection and recognition method combined with scattering perception to address the problem of target discreteness and false alarms caused by strong background interference in SAR images. The global information is enhanced through a context-guided feature pyramid module, which suppresses strong disturbances in complex images and improves the accuracy of detection and recognition. Additionally, scatter key points are used to locate targets, and a scatter-aware detection module is designed to realize the fine correction of the regression boxes to improve target localization accuracy. This study generates and presents a high-resolution SAR-AIRcraft-1.0 dataset to verify the effectiveness of the proposed method and promote the research on SAR aircraft detection and recognition. The images in this dataset are obtained from the satellite Gaofen-3, which contains 4,368 images and 16,463 aircraft instances, covering seven aircraft categories, namely A220, A320/321, A330, ARJ21, Boeing737, Boeing787, and other. We apply the proposed method and common deep learning algorithms to the constructed dataset. The experimental results demonstrate the excellent effectiveness of our method combined with scattering perception. Furthermore, we establish benchmarks for the performance indicators of the dataset in different tasks such as SAR aircraft detection, recognition, and integrated detection and recognition.
6
Detection of small, slow-moving targets, such as drones using Unmanned Aerial Vehicles (UAVs) poses considerable challenges to radar target detection and recognition technology. There is an urgent need to establish relevant datasets to support the development and application of techniques for detecting small, slow-moving targets. This paper presents a dataset for detecting low-speed and small-size targets using a multiband Frequency Modulated Continuous Wave (FMCW) radar. The dataset utilizes Ku-band and L-band FMCW radar to collect echo data from six UAV types and exhibits diverse temporal and frequency domain resolutions and measurement capabilities by modulating radar cycles and bandwidth, generating an LSS-FMCWR-1.0 dataset (Low Slow Small, LSS). To further enhance the capability for extracting micro-Doppler features from UAVs, this paper proposes a method for UAV micro-Doppler extraction and parameter estimation based on the local maximum synchroextracting transform. Based on the Short Time Fourier Transform (STFT), this method extracts values at the maximum energy point in the time-frequency domain to retain useful signals and refine the time-frequency energy representation. Validation and analysis using the LSS-FMCWR-1.0 dataset demonstrate that this approach reduces entropy on an average by 5.3 dB and decreases estimation errors in rotor blade length by 27.7% compared with traditional time-frequency methods. Moreover, the proposed method provides the foundation for subsequent target recognition efforts because it balances high time-frequency resolution and parameter estimation capabilities. Detection of small, slow-moving targets, such as drones using Unmanned Aerial Vehicles (UAVs) poses considerable challenges to radar target detection and recognition technology. There is an urgent need to establish relevant datasets to support the development and application of techniques for detecting small, slow-moving targets. This paper presents a dataset for detecting low-speed and small-size targets using a multiband Frequency Modulated Continuous Wave (FMCW) radar. The dataset utilizes Ku-band and L-band FMCW radar to collect echo data from six UAV types and exhibits diverse temporal and frequency domain resolutions and measurement capabilities by modulating radar cycles and bandwidth, generating an LSS-FMCWR-1.0 dataset (Low Slow Small, LSS). To further enhance the capability for extracting micro-Doppler features from UAVs, this paper proposes a method for UAV micro-Doppler extraction and parameter estimation based on the local maximum synchroextracting transform. Based on the Short Time Fourier Transform (STFT), this method extracts values at the maximum energy point in the time-frequency domain to retain useful signals and refine the time-frequency energy representation. Validation and analysis using the LSS-FMCWR-1.0 dataset demonstrate that this approach reduces entropy on an average by 5.3 dB and decreases estimation errors in rotor blade length by 27.7% compared with traditional time-frequency methods. Moreover, the proposed method provides the foundation for subsequent target recognition efforts because it balances high time-frequency resolution and parameter estimation capabilities.
7
To address issues such as insufficient feature extraction, limited spatiotemporal correlation modeling, and poor classification performance in radar classification of Low, Slow, and Small (LSS) targets, this paper investigates on graph network-based feature extraction and classification methods. First, focusing on digital array ubiquitous radar, a radar detection dataset for LSS targets, named LSS-DAUR-1.0, is constructed; it contains Doppler and track data for six types of targets: Passenger ships, speedboats, helicopters, rotor drones, birds, and fixed-wing drones. Second, based on this dataset, the multidomain and multidimensional characteristics of the targets are analyzed, and the complementarity between Doppler and physical motion features is verified through correlation and cosine similarity analyses. On this basis, a Graph Convolutional Network with Dynamic Graph Construction (DG-GCN) classification method fusing dual features is proposed. An adaptive window adjustment, a hybrid attenuation function, and a dynamic threshold mechanism are designed to construct an adaptive dynamic graph based on spatiotemporal correlation. Combined with graph convolution-based feature learning and classification modules, this approach achieves refined classification of low, slow, and small targets. Validation on the LSS-DAUR-1.0 dataset shows that the DG-GCN achieves 99.66% classification accuracy, which is 6.78% and 17.97% higher than that of ResNet and Transformer models, respectively. The total processing time is only 4.98 ms, which is more than 80% lower than that of the aforementioned comparison models. Hence, the DG-GCN achieves both high accuracy and efficiency. In addition, noise environment tests show good robustness. Ablation experiments verify that the dynamic edge weight mechanism compensates for the lack of spatial feature correlation in purely temporal connections and improves the model’s generalizability. To address issues such as insufficient feature extraction, limited spatiotemporal correlation modeling, and poor classification performance in radar classification of Low, Slow, and Small (LSS) targets, this paper investigates on graph network-based feature extraction and classification methods. First, focusing on digital array ubiquitous radar, a radar detection dataset for LSS targets, named LSS-DAUR-1.0, is constructed; it contains Doppler and track data for six types of targets: Passenger ships, speedboats, helicopters, rotor drones, birds, and fixed-wing drones. Second, based on this dataset, the multidomain and multidimensional characteristics of the targets are analyzed, and the complementarity between Doppler and physical motion features is verified through correlation and cosine similarity analyses. On this basis, a Graph Convolutional Network with Dynamic Graph Construction (DG-GCN) classification method fusing dual features is proposed. An adaptive window adjustment, a hybrid attenuation function, and a dynamic threshold mechanism are designed to construct an adaptive dynamic graph based on spatiotemporal correlation. Combined with graph convolution-based feature learning and classification modules, this approach achieves refined classification of low, slow, and small targets. Validation on the LSS-DAUR-1.0 dataset shows that the DG-GCN achieves 99.66% classification accuracy, which is 6.78% and 17.97% higher than that of ResNet and Transformer models, respectively. The total processing time is only 4.98 ms, which is more than 80% lower than that of the aforementioned comparison models. Hence, the DG-GCN achieves both high accuracy and efficiency. In addition, noise environment tests show good robustness. Ablation experiments verify that the dynamic edge weight mechanism compensates for the lack of spatial feature correlation in purely temporal connections and improves the model’s generalizability.
8
The bistatic Synthetic Aperture Radar (SAR) system, which employs spatially separated transmitting and receiving platforms, provides high-resolution imaging of terrestrial and maritime scenes and targets in complex environments. Its advantages include flexible configuration, strong concealment capabilities, high interference resistance, and comprehensive target information acquisition, making it valuable in high-precision remote sensing mapping, covert imaging, and precision strikes. Image processing is critical for obtaining high-resolution Bistatic SAR (BiSAR) images. However, the echo model and characteristics of BiSAR substantially differ from those of traditional monostatic SAR, necessitating specialized image processing methods tailored to various operational modes and configurations. This study examines key challenges and solutions for several BiSAR configurations, including airborne BiSAR, BiSAR with high-speed and highly maneuverable platforms, spaceborne heterogeneous BiSAR, and spaceborne homogeneous BiSAR. This study also addresses motion compensation approaches and moving target imaging in BiSAR systems, reviews relevant domestic and international research advancements, and provides an outlook on future trends in BiSAR image processing. The bistatic Synthetic Aperture Radar (SAR) system, which employs spatially separated transmitting and receiving platforms, provides high-resolution imaging of terrestrial and maritime scenes and targets in complex environments. Its advantages include flexible configuration, strong concealment capabilities, high interference resistance, and comprehensive target information acquisition, making it valuable in high-precision remote sensing mapping, covert imaging, and precision strikes. Image processing is critical for obtaining high-resolution Bistatic SAR (BiSAR) images. However, the echo model and characteristics of BiSAR substantially differ from those of traditional monostatic SAR, necessitating specialized image processing methods tailored to various operational modes and configurations. This study examines key challenges and solutions for several BiSAR configurations, including airborne BiSAR, BiSAR with high-speed and highly maneuverable platforms, spaceborne heterogeneous BiSAR, and spaceborne homogeneous BiSAR. This study also addresses motion compensation approaches and moving target imaging in BiSAR systems, reviews relevant domestic and international research advancements, and provides an outlook on future trends in BiSAR image processing.
9
Synthetic Aperture Radar (SAR) is widely used in military and civilian applications, with intelligent target interpretation of SAR images being a crucial component of SAR applications. Vision-Language Models (VLMs) play an important role in SAR target interpretation. By incorporating natural language understanding, VLMs effectively address the challenges posed by large intraclass variability in target characteristics and the scarcity of high-quality labeled samples, thereby advancing the field from purely visual interpretation toward semantic understanding of targets. Drawing upon our team’s extensive research experience in SAR target interpretation theory, algorithms, and applications, this paper provides a comprehensive review of intelligent SAR target interpretation based on VLMs. We provide an in-depth analysis of existing challenges and tasks, summarize the current state of research, and compile available open-source datasets. Furthermore, we systematically outline the evolution, ranging from task-specific VLMs to contrastive-, conversational-, and generative-based VLMs and foundational models. Finally, we discuss the latest challenges and future outlooks in SAR target interpretation by VLMs. Synthetic Aperture Radar (SAR) is widely used in military and civilian applications, with intelligent target interpretation of SAR images being a crucial component of SAR applications. Vision-Language Models (VLMs) play an important role in SAR target interpretation. By incorporating natural language understanding, VLMs effectively address the challenges posed by large intraclass variability in target characteristics and the scarcity of high-quality labeled samples, thereby advancing the field from purely visual interpretation toward semantic understanding of targets. Drawing upon our team’s extensive research experience in SAR target interpretation theory, algorithms, and applications, this paper provides a comprehensive review of intelligent SAR target interpretation based on VLMs. We provide an in-depth analysis of existing challenges and tasks, summarize the current state of research, and compile available open-source datasets. Furthermore, we systematically outline the evolution, ranging from task-specific VLMs to contrastive-, conversational-, and generative-based VLMs and foundational models. Finally, we discuss the latest challenges and future outlooks in SAR target interpretation by VLMs.
10
In recent years, the rapid development of Multimodal Large Language Models (MLLMs) and their applications in earth observation have garnered significant attention. Earth observation MLLMs achieve deep integration of multimodal information, including optical imagery, Synthetic Aperture Radar (SAR) imagery, and textual data, through the design of bridging mechanisms between large language models and vision models, combined with joint training strategies. This integration facilitates a paradigm shift in intelligent earth observation interpretation—from shallow semantic matching to higher-level understanding based on world knowledge. In this study, we systematically review the research progress in the applications of MLLMs in earth observation, specifically examining the development of Earth Observation MLLMs (EO-MLLMs), which provides a foundation for future research directions. Initially, we discuss the concept of EO-MLLMs and review their development in chronological order. Subsequently, we provide a detailed analysis and statistical summary of the proposed architectures, training methods, applications, and corresponding benchmark datasets, along with an introduction to Earth Observation Agents (EO-Agent). Finally, we summarize the research status of EO-MLLMs and discuss future research directions. In recent years, the rapid development of Multimodal Large Language Models (MLLMs) and their applications in earth observation have garnered significant attention. Earth observation MLLMs achieve deep integration of multimodal information, including optical imagery, Synthetic Aperture Radar (SAR) imagery, and textual data, through the design of bridging mechanisms between large language models and vision models, combined with joint training strategies. This integration facilitates a paradigm shift in intelligent earth observation interpretation—from shallow semantic matching to higher-level understanding based on world knowledge. In this study, we systematically review the research progress in the applications of MLLMs in earth observation, specifically examining the development of Earth Observation MLLMs (EO-MLLMs), which provides a foundation for future research directions. Initially, we discuss the concept of EO-MLLMs and review their development in chronological order. Subsequently, we provide a detailed analysis and statistical summary of the proposed architectures, training methods, applications, and corresponding benchmark datasets, along with an introduction to Earth Observation Agents (EO-Agent). Finally, we summarize the research status of EO-MLLMs and discuss future research directions.
11
Non-Line-Of-Sight (NLOS) millimeter wave radar 3D imaging leverages electromagnetic wave propagation characteristics such as reflection, diffraction, scattering, and penetration to detect, locate, and image hidden targets in occluded environments. It holds significant potential for applications in autonomous driving, disaster rescue, and urban warfare. However, uncertainties introduced by reflection surfaces and occluding objects in practical NLOS scenarios, such as phase errors, aperture shadowing, and multipath effect, lead to issues like blurred imaging and increased artifacts in radar imaging. To address these challenges, this study proposes a 3D imaging method for NLOS millimeter wave radar based on Range Migration (RM) operator learning, leveraging the adaptive optimization properties of deep unfolding networks and prior environmental perception. First, a 3D imaging model for NLOS millimeter wave radar in Looking Around Corner (LAC) scenarios is established. An RM kernel operator is introduced to enhance imaging efficiency and reduce computational complexity. Second, a high-precision NLOS 3D imaging network is constructed based on the Fast Iterative Shrinkage/Thresholding Algorithm (FISTA) framework. Utilizing features specific to NLOS scenes and designing algorithm parameters as functions of network weights, the method achieves high-precision, high-efficiency 3D reconstruction of NLOS targets. Finally, a near-field NLOS millimeter wave radar imaging platform is developed. Experimental validations are performed on targets, including metal letters “O” and “S”, an Eiffel Tower model, and an artificial satellite model, under both ideal and non-ideal reflection surface conditions. The results demonstrate that the proposed method significantly improves 3D imaging precision, achieving a two-orders-of-magnitude increase in computational speed over traditional sparse imaging algorithms. Non-Line-Of-Sight (NLOS) millimeter wave radar 3D imaging leverages electromagnetic wave propagation characteristics such as reflection, diffraction, scattering, and penetration to detect, locate, and image hidden targets in occluded environments. It holds significant potential for applications in autonomous driving, disaster rescue, and urban warfare. However, uncertainties introduced by reflection surfaces and occluding objects in practical NLOS scenarios, such as phase errors, aperture shadowing, and multipath effect, lead to issues like blurred imaging and increased artifacts in radar imaging. To address these challenges, this study proposes a 3D imaging method for NLOS millimeter wave radar based on Range Migration (RM) operator learning, leveraging the adaptive optimization properties of deep unfolding networks and prior environmental perception. First, a 3D imaging model for NLOS millimeter wave radar in Looking Around Corner (LAC) scenarios is established. An RM kernel operator is introduced to enhance imaging efficiency and reduce computational complexity. Second, a high-precision NLOS 3D imaging network is constructed based on the Fast Iterative Shrinkage/Thresholding Algorithm (FISTA) framework. Utilizing features specific to NLOS scenes and designing algorithm parameters as functions of network weights, the method achieves high-precision, high-efficiency 3D reconstruction of NLOS targets. Finally, a near-field NLOS millimeter wave radar imaging platform is developed. Experimental validations are performed on targets, including metal letters “O” and “S”, an Eiffel Tower model, and an artificial satellite model, under both ideal and non-ideal reflection surface conditions. The results demonstrate that the proposed method significantly improves 3D imaging precision, achieving a two-orders-of-magnitude increase in computational speed over traditional sparse imaging algorithms.
12
Passive radar plays an important role in early warning detection and Low Slow Small (LSS) target detection. Due to the uncontrollable source of passive radar signal radiations, target characteristics are more complex, which makes target detection and identification extremely difficult. In this paper, a passive radar LSS detection dataset (LSS-PR-1.0) is constructed, which contains the radar echo signals of four typical sea and air targets, namely helicopters, unmanned aerial vehicles, speedboats, and passenger ships, as well as sea clutter data at low and high sea states. It provides data support for radar research. In terms of target feature extraction and analysis, the singular-value-decomposition sea-clutter-suppression method is first adopted to remove the influence of the strong Bragg peak of sea clutter on target echo. On this basis, four categories of ten multi-domain feature extraction and analysis methods are proposed, including time-domain features (relative average amplitude), frequency-domain features (spectral features, Doppler waterfall plot, and range Doppler features), time-frequency-domain features, and motion features (heading difference, trajectory parameters, speed variation interval, speed variation coefficient, and acceleration). Based on the actual measurement data, a comparative analysis is conducted on the characteristics of four types of sea and air targets, summarizing the patterns of various target characteristics and laying the foundation for subsequent target recognition. Passive radar plays an important role in early warning detection and Low Slow Small (LSS) target detection. Due to the uncontrollable source of passive radar signal radiations, target characteristics are more complex, which makes target detection and identification extremely difficult. In this paper, a passive radar LSS detection dataset (LSS-PR-1.0) is constructed, which contains the radar echo signals of four typical sea and air targets, namely helicopters, unmanned aerial vehicles, speedboats, and passenger ships, as well as sea clutter data at low and high sea states. It provides data support for radar research. In terms of target feature extraction and analysis, the singular-value-decomposition sea-clutter-suppression method is first adopted to remove the influence of the strong Bragg peak of sea clutter on target echo. On this basis, four categories of ten multi-domain feature extraction and analysis methods are proposed, including time-domain features (relative average amplitude), frequency-domain features (spectral features, Doppler waterfall plot, and range Doppler features), time-frequency-domain features, and motion features (heading difference, trajectory parameters, speed variation interval, speed variation coefficient, and acceleration). Based on the actual measurement data, a comparative analysis is conducted on the characteristics of four types of sea and air targets, summarizing the patterns of various target characteristics and laying the foundation for subsequent target recognition.
13
Millimeter-wave radar is increasingly being adopted for smart home systems, elder care, and surveillance monitoring, owing to its adaptability to environmental conditions, high resolution, and privacy-preserving capabilities. A key factor in effectively utilizing millimeter-wave radar is the analysis of point clouds, which are essential for recognizing human postures. However, the sparse nature of these point clouds poses significant challenges for accurate and efficient human action recognition. To overcome these issues, we present a 3D point cloud dataset tailored for human actions captured using millimeter-wave radar (mmWave-3DPCHM-1.0). This dataset is enhanced with advanced data processing techniques and cutting-edge human action recognition models. Data collection is conducted using Texas Instruments (TI)’s IWR1443-ISK and Vayyar’s vBlu radio imaging module, covering 12 common human actions, including walking, waving, standing, and falling. At the core of our approach is the Point EdgeConv and Transformer (PETer) network, which integrates edge convolution with transformer models. For each 3D point cloud frame, PETer constructs a locally directed neighborhood graph through edge convolution to extract spatial geometric features effectively. The network then leverages a series of Transformer encoding models to uncover temporal relationships across multiple point cloud frames. Extensive experiments reveal that the PETer network achieves exceptional recognition rates of 98.77% on the TI dataset and 99.51% on the Vayyar dataset, outperforming the traditional optimal baseline model by approximately 5%. With a compact model size of only 1.09 MB, PETer is well-suited for deployment on edge devices, providing an efficient solution for real-time human action recognition in resource-constrained environments. Millimeter-wave radar is increasingly being adopted for smart home systems, elder care, and surveillance monitoring, owing to its adaptability to environmental conditions, high resolution, and privacy-preserving capabilities. A key factor in effectively utilizing millimeter-wave radar is the analysis of point clouds, which are essential for recognizing human postures. However, the sparse nature of these point clouds poses significant challenges for accurate and efficient human action recognition. To overcome these issues, we present a 3D point cloud dataset tailored for human actions captured using millimeter-wave radar (mmWave-3DPCHM-1.0). This dataset is enhanced with advanced data processing techniques and cutting-edge human action recognition models. Data collection is conducted using Texas Instruments (TI)’s IWR1443-ISK and Vayyar’s vBlu radio imaging module, covering 12 common human actions, including walking, waving, standing, and falling. At the core of our approach is the Point EdgeConv and Transformer (PETer) network, which integrates edge convolution with transformer models. For each 3D point cloud frame, PETer constructs a locally directed neighborhood graph through edge convolution to extract spatial geometric features effectively. The network then leverages a series of Transformer encoding models to uncover temporal relationships across multiple point cloud frames. Extensive experiments reveal that the PETer network achieves exceptional recognition rates of 98.77% on the TI dataset and 99.51% on the Vayyar dataset, outperforming the traditional optimal baseline model by approximately 5%. With a compact model size of only 1.09 MB, PETer is well-suited for deployment on edge devices, providing an efficient solution for real-time human action recognition in resource-constrained environments.
14
Direction of Arrival (DOA) estimation for low-elevation angle targets is a critical challenge in meter-wave and holographic staring radar systems, as its accuracy directly affects target height measurement performance. Traditional beamspace methods reduce computational complexity by projecting high-dimensional element-space data onto a low-dimensional beamspace using a beamformer. However, this lossy mapping leads to partial information loss, resulting in degraded elevation-angle estimation accuracy compared to that of element-space methods. To address this issue, this study proposes a high-accuracy beamspace DOA estimation method for low-elevation angle targets. First, the Cramér-Rao Bound (CRB) for both element-space and beamspace DOA estimation is derived, and the conditions under which these bounds are equal are analyzed. Since these conditions are difficult to satisfy in practical scenarios, an approximate-condition-based beamformer design strategy is developed to reduce data dimensionality while preserving effective target information. Finally, precise elevation-angle estimation is achieved using the maximum likelihood criterion. Simulation and experimental results show that the proposed method significantly reduces data dimensionality while maintaining estimation accuracy comparable to that of element-space methods at low-elevation angles, clearly outperforming existing beamspace algorithms. Direction of Arrival (DOA) estimation for low-elevation angle targets is a critical challenge in meter-wave and holographic staring radar systems, as its accuracy directly affects target height measurement performance. Traditional beamspace methods reduce computational complexity by projecting high-dimensional element-space data onto a low-dimensional beamspace using a beamformer. However, this lossy mapping leads to partial information loss, resulting in degraded elevation-angle estimation accuracy compared to that of element-space methods. To address this issue, this study proposes a high-accuracy beamspace DOA estimation method for low-elevation angle targets. First, the Cramér-Rao Bound (CRB) for both element-space and beamspace DOA estimation is derived, and the conditions under which these bounds are equal are analyzed. Since these conditions are difficult to satisfy in practical scenarios, an approximate-condition-based beamformer design strategy is developed to reduce data dimensionality while preserving effective target information. Finally, precise elevation-angle estimation is achieved using the maximum likelihood criterion. Simulation and experimental results show that the proposed method significantly reduces data dimensionality while maintaining estimation accuracy comparable to that of element-space methods at low-elevation angles, clearly outperforming existing beamspace algorithms.
15
A large-scale Vision-Language Model (VLM) pre-trained on massive image-text datasets performs well when processing natural images. However, there are two major challenges in applying it to Synthetic Aperture Radar (SAR) images: (1) The high cost of high-quality text annotation limits the construction of SAR image-text paired datasets, and (2) The considerable differences in image features between SAR images and optical natural images increase the difficulty of cross-domain knowledge transfer. To address these problems, this study developed a knowledge transfer method for VLM tailored to SAR images. First, this study leveraged paired SAR and optical remote sensing images and employed a generative VLM to automatically produce textual descriptions of the optical images, thereby indirectly constructing a low-cost SAR-text paired dataset. Second, a two-stage transfer strategy was designed to address the large domain discrepancy between natural and SAR images, reducing the difficulty of each transfer stage. Finally, experimental validation was conducted through the zero-shot scene classification, image retrieval, and object recognition of SAR images. The results demonstrated that the proposed method enables effective knowledge transfer from a large-scale VLM to the SAR image domain. A large-scale Vision-Language Model (VLM) pre-trained on massive image-text datasets performs well when processing natural images. However, there are two major challenges in applying it to Synthetic Aperture Radar (SAR) images: (1) The high cost of high-quality text annotation limits the construction of SAR image-text paired datasets, and (2) The considerable differences in image features between SAR images and optical natural images increase the difficulty of cross-domain knowledge transfer. To address these problems, this study developed a knowledge transfer method for VLM tailored to SAR images. First, this study leveraged paired SAR and optical remote sensing images and employed a generative VLM to automatically produce textual descriptions of the optical images, thereby indirectly constructing a low-cost SAR-text paired dataset. Second, a two-stage transfer strategy was designed to address the large domain discrepancy between natural and SAR images, reducing the difficulty of each transfer stage. Finally, experimental validation was conducted through the zero-shot scene classification, image retrieval, and object recognition of SAR images. The results demonstrated that the proposed method enables effective knowledge transfer from a large-scale VLM to the SAR image domain.
16
Research on target recognition using radar High-Resolution Range Profiles (HRRPs) is extensive and diverse in methodology. In particular, the application and development of deep learning to radar HRRP target recognition have enabled efficient, precise target perception directly from radar echoes. However, deep learning-based recognition networks rely on large amounts of training data. For non-cooperative targets, due to limited radar system parameters and rapid target attitude variations, acquiring adequate HRRP training samples that comprehensively cover target attitudes in advance is difficult in practice. Consequently, deep recognition networks are prone to overfitting and exhibit considerably degraded generalization capability. To address these issues, and given the ease of obtaining full-attitude electromagnetic simulation data for the target, this paper leverages simulated data as auxiliary information to mitigate the small-sample-size problem through data augmentation and cross-domain knowledge-transfer learning. For data augmentation, based on the analysis of differences in mean and variance between simulated and measured HRRPs within a given attitude-angle range, a linear transformation is applied to a set of simulated HRRPs spanning the same angular domain as a small set of measured HRRPs. This adjustment ensures that the simulated data’s mean and variance match the characteristics of the measured HRRPs, thereby achieving data augmentation that approximates the true distributional properties of HRRPs. Meanwhile, for cross-domain knowledge transfer learning, the proposed method introduces a domain alignment strategy based on generative adversarial constraints and a class alignment strategy based on contrastive learning constraints. These approaches draw the domain features of full-attitude simulation—strong discriminability and generalizability—closer to the measured domain features on a class-by-class basis, thereby further aiding learning from the measured domain data and leading to substantial improvements in few-shot recognition performance. Experimental results based on electromagnetic simulated and measured HRRP data for three and ten types of aircraft and ground vehicle targets, respectively, demonstrate that the proposed method yields superior recognition robustness compared with existing few-shot recognition methods. Research on target recognition using radar High-Resolution Range Profiles (HRRPs) is extensive and diverse in methodology. In particular, the application and development of deep learning to radar HRRP target recognition have enabled efficient, precise target perception directly from radar echoes. However, deep learning-based recognition networks rely on large amounts of training data. For non-cooperative targets, due to limited radar system parameters and rapid target attitude variations, acquiring adequate HRRP training samples that comprehensively cover target attitudes in advance is difficult in practice. Consequently, deep recognition networks are prone to overfitting and exhibit considerably degraded generalization capability. To address these issues, and given the ease of obtaining full-attitude electromagnetic simulation data for the target, this paper leverages simulated data as auxiliary information to mitigate the small-sample-size problem through data augmentation and cross-domain knowledge-transfer learning. For data augmentation, based on the analysis of differences in mean and variance between simulated and measured HRRPs within a given attitude-angle range, a linear transformation is applied to a set of simulated HRRPs spanning the same angular domain as a small set of measured HRRPs. This adjustment ensures that the simulated data’s mean and variance match the characteristics of the measured HRRPs, thereby achieving data augmentation that approximates the true distributional properties of HRRPs. Meanwhile, for cross-domain knowledge transfer learning, the proposed method introduces a domain alignment strategy based on generative adversarial constraints and a class alignment strategy based on contrastive learning constraints. These approaches draw the domain features of full-attitude simulation—strong discriminability and generalizability—closer to the measured domain features on a class-by-class basis, thereby further aiding learning from the measured domain data and leading to substantial improvements in few-shot recognition performance. Experimental results based on electromagnetic simulated and measured HRRP data for three and ten types of aircraft and ground vehicle targets, respectively, demonstrate that the proposed method yields superior recognition robustness compared with existing few-shot recognition methods.
17

Flying birds and Unmanned Aerial Vehicles (UAVs) are typical “low, slow, and small” targets with low observability. The need for effective monitoring and identification of these two targets has become urgent and must be solved to ensure the safety of air routes and urban areas. There are many types of flying birds and UAVs that are characterized by low flying heights, strong maneuverability, small radar cross-sectional areas, and complicated detection environments, which are posing great challenges in target detection worldwide. “Visible (high detection ability) and clear-cut (high recognition probability)” methods and technologies must be developed that can finely describe and recognize UAVs, flying birds, and “low-slow-small” targets. This paper reviews the recent progress in research on detection and recognition technologies for rotor UAVs and flying birds in complex scenes and discusses effective detection and recognition methods for the detection of birds and drones, including echo modeling and recognition of fretting characteristics, the enhancement and extraction of maneuvering features in ubiquitous observation mode, distributed multi-view features fusion, differences in motion trajectories, and intelligent classification via deep learning. Lastly, the problems of existing research approaches are summarized, and we consider the future development prospects of target detection and recognition technologies for flying birds and UAVs in complex scenarios.

Flying birds and Unmanned Aerial Vehicles (UAVs) are typical “low, slow, and small” targets with low observability. The need for effective monitoring and identification of these two targets has become urgent and must be solved to ensure the safety of air routes and urban areas. There are many types of flying birds and UAVs that are characterized by low flying heights, strong maneuverability, small radar cross-sectional areas, and complicated detection environments, which are posing great challenges in target detection worldwide. “Visible (high detection ability) and clear-cut (high recognition probability)” methods and technologies must be developed that can finely describe and recognize UAVs, flying birds, and “low-slow-small” targets. This paper reviews the recent progress in research on detection and recognition technologies for rotor UAVs and flying birds in complex scenes and discusses effective detection and recognition methods for the detection of birds and drones, including echo modeling and recognition of fretting characteristics, the enhancement and extraction of maneuvering features in ubiquitous observation mode, distributed multi-view features fusion, differences in motion trajectories, and intelligent classification via deep learning. Lastly, the problems of existing research approaches are summarized, and we consider the future development prospects of target detection and recognition technologies for flying birds and UAVs in complex scenarios.

18

As one of the core components of Advanced Driver Assistance Systems (ADAS), automotive millimeter-wave radar has become the focus of scholars and manufacturers at home and abroad because it has the advantages of all-day and all-weather operation, miniaturization, high integration, and key sensing capabilities. The core performance indicators of the automotive millimeter-wave radar are distance, speed, angular resolution, and field of view. Accuracy, cost, real-time and detection performance, and volume are the key issues to be considered. The increasing performance requirements pose several challenges for the signal processing of millimeter-wave radar systems. Radar signal processing technology is crucial for improving radar performance to meet more stringent requirements. Obtaining dense radar point clouds, generating accurate radar imaging results, and mitigating mutual interference among multiple radar systems are the key points and the foundation for subsequent tracking, recognition, and other applications. Therefore, this paper discusses the practical application of the automotive millimeter-wave radar system based on the key technologies of signal processing, summarizes relevant research results, and mainly discusses the topics of point cloud imaging processing, synthetic aperture radar imaging processing, and interference suppression. Finally, herein, we summarize the research status at home and abroad. Moreover, future development trends for automotive millimeter-wave radar systems are forecast with the hope of enlightening readers in related fields.

As one of the core components of Advanced Driver Assistance Systems (ADAS), automotive millimeter-wave radar has become the focus of scholars and manufacturers at home and abroad because it has the advantages of all-day and all-weather operation, miniaturization, high integration, and key sensing capabilities. The core performance indicators of the automotive millimeter-wave radar are distance, speed, angular resolution, and field of view. Accuracy, cost, real-time and detection performance, and volume are the key issues to be considered. The increasing performance requirements pose several challenges for the signal processing of millimeter-wave radar systems. Radar signal processing technology is crucial for improving radar performance to meet more stringent requirements. Obtaining dense radar point clouds, generating accurate radar imaging results, and mitigating mutual interference among multiple radar systems are the key points and the foundation for subsequent tracking, recognition, and other applications. Therefore, this paper discusses the practical application of the automotive millimeter-wave radar system based on the key technologies of signal processing, summarizes relevant research results, and mainly discusses the topics of point cloud imaging processing, synthetic aperture radar imaging processing, and interference suppression. Finally, herein, we summarize the research status at home and abroad. Moreover, future development trends for automotive millimeter-wave radar systems are forecast with the hope of enlightening readers in related fields.

19
In recent years, with the increasing diversification of mission requirements, radar imaging has expanded from conventional side-looking and squint-looking modes to the forward-looking mode. In this regard, the monopulse imaging method offers several advantages, including its forward-looking imaging capability, real-time processing ability, and effective anti-jamming performance. These features can help efficiently overcome the problems faced by conventional imaging methods, i.e., low azimuth resolution and Doppler ambiguity in the forward-looking region. Hence, this method has emerged as a key solution to these challenges. This study first explores the distinction between monopulse tracking and monopulse imaging, followed by a systematic review of the existing technical approaches and evaluation metrics for monopulse imaging. Subsequently, the performance of different methods is analyzed, and specific applications of monopulse imaging technology in various scenarios are introduced, including three-dimensional imaging, moving target localization and imaging, and multi-view image fusion. The paper ends with a discussion of the development trends of monopulse imaging technology and an analysis of future research directions, such as imaging quality improvement and the expansion of the application scope. In recent years, with the increasing diversification of mission requirements, radar imaging has expanded from conventional side-looking and squint-looking modes to the forward-looking mode. In this regard, the monopulse imaging method offers several advantages, including its forward-looking imaging capability, real-time processing ability, and effective anti-jamming performance. These features can help efficiently overcome the problems faced by conventional imaging methods, i.e., low azimuth resolution and Doppler ambiguity in the forward-looking region. Hence, this method has emerged as a key solution to these challenges. This study first explores the distinction between monopulse tracking and monopulse imaging, followed by a systematic review of the existing technical approaches and evaluation metrics for monopulse imaging. Subsequently, the performance of different methods is analyzed, and specific applications of monopulse imaging technology in various scenarios are introduced, including three-dimensional imaging, moving target localization and imaging, and multi-view image fusion. The paper ends with a discussion of the development trends of monopulse imaging technology and an analysis of future research directions, such as imaging quality improvement and the expansion of the application scope.
20
Detecting targets despite sea clutter is crucial in military and civilian applications. In complex marine environments, sea clutter exhibits target-like spikes and inherently broad-spectrum characteristics, posing a significant challenge for marine radars in detecting Low-Slow-Small (LSS) targets and leading to high false alarm rates. In this study, an S-band holographic staring radar with high-Doppler and high-range-resolution capabilities (i.e., “dual-high” capability) was utilized in sea detection experiments. We obtained sea clutter data, LSS target data (over the sea surface and in the air), ground truth data on target positions and trajectories, as well as wind and wave data. Using these data, we constructed an S-band holographic staring radar dataset for low-observable targets at sea. The time-domain, frequency-domain, and time-Doppler characteristics of the dataset were analyzed, and the results served as a reference for data utilization. Future work will involve continuing experiments to expand the maritime experimental environment (e.g., sea state and region) and target types toward enhancing data diversity. This open dataset will support the enhancement of new radar systems for detecting low-observable targets at sea and improving maritime target detection and recognition performance. Detecting targets despite sea clutter is crucial in military and civilian applications. In complex marine environments, sea clutter exhibits target-like spikes and inherently broad-spectrum characteristics, posing a significant challenge for marine radars in detecting Low-Slow-Small (LSS) targets and leading to high false alarm rates. In this study, an S-band holographic staring radar with high-Doppler and high-range-resolution capabilities (i.e., “dual-high” capability) was utilized in sea detection experiments. We obtained sea clutter data, LSS target data (over the sea surface and in the air), ground truth data on target positions and trajectories, as well as wind and wave data. Using these data, we constructed an S-band holographic staring radar dataset for low-observable targets at sea. The time-domain, frequency-domain, and time-Doppler characteristics of the dataset were analyzed, and the results served as a reference for data utilization. Future work will involve continuing experiments to expand the maritime experimental environment (e.g., sea state and region) and target types toward enhancing data diversity. This open dataset will support the enhancement of new radar systems for detecting low-observable targets at sea and improving maritime target detection and recognition performance.
  • First
  • Prev
  • 1
  • 2
  • 3
  • 4
  • 5
  • Last
  • Total:5
  • To
  • Go