Area Efficient Implementation of MTI Processing Module on a Reconfigurable Platform

This paper presents an area efficient Field Programmable Gate Array (FPGA) based digital design of a processing module for MTI radar. Signal contaminated with noise and clutter is modelled to test the efficacy of the design algorithms. For flexibility of design and to achieve optimized results, we have combined the high-level utility of MATLAB with the flexibility and optimization on FPGA for this implementation. Twoand three-pulse cancellers are chosen for design due to its simplicity in both concept and implementation. The results obtained are efficient in terms of enhanced throughput per Slice (TPA) of 1.146, that is, occupying fewer area resources on hardware while achieving optimized speed. The outcomes show that this design of MTI radar processor has many advantages, such as high processing precision, strong processing ability, real time, and low cost. All these advantages greatly contribute to the design requirements and make it appropriate for the application of high-speed signal processing.


Introduction
In radar systems, it is often desirable to be able to differentiate between moving targets and those that are stationary. This information can greatly serve operators when the surveillance targets of interest, such as vehicles or aircraft, are located in environments with a high density of traffic, clutter, stationary, and slow moving targets. Fortunately, through the use of digital signal processing, it is possible to extract this information from the system in real time using a Moving Target Indicator (MTI) filter [1]. Modern radar systems have evolving requirements, both in how the systems are designed and how the end user uses the data. This results in design change in electronic systems affecting both the military and commercial design communities, that is, the need for smaller, energyefficient systems with high processing-power requirements. This makes low power consumption key drivers in most designs of high speed systems. More digital logic also allows designers to make early decisions on actionable intelligence and to meta-tag sensor data earlier for more efficient analysis. These and other emerging techniques will allow for the creation of better radar systems, but each requires additional signal-processing resources. One of these resources is the emerging class of high-performance FPGAs.
Moving Target Indicator (MTI) radars are developed and used extensively to detect and follow specific moving targets and eliminate clutters. Processing system in the radar is massive and complex, since it is required to perform at high processing speed. Improvement in FPGAs component density and performance capability has resulted in more signalprocessing functions being incorporated and migrated to the front end of the radar system containing the exciter/receiver [2]. In past systems, conventional digital signal processors were used to perform many of these algorithms. However, Field-Programmable Gate Arrays (FPGA) deliver an order of magnitude higher performance than traditional DSPs. Several DSP functions are needed for radar signal processing in the receiver. Each function should be closely examined to determine whether the application will show substantial speed and performance improvements through implementation in an FPGA. These functions can be efficiently implemented using an FPGA embedded processor, even for highly complex and adaptive operations. Our target will be to combine the capabilities of hardware and software to implement the original MTI receiver to generate a true bit accurate model and validate it using Simulink. The contribution factor of this paper is the design strategy that invests more time in thorough simulation based verification of design concepts to save time rather than on postsynthesis issues. This focus has helped us simplify the design to more optimal levels on reconfigurable hardware, that is, FPGA. This enables us to minimize the hardware delivery time by translating the concept into a fully functional, implementation ready design optimized for target embedded platform as per requirement.
The rest of the paper is arranged as follows. In Section 2 a detailed literature review has been given that discusses the contribution factor of this work as compared to previous implementations. Section 3 describes the basic characteristics of MTI processing. Section 4 covers design architecture on FPGA. This also includes the integration of software and hardware platforms to achieve optimized results. Section 5 includes results of design implementation. Section 6 pertains to the conclusions that are arrived at, after combining the software and hardware computations.

A Literature Review
High-speed, real time, and accuracy have always been the significant requirements of radar signal processing in its application area. Using FPGA cannot only meet these needs, but also improve reliability of system and strengthen universality of unit, cut cycle, and cost of the system development, and so forth. This paper has completed the design of an MTI radar module in which a FPGA is the core, primarily elucidating the principles of radar applications, filtering amplification of radar signal processing, / sampling, algorithm and FPGA realization. The literature survey does not reveal any application that discusses a completely FPGA based implementation of an MTI processor. However, some implementations present FPGA's work alongside DSPs that is a semiparallel architecture to perform MTI processing, implemented by Ali et al. [3]. An architecture utilizing a DSP-FPGA combination for clutter processing has been proposed by Xu et al. [4], which involves the FPGA writing the successive returns into an SRAM chip and DSP performing the computations. Different FPGA implementations for FIR filters have been proposed in [5,6]; both of these implementations focus primarily on area reduction while trying to achieve reasonable speeds. Stapleton et al. [7] have examined the possibilities of enhancing radar signal processing algorithms using FPGAs. Li and Wang [8] have given radar signal processor design on FPGA that focuses on filtering amplification, / sampling algorithm, and realization of FFT. Reference [9] has proposed a timeshared approach requiring about 1.2% slices of an XC2V6000 for implementation of a dynamically reconfigurable Pulse Doppler radar in a mixed system comprising a DSP and FPGA to perform linear frequency modulation. Reference [10] has given implementation of an IIR filter based moving target indication (MTI) processor but there is a downside to IIR filters in that they do not guarantee a linear phase response. Moreover, stability issues have to be looked after. Bin Khalid et al. [11] have proposed an FPGA based real-time signal processor for Pulse-Doppler radar. Hence it is observed that the radar processors have been implemented either entirely on hardware or entirely on software. The results of this research work done contribute to integrate the individual capabilities of both hardware and software to achieve optimized results. The conclusions show that this design is advantageous in terms of speed and efficient device utilization.

MTI Processing
Doppler processing can be divided into two major classes: Pulse Doppler processing and Moving Target Indication (MTI). When only target detection is of concern, the MTI filter is usually adequate. MTI filters even a first or second order FIR high-pass filter such as 2-or 3-pulse MTI cancellers can be used to filter out the stationary clutter. Figure 1 shows a typical MTI Pulse canceller. Higher order filter types are used as they only provide modest performance improvements over the pulse cancellers [12]. The synchronous detection in Figure  2 shows the complex envelope during the coherent local oscillator demodulation stage in the receiver. Quadrature demodulation avoids loss of signal due to blind phase effects of a single demodulator as well as resolving the phase function of the complex envelope. Once the synchronous detection of the complex envelope is complete, it is processed in two stages: high-pass FIR filters with sampling at the pulse repetition rate and a matched correlator for return code (target) detection with video / sampling at the bit duration interval. The resulting complex signal will be filtered in both the in-phase and quadrature-phase components as defined in:  A steadily moving target moving at will Doppler shift by the carrier frequency , thereby altering the complex envelope of the radar waveform. This is defined in (2), where is the speed of light. The Doppler frequency is simply the time derivative of the complex envelope's phase function, thus emphasizing constant receiver phase offset. This Doppler shift is found by sampling the quadrature phase angle at the pulse repetition frequency. It is this angle , defined in (2) and illustrated in Figure 3 that is measured using consecutive samples at the pulse repetition interval: The set of relations as defined in [1] best describe the phenomena. The reference signal and the target echo signal are mixed in the mixer stage of the receiver. Only the difference frequency voltage component from the mixer is of interest: The fundamental MTI filter component is the high-pass FIR feed-forward pulse canceller. These filters are guaranteed to be stable, that is, all zeroes and no poles in the transfer function. They are easy to implement in direct form using a tapped delay line. A two-pulse canceller as shown in Figure  1 is used if the clutter component (assuming DC only) remains constant in a given range bin and can be eliminated by subtracting the output from two successive pulses. The transfer function of two-pulse canceller is equal to 1 − 1/ and is equivalent to FIR digital filter with magnitude response sin( /2). In practice, the clutter has a power spectrum that covers frequencies above DC. The two-pulse canceller will attenuate low frequency components but may not totally reject clutter. A three-pulse canceller with its transform function equivalent to FIR filter is given in (7). This decreases further the components near DC [13]. Figure 4 shows the unity gain frequency response of the filter: Signal-to-Clutter Ratio (SCR) instead of Signal-to-Noise Ratio (SNR) determines the RADAR's capability to detect targets in a high clutter background. Normally, clutter signal level is much higher than the receiver noise level because clutter echoes are random and have thermal noise-like characteristics. Grazing angle, surface roughness, and the RADAR wavelengths mainly affect the amount of clutter in the RADAR operation [14]. Differentiating target returns from clutter echoes is based on the target RCS and the estimated clutter RCS. Clutter RCS can be defined as the equivalent radar cross section endorsed to reflections from a clutter area, . The average clutter is given by where 0 (dB/m 2 ) is the clutter scattering coefficient. The clutter area and SCR are defined as Clutter-to-Noise (CNR) is defined as where is the peak transmitted power, is the antenna gain, is the wavelength, is Boltzmann's constant, 0 is the effective noise temperature, is the radar operating bandwidth, is the receiver noise figure, and is the total radar losses.
For Gaussian clutter, the clutter and noise can be combined and RADAR measurement can be derived from the signal-to-clutter + noise ratio as given by The optimal goal is to concentrate on clutter rejection; hence, the desired SNR as shown in the top of the graph is quite high. It is also clear that the SCR is well below the SNR level.

Design Implementation
The workflow demonstrated by Figure 6 starts by reading the target information from an array stored in the workspace which in this case is the speed of the target. A MATLABbased mathematical model of the system has been developed and investigated. Part of the data, received as a result from these investigations, has been used for the simulation of the circuit [15] and synthesized in FPGA to verify if the system meets the real-time constraints. The analysis of the results shows their complete coincidence with those received from the mathematical simulation. The gateways are used to do conversion from floating point data type to fixed point data type and vice versa.
The main advantage of proposed structure, that is, the hybrid of software and FPGA hardware, is the capacity whereupon we can easily change the operating factors of any block of the system. If it is required in some applications to increase the precision of the overall system, then the increase in number of bits can be realized in each block hence giving us the flexibility to adjust the parameters of the system and therefore better solutions. The parameters of the design have been chosen carefully so that the radar system is operating  in the band of the electromagnetic spectrum. Table 1 shows process parameter assessed and refined according to design approach. The core of this design relies on the fact that MTI systems contain coherent pulse train signals. The output of the phase detector for moving target varies in amplitude from pulse to pulse as the phase changes between the transmitted and received signals. However there is no change in phase for fixed objects. By making use of this approach signal processing technique like delay line cancellers are used to implement MTI systems. A stationary target at the range of 60 km has been placed to simulate the effect of clutter rejection filter. Also, additive white Gaussian noise (AWGN) has been added to account for receiver noise introduced in actual systems. The simulated radar parameters yield an unambiguous range of approximately 100 Nm and detection of targets with maximum radial velocity of no less than 310 m/s. The computational block for the radar signal processor is implemented on Xilinx FPGA Spartan-III E Xc3s500e using Verilog HDL (Hardware Descriptive Language). Fixed point implementation has been done. Each memory location was 16 bit wide; lower 8 bits to store the imaginary part and upper 8 bits to store the real part. The block diagram of radar system implemented on FPGA is shown in Figure 7. The user defined precision has been set for the addition blocks with output type as signed 2's complement 16 bit number with 14 binary points. The implementation field is set by selecting fabric as core parameter. The solver used for simulation is fixed step discrete solver and simulation step size is 1.25e-007 for this design.
The received echo is fed to the phase detector for / channel processing. The / demodulation stage is most Chinese Journal of Engineering commonly digitally implemented as either a Hilbert transform or an in-phase and quadrature mixing operation. The mixing operations are simply digital multiplications by sine and cosine [16]. These are further illustrated by pseudocodes in the following listings.
(A) Pseudocode where is the number of pulses per burst and is the number of range bins per pulse. IF The processing requirements for the two-pulse canceller are thus × × 2 memory reads, subtractions, and memory writes, while the 3-pulse canceller requires an additional × × 2 multiplications and additions [16].
where corresponds to the present pulse.  Our design module 1 implements MTI filter using threepulse canceller realizations as shown in Figure 8. The threepulse canceller is a popular and simple MTI filter. It is an all-zero FIR filter with filter coefficients 1, −2, and 1. Design module 2 implements MTI filtering using single delay line canceller also known as two-pulse canceller as shown in Figure 9. The system throughput is one sample per clock cycle. According to minimum period of 9.787 ns, the maximum frequency is 144.8 MHz.

Results
We have successfully implemented our design module on FPGA. The design takes the target information and approximated Doppler speeds within acceptable threshold. On implementation of designs, in Design module 1 the clock rate achieved is higher while Design module 2 turns out to be an efficient approach in terms of achieving effective area utilization on FPGA. Also the results obtained are resourceful in terms of enhanced throughput per slice, that is, occupying fewer area resources on hardware while, achieving optimized speed, TPA of 1.146 for our design module 2 is much better than other implementations [3], which is considered as a significant achievement. Bin Khalid et al. 's [11] implementation on Xilinx Virtex-II Pro (ML300) took less than 90 s hence resulting in slower performance of overall system. These premises give us a subjective idea of the performance since there are not similar architectures reported. Table 2 gives a comparative analysis of the device utilizations and clock details of our designs with other works [3,9,11]. Results indicate that our design gives more area optimization. The filtered signal by the proposed architecture is shown in Figure  10 with clutter being suppressed.   [3] Virtex-4 SX35 177.70 1382 0.128 Other work [11] Virtex-II Pro (ML-300) 0.11 1468 -Other work [9] XC2V6000 102.17 811 -Bold values emphasize superiority of results.

Conclusions
Our research emphasizes on designing digitized MTI processing modules that have been efficiently tested and verified on FPGA. This research contribution has integrated the capabilities of both hardware and software to achieve optimized results both in terms of device utilization and maximum achievable frequency. In the future, this work can be optimized further by carrying out all the phases of design on Verilog HDL. The Verilog HDL code can be optimized to achieve the desired results in lesser time and fewer storage elements. Also, the newer FPGA families being introduced in the market offer better performance and more storage elements to enhance performance and yield better results.