Source Geolocation in Urban Environments Using Multipath Fingerprinting

A method for determining the location of Global Systems for Mobile Communications (GSM) mobile transmitters is proposed. Our approach estimates the location of a source without the use of multilateration or Line-of-Sight (LOS) techniques. A Multipath Characteristic Database (MCD) containing the multipath feature vectors, for each possible transmitter location within an area of interest, is populated via ray-tracing software simulations. The multipath characteristics of interest are angle-of-arrival (AOA) (azimuth) and time-of-arrival (TOA). By minimizing the “distance” between estimated and simulated multipath feature vectors, an estimate for the actual source location can be obtained. The development of the estimation method is presented, followed by a detailed analysis of its estimation accuracy. Since the proposed method utilizes a simulated multipath signature database based upon the knowledge of the environment and the terrain, the need for a priori soundings from the area of interest is eliminated, thus making this location estimation system suitable for application in denied territories. Location accuracies compare favorably with the requirements for the location of wireless 9-1-1 callers as recommended by the Federal Communications Commission (FCC).


Introduction
Spatial localization of cellular emitters in dense urban environments can provide a valuable tool for a variety of users such as emergency services, law enforcement, and military personnel.Navigation, social media, and location-dependent searching would also benefit from advances in localization techniques.The ability to use Non-Line-of-Sight (NLOS) and nonmultilateration techniques would allow the end user to perform the localization from an arbitrary position anywhere within a given range of the target.
The concept of using the multipath fingerprint composed of time-and angle-of-arrival data for wireless location finding in urban environments was developed using electromagnetic ray-tracing techniques and validated using computer-aided design (CAD) models of a real city [1].The fundamental premise of this approach is to extract the features of the multipath signals to create a unique fingerprint, such as angle-of-arrival, time delay, and signal strength, which is then compared to a database of known fingerprints, each corresponding to a known location.A matching fingerprint found in the database provides an estimate for the correct location of the transmitter.A fingerprinting technique using the channel's impulse response information combined with an artificial neural network was developed for geolocation in mines or other confined environments with rough sidewall surfaces with a location accuracy of 2 m [2].A fingerprinting technique exploiting the spatial-temporal characteristics of the multipath signals received by the base station antenna array was proposed in [3].The spatial-temporal fingerprint was based on a lower dimensional subspace of the spatialtemporal covariance matrix capturing the AOAs and the differential delays of the dominant multipath reflections.Localization accuracies of about 1 m were achieved in typical indoor environments.Similar fingerprinting techniques were also proposed and developed for emergency location services [4], Global Navigation Satellite System (GNSS) indoor positioning [5], indoor geolocation for ultrawideband (UWB) systems [6], and channel estimation in multipath-rich mobile communication scenarios [7].

International Journal of Antennas and Propagation
The method discussed here compares simulated and measured signal characteristics based only upon multipath propagation to provide an estimate for the location of a GSM emitter.Our approach is different from the one adopted in [1] in that while their base station is located solely within their area of interest, ours may be located both inside and outside of the area of interest, the latter generally occurring in denied environments.Having the base station in the center yields higher variability in the AOA.For base stations located within the area of interest, AOAs are distributed over a 360-degree spread.However, for base stations located outside the area of interest, AOAs are mostly restricted to be within a 180-degree spread.
The Federal Communications Commission (FCC) recently issued its Third Further Notice of Proposed Rulemaking and proposed the following specific measures in their E911 location accuracy rules to ensure accurate indoor location information [8].According to their guidelines, location accuracies must be within 100 meters for 67 percent of calls and 300 meters for 90 percent of calls for networkbased technologies and within 50 meters for 67 percent of calls and 150 meters for 90 percent of calls for handset-based technologies.Network-based technologies have less stringent requirements compared to handset-based technologies.Our approach addresses the network-based case and, therefore, aims to satisfy the first set of requirements stated above.In addition, our approach avoids the use of received signal strength (RSS) as these are dependent on various unknown factors, such as transmit power, reflectivity of walls, and propagation characteristics, which do not largely impact TOA and AOA.
Through the use of ray-tracing software, a MCD is populated via simulations, which contains a feature vector for possible transmitter locations within an area of interest.The MCD is then used to find possible transmit locations that have similar multipath characteristics to that of the estimated parameters.The multipath characteristic estimators used here are joint angle delay estimation-multiple signal classification (JADE-MUSIC) and JADE Estimation of Signal Parameters via Rotational Invariance Techniques (JADE-ESPRIT).A -Weighted Nearest Neighbor (KWNN) distance metric between the estimated and simulated feature vectors is used to determine the final geolocation estimate.An analysis on the geolocation algorithm's performance is presented which shows the effect of SNR, receiver location, oversampling rate, and number of antenna elements.The computational complexity of the JADE-MUSIC and JADE-ESPRIT is also analyzed.
Although GSM telephony was used in the analysis presented in subsequent sections, our proposed estimation techniques can be applied to all narrowband Time Division Multiple Access (TDMA) systems (which operate in multipathrich environments).This paper discusses the results of an investigation of our proposed localization technique and extends the work previously reported by us in [9].New results reported here include geolocation estimates derived from the JADE-ESPRIT algorithm, as well a comparison between estimates using a receiver located inside and outside the area of interest.A summary of the implemented algorithms is included, as well as result comparisons for various SNRs, oversampling rates, and number of antenna elements used.
This paper is organized as follows.Section 2 provides the linear approximation of GSM signals used in the estimation portion of the localization technique.Section 3 provides an introduction to the JADE estimation techniques.Section 4 discusses the creation of the MCD via ray-tracing software.Section 5 discusses the geolocation fingerprint matching technique.An analysis of the performance of the geolocation estimator is given in Section 6.Finally, Section 7 contains a summary and future work for the development of the proposed method.
A method for representing digital phase modulations via superposition of amplitude modulated pulses was introduced by Laurent in [11]; this method was further investigated for signals with modulation index 1/2 in [12] and for GMSK in [13].Wiesler et al. show that a GMSK signal with the GSM parameters (BT = 0.3,  = 4) can be decomposed into linear and nonlinear parts  lin () +  nlin () [13]: where () is defined as and   is the NRZ data stream.Since   and  , (for  ≥ 1) are only found in the  nlin , we have omitted their definitions here.General expressions for   and  , are given in equations 11 and 13 of [11], respectively.Wiesler et al. show that approximating () solely by its linear term results in a BER performance as good as or (for good SNR) even better than the exact GMSK representation.We further simplify our representation of the GMSK modulated signal by setting the exponential term in  lin of (1) equal to   .Therefore, our linear approximation yields where  0 can be considered to represent the pulse shaping modulation function of a linear modulation scheme.This result is used in JADE techniques presented in subsequent sections.

Joint Angle Delay Estimation (JADE)
The coherence time of the (Rayleigh) fading in the mobile channel is roughly given by  coh = /(V ⋅   ) [14].This yields a coherence time of 160 ms for a mobile moving at a walking speed and 5.6 ms for a mobile moving at a highway speed.
The fading can, therefore, be modeled as time-invariant for a single GSM frame at highway speeds and for up to 30 frames for walking speeds.Subsequently, for a transmitter moving at highway speeds or lower, the wireless channel has path fadings that are constant over multiple GSM time slots.An adaptation to the JADE techniques discussed here is presented in [15], which considers channels with path fadings varying within the duration of a time slot.The angles and delays of the received multipaths vary much slower than the mobile channel fading; therefore, we model these as timeinvariant over many GSM frames [14].
Vanderveen et al. have developed JADE techniques based on the above assumptions.The -multipath channel model for a -element antenna array takes the following form (see [14,16] for a complete analysis): . . .
where a(  ) is the steering vector of the th multipath and A() is the  ×  steering matrix.The diagonal matrix of  values denotes the complex attenuations associated with each multipath, g(  ) denotes the th delayed pulse shaping modulation function, as shown in (2), and G  () is a  ×  matrix containing the  time-delayed pulse shaping functions. defines the length of the channel impulse response in symbol periods, and  is the oversampling factor.Unlike GSM's modulation scheme which can be decomposed into both linear and nonlinear parts, the pulse-shaping modulation function is associated with an entirely linear modulation scheme.The aforementioned linear approximation of the GSM signal was used to accommodate this requirement.

JADE-MUSIC Algorithm. Vanderveen et al. eventually
arrive at a vectorized noisy channel estimate of the following form [16]: = U (, )  () + k () ,  = 1, . . ., , where ∘ denotes a column-wise Kronecker product, k () denotes the estimate noise, and the superscript ⋅ () denotes the th channel estimate.Here U(, ) is time-invariant over  channel estimates, where  is determined based on the stationarity of the AOAs and TOAs for a given multipath scenario.The channel estimation rate is determined by the temporal coherence of the channel path fadings.The channel should be estimated once over the coherence time of the channel path fadings (i.e., the channel can be estimated on the order of a frame for a mobile moving at highway speeds and once every 30 frames for a mobile moving at a walking speed).After estimating  channels, ( 6) can be represented in matrix form as where B = [ (1) ⋅ ⋅ ⋅  () ] and Y and V have similar form.A 2-dimensional MUSIC algorithm can now be applied to find  and .The MUSIC algorithm will result in a joint delay and angle estimation under the following restrictions.
(2) The number of channel estimates used must be greater than the number of multipaths.
(3) The collection time must be longer than the coherence time (for fadings) of the mobile channel.
A significant consequence of these restrictions is that the number of antenna elements needed can be reduced to less than the number of multipaths, if an adequate number of samples are obtained.

JADE-ESPRIT Algorithm.
The JADE-ESPRIT algorithm manipulates the form of the channel estimate matrix so that a 2D ESPRIT-like algorithm can be applied.First, the discrete Fourier transform is applied to the H in (5).The pulse shaping function is then deconvolved from the Fourier-transformed channel estimate (see [16] for details), whereby the channel now satisfies the following model: . . . where Applying a similar vectorization and stacking operation as in the JADE-MUSIC algorithm, the vectorized noisy channel estimate becomes where Ȗ(, ) = F()∘A().Next, a basis of the column span of U(, ) is estimated via the  left singular vectors which correspond to the largest values of covariance matrix, R Y.A 2D ESPRIT algorithm is now applied and estimates for  and  can be obtained.
The restrictions of the JADE-ESPRIT algorithm are slightly stricter than those of the JADE-MUSIC algorithm, in addition to the aforementioned JADE-MUSIC restrictions.The number of multipaths must be less than min(( − 1), ( − 1)), where  is a factor used in the DFT (taken as 1 here).

Multipath Characteristic Database Analysis
Remcom's Wireless InSite ray-tracing software was used to populate the MCD, which contains location-based multipath feature vectors to be used as a basis for a fingerprint matching algorithm.A grid of transmitters was placed over the area of interest and then various multipath characteristics of the received signal are collected for each transmitter location.For the analysis here, only the AOA (azimuth) and TOA were considered for utmost simplicity and for minimizing processing and time requirements; however, the multipath feature vector could be expanded to contain (a) number of received multipaths; (b) AOA (azimuth and elevation) of each multipath; (c) TOA of each multipath; (d) received signal strength (RSS) of each multipath; (e) delay spread amongst rays; and (f) mean and standard deviation of all aforementioned multipath characteristics.Although adding additional features may doubtlessly increase the location accuracy, this would entail higher latency owing to excessive processing requirements.
A model of downtown State College, Pennsylvania, was used in the ray-tracing simulation.The model consisted of concrete buildings placed on top of the appropriate elevation (i.e., landscape).Transmitters were spaced in 3.5meter increments along the and -axes at a height of 2 meters (with respect to the ground).The grid covered a 1 km 2 area and consisted of 81,796 transmitters.The receiver was placed at two different locations represented by Scenario A located outside the area of interest and Scenario B located within the area of interest.For Scenario A, the receiver was placed on top of the Applied Research Laboratory building of The Pennsylvania State University, at a height of 10 meters (approximately 2 meters above the building height).For Scenario B, the receiver was placed on top of the HUB-Robeson Center of The Pennsylvania State University, at a height of 12 meters (approximately 2 meters above the building height).Figure 1 shows the 2D representation of the Wireless InSite model overlaid on a satellite image of State College, PA.The transmit antennas were dipoles aligned perpendicular to the ground, and isotropic antennas were used in the receiver antenna array.A 3D ray-tracing model was used in the creation of the MCD.The ray-tracing model considered building reflections, diffractions, and ground interactions.The model was capable of analyzing LOS, singly diffracted, and doubly diffracted paths.The building faces were chosen to be one-sided, which only allow for transmission from inside the building to outside.This choice was made because the number of transmissions greatly increases the computation time, and multipaths which underwent attenuation through 2 wall transmissions would be significantly weaker than multipaths which did not undergo wall transmissions.The maximum number of reflections allowed was 6, and the maximum number of diffractions allowed was 2; this allows for rays between the receiver and transmitters which are blocked by the height of a building.A simulated GSM signal with center frequency,   , of 1.91 GHz was used.
Figures 2-4 display the diversity of the MCD data obtained via our simulations for Scenario A. Figure 2 shows the TOA of the dominant multipath for each transmitter location.The TOA generally increases as the distance between the receiver and the transmitters increases.Some outliers exist, which represent multipaths which underwent multiple  reflections.Figure 3 shows the AOA in the azimuth plane for the dominant multipath of each transmitter.The AOA (azimuth) is influenced more by the surrounding environment of each transmitter than the distance between transmitter and receiver.Figure 4 displays the received power of the dominant multipath for each transmitter location.Figure 4 shows a strong (negative) correlation between the received signal strength and the distance between the transmitter and receiver, as expected.The properties for each multipath characteristic can be exploited to produce optimal clustering schemes.
Figures 5-7 show similar plots for Scenario B. Figure 5 shows the TOA, Figure 6 the AOA, and Figure 7 the received power.The main differences between these plots and those for Scenario A are the increase in the AOA variation over short distances, and the decrease in the spread of TOA values.Since the transmitter is placed inside the area of interest,  the dominant multipaths from adjacent transmitters are less likely to follow similar paths, due to a lower number of obstructions between the transmitter and receiver.This is evident when comparing Figure 3 with Figure 6, wherein the former contains large areas where the dominant paths (from neighboring transmitters) arrive from similar directions, while the latter has greater variation in AOA over short distances.Scenario A is better suited for our geolocation algorithm, as an error in the AOA estimate may result in a geolocation estimate of a transmitter that is close to the actual transmit location.Furthermore, the TOA variability between neighboring transmitters also appears to be lower for Scenario A than Scenario B. Although it is not always possible, Scenario A is preferred.
Depending on the size of the area of interest and spatial density of the transmitters, the MCD can contain an enormous amount of information.One way to reduce search redundancy and thus improve the location estimation speed is to perform a clustering of the MCD.Owing to the variability in each multipath characteristic over the area of interest, the clustering algorithm should be able to detect clusters of arbitrary shape.If the receiver location is known in advance, the application of clustering techniques on the MCD can be performed prior to taking field measurements.Also, clustering schemes may be more useful than others in different scenarios.For example, if an approximate location prediction is urgently required, then a clustering scheme containing a large amount of data points in each cluster may be more useful.For these reasons, a partitioning clustering algorithm appears to be a good choice to process the MCD, because it can provide 1 to  (number of data points in MCD) different clustering configurations, even though it has the highest computational complexity among the predominant clustering techniques.A more thorough analysis of the clustering of the MCD is given in [17], but it has not been applied to the final geolocation algorithm yet.

Geolocation Algorithm
The method presented here is a fingerprint matching technique, in which a comparison between measured multipath feature vectors and the feature vectors in the reference database is used to determine the best estimate for the location of a transmitter.The fingerprinting localization technique can be decomposed into two phases described below [18].
(1) Calibration.Measured (or simulated) reference multipath feature vectors are collected for possible transmit locations within the area of interest.Each feature vector is stored, along with its corresponding location.
(2) Run-Time.The multipath features for a transmitter at an unknown location are estimated via the aforementioned estimation techniques.A distance metric is then calculated between the unknown transmitter's feature vector and the reference feature vectors found in the calibration phase.The minimum values of the distance metric can then be used to provide estimates for the transmitter's location.
Traditionally, fingerprinting techniques only considered RSS features and real-world measurements were taken to construct the reference database [19].Such an approach was limited for use in indoor or in small areas, because obtaining the reference multipath measurements would be impractical for a large area.Since the method presented here uses ray-tracing software to generate its reference database (the MCD), a larger area of operation is possible.Furthermore, our method allows for the localization of a mobile emitter found in a denied territory or other areas where reference measurements cannot be obtained.
Since the earlier versions of the technique only relied on RSS, multiple stationary access points (or base stations) were needed [18,19].Usually the access points used were preexisting in a wireless network.The RSS from transmitters positioned in reference locations were collected for each access point.A test transmitter then emits from an unknown location, and the RSS is once again collected for each access point.A distance metric is then calculated; the minimization of this metric yields the location estimate.
Reference [18] introduced the use of ray-tracing software and propagation models to construct the RSS reference database, which induced a slight increase in location estimate errors compared to the same approach with measured RSS reference data.This greatly reduced the complexity of the calibration phase and allowed the technique to be used in larger areas.
References [20,21] implemented fingerprinting techniques which consider more multipath characteristics than those found in [18,19].By using the channel impulse response (CIR), these techniques essentially utilize all multipath features, except for the phase of the complex attenuation, which varies greatly over small distances due to small-scale fading.These techniques are similar to the one presented here, except that our technique performs a distance metric on each of the estimated multipath parameters.

Create Received Signals Based on Multipath and Antenna Array Parameters
(i) Create antenna array matrix.
(iv) Compute channel estimates based on JADE-MUSIC and JADE-ESPRIT algorithms.
(v) Add white Gaussian noise with power calculated based on signal power and current SNR.

Perform MUSIC and ESPRIT Algorithms
(i) The MUSIC and ESPRIT Algorithms are described in Section 3.

Calculate the Distance Metric
(i) The distance-metric is described in Section 5.2.

Record Performance Results for Each SNR and Iteration
(i) Determine radius between each estimate and actual location.
(ii) Record number of "hits" for radii of interest.

Distance Metric. Multiple distance metrics have been used in fingerprinting localization techniques. Each has
advantages in different multipath scenarios.Utilization of these metrics is known as instance-based localization as they do not rely on previous location estimates.Intuitively, one may assume that previous location estimates may aid in the estimation of a transmitter in close proximity to a previous estimate.However, Qiu and Kennedy found that implementing a Support Vector Machine (SVM), which is not an instance-based technique, but rather a feed-forward network, did not improve location estimation accuracy [21].
The reason that implementation of the SVM, which is known for its aptitude in solving pattern classification problems, did not improve performance is due to the "small-scale" distribution of multipath characteristics; that is, the wireless channel impulse response fluctuates rapidly over short distances.The distance metric used here was -Weighted Nearest Neighbors (KWNN) given as [21] where In the above equations,  = 2 (i.e., the Euclidean distance was measured),  and  represent the "coordinates" of the possible transmit locations, q is the number of multipaths, and  is the number of multipath characteristics considered.f , denotes the estimated feature vector which contains AOA and TOA for the analysis presented here.MCD (,,,) denotes the feature vector obtained via the ray-tracing software.  is a weight given to each multipath characteristic; since the range of values differs greatly between different multipath characteristics, a weight is needed to ensure that each characteristic's contribution to the final location estimate is appropriate.  is user-defined and can be modified based on the geolocation scenario, that is, the expected range of values for each multipath.The weights used to produce the results found in this paper were 10.14 nS for the TOA parameter and 180 ∘ for the AOA parameter.Δ  and Δ  denote the spatial resolution in the and -directions used when constructing the MCD, respectively.The nearest neighbor metric is the special case of KWNN where  = 1 and   = 1, and -nearest neighbor metric corresponds to   = 1.Various values of  were used in the subsequent analysis, and the   's, which denote the weights, were varied between 1 (NN and KNN) and the inverse of the distance between measured/simulated and reference feature vectors (KWNN).The subscripted  in (Δ    , Δ    ) represents the th calculated coordinates if the  − 1 coordinates were removed from the MCD.

Geolocation Algorithm Analysis
Several different estimation scenarios are shown in this section, followed by statistics of the geolocation algorithm.Sections 6.1-6.4 provide insight in the geolocation algorithm's performance when varying the following attributes: SNR, receiver location, oversampling rate, and number of receiver antennas.

Effect of SNR.
Tables 1-3 display the number of times a location estimate was within a certain radius of the actual transmit location.An oversampling rate 2 and 2 uniform linear arrays (angled 90 ∘ apart) with 8 antennas (spaced a half wavelength apart) were used.The number of trials performed was 100.We see that the ESPRIT algorithm's overall performance is worse than that of the MUSIC algorithm; this is expected since the JADE-ESPRIT algorithm has stricter requirements compared to the JADE-MUSIC algorithm.The results show that the nearest neighbor estimate performs best, but KWNN may be beneficial in real-world experiments since the exact transmitter location may not be included in the MCD, and modeling errors may be present.Overall, these results show that the geolocation technique is conceptually sound.Although a 85% sucess rate (at SNR = 15 dB) at a 100-meter radius may not meet the requirements of practical systems, with improvements in the technique a reasonable performance level may be achievable.4-6 show results with the same simulation parameters as Tables 1-3, except the receiver was placed on top of the HUB-Robeson Center (Scenario B).It is apparent that moving the receiver location inside the area of interest has diminished the performance of the geolocation algorithm; this is most likely a consequence of a reduction in spatial coherence in multipath parameters when the receiver is placed inside the area of interest.The AOA and TOA are constant over larger areas of transmit locations for Scenario A. Therefore, an error in AOA and TOA estimation in Scenario A may still allow for a geolocation estimate that is nearby to the actual transmit location.The success rate was 77% (at SNR = 15 dB) at a 100-meter radius for Scenario B. 7 shows the geolocation performance when the oversampling rate is set to  12.6% accuracy at a radius of 10 meters for NN-MUSIC and an increase of 6.67% accuracy for NN-ESPRIT.Thus, the small increase in estimation accuracy may not justify the increase in computation time for most applications.

Effect of Number of Antenna Elements in ULA.
Table 8 shows the geolocation performance when the number of antennas used in each ULA is set to 12, compared to Table 2.An SNR of 15 dB was used; the oversampling rate used was 2, and only Scenario A is considered.The geolocation accuracy generally increased with an increase in the number of antennas used.It is clear that the ESPRIT algorithm is more affected than the MUSIC algorithm by an increase in the number of antennas used.The MUSIC-and ESPRITbased geolocation algorithms only increased in accuracy by 8.0% and 3.3% at a radius of 10 meters when the number of antennas used in each ULA was increased from 8 to 12, respectively.Again, as stated in Section 6.3, the slight increase in performance may not warrant the increase in computational complexity.
6.5.Analysis of Computation Time.Table 9 displays the time (in seconds) required to run the geolocation algorithm as a function of oversampling factor and the number of antennas used in each ULA.The channel was sampled 30 times and the path fadings are assumed to be nonstationary over the sample time.The computational complexity of the JADE-MUSIC algorithm is ((⋅⋅) 3 ) for the eigendecomposition of the covariance matrix, plus an additional (( ⋅  ⋅ ) 2 ) for each point in the 2D search (or 3D for AOA in elevation as well).
Whereas JADE-ESPRIT has a much lower computational complexity ((⋅⋅) 3 ), at the cost of reduced performance [15], the JADE-MUSIC algorithm's computational complexity is highly dependent on its 2D search.For the entirety of the results presented in this paper, a temporal resolution of 2 (i.e., a temporal resolution equal to twice the product of the oversampling rate and bit rate) and an angular resolution of 1 ∘ were used in the JADE-MUSIC search.These numbers could be reduced to decrease the computation time of the MUSICbased geolocation algorithm, but the multipath characteristic estimations would suffer.

Summary and Further Considerations
A localization scheme that estimates location based on a comparison between estimated and simulated multipath characteristics was proposed.The main advantage of our proposed approach is that it uses only the AOA and TOA data as inputs, thereby reducing the processing overload, compared to other approaches which use additional data inputs.Although the extensive results and conclusions presented here are based solely on simulations and relate to a single scenario, they provide an understanding of the trade-offs and a good starting point for future studies.It should be emphasized that construction activity or other events that alter the multipath landscape will necessitate a resurvey of the region and/or recalculation of the entire multipath fingerprint database [4].Furthermore, the inclusion of unexpected scatters (i.e., motor vehicles, people, animals, etc.) will need to be analyzed experimentally to determine the effect on the estimation technique and which countermeasures would be appropriate for such situations.With an SNR of 15 dB, the location estimate was within a 25-meter radius of the actual transmit location more than 70% of the time in both Scenarios A and B when the oversampling factor was set to 2 and the ULA's had 8 antenna elements.Increases in the number of antenna elements and oversampling rate resulted in a slight improvement in performance, which leads the authors to believe that when the restrictions in Section 6.3 are satisfied, the estimation technique's performance is limited by the diversity in multipath parameters amongst each transmit location.With the inclusion of more multipath characteristics, the location estimate should improve thus making this a valuable technique for source geolocation estimation.When used in outdoor scenarios, the location estimation technique presented here has an advantage over other techniques which only include RSS comparisons, because they are more susceptible to error due to weather and other changes in the environment.Furthermore, only one base station is needed to obtain a geolocation estimate since our technique does not rely on multilateration.
Although -weighted distance metrics performed worse than the nearest neighbor estimates, they have been included here for completeness.The -weighted approach may be more beneficial when the algorithm is implemented in realworld experiments and the multipath model is no longer exact.The -weighted approach should also be beneficial as the distance between simulated transmitters is reduced.A slight estimation error may move the "max" geolocation estimate to a nearby transmitter, and the average of nearest neighbor estimates may yield a better estimate to the actual transmitter location.
Further work with this technique include (but are not limited to) (a) inclusion of more multipath characteristics in the MCD, (b) utilization of clustering techniques to increase efficiency, (c) development and assessment of other distance metrics, (d) application of the technique within other rich-multipath environments, and (e) implementation of experimental validation of the simulated results.

Figure 1 :
Figure 1: 2D representation of the Wireless InSite model overlaid on satellite image of State College, PA.The transmitter grid covers the blue overlay.Rx1 corresponds to the receiver location of Scenario A, and Rx2 corresponds to the receiver location of Scenario B, © 2013 Google Earth.

Figure 2 :
Figure 2: Simulated TOA (s) data from the MCD for downtown State College, PA, for Scenario A. Data plotted for the dominant multipath.

Figure 3 :
Figure 3: Simulated AOA in azimuth plane ( ∘ ) data from the MCD for downtown State College, PA, for Scenario A. Data plotted for the dominant multipath.

Figure 4 :
Figure 4: Simulated received power (dBm) data from the MCD for downtown State College, PA, for Scenario A. Data plotted for the dominant multipath.

Figure 5 :
Figure 5: Simulated TOA (s) data from the MCD for downtown State College, PA, for Scenario B. Data plotted for the dominant multipath.

Figure 6 :
Figure 6: Simulated AOA in azimuth plane ( ∘ ) data from the MCD for downtown State College, PA, for Scenario B. Data plotted for the dominant multipath.

Figure 7 :
Figure 7: Simulated received power (dBm) data from the MCD for downtown State College, PA, for Scenario B. Data plotted for the dominant multipath.
(i) Set SNR values to implement.(ii) Set oversampling rate.(iii) Set distance-threshold.Load MCD (i) Load 4-dimensional MCD.(ii) Determine size of MCD dimensions.(iii) Determine random location indices based on number of iterations.(iv) Determine weights to normalize TOA and AOA calculations.(a) The weights allow for appropriate comparison of TOA and AOA distances, as their respective domains vary over different values.(b) The TOA weight is the maximum TOA in the MCD.(c) The AOA weight is 180 ∘ .For SNR Values For  Iterations (i) Select random location of iteration .(ii) Extract AOAs and TOAs from th location: (a) if no multipaths exist, select another (unused) random location.Ignore Multipaths That Are Too Closely Spaced in the AOA/ TOA Domain (i) Ignore only one of the pairs that are closely spaced.(a) The distance-threshold is a user-defined parameter which can be set to determine which multipaths should be ignored.(b) These could be an artifact of the ray-tracing technique which would most likely produce a single signal.(c) They diminish the performance of the JADE-MUSIC and JADE-ESPRIT algorithms.

Table 1 :
Number of "hits" within given radius to actual location for each distance metric out of 100 trials.SNR = 30 dB; number of antennas = 8; oversampling factor = 2; distance-threshold = 0.2, Scenario A.

Table 2 :
Number of "hits" within given radius to actual location for each distance metric out of 100 trials.SNR = 15 dB; number of

Table 3 :
Number of "hits" within given radius to actual location for each distance metric out of 100 trials.SNR = 0 dB; number of antennas = 8; oversampling factor = 2; distance-threshold = 0.2, Scenario A.

Table 5 :
Number of "hits" within given radius to actual location for each distance metric out of 100 trials.SNR = 15 dB; number of antennas = 8; oversampling factor = 2; distance-threshold = 0.2, Scenario B.
20, compared to Table2.An SNR of 15 dB was used; the number of antennas in each ULA was 8, and only Scenario A is considered.The geolocation accuracy generally increases with an increase in oversampling rate.However, a 300% increase in oversampling rate results only in an increase of

Table 8 :
Number of "hits" within given radius to actual location for each distance metric out of 100 trials.SNR = 15 dB; number of antennas = 12; oversampling factor = 2; distance-threshold = 0.2, Scenario A.

Table 9 :
Number of seconds to compute a geolocation estimate as a function of oversampling rate and number of antennas used in each ULA.