An Improved Indoor Positioning Technique Based on Receiving Signal’s Strength

Fujian Provincial Key Laboratory of Big Data Mining and Applications, Fujian University of Technology, Fuzhou 350118, China Guangxi Key Laboratory of Automatic Detecting Technology and Instruments, Guilin University of Electronic Technology, Guilin 541004, China Intelligent Information Processing Research Center, Fujian University of Technology, Fuzhou 350118, China Fujian Key Lab for Automotive Electronics and Electric Drive, Fujian University of Technology, Fuzhou 350118, China Institute of Artificial Intelligence, Fujian University of Technology, Fuzhou 350118, China College of Information Science and Engineering, Fujian University of Technology, Fuzhou 350118, China Anktion (Fujian) Technology Company Limited, Fuzhou 350003, China


Introduction
With the advancement of information technology and the development of society, Location-Based Service (LBS) has become a basic service requirement for people's daily work and life [1]. e positioning technique can be divided into two types: outdoor positioning technique and indoor positioning technique. In an outdoor environment, Global Positioning System (GPS), BeiDou Navigation Satellite System (BDS), and other Global Navigation Satellite System (GNSS) provide users with meter-level location services, solved the problem of accurate positioning in outdoor space basically [2]. However, in an indoor environment that occupies 80% of human daily life, the GNSS signal strength is drastically reduced due to the blocking effect of buildings on wireless signals. As a result, the positioning accuracy is greatly reduced and cannot meet the needs of indoor location services. is problem is particularly prominent in large shopping malls, integrated transportation hubs, underground mines, etc., in large and complex indoor environments. erefore, how to improve the accuracy of indoor positioning technique in a large and complex indoor environment is the current research focus in the field of positioning technique [3]. Indoor positioning technique based on receiving signal's strength (RSS) has become the mainstream indoor positioning technique because it has the characteristics that it can directly use the existing widely deployed WLAN equipment and the low cost of hardware is easy to deploy. e indoor positioning technique based on RSS can be mainly divided into the positioning technique based on ranging and the positioning technique based on location fingerprint [4]. e location method based on location fingerprint is to detect the wireless signal characteristics of a specific location and build a location fingerprint database based on this. During the positioning stage, the wireless signal characteristics are detected and a specific matching algorithm is used to traverse the location fingerprint database through calculation to estimate the location information of the target node. e location fingerprint positioning method does not require the specific information of the reference point and has unique advantages in specific occasions where it is impossible or inconvenient to calibrate the reference point. However, the location fingerprint positioning method requires considerable workload to establish and maintain the location fingerprint database. e range-based positioning method uses the attenuation characteristics of the wireless signal during transmission to determine the distance between the target node and multiple reference points whose positions are known. On this basis, the location information of the target node is obtained by further calculation. e positioning method based on distance measurement is easy to deploy, low in cost, low in maintenance, and easy to promote. However, due to the complexity and diversity of the propagation path of wireless signals in an indoor environment, a single wireless signal transmission model cannot describe the propagation characteristics of wireless signals at different points indoors, resulting in large errors and lower positioning accuracy. In order to accurately determine the area of the point to be measured and further improve the positioning accuracy, this paper designs and implements a judgment method based on regional confidence. is method uses the error vector of the equation and the degree of compatibility to filter out the irrelevant combination of regions; thus, making the determination position of the point to be measured is more accurate.

Indoor Positioning Technique
With the vigorous development of information technology and electronic technology, indoor positioning methods continue to emerge and have been applied. e widely used indoor positioning methods include proximity detection, triangulation, polar point method, dead reckoning, finger printing, and multilateration [5]. e proximity detection method is to detect the coverage area of the launch point where the target is located to determine the approximate range where the target is located, with low accuracy. e triangular positioning method and pole method mainly determine the coordinates of the target by observing the angle relationship between the target and the known reference point. Dead reckoning is positioning by acquiring the movement speed, direction, and time of the measured target. It is convenient to use but has accumulated errors. When it is used in an indoor environment, it has a pedestrian dead reckoning (PDR) algorithm for pedestrian positioning scenarios. Fingerprint positioning is a method of establishing a fingerprint map database in advance and matching the collected signal fingerprint data with database data records during positioning to determine the target location. Fingerprint positioning does not require reference point position information and is very suitable for special occasions where reference point position information cannot be calibrated. However, the workload of establishing and maintaining the fingerprint database is very large, and achieving efficient data matching has always been a difficult problem, which has attracted the attention of many scholars. Some scholars have made other progress in the field of ontology research and made some progress [6][7][8][9][10][11][12][13][14][15]. e multilateral positioning method is to calibrate the positions of several reference points in advance and then determine the position of the point to be measured by measuring the distance between the point to be measured and the reference point. In practical applications, three reference points are generally used for positioning, which is also called three-sided positioning. Based on these indoor positioning methods, various indoor positioning technologies have been derived and developed [16], which are compared in Table 1.
e technologies for trilateral positioning based on RSS mainly include Wifi, Bluetooth, RFID, ZigBee, and so on. Because of its convenient data collection, a small amount of calculation, and extensive network deployment, it has a good application prospect.

Wireless Signal Transmission Model and Trilateral Positioning Algorithm
Calculating the distance between the point to be measured and the reference point using the distance measurement method and then using the trilateration algorithm to calculate the position of the point to be measured is a typical method of RSS-based positioning technique, which is simple to calculate and has good universality.

Wireless Signal Transmission Model.
e wireless signal transmission model is a mathematical model in which wireless signals propagate through a medium in a certain medium environment.
rough the wireless signal transmission model, a quantitative relationship between the degree of signal attenuation caused by the transmission process of the wireless signal in the medium space and the transmission distance can be obtained, and a reasonable selection of the signal transmission model and transmission model parameters suitable for the current environment is satisfactory indoor positioning performance is crucial. e propagation model of wireless signals in free space suggests that the receiving signal's strength P r can be expressed by the following formula [16]: where P t is the signal power at the transmitter, λ is the wavelength of the wireless signal, G t is the antenna gain at the transmitter, G t is the antenna gain at the receiver, d is the separation distance between the transmitter and receiver, and L is the system loss factor. It is troublesome to directly use the free space propagation model for ranging and positioning, so the logarithmic distance loss model is commonly used for indoor positioning [17]: e above formula P(d) is the signal strength received by the receiving end whose distance source (transmitter) is d, that is, the RSS value. P(d 0 ) is the signal strength received by the receiver when the distance from the source is d 0 . n is the path loss index, which is usually obtained by actual measurement. Generally, the more the obstacles on the propagation path, the larger the value of n, so that the wireless signal decreases more in the unit propagation distance [18]. d 0 is the reference distance, which depends on the actual situation on-site. For the convenience of calculation and measurement, the value is usually 1. X is a Gaussian random variable in dBm with a mean value of 0 and a variance ranging from 4 to 10 [19].
In engineering applications, the following simplified form of the logarithmic distance loss model is commonly used [20]: e argument d is the distance between the source and the receiver, R is the signal strength received by the receiver, A is the signal strength received 1 m from the source, and n is the path loss. During application, it is necessary to select several sample points in different positions in advance, then determine A and n parameters of the model through regression analysis, and thus establish the quantitative relationship between RSS and d detected by the receiving end. In the positioning stage, the distance d between the point to be measured and the source can be determined based on the RSS value received by the point to be measured.

Trilateral Positioning Algorithm.
e currently widely used positioning method is the trilateral positioning algorithm [21]. On a two-dimensional plane, if the distance between the point to be measured and three reference points (i.e., sources) with known positions can be determined, the three reference points are used as the center of the circle to measure the distance between the source and the reference point. A circle is made for the radius. Figure 1 shows an example of three-sided positioning technique.
In the three-dimensional space, four reference points are needed for spherical intersection for positioning calculation. Traditional RSS indoor positioning technique considers the entire indoor environment as a whole during the entire positioning process and only uses a single wireless signal transmission model and corresponding model parameters for distance calculation. In complex indoors with many barriers, the environment will inevitably produce a large error, and due to the influence of the ranging error, the three-sided positioning algorithm is likely to not intersect the three circles at a point, but to a common area. In extreme cases, they are also may be completely disjointed, resulting in a large error in the calculation results of positioning. Even if some methods such as the centroid method are used to modify and approximate the position, the results are not satisfactory. is paper improves this and proposes a new method, which uses the indoor space regional preprocessing method and regional confidence discrimination method to solve the above problems.

Improved RSS Indoor Positioning Technique
e main reason for the low accuracy of the traditional RSS indoor positioning method based on three-sided positioning is that the indoor environment is complicated. e situation of the obstructions on the propagation path of the wireless signal from the source to the indoor points is different. In the wireless signal strength of each propagation path, the decay rate is very different. e single set of propagation model parameters used in this method ignores these differences and naturally introduces large errors. To this end, this paper first conducts regionalization processing and divides the indoor environment into several areas according to the wireless signal transmission characteristics. Each point in the same area selects the same transmission model parameters, and different areas correspond to different transmission model parameters. In order to simplify the calculation, the transmission model of each region is set as the logarithmic distance loss model. In the positioning stage, the region where the point to be measured is located is determined by the region confidence judgment method, and the model parameters corresponding to the region are selected for accurate positioning calculation. Since the difference in the wireless signal transmission path from each point in the room to the source under the spatial layout of the indoor environment is fully considered, the positioning accuracy can be greatly improved.

Regionalization of Interior Space.
e path loss factor n of the logarithmic distance loss model characterizes the attenuation rate of wireless signals on the unit transmission path and fixed hard barriers such as walls affect the value of the parameter A. According to this, the indoor space can be divided into several areas according to the number and size of obstructions on the wireless signal transmission path.
As shown in Figure 2, in a specific indoor space, three sources of AP1, AP2, andAP3 are deployed. ere are two walls in the room, namely, wall fh and wall ek. To simplify the analysis and highlight the problem elements, assume the wall the thickness is 0. It can be seen from the figure that the wireless signal sent from the source AP1 passes through two representative paths, namely, path aj and path ai. Obviously, the difference between the two paths is significant, the data of the barriers (walls) on each path are different, and the wireless signal transmission characteristics on each path are also different; that is, the corresponding parameters in each path change accordingly. According to the obstruction on the wireless signal transmission path, the indoor space can be divided into regions according to the source AP1, as shown in Figure 3.
As can be seen from Figure 3, the indoor space is divided into three areas as z 1,1 , z 1,2 , and z 1,3 according to the source AP1, and the parameter values of the logarithmic distance loss model corresponding to each area (that is, the values of A and n) are different and determined by the actual situation. ese areas constitute the indoor space that is divided into regions based on the source AP1 and Z 1 are recorded as their union, which is defined as follows: Obviously, it is known that Z 1 and the actual size of the indoor space are in one-to-one correspondence and completely coincide. is article defines it as the plane corresponding to the source AP1. As can be seen from the foregoing, the parameters of the wireless signal transmission model corresponding to each area included in the plane Z 1 are generally different, and c 1,1 � A 11 , tn 11 } is a set of transmission parameters corresponding to the area z 1,1 . Among them, A 11 and n 11 are the estimated values of parameters A and n in the area z 1,1 in formula (3). Similarly, it can be constructed as c 1,1 , c 1,2 , and c 1,3 . Let C 1 be the parameter set corresponding to the plane Z 1 , then 3 � c 1,1 , c 1,2 , c 1,3 . Similarly, based on the source AP2 and source AP3, the plane Z 2 and plane Z 3 and corresponding parameter sets C 2 and C 3 can be constructed. As can be seen from Figure 3, z 1,1 is a polygon composed of multiple line segments. In general, the region z 1,1 is defined as follows: where f i (x, y) is an analytical formula of straight lines that constitute the outline of a polygon and r is 5 in the area z 1,1 .
‖p 1 − p 2 ‖ is recorded as the normal distance between points p 1 and p 2 , the point p(x, y) is any point on z 1,1 , and the point a(x a , y a ) is the location of a source AP1. en, from equations (3) and (5), the RSS minimum value R a1 based on the source AP1 that can be obtained by all points located in the area z 1,1 is as follows: Obviously, equation (7) is a linear programming problem, and the minimum value of R a1 can be solved by the simplex method, which is recorded as R a1 min . Similarly, the maximum value R a1 max of R a1 can be obtained. Supposing the RSS value is calculated based on the source AP1 corresponding to any point p(x, y) on z 1,1 be R p . Considering the connectivity of the area z 1,1 and the monotonicity of formula (3), there are obviously In this way, the area z 1,1 has a corresponding relationship with the R a1 min and R a1 max values. In this paper, the range determined by the set of values corresponding to the area is defined as the decision area of the area. For example, the above decision area of z 1,1 is [R a1 min , R a1 max ], which is denoted as j 1,1 . Similarly, the decision domain j 1,2 and j 1,3 corresponding to the remaining regions z 1,2 and z 1,3 of the plane Z 1 can be established. Remember that the set of decision domains in the region contained in the plane Z 1 is J 1 , and then J 1 � j 1,1 , j 1,2 , j 1,3 . With reference to the processing procedure of the plane Z 1 , the same processing can also be performed on the planes Z 2 and Z 3 to construct the boundary ranges, parameter sets, and decision domains of the regions in the Z 2 and Z 3 planes. So far, the regionalization of indoor space has been completed.

Positioning.
e actual positioning process can be carried out only after the process of indoor space regionalization is completed. For ease of explanation, this article defines the former as the indoor positioning training stage and the latter as the positioning stage.
Assuming that the preprocessing of indoor regionalization has been completed during the training phase, the positions of the three sources AP1, AP2, and AP3 are determined, and the area where the point p(x, y) to be measured is located is z p , and the values of the RSS of the three sources detected at p(x, y) are R 1 , R 2 , and R 3 , respectively, and the plane Z, parameter set C, and decision domain set J corresponding to each source are constructed. In the positioning stage, according to different situations in which R 1 , R 2 , and R 3 detected at the position of the point p(x, y) to be measured fall within the decision domains corresponding to the respective regions of the planes Z 1 , Z 2 and Z 3 which will be separately explained below.

Ideal Positioning.
First, consider the simplest case, that is, R 1 , R 2 , and R 3 ,only falls within the corresponding one decision domain on the three planes of Z 1 , Z 2 , and Z 3 , namely, Among them, j 1,1 , j 2,1 , and j 3,1 are the corresponding judgment domain in the area z 1,1 , z 2,1 , and z 3,1 , and it is obvious that

Mobile Information Systems
erefore, the transmission parameters relative to the three sources can be obtained by z p as c 1,1 , c 1,2 , and c 1,3 . From equation (3), combined with known conditions, the following equations can be obtained: is is the system of equations about x and y, where x and y are the position coordinate of the point p(x, y) to be measured, and (x 1 , y 1 ), (x 2 , y 2 ), and (x 3 , y 3 ) are the position coordinates of the source AP1, source AP2, and source AP3, respectively, which are known constants. d 1 , d 2 , and d n are the distances between the point to be measured p(x, y) and each source, which is calculated by using the transmission parameters c 1,1 , c 2,1 , and c 3,1 of z p relative to each source and combining formula (3). According to the title, there are only 3 sources deployed in the indoor environment, so the value of n in equation (11) is 3.
Execute the linear transformation on equation (11) with r (k+1)k (−1), k � 1, 2, . . . , n − 1 eliminate the quadratic term and sort out, and then the result can be expressed as follows: Here, G, W, and b are defined as follows: where W is the position of the point to be measured that is expected to be obtained. When the number of sources is greater than 3 (n > 3), the form of equation (11) remains unchanged. Due to hardware limitations and measurement accuracy, if equation (11) is used to solve the position of the point p(x, y) to be measured in engineering applications, there may be no solution. e following uses the maximum likelihood method to convert equation (11).
Let g ij , w j , and b i be the elements of the matrix, respectively, where i � 1, 2, . . . , n and j � 1, 2, then equation (11) can be expressed as Since the measurement of the RSS values of each source at the point p(x, y) to be measured is independent of each other, en, we get Finally, the joint density of b i is When the maximum value of L w is obtained, the corresponding is the estimated position value of the point to be measured, and set Taking the logarithm of both sides of L w and taking into account the sign and constant terms, it is obvious that the maximum value obtained by L w is equivalent to the minimum value obtained by Q w . For the convenience of subsequent calculations, transform Q w into matrix form: Expand and organize the right term of formula (18) to get en, derivate the two sides of equation (19): Let equation (20) be 0, and after shifting the terms, we get Equation (21) is the result of equation (11) converted by the maximum likelihood method.
From equation (21), Let r be the rank of the matrix G, namely, 6 Mobile Information Systems r � rank(G).

(22)
en, there must be At the same time, the augmented matrix of equation (21) can be expressed as because rank G T (G, b) ≤ rank G T � r.
In addition, (G T G, G T b) has more columns than (G T G); obviously, So, there must be erefore, it can be seen that equation (21) must not appear without solution.
e position of the point to be measured can be obtained by solving W from equation (21).

General Positioning
Processing. If the RSS value of a certain source detected at the position of the point p(x, y) to be measured does not fall into any one of the decision domains of the corresponding decision domain set J through traversal retrieval, this situation indicates that the RSS measurement error is too large, resulting in an error and requires remeasurement, or it may be that the parameter calibration of the transmission model in the training phase has caused an error, and it needs to be carefully checked and re-executed. e following discussion focuses on the more general case; that is, the RSS value falls into multiple decision domains of the corresponding decision domain set of each source.
As can be seen from the above formula, the area z p to which the point p(x, y) to be measured belongs is in the plane Z 1 , m areas meet the conditions, n areas in the plane Z 2 meet the conditions, and k areas in the plane Z 3 meet the conditions. erefore, the number of equations to be solved is Obviously, there are mnk solutions for W according to equation (21). Simply, you can consider using the centroid method to synthesize all the calculation results to estimate the position of the point p(x, y) to be measured: where W i is the solution of the i-th equation system among the mnk equation systems determined by equation (29). However, considering the known conditions of the problem set, a more precise screening can be made to further improve the positioning accuracy. According to known conditions, in an ideal situation, equation (11) must be solvable, and z p can only be in one of the regions of Z a , Z b , and Z c ; that is, the situation of equation (9) must occur. However, now z p occupies multiple areas in Z 1 , Z 2 , and Z 3 . at is, the situation of formula (29) appears, so it is reasonable to believe that due to the influence of a certain factor, the possible solution of formula (11) is deviated, and thus appears the result of equation (29). Examining the various parameters of equation (3) can only be caused by errors introduced by the R-value measurement. It can be seen from common sense that the specific position (coordinate value) of the point p(x, y) to be measured has one and only one at a certain time, so the solution of only one equation system among the mnk equation systems determined by equation (30) is the closest to the objective reality. e degree of incompatibility of equation (11) is used as a measure of the degree to which the solution of the equation is close to the objective reality. e equation with the lowest degree of incompatibility is the best choice among mnk equations, and the solution is the optimal solution. Define the regional confidence ρ as follows: where c is a measure of the degree to which the solution of the equation system in the region corresponding to ρ is close to the objective degree and c i is the c value of the equation system in the i-th region. Obviously, the value of ρ is between [0, 1], and the value of the parameter c is determined below. According to the above error analysis, the error vector m ε is introduced so that the following formula holds: From equation (21), we can see that the estimated value of equation (11) is Substitute into (33), and sort out Mobile Information Systems due to According to the topic, the positions of the sources AP1, AP2, and AP3 are independent of each other, so the vector group formed G is linearly independent, and it is obvious that det G T ≠ 0 and det(G) ≠ 0 ⟹ det G T G ≠ 0.
(37) erefore, the term (G T G) −1 on the right side of equation (36) must exist. Make Substituting formula (32) finally obtains where m ε,i is the error vector of m ε corresponding to the i-th equation group. For the mnk combinations determined by equation (29), ρ is sorted according to equations (36) and (39) and sorted, the equation group with the largest ρ value is the best choice, and then the W calculated according to equation (21) is the pending optimal solution for the measuring point p(x, y).

Experiment
e experimental environment selects an office area on the third floor of an office building. e office area is an approximately rectangular area with a length and width of approximately 50 meters × 25 meters, as shown in Figure 4. e upper left corner of the figure is used as the origin of the coordinates, the horizontal direction to the right is the horizontal axis, and the vertical direction is the vertical axis to establish the coordinate system. For comparison, the experiment was divided into two groups. e first group of experiments uses the traditional three-sided positioning method for testing. e entire indoor environment is regarded as a whole. In the test environment, 50 test points are randomly selected for position calibration as shown in Figure 5.
en, further estimate and calibrate the values of the transmission model parameters A and n expressed by equation (3). en, select 30 random positions for positioning test and record the actual coordinates and positioning results of the random positions. In the second set of experiments, the improved method proposed in this paper is used to carry out regional preprocessing of indoor space, in turn constructing the area set, parameter set, and decision domain set corresponding to each plane, and applying the regional confidence method to the distance measurement and positioning in the positioning stage. e solution results are constrained and filtered, and the positioning results are recorded. In order to reduce the interference factors, the two sets of experiments used the same set of test points in the estimation and calibration of the model parameters. For the same reason, the positioning test point data of the two sets of experiments are also the same.
Suppose the actual position of the point to be measured is W 0 , the positioning result solved by the traditional threesided positioning method in the first group of experiments is W 1 , and the positioning result solved by the improved method in the second group of experiments is W 2 , e i � W i − W 0 , i � 1, 2. at is, e 1 is the positioning error of the traditional three-sided positioning method obtained by experiments, and e 2 is the positioning error of the improved method. Process the test data of the two sets of experiments according to the above formula, calculate and plot the positioning error corresponding to each test point, and set the horizontal axis as the test point number and the vertical axis as the positioning error, as shown in Figure 6.
It can be seen from the figure that the positioning error of the traditional three-sided positioning method is relatively large, the positioning error of individual test points even exceeds 10 m, and the fluctuation of the error is relatively large, which indicates that the positioning error of the traditional three-sided positioning method is relatively discrete. e positioning error range using the improved method in this paper is basically controlled within about 2 m.
Set the horizontal axis as the positioning error and the vertical axis as the cumulative error probability. Plot the distribution diagram of cumulative error probability accordingly, as shown in Figure 7. F 1 and F 2 are the distribution of cumulative error probability of the first and second sets of experimental data, respectively. It can be clearly seen  from the figure that the distribution curve of cumulative error probability of the second group that used the improved positioning method reaches the peak soon after the horizontal axis crosses 2 m, which indicates that the positioning error is basically distributed within 2.2 m, while the distribution curve of cumulative error probability in the first group which used traditional positioning is relatively smooth, and as the distance increases, the error convergence is slower. Relatively speaking, the positioning effect of the second group that used the improved method is obviously better.

Conclusion and Future Work
e propagation of wireless signals in an indoor environment is a fairly complicated process. e RSS value of a wireless signal at a specific indoor location is restricted by many factors. e traditional indoor positioning technique based on RSS considers the indoor environment as a unified whole. Since it ignores the differences in the wireless signal transmission paths of various points in the indoor environment, the positioning effect is not satisfactory. e method proposed in this paper takes sufficient account of these differences, and it takes advantage of the prior knowledge of the layout algorithm of the known indoor environment, by the way of the localization of the indoor environment and the approaching positioning to solve this problem. It is worth noting that in the application, the first problem needs to be solved is the acquisition of indoor environment data of the building in the training phase. In the current field of building and construction, the building information model (BIM) has become popular, and many buildings are carried on by BIM in the design phase. erefore, the BIM can be used to obtain data on the indoor environment, saving a lot of complicated work in the early phase. In addition, in the indoor environment, due to the unpredictable shielding effect of indoor pedestrians on the wireless signal transmission path and the multipath effect caused by the reflection of the wireless signal by the wall plate in the building, these factors will reduce the positioning accuracy. erefore, adding Gaussian filtering and Kalman filtering to the RSS data acquisition module of the RSS indoor positioning equipment can weaken this effect to a certain extent, and this follow-up work needs to be further developed in the future.
Data Availability e data are available on requesting the corresponding author Xingsi Xue, whose e-mail is jack8375@gmail.com.

Conflicts of Interest
e authors declare that there are no conflicts of interest regarding the publication of this paper.