Solving the Location Selection Problem of Self-Service Stores from the Perspectives of Sustainability and Uncertainty

This paper presents an improved method for selecting a specific location in the development of convenience stores in municipal areas. This method solves the problem of self-service store location from the perspective of sustainability and uncertainty and adequately considers the characteristics of individual locations with the proposition of an improved grey wolf optimization algorithm. The example presented shows that the improved algorithm has obvious advantages in facilitating the selection of convenience stores with respect to the search precision, stability, and convergence. Based on the macroenvironment, income, and cost, this work establishes a relatively complex, complete, and targeted mathematical model. Finally, taking the Xiaonanzhuang area of Suzhou Street in Beijing as an example, the scientificity, feasibility, and sustainability of the location model are verified.


Introduction
The self-service model has been studied since 2010 [1][2][3]. Buell et al. [4] investigated the impact of self-service technology usage on customer satisfaction and retention. Selfservice terminals are used as part of a customer's checkout process in retail operations. Li et al. [5] proposed that intercustomer interactions are important for the operation of self-services in retail settings. Weretecki et al. [6] studied the impact of intercustomer interactions at retail selfservice terminals on service quality perceptions and repeat purchase intentions at retail stores. Self-service stores are retail outlets in which machines replace humans in providing customers with specific services to reduce the cost of delivery and purchasing through service standardization [7,8]. Compared with traditional retail stores, self-service stores have numerous benefits, including the use of small retail space, low inventory, the provision of convenience, and reduced operation costs [9,10]. As a result, selfservice stores have developed rapidly. The rapid development of self-service stores requires the adequate evaluation and selection of store locations in specific situations. The store location is one of the most important factors in the operation of unattended convenience stores and is directly related to their profitability. Consequently, evaluating and selecting the best store location is a critical problem that needs to be adequately addressed.
Much research has benefited from the development of specific methods for solving the site selection problem under various circumstances. Huang et al. [11] reported an analytical approach to select expansion locations for retailers selling add-on products. Moreover, the authors built predictive models for understanding the derived demand of the add-on product and established an optimization framework for automating expansion decisions to maximize expected sales. Zhao et al. [12] presented a data-driven approach to allocating 'taxi canteens' throughout a city and proposed a constrained optimization model to select locations for these services. Mohammad and Morteza [13] presented a fast, constructive heuristic algorithm based on priority rules to determine the optimal location for storage facilities.
Rao et al. [14] proposed a fuzzy multiattribute group decision-making (FMAGDM) technique based on linguistic tuples to evaluate potential alternative CLC locations. Birol [15] proposed the Fuzzy Preference Ranking Organization method for enrichment evaluation to evaluate the potential location of logistics centers. Such methods in general can be classified as multiple criteria decision-making (MCDM) methods, mathematical methods, and intelligent methods. MCDM methods always involve multiple objectives, there are contradictions and incommensurability among objectives, and the qualitative and quantitative indicators are mixed [16][17][18]. The multicriteria decision siting model often contains some specific constraints, can satisfy multiple objectives, and focuses on comprehensively considering multiple criteria for nuclear site selection [19]. Rouyendegh and Savalan [20] combine multiple-criteria decisionmaking with an analytic hierarchy process and fuzzy sets to establish a hybrid model to provide decision support for the selection of sustainable solutions to agricultural problems. Ghosh et al. [21] studied the application of the hexagonal fuzzy multicriteria decision-making method in the location of charging stations for electric vehicles. Erol et al. [22] proposed a new decision-making framework based on fuzzy MCDM to sort the site selection of alternative nuclear power plants in Turkey. As a result of the existence of qualitative indicators, this method is greatly influenced by subjective factors. In actual decision-making, these qualitative indicators must be treated with fuzzy quantification.
Mathematical methods are used to solve location selection problems by quantifying factors such as costs and expenses, defining definite assumptions, setting parameters and variables, and establishing a mathematical model for optimization [23,24]. The entropy weight method and TOPSIS model are used to determine distribution center locations in cold-chain logistics. From the perspective of quantitative analysis, how comprehensive factors influence the site selection problem is difficult to consider by establishing a mathematical model for solving it, such as topography, land appreciation, urban development, environment, traffic, pollution, and the impact on the user, which usually cannot be quantified and ignored. The use of mathematical methods to solve practical problems with large amounts of data is often difficult. Mathematical models are always ideal and precise.
The emergence of intelligent algorithms has greatly enriched modern optimization technology and provided practical solutions for combinatorial optimization problems that are difficult to solve using traditional optimization techniques, such as genetic algorithms, particle swarm algorithms, and differential evolution algorithms [25,26]. These methods share many of the same characteristics: (1) all indeterminate algorithms; (2) global optimization algorithms; (3) excellent distributed computing mechanisms; (4) self-organization and evolution; and (5) strong robustness; however, some defects exist, such as (1) the accuracy of the solution results is often sacrificed for the efficiency of the solution; (2) it easily falls into the local optimal solution; and (3) the selection of algorithm parameters is based mainly on experience.
As a new type of offline retail business, there are few research papers on unmanned convenience stores, and few researchers participate in the study of the operation and management of unmanned convenience stores. At present, the existing research articles on unmanned convenience store mainly focus on analyzing its market positioning and its own operating characteristics and summarize the current development of the industry. Based on this, a self-service store location model is proposed, and an improved grey wolf optimization (GWO) algorithm is proposed to solve the model. Aiming at the improvement strategy of global optimization algorithm, an endogenous improvement method is proposed to optimize the application of global optimization algorithm in linear programming computational problems. The factors affecting the location of self-service stores are analyzed. Seven representative influencing factors are proposed, and two hypotheses are put forward: store rent is the revenue factor, and daily flow is the cost factor. Horizontal competitors in the surrounding area assist in the operation of self-service stores. These hypotheses are verified by practical cases, and their rationality and practical significance are demonstrated.
The remainder of the paper is divided into four sections. The next section describes the location problem and the associated factors. The improved grey wolf algorithm is introduced to solve the mathematical model of the location problem and runs tests using the class functions. A location model of convenience stores is established against this improved algorithm backdrop. This step is followed by selecting the location of two convenience stores in Beijing, and the results of the location model are analyzed. Finally, the concluding section details some practical suggestions for improving future site planning.

Formulating the Selection Problem
After analyzing many references and conducting field research in Beijing, the following factors influencing the location selection of self-service stores were summarized to establish the location selection model: 2.1. Environmental Factors. Environmental factors should include the specific location of the reserved location, specific area, degree of policy support, degree of macroregional development, and degree of development of social law and morality [27]. These factors are generally predetermined.

Revenue
Factor. The revenue factor is an important factor that can, directly and indirectly, influence the operating profit of self-service stores. Profit is how much revenue exceeds the cost [28]. The profit of the enterprise comes directly from consumers' consumption in the store.

Cost Factors.
The cost factors generally include the site cost, goods loss cost, maintenance and replacement cost, depreciation cost, and manpower cost [29,30]. The model of this paper should include the daily flow of people, distance from dangerous areas, maintenance distance, and several peripheral competitors. Self-service stores prefer small numbers of customers with high net worth and long-term stable demand. Therefore, too much traffic does not have a positive effect on these stores. In contrast, due to its limited space, limited reception capacity, and the consumption of hardware facilities and products, too much traffic will have 2 Journal of Environmental and Public Health a highly negative impact. Therefore, the people flow is a cost factor with a large changed weight. Location in a potentially dangerous area is a serious cost factor. Such an area can attract people who do not intend to shop at the convenience store, create discomfort, and feature low-quality facilities and the potential harm to the store or customers. This factor should therefore be heavily weighted. Surrounding competitors represent the competition factor. Peripheral competitors will inevitably affect the store's income level, but there are no convenience stores with general offline retail stores. Sales are not repelled but increased around a certain number of traditional offline retail stores, as the presence of competitors increases the main demand of passenger flow and disperses them; therefore, this cost factor has a smaller weight. Maintenance distance is the distance from the warehouse to the location of the staff responsible for maintaining the self-service store. The space affects warehouse sales and self-service stores and is affected by the total number of warehouses. Then, the model should consider all preexisting warehouses and no convenience store randomness. The characteristics of the high-frequency replenishment and exchange of goods mean that the distance should be as low as possible. Thus, this factor is a cost factor.

Correlation Factors.
Correlation factors refer to the interactions between factors. The main correlation factors are as follows; 2.4.1. Associated Effects of the Master Stream Band. When the site location is closer to the mainstream of people, away from residential and office areas, the income-weighted rent cost for peripheral small village land is low, and the weight of the cost factor of daytime traffic is large. Conversely, when the location is away from the mainstream of people and close to residential and office areas, the income-weighted rent cost for peripheral districts is larger, with a smaller value for the weight of the cost factor of daytime traffic.

Associated Effects of Risky Areas.
When a location is too close to a dangerous area, the weight of daytime human flow and the distance to the dangerous area will correspondingly increase. In contrast, when the location's proximity to a dangerous area meets the specified limit, the weight of the daytime human flow and the distance to the dangerous area will correspondingly decrease.

Associated Effects of Distance to the Maintenance Area.
When the location is too far from the maintenance area, the weight of the daytime human flow and distance to the maintenance area will correspondingly increase. In contrast, when the location's distance to a dangerous area meets the specified limit, the weight of daytime human flow and distance to the maintenance area will decrease.

Formulating the Site Selection Problem
The objective function of the optimal location is expressed as follows: where P i is the optimal location searched by the algorithm in each iteration, which can be expressed as (1) LP i is the total weight value of the land price in the neighbourhood of the community at position i, which should be expressed as where LP w is the default weight of the land price, LPc i is the normalized value of the practical land price at position i, and LPk i is the dependency coefficient of the actual location on the land price, which can be expressed as where ω 1 and θ 1 are different constants. dd is the distance from the location to the nearest master stream belt, and DD is the default distance limit. DPa i is the influence coefficient of the closed area on the surrounding land price, while α 1 and β 1 are different constants, pp i is the location of the prelocation, and DP is the closed area.
(2) R i is the total weighted value of the store's rent in location i, which can be expressed as where R w is the default weight of the store's rent, Rc i is the normalized value of the actual rent, and Rk i is the dependency coefficient of the actual location on the rent, which can be represented as where ω 2 and θ 2 are different constants. DPb i is the influence coefficient of the closed area on the surrounding land price, while α 2 and β 2 are different constants.
(3) DF i is the total weighted value of the daytime flow of people at position i, which can be expressed as 3 Journal of Environmental and Public Health where DF w is the fixed weight of the daytime flow, DFc i is the normalized value of actual daytime flow at position i, DFk i is the dependence coefficient of the actual location on the flow of people at position i, and CO i is the influence coefficient of the number of competitors on the daytime flow, which can be described as where ω 3 and θ 3 represent different constants. CO i is the influence coefficient of the number of competitors on the daytime flow. 1 < μ 1 < μ 2 , and they are all constants. cn ic is the number of competitors in the default distance dcc i of position i. DPc i is the influence coefficient of the closed area on the daytime flow, and α 3 and β 3 are different constants.
(4) NF i is the total weighted value of the night flows at position i, which should be expressed as where NF w is the default weight of the flow of people at night, and NFc i is the normalized value of the actual night flow at position i. Because the stream of people at night is relatively lower than that during the day and the service objects are mostly local residents, the cost of loss is lower. In this context, because of the sudden demand of customers and the irreplaceability of self-service stores for general stores, the store earnings rise as the number of customers increases.
(5) DZ i is the total weighted value of dangerous areas at position i, which can be expressed as where Rw is the default weight of the store's rent, Rc i is the normalized value of the actual rent, and DZk i is the dependency coefficient of the actual location on the dangerous area, which can be represented as where ω 4 and θ 4 represent different constants. ddz i is the distance between location i and the nearest dangerous area, and DDZ is the default distance limit.
(6) CN i is the total weighted value of regional competitors in position i, which can be expressed as where CN w is the default regional competitor weight and CNc i is the normalized value of the actual number of regional competitors in position i.
(7) MD i is the total weighted value of the maintenance distance at location i, which can be expressed as where MD w is the weight of the default maintenance distance and MDc i is the normalized value of the actual maintenance distance at position i. MDk i is the dependence coefficient of the actual location on the distance to the maintenance area, which can be represented as where ω 5 and θ 5 represent different constants. ddm i is the distance between location i and the nearest dangerous area, and DDM is the default distance limit.

A New Method
Grey wolves are highly intelligent social animals and have long been considered a dangerous top predator, which means they are at the top of the food chain. Most grey wolves like to live in a small cluster.
The leader of the herd can be a male wolf or a female wolf, which we call the alpha (α) wolf. Alpha (α) wolves are mainly responsible for making decisions about where the population hunts, where they sleep, when they wake up, and so on. The decision of the alpha (α) wolf depends on individual thinking and decision-making behaviour [31]. However, biologists have also observed some kind of democratic behaviour; that is, alpha (α) wolves sometimes follow other subordinate wolves. In the group, the whole population recognizes the alpha (α) by raising its tail. The alpha (α) wolf is also called the dominant wolf because his or her orders should be obeyed by the whole pack. Alpha (α) wolves are only allowed to mate and reproduce in their own population. It is worth noting that alpha (α) wolves are not necessarily the strongest and most ferocious members of the pack, but they are the best at managing the pack 4 Journal of Environmental and Public Health [32]. This shows that for the first wolf, that is, alpha (α) wolf, organizational ability and discipline are much more important than their individual strength in status evaluation. The second level of the grey wolf class is the beta (β) wolf. Beta (β) wolves are subordinate wolves to help alpha (α) wolves make decisions or leaders of other ethnic activities. Beta (β) wolves can be either male or female, and he or she may be the best candidate for alpha (α) wolves to prevent the death or aging of alpha wolves. Beta (β) wolves should respect the alpha (α), but also command and command other lower-level wolves. It plays the role of consultant and trainer. Beta (β) wolves play a role in strengthening the alpha (α) command and providing feedback to the alpha (α) in the whole population [33,34].
The lowest ranking grey wolf is the omega (ω) wolf. Omega (ω) plays the most passive role of scapegoat. Omega wolves always have to obey all other higher-level dominant wolves. They are the last wolves allowed to hunt and feed. Omega (ω) wolves may not seem like a very important hierarchy, but biologists have observed the problems caused by internal fighting and the loss of omega (ω) wolves in the entire population [35]. This is because omega (ω) wolves mainly perform the function of violence and destruction at all levels of the pack. This helps to meet the needs of the whole ethnic group and maintain the dominant structure. In some cases, omega (ω) is also a maintainer in the community.
If the selected grey wolf is not an alpha (α) wolf, a beta (β) wolf, or an omega (ω) wolf, then he or she is called a delta (δ) wolf. Delta (δ) wolves must obey the alpha (α) wolves and the beta (β) wolves, but they will dominate the omega (ω) wolves. Scouts, sentinels, elders, hunters, and guards belong to the role of this type of wolf. The scouts are responsible for monitoring the boundaries of the territory and warning the guard in the event of danger. The sentry protects and ensures the safety of the packing [36]. Elders are experienced wolves who used to be alpha (α) wolves or beta (β) wolves. Hunters help the alpha (α) wolves and the beta (β) wolves when hunting prey and providing food. Finally, the guards are responsible for taking care of the weak, sick, and injured members of the pack. In the GWO algorithm, the hunter-gatherer (optimization) is always led by α, β, and δ. The rest of the wolves follow these three wolves. According to the description of Muro et al. [37] the main stages of grey wolf hunting are as follows: (1) Tracking, chasing, and approaching prey (2) Pursuing, surrounding, and harassing the prey until it stops moving (3) Attacking the prey Grey wolves surround their prey during hunting. To simulate surrounding behaviour mathematically, the following equation is proposed: where t represents the current iteration, A ! and C ! are coefficient vectors, X ! p ðtÞ is the position vector of prey, and X ! indicates the position vector of grey wolves.
In the iterative process, the convergence factor decreases linearly from 2 to 0, and the vector r is the random number between 0 and 1. Grey wolves can update the position in the surrounding space of their prey at any random position using equations (16) and (17).
We save the first three best solutions so far and ask the other search agents (including the omega wolf) to update their location X α , X β , and X δ according to the position of the optimal search agent: where X d i ðt + 1Þ represents the fitness value of the individual grey wolf i in generation t + 1.
To solve the problem that the basic GWO easily falls into a local optimum, this paper improves two aspects:

Convergence Factor Strategy Based on Nonlinear Decline.
In GWO, the default convergence factor decreases linearly from 2 to 0 as the number of iterations increases; however, in using the actual algorithm and solving the function, the algorithm convergence trend is not linear. Therefore, the linear decreasing strategy of convergent factor in the basic algorithm is not completely suitable for the actual algorithm when searching and calculating the optimal value of the function. Therefore, a new nonlinear convergence method is proposed in this paper: where l is the current iteration number; n is the default maximum number of iterations; and a is a nonlinear convergence factor.
The nonlinear convergence of convergence factor a is shown in figure 1.
When the maximum number of iterations is 500, the blue function curve represents the speed of basic GWO convergence, and the red function curve represents the speed of optimized algorithm convergence. The improved convergence factor a presents the curve convergence with the 5 Journal of Environmental and Public Health iterations; in the short term, a slow convergence speed can interact with the GWO to determine a better global optimal solution, and in the medium term, the convergence of the algorithm is accelerated. The convergence speed of a accelerates at the later stage, which helps the algorithm find the local optimal solution quickly.

Fixed Penalty Function Method for Multisegment
Association Mapping. Because GWO is an optimization algorithm search technology based on an unconstrained function, it is necessary to combine an appropriate constraint treatment when solving the optimization problem of the constrained function with GWO. The penalty function method is most commonly used to handle a series of unconstrained optimization algorithm technologies.
As shown in equation (20), the constraint optimization problem is as follows: where f ðxÞ is the target function, g j ðxÞ is the constraint condition of the inequality, h j ðxÞ is the constraint condition of the equation, and l i and u i are the upper and lower bounds of the variable x i , respectively. This equation can be translated into the following form: where ε is called the tolerance value, which is generally taken as a small positive number. The general form of the constructed generalized objective function is as follows: where FðxÞ is the original target function, partial ∂ðtÞH ðxÞ is called the punishment term, partial ∂ðtÞ is expressed as the punishment intensity, and HðxÞ is called the punishment factor.
During the whole process of solving the optimization problem of constraints, if the partial ðtÞ in equation (22) is fixed, it is called the fixed penalty function method; otherwise, it is called the nonfixed penalty function method. The fixed penalty function method will be determined by a set of preset parameters in accordance with the actual situation or theoretical situation and will not change due to changes in the number of iterations. Multisegment mapping is when the function value or variable value changes, and the default constraint function will have different constraint effects on the original function, which is expressed as: where H 1 ðxÞ is the target punishment function; AðxÞ is the punishment intensity; BðxÞ is the punishment degree; A 1 , A 2 , and A 3 are the punishment intensity set; and r 1 , r 2 , and r 3 are the upper and lower bounds of different sections.   Journal of Environmental and Public Health The correlational penalty function can be expressed as According to the above ideas and formulas, a new grey wolf optimization algorithm is proposed in this paper, and its process can be summarized as follows. The flow chart is shown in Figure 2.
Step 1. Initialize the grey wolf population, that is, randomly generate the position of n intelligent individuals; initialize a, A, and C; initialize the value of X α , X β , and X δ .
Step 2. The transboundary intelligent individual is processed, and the fitness value of each intelligent individual is calculated.
Step 3. Compare the fitness of each individual of intelligence, and determine the optimal solution, optimal solution, suboptimal solution, and X α , X β , and X δ in the current iteration.
Step 4. For each intelligent individual, the strategy is adjusted nonlinearly according to the control parameters, and the parameters a are obtained by optimization, and then A and C are obtained.
Step 5. According to the optimized position determination method and the nonlinear penalty function, the position of the intelligent individual is redetermined.
Step 6. Then, determine the fitness of the new generation of individuals and update the population level.
Step 7. If the end condition (maximum number of iterations) is reached, the end is finished and the optimal solution is output, otherwise go to Step 2.

Simulation Study
To test the feasibility and practical reliability of the IGWO, we used classic test functions, including unimodal functions and multipeak functions. Each function was used to extract three sample tests and was compared with GWO and PSO. The details are shown in Tables 1 and 2: Each test function was tested at least 50 times, and the representative image was taken for display. The details are shown in Figures 3-8.
As shown, compared to PSO, IGWO has obvious advantages in terms of speed and precision and does not easily fall into the local optimum. IGWO shows clear superiority of the functional testing results shown in Table 3. Table 3 shows the following aspects in detail: ,100] 0 [-100,100] 0 7 Journal of Environmental and Public Health   Figure 4: Journal of Environmental and Public Health

Evaluation of exploitation capability (functions F 1 -F 3 ).
Functions F 1 -F 3 are unimodal with only one global optimum, which can evaluate the exploitation capability of the investigated metaheuristic algorithms. According to the results in the table, the present algorithm provides very good exploitation.

Evaluation of exploration capability (functions F 4 -F 6 ).
To evaluate the exploration capability of an optimization algorithm, we used multimodal functions. These functions include many local optima whose number increases exponentially with the problem size (the number of design variables).
In terms of quantitative analysis, on the premise of keeping the original search time unchanged, the improved grey wolf optimization algorithm improves the search accuracy by 100%. The search accuracy of all test functions of PSO, PSO, and GWO can only be maintained to about e −28 , while the improved grey wolf optimization algorithm can stably improve the search accuracy to about the maximum e −60 , and performs well in several test functions, significantly leading the accuracy of GWO and PSO. In fact, because the algorithm slows down the convergence speed of the convergence factor in the early stage, the algorithm actually searches more possible solutions and traverses more possibility intervals, so the actual operation speed of the  Figure 8: algorithm is accelerated. It shows its extremely high precision and not easy to fall into the local optimal performance. In terms of qualitative analysis, the improved grey wolf optimization algorithm further improves the convergence speed and the smooth degree of convergence in the second half of the algorithm on the premise of keeping the original convergence curve from deterioration. IGWO further maintains and improves its own characteristics of fast iteration speed, fast convergence speed, and fast search speed in the process of comparing PSO and GWO.
6. Examples 6.1. Case Analysis. The case area is zoned from the auxiliary road of Wanquanhe Road in the north to Suzhou Street in the south and from western Sanyimiao Community in the east to Wanliu East Road in the west. The area is 1.2 km long from east to west, the longest distance from north to south is 900 m, and the shortest is 650 m. The case area contains a total of eight residential areas, including Guangda Garden, Xiaonanzhuang Community, and Xinjiyuan Space, two primary schools, two secondary schools, and a floating population area. An illustration of the area is shown as Figure 9.
To build the model, the area was transformed into a rectangle with a length of 120 and a width of 80, and each optional location in the region was marked. The dangerous area is marked in blue, the maintenance area is marked in green, and surrounding competitors are marked in red. The approximation is as follows Figure 10.
Through the measurement of the actual model described in the previous chapter, we obtained seven kinds of data for seven areas, as shown in Table 4, then, we normalized the data.
We then introduced a linear normalization formula to normalize the data used in this paper: where P i is the i-th value of the P class data. P max refers to the maximum value of the P class data, P min refers to the minimum value of the P class data, and P * i is the normalized value of the i-th value of the P class data. Formula (25) is used when P i is smaller and closer to the expected situation. Formula (26) is used when P i is larger and closer to the expected situation.
In practical application, taking area 1 (Xiaomai) as an example when calculating the land price factor, this factor increases as it becomes closer to the expected value, which can be written as However, when normalizing the value of the distance from the danger area, this factor decreases as it becomes closer to the expected value, which can be written as

Journal of Environmental and Public Health
This process is similar to the remaining data.
6.2. Analysis of the Model Calculation Results. MATLAB software was used for 200 tests (as in the previous procedure) to form the scatter diagram of location selection shown in Figure 11; that is, the optimal solution to the location selection problem. The optimal locations are mainly concentrated in the Youth Apartment area (enclosed residential area) in area 4 and the office building in area 5. In these areas, the floating population is low, which guarantees that the customer source is active and stable; however, the land price of the neighborhood around area 7 is high, as the area features mainly residents with higher incomes.
The area is close to the planned maintenance location for urban and rural storage, which helps to reduce maintenance costs. Conforming with the self-service retail industry in Beijing before the existing location mode emphasizes the main principles to "avoid crowds, implement closed-end management, serve select customer groups, implement low flow operations, and be close to maintenance areas;" this approach can ensure the long-term stable development of self-service stores.

Conclusion
In this paper, IGWO is proposed. This approach features high stability, high speed, accuracy, and global scope, can solve the optimal search problem of the related functions involved in the general swarm intelligence optimization algorithm, and has good adaptability and reliability for common functions. This paper also tries to solve the problem of convenience store location from the perspective of sustainability and uncertainty, and explores the development history of the unmanned retail industry in China, focuses on the development process of the self-service store industry in Beijing, and analyses the development of several typical self-service stores in Beijing and the location of these sites. Based on research and analysis, three categories of dimensions are summarized:    Through these dimensions, a relatively complex, sustainable, complete, and targeted mathematical model has been established. Finally, we selected Suzhou Street in the Xiaonanzhuang area as a case area and verified the scientific feasibility of the location model. 7.1. Managerial Insights. The problem of determining the location of convenience stores has often been mentioned by previous researchers but without a scientific, systematic, visual, and quantifiable description or calculation method to address it. In response, an effective, targeted, systematic, and practical location method is proposed in this paper. In terms of optimization and improvement strategies for the grey wolf optimization algorithm, some endogenous improvement methods are provided, and the calculation of the linear programming problem is optimized. These advancements offer innovation, pertinence, and practicability. For the establishment of the location model for convenience stores, seven representative important influencing factors are put forward, and two hypotheses are proposed: store rent is a factor of income, and the flow of people in the daytime is a factor of cost, and competitors in the surrounding industry are helpful for the operation of the convenience store. Using an actual case to verify these assumptions, we proved their rationality and practical significance.

Opportunities for Future
Research. This paper focuses on improving the internal mechanism of the GWO algorithm. The algorithm's optimization speed is not optimal and thus room for improvement remains to avoid falling into local optimization. At present, the most popular method is to use two or more hybrid algorithms, such as hybrid difference algorithms and genetic algorithms, to improve this optimization. Future research can integrate multiple algorithms to overcome the problem of easily falling into local optimization.
For convenience, this paper focuses mainly on factors that can be simply quantified and have fixed values. For some content, it is difficult to investigate and collect information, including the impact of the population age ratio on consumption in the convenience store, the consumption tendency of customers at the convenience store, and the subjective attraction of the location to the crowd. These factors have not been investigated, analyzed, and evaluated in depth, which makes the evaluation ability of the mathematical model in the subjective direction insufficient. Future researchers can conduct more in-depth analyses of these aspects.

Data Availability
The labeled data set used to support the findings of this study is available from the corresponding author upon request.

Conflicts of Interest
The author declares that there are no conflicts of interest.