Socially Responsible Investment Portfolio Construction with a Double-Screening Mechanism considering Machine Learning Prediction

Although socially responsible investment (SRI) has developed into an important investment style, only a small number of studies discuss SRI portfolio construction. In view of the overwhelming breakthrough of machine learning in prediction, this paper proposes SRI portfolio construction models by combining a double-screening mechanism considering machine learning prediction and an extended global minimum variance (GMV)model (or extendedmaximum Sharpe ratio (MSPR)model), which are, respectively, named double-screening socially responsible investment (DSSRI) portfolio models I and II. )e proposed models consist of two stages, i.e., stock screening and asset allocation. First, this paper develops a novel double-screening mechanism incorporating environmental, social, and corporate governance (ESG) and return potential criteria to ensure that high-quality stocks with good ESG performance and high-return potential are input into the optimal portfolio. Specifically, to obtain accurate stock return predictions, an extreme learning machine model optimized by the genetic algorithm is employed to predict stock prices. Next, to trade off the financial and ESG objectives of SRI investors, an extended GMV model (or extended MSPR model) considering the ESG factor is introduced to determine the capital allocation proportion of the stocks. We take the A-share market of China as the sample to verify the effectiveness of the proposed models. )e empirical results demonstrate that compared with alternative models, the proposed models can yield better annualized return and ESG score performance as well as competitive Sharpe ratio performance.


Introduction
Socially responsible investment (SRI) considers personal values and societal concerns in investment decisions [1,2]. SRI investors consider investing as an extension of their lifestyle and thus are willing to incorporate their social beliefs and values into financial activities [3]. With the increasing salience of social and environmental issues and the gradual awakening of investors' consciousness of environmental, social, and corporate governance (ESG) factors, SRI has presented rapid growth [4] and has become an important investment style. As of 2020, there were 3038 signatories in the global financial market agreeing to abide by the Principles for Responsible Investment supported by the United Nations, encompassing management assets of 103.4 trillion dollars. Compared with the levels in 2019, this number of signatories represented an increase of 20%, and the management asset volume exhibited an increase of 666 trillion dollars (see Principles for Responsible Investment at http://www.unpri.org).
In fact, SRI has attracted wide attention from academia over the past decades. Researchers have explored SRI related topics from various perspectives, including the motivation of SRI investors [5,6], socially responsible measurement [7,8], and the relationship between ESG rating and finance returns of the firm [9,10]. For more references, see [11]. However, only a small number of studies focuses on SRI portfolios. Unlike the conventional portfolio construction, SRI portfolio construction requires a stock screening process to ensure that the stocks in the portfolio are in line with SRI investors' social values. A few researchers have made valuable explorations of stock screening. For example, Auer [12] applied different screening methods to test the performance of SRI. e empirical results showed that compared with positive screening, which can damage investment performance, negative screenings with low cut-off rates are better. Kempf and Osthoff [13] compared the three SRI screening mechanisms of negative, positive, and best-in-class screening through an investment strategy of buying stocks with high social responsibility ratings and selling stocks with low social responsibility ratings. e best-in-class screening was proven to generally obtain the highest abnormal returns. Barnett and Salomon [14] evaluated the effect of the intensity of social screening on the financial performance of SRI funds. ey concluded that the financial returns of the SRI funds decrease with increasing screening intensity, while a rebound in financial returns occurs once the screening intensity reaches a maximum. It is worth mentioning that the stock screening conditions in previous studies were based solely on ESG criteria. However, as discussed by Diouf et al. [15], MacKenzie and Lewis [16], and Rosen et al. [17], SRI investors are concerned with both ESG and financial performance at the same time. ere is no conclusive evidence that stocks screened based on ESG criteria can deliver high returns [2,[18][19][20]. erefore, it is necessary to consider the return potential of candidate stocks in the screening process to ensure that the screened stocks meet both the ESG criteria and the return demand of SRI investors.
To screen the potential high-return stocks, reliable stock return prediction is critical. Machine learning methods with excellent learning capability have been widely used for stock return prediction. e representative machine learning models for stock prediction include support vector machine regression (SVR) [21,22], artificial neural networks (ANNs) [23,24], fuzzy logic system [25,26], deep neural network (DNN) [27,28], convolutional neural network [29,30], long short-term memory model (LSTM) [31][32][33], and ensemble learning [34,35]. Recently, extreme learning machine (ELM), which is a novel machine learning model, has received attention due to its powerful performance. For instance, Lie et al. [36] exploited the high accuracy and quick learning advantages of ELM to predict the stock price and then generate trading signals. Dash and Dash [37] employed ELM to train a computational efficient functional link artificial neural network and obtained better prediction effects for stock index movements than those with the backpropagation (BP) algorithm. Wei et al. [38] effectively improved the accuracy of exchange rate forecasting using a hybrid model of kernel-based ELM. Jiang et al. [39] developed two hybrid models for stock price prediction that employed ELM as the predictor. e empirical results showed that the proposed models had high prediction accuracy. Although ELM is capable of achieving good prediction accuracy, its main drawback is the mechanism of randomly generating input weights and biases of the hidden layer, which may lead to inferior solutions and thus undermine its stability and performance [40,41]. To overcome this drawback, an effective strategy is to introduce an intelligent algorithm to optimize the input weights and biases of the hidden layer of ELM [40,42,43]. e genetic algorithm (GA), a classic and excellent intelligent algorithm, has been utilized to optimize machine learning models, i.e., ELM [44][45][46] and BP [47]. In particular, ELM optimized by the GA (GA-ELM) has been successfully applied in various prediction topics, such as CO 2 emission [44], wind power [45], and gas concentration [46]. erefore, this paper introduces the GA-ELM to predict stock prices and then screens stocks with high-return potential based on the prediction results.
Once the stock screening is completed, SRI investors next need to determine the optimal investment weight of screened stocks. e mean-variance (MV) model, as the basis of modern portfolio theory, is still a common choice for SRI portfolio optimization. On this topic, only a handful of scholars have done relevant research. For example, Oikonomou et al. [48] tried to explore optimal ways to build SRI portfolios. ey applied six models (three simplistic and three sophisticated) to construct SRI portfolios and compared the performance of each. e results indicated that the SRI portfolios based on the sophisticated optimization models represented by MV had superior risk-return trade-offs relative to the simplistic models (e.g., those with naive diversification). Branch et al. [49] analysed the performance of quantitative methodologies in ESG portfolio construction. ey concluded that quantitative methods are effective at minimizing risk but could lead to unwanted exposure. Although the aforementioned studies performed meaningful and in-depth analyses, they either did not take the ESG factor into account in the portfolio optimization process or took ESG as a constraint condition instead of an optimization objective, thus failing to cater to investors' demand to maximize ESG. In [50], an extended MV was developed to construct an ESG portfolio, where minimizing the negative Sharpe ratio with the ESG score was taken as a novel variable in the objective function.
e proposed model showed considerable superiority in ESG scores over the traditional MV model. However, this research lacked an effective screening process to guarantee that high-quality stocks with both high ESG and high-return potential were input into the SRI portfolio, leading to mediocre return performance. Similarly, several modified versions of MV models incorporating ESG objectives (see, for example, Utz et al. [51], Calvo et al. [52], Stephan et al. [53] and Bilbao et al. [54]) also fail to preselect high-quality stocks before SRI portfolio construction.
Based on the above analysis, this paper proposes two SRI portfolio construction models called double-screening socially responsible investment (DSSIR) portfolio models I and II (abbreviated as DSSIR-I and DSSIR-II), which combine a double-screening mechanism and portfolio optimization models. Specifically, the novel double-screening mechanism consists of ESG and return potential screening. First, in the ESG screening, stocks with inferior ESG scores are excluded from the investment universe to ensure that the screened stocks align with the social values of investors. Subsequently, in the return potential screening, to include stocks with high-return potential into the portfolio, the GA-ELM model is introduced to predict the price of the screened stock. Finally, the stocks with high ESG and high-return potential, 2 Discrete Dynamics in Nature and Society as high-quality stocks, are input into the portfolio. After this stock screening, the portfolio optimization model is employed to weigh the screened high-quality stocks. In this work, to trade off the financial and ESG objectives of SRI investors, an extended global minimum variance (GMV) model (or extended maximum Sharpe ratio (MSPR) model) incorporating the ESG factor, which we call GMV-ESG (MSPR-ESG), is introduced as a portfolio optimization model. e main contributions of this paper are as follows: (1) Unlike the majority of studies on SRI portfolio construction, this paper incorporates stock prediction into the portfolio selection, thereby providing a new method for SRI portfolio construction. (2) is paper develops two novel SRI portfolio models called DSSRI-I and DSSRI-II, in which a doublescreening mechanism is designed to ensure highquality stocks (both high ESG and return potential) being the component of the SRI portfolio. e rest of this paper is as follows. In Section 2, we outline the related component models and introduce the proposed models. In Section 3, we describe datasets, parameter settings, and present and discuss the empirical results. In Section 4, we summarize the conclusions and prospects for future work.  [55], is an advanced algorithm for single-hidden layer feedforward neural network (SLFN), whose network topology consists of three layers, namely, the input layer, hidden layer, and output layer. e traditional algorithm for SLFN like BP algorithm determines the optimal weights through multiple iterations, while the ELM randomly generates input weights and biases of the hidden layer. e ELM can analytically calculate the output weights based on the minimum norm least squares solution. erefore, compared with the BP algorithm, the ELM is capable of obtaining good generalized performance at an extremely fast speed. Given Q sample pairs (X i , Y i ), where i � 1, 2, . . . , Q, X i � [x 1 , x 2 , . . . , x n ] T , and Y i � [y 1 , y 2 , . . . , y m ] T , the following equation can be obtained for the ELM owning L neurons in the hidden layer [55]:

Methodology
where w i � [w i1 , w i2 , . . . , w in ] T denotes the input weight, is the activation function, and b i is the bias of the ith neuron in the hidden layer. e matrix form of equation (1) is as follows [55]: where H is the hidden layer output matrix. Accordingly, the output weight of the ELM can be obtained using the minimum norm least squares solution [55] as follows: where H † represents the Moore-Penrose generalized inverse of H. e ELM can be implemented in the following four steps [55,56]: (1) determine the number of neurons in the hidden layer; (2) randomly generate the input weights and biases of the hidden layer, w i and b i ; (3) compute the hidden layer output matrix, H; and (4) estimate the output weight, β.

Genetic Algorithm.
e genetic algorithm is a classic intelligent algorithm first proposed by Holland in 1975 [57]. GA is regarded as a very efficient tool for optimization problems since it is computationally simple and powerful, as well as there are no restrictive assumptions about the solution space [58]. e core idea of the GA, inspired by the theory of 'survival of the fittest', is to imitate the biological evolution process of a population to obtain a globally optimal solution [59]. In solving a practical problem, the GA randomly generates a population of individuals (called chromosomes) as candidate solutions of a practical problem to evolve in successive generations, aiming to eliminate poor solutions within the population; thus, the remaining good solutions are further developed into better ones [60]. As a result, after finite iterations of evolution (until a predefined termination criterion is satisfied), an optimal solution from the population can be obtained. In the process of evolution, the chromosomes of each generation determine those in the next generation through three genetic operations, i.e., selection, crossover, and mutation.
is operation chooses chromosomes from the current population as parents to produce offspring input within the new population. A common selection technique is the route wheel method in which the probability of selection is proportionate to the fitness of each individual [61]. e fitness is the measure of the quality of the chromosome (solution), which is calculated by presetting the fitness function defined according to the practical problem. Given that f(C i ) is the fitness of the i-th chromosome, the total fitness of the population of G individuals can be expressed as follows [62]: e accumulated probability of each chromosome is as follows [62]: (2) Crossover Operation. A crossover operation that exchanges some parts of the two parents to produce a new pair of offspring is conducted to seek promising solutions from the search space [63]. Given a crossover probability ε, if generating a random number R with a standard uniform distribution is less than ε, a locus is chosen between two neighbouring chromosomes, i.e., parents 1 and 2. e chromosomes after this locus are exchanged, thereby forming new offsprings 1 and 2 [64]. e crossover processing can be expressed as follows: (3) Mutation Operation. e mutation operation ensures the diversity of individuals and prevents solutions from being too similar to avoid obtaining the local optimal solution [65]. Similar to the crossover operation, given a mutation probability ε, if a random number R with a standard uniform distribution is generated and is less than ε, a locus is chosen randomly on the parent, and the value of the locus is reversed, thereby producing a new offspring [64]. e mutation process can be expressed as follows:

Optimization of the ELM Model Using the GA.
Although the ELM has a fast learning speed and good generalization performance, the stability of the prediction results is weak because it randomly selects the input weights and the biases of the hidden neurons in training [40]. To improve the prediction performance of the ELM, the best input weights and the biases of the hidden layer should be determined to replace those from random selection. e most direct way to do so is by constructing a hybrid framework to combine an optimization algorithm and the ELM, thereby optimizing the input weights and the biases of the hidden layer randomly generated by the ELM [42]. Following previous studies [44][45][46], considering the sound performance in global search, GA is utilized to optimize the input weights and the biases of the hidden layer to improve the performance of the ELM in stock prediction. A detailed procedure of the GA-ELM algorithm is given in Algorithm 1.

Extended Portfolio Optimization Models.
Markowitz's mean-variance (MV) model, as a classic portfolio optimization method, has been popular in the practice of financial investment. Based on a set of different return or risk levels, MV can determine the efficient frontier, at which the portfolios can obtain the maximum return for the given risk level or achieve the minimum risk for the given return level. ere are two commonly known portfolio models on the efficient frontier, i.e., the GMV model [66] and the MSPR model [50]. e former denotes the minimum risk portfolio of the efficient frontier, and the latter denotes portfolio that maximizes the Sharpe ratio [66]. is paper incorporates the ESG factor into the objective functions of the GMV and MSPR models to form the GMV-ESG and MSPR-ESG models, respectively. e GMV-ESG model can be defined as follows: where w i is the weight of stock i, σ ij represents the covariance between stocks i and j, and σ p is the standard deviation of the portfolio. Following the advice of Auer [12], considering that investors usually have only a vague understanding of ESG, a synthetic ESG score is used to quantify the ESG performance. φ p and φ i denote the expected ESG score of portfolio and stock i, respectively. For the MSPR-ESG model, similarly to the Sharpe ratio, we trade-off ESG score maximization and risk minimization, thereby incorporating the ESG score under a unit of risk into the MSPR model. e MSPR-ESG model can be defined as follows: 4 Discrete Dynamics in Nature and Society where u i and u p separately denote the expected return of stock i and the portfolio, respectively, and r f denotes the risk-free return. In this study, the expected return u i , covariance σ ij , and ESG score φ i are obtained by the historical data.

e Proposed Models.
e input of high-quality stocks plays an important role in constructing an optimal portfolio [67].
us, it is necessary to exclude inferior stocks that cannot satisfy investor demands from the investment universe. For most SRI investors, high-quality stocks not only need to meet ESG expectations but also have high-return potential. In the proposed models, a novel double-screening mechanism considering ESG and return potential is implemented to guarantee that high-quality stocks are input into the optimal portfolio. To be specific, low-ESG stocks are first cleaned from the investment universe based on the ESG threshold given by the investor. Subsequently, the GA-ELM model is utilized to predict the price of the remaining stocks in the future period, thereby obtaining the corresponding return prediction. Finally, the predicted returns are sorted in a descending order, and stocks with top-ranked returns are regarded as the ultimate high-quality stocks obtained from double-screening.
Furthermore, as Oikonomou et al. [48] claimed, in addition to optimizing the investment universe through the screening process, SRI investors also carefully consider portfolio selection techniques. e conventional GMV and MSPR models cannot trade off the financial and ESG performance of SRI investors since they do not incorporate the ESG factor. erefore, this paper includes the ESG score within the objective functions of the GMV and MSPR models to form extended portfolio optimization models, i.e., the GMV-ESG and MSPR-ESG models, to balance the ESG Input: initialize the chromosome populations Output: the optimal input weights and biases of the hidden layer of the ELM (1) Begin (2) Set the parameters of the ELM (3) Define the fitness function (4) Evaluate the fitness of all chromosomes in population (5) C � the best chromosome (6) g � 1 (7) while g < maximum generation do (8) for i � 0 ⟶ P do (9) Generate rd1 randomly in the interval [0, 1] (10) if rd1< crossover probability then (11) Update the chromosome according to equation (7)  (12) end if (13) Generate rd2 randomly in the interval [0, 1] (14) if rd2< mutation probability then (15) Update the chromosome according to equation (8)  (16) end if (17) Evaluate the chromosome fitness (18) end for (19) Update the chromosome populations according to equation (6)  (20) Rank the chromosome populations according to fitness (21) C � the best current chromosome (22) g � g + 1 (23) end while (24) Return the optimal input weights and biases of the hidden layer of the ELM according to C (25) End ALGORITHM 1: GA-ELM algorithm.
Discrete Dynamics in Nature and Society 5 and financial objectives of SRI investors. In summary, the proposed models are divided into the two stages of stock screening and asset allocation, involving four steps. e flowchart of the proposed models is shown in Figure 1, and a more detailed description of the proposed models is given as follows: Step 1. Exclude low-ESG stocks. All sample stocks are ranked in descending order of ESG scores, i.e., S � s 1 , s 2 , . . . , s i , s j , . . . , s n |i < j, ESG(s i ) > ESG(s j ) , where s i denotes stock i. en, the ESG score threshold ESG T is set, and stocks with ESG scores lower than the threshold ESG T are excluded from S to form a new Step 2. Predict stock prices. Because SRIs are usually medium-to long-term investments, we predict stock prices over a relatively long interval period instead of the next. First, stock price data are divided into a training set and a testing set, which are transformed into supervised learning forms. Let P i (t) denote the price of s i ∈ S at time t(t � 1, 2, . . . , M), Δt denote the interval, and δ represent the time window size. us, the input matrix X i and output matrix Y i of s i can be expressed as follows: . . .
We stack X i (i � 1, 2, . . . , h) in one large feature metric V with dimension w × δ(w � h × (M − Δt − δ + 1)). Similarly, we can obtain one large target matrix O with dimension w × 1. Subsequently, the feature metric V and target matrix O are fed to the GA-ELM model for training. en, the optimal GA-ELM model obtained is applied to predict the prices of all stocks for the next Δt periods, i.e., P i (t + Δt), (i � 1, . . . , h).
Step 3. Screen high-quality stocks. Converting predicted stock prices to predicted returns, R s i (t + Δt) � (P i (t + Δt) − P i (t))/P i (t), (i � 1, . . . , h). en, the obtained predicted returns R s i (t + Δt) are sorted in descending order, and the top k stocks are considered the ultimate high-quality stocks S � s 1 , Step 4. Compute the proportion of capital allocation. e GMV-ESG and MSPR-ESG models are employed to determine the proportion of capital allocated to the high-quality stocks S, thereby obtaining the optimal SRI portfolios. For convenience in solving, a multiobjective model is often transformed into a singleobjective model. Equations (9) and (10) can be expressed as follows: 6 Discrete Dynamics in Nature and Society

Data Description.
In this paper, we use the ESG score from omson Reuters ASSET 4, one of the most prominent professional ESG rating agencies. We obtain the monthly closing prices of stocks from January 1, 2010 to December 31, 2019 in Choice Database. At present, the omson Reuters ASSET 4 database reports the ESG scores of only a few Chinese listed companies. After excluding some companies whose ESG scores are much lower or whose stock price data are notably incomplete, we select 50 companies as the study samples, as shown in Table 1. e stock closing price time series is split into training and testing sets based on the yearly rolling window approach, thereby forming three study periods. Specifically, in the first study period, we apply the stock prices from 2010 to 2016 for the training set and from 2017 for the testing set. en, we move to the next year in turn until the third study period, using the stock prices from 2012 to 2018 for the training set and the stock prices from 2019 for the testing set.

Parameter Settings.
In our experiments, the interval Δt is set to 3 and the time window size δ is set to 6; that is, the stock price from the first 6 months is used to predict the price 3 months ahead to achieve the quarterly prediction. e parameters of all relative models are presented in Table 2. e asset size of the portfolio is a critical factor to be considered by investors. Because inappropriate portfolio cardinality weakens the performance of the portfolio, it can either increase the fee or worsen the risk. Some studies provide us with valuable references on this subject; for example, Zulkifli et al. [68] argued that the inclusion of 15 stocks can diversify away the diversifiable risk. Tang [69] found that portfolios containing 20 stocks can eliminate 95% of the diversifiable risk.
Considering the above conclusions and trading off the fee and risk, we set the portfolio cardinality C � 15, 16, 17, 18, 19, and 20. To guarantee the reliability of the prediction results, the ultimate prediction results of all models are obtained by computing the average value of 20 independent runs. e empirical study is implemented using Python 3.7 and Ten-sorFlow 2.0 environments on a computer with 1.80 GHz CPU and 16.0 GB RAM.

Stock Double-Screening.
In this section, doublescreening, including ESG screening and return potential screening, is performed. First, considering the limited stock universe in this study, we censor the 20% of stocks with the lowest ESG scores to complete the ESG screening. Next, for the remaining stocks, we use the GA-ELM model to predict the stock prices, further screening out stocks based on their return potential. To demonstrate the performance of the GA-ELM in terms of stock screening, we compare it with four advanced machine learning models: ELM, SVR, DNN, and LSTM. For prediction evaluation, several statistical criteria, such as the mean square error, root mean square error, and mean absolute error, are commonly used to measure the error of the prediction model. However, considering the purpose of the quantitative investment, these statistical metrics may not truly meet the financial goals of the investor. As mentioned in [70], practitioners focus more on potential trading profitability than on the prediction accuracy of models. A straightforward fact is that the more high-return stocks a portfolio contains, the greater is its probability of yielding high returns. erefore, in this paper,   Table 3 shows the minimum, maximum, mean, and average accuracy (average accuracy � mean/N) of the screened number of high-return stocks from 2017Q1 to 2019Q4 using different models. We find that the GA-ELM outperforms the other models in nearly all cases in terms of minimum, maximum, mean, and average accuracy. Specifically, when the number of high-return stocks N � 15, 16, 17, 18, 19, and 20, the values of minimum, maximum, mean, and average accuracy obtained by the GA-ELM are almost uniformly the highest. For example, when N � 15, the GA-ELM can screen 6.25 high-return stocks on average, while the ELM screens 5.25. e average accuracy of the GA-ELM is 6.67% higher than that of the ELM. e comparison results imply that using GA to optimize ELM can effectively improve the performance of ELM. Furthermore, compared with conventional machine learning models, i.e., SVR, and deep learning methods, i.e., DNN and LSTM, the average number of high-return stocks obtained by GA-ELM separately increase by 0.58, 1.17, and 0.83, and the corresponding average accuracy also improves by 3.89%, 7.78%, and 5.56%. Accordingly, the superiority of the GA-ELM is further confirmed. When the number of high-return stocks N equals other values, i.e., 16,17,18,19, or 20, we obtain similar conclusions, which indicate that the GA-ELM model has good generalizability. erefore, the GA-ELM is a promising tool for use in the practice of stock screening.

SRI Portfolio Construction.
In this section, the screened high-quality stocks are input into the GMV-ESG and MSPR-ESG models to construct SRI portfolios. e proposed models are compared with several alternative models at different portfolio cardinalities along the dimensions of return (annualized return), risk-adjusted return (Sharpe ratio), and ESG (ESG score).

Outline of Alternative Models.
To test the performance of the proposed models, two kinds of alternative models are introduced for comparison: (1) Alternative Model I: SSRI-GMV (or SSRI-MSPR). ese models represent conventional SRI strategies that do not consider the return potential of stocks in the screening stage and do not introduce the ESG factor into the portfolio optimization model. In these models, low-ESG stocks are first excluded using ESG screening. Next, to keep the portfolio cardinality consistent with that of the proposed models, the same number of stocks as in the proposed models are selected from the screened stocks.
en, the GMV model (or MSPR model) is utilized to construct the SRI portfolio.

(2) Alternative Model II: DSSRI-I (Based on Other Machine Learning Models) (or DSSRI-II (Based on Other Machine
Learning Models)). e purpose of introducing these models is to prove that the screening results obtained by different machine learning prediction models can affect the performance of the SRI portfolio. Specifically, we first exclude some stocks with low-ESG scores from the investment universe. Subsequently, several advanced machine learning models, such as ELM, SVR, DNN, and LSTM, are employed separately to predict stock prices, thereby further screening stocks with high-return potential. Finally, as with the proposed models, the GMV-ESG model (or MSPR-ESG model) is utilized to construct the SRI portfolio. Table 4 reports the annualized return, Sharpe ratio, and ESG score of the proposed DSSRI-I and the SSRI-GMV at different portfolio cardinalities (C � 15, 16, 17, 18, 19, and 20). e performance comparison results of the two models are shown in Figure 2. It is obvious that the proposed DSSRI-I provides the highest values on the three evaluation metrics. If we take portfolio cardinality C � 15 as an example, the annualized return, Sharpe ratio, and ESG score of the SSRI-GMV are 0.0444, 0.3198, and 55.3456, respectively. Correspondingly, these metrics for the proposed DSSRI-I are 0.0982, 0.4930, and 61.9809, representing increases of 0.0538, 0.1733, and 6.6353. For other portfolio cardinalities (C � 16, 17, 18, 19, and 20), the proposed DSSRI-I still generate good performance on all evaluation metrics. Specifically, compared with the SSRI-GMV, the annualized return of the proposed DSSRI-I increases in the range of 0.0320-0.0428, the Sharpe ratio increases in the range of 0.0800-0.1510, and the ESG score increases in the range of 5.3184-6.3273. Furthermore, at the portfolio cardinality C � 17, the proposed DSSRI-I provides the highest annualized return and Sharpe ratio of 0.1113 and 0.5517, respectively. e highest ESG score is generated by the proposed model at 61.9809 at portfolio cardinality C � 15. In summary, the proposed DSSRI-I can yield better ESG and financial performance than the SSRI-GMV. Discrete Dynamics in Nature and Society

Alternative Model I Comparison (1) Comparison of the Proposed DSSRI-I and the SSRI-GMV.
(2) Comparison of the Proposed DSSRI-II and the SSRI-MSPR. e annualized return, Sharpe ratio, and ESG score of the two models are shown in Table 5. Figure 3 compares the performance of the two models. Obviously, DSSRI-II has higher evaluation metrics than the SSRI-MSPR at different portfolio cardinalities (C � 15, 16, 17, 18, 19, and 20). For example, the annualized returns of the proposed DSSRI-II are 0.0882, 0.0780, 0.1057, 0.0856, 0.0905, and 0.0887 at portfolio cardinalities C � 15 to 20, representing increases of 63.02%, 67.76%, 94.95%, 82.01%, 62.13%, and 65.61% over those of the SSRI-MSPR. For the Sharpe ratio, the DSSRI-II performs similarly. Specifically, the best Sharpe ratio (0.5339) is obtained by the DSSRI-II with portfolio cardinality C � 17. In contrast, the best Sharpe ratio of the SSRI-MSPR is 0.3750. When the portfolio cardinality C � 15, the DSSRI-II generates the highest ESG score of 62.0703, which is 11.50% higher than the best score of the SSRI-MSPR, at 55.6684. In short, the proposed DSSRI-II outperforms the SSRI-MSPR in terms of annualized return, Sharpe ratio, and ESG score.       verify the superiority of return potential screening based on the GA-ELM. e annualized return, Sharpe ratio, and ESG score of all models at different portfolio cardinalities C � 15, 16,17,18,19, and 20 are presented in Table 6. e comparison of the evaluation metrics for the different models is displayed in Figure 4. First, we compare the proposed DSSRI-I with the DSSRI-I (ELM). As shown in Table 6 and Figure 4, when the portfolio cardinality C � 15, the annualized return of the proposed DSSRI-I is 0.0708 higher than that of the DSSRI-I (ELM). e Sharpe ratio of the proposed DSSRI-I is 0.4930, which is 117.32% higher than that of the DSSRI-I (ELM). e highest ESG score (61.9809) is still obtained by the proposed DSSRI-I. For other portfolio cardinalities (C � 16, 17, 18, 19, and 20), similar conclusions are obtained. In general, the proposed model is superior to the DSSRI-I (ELM) in terms of annualized return, Sharpe ratio, and ESG score and thus proves again that GA optimization can improve the performance of the ELM. Subsequently, the proposed DSSRI-I is compared with SRI portfolio models based on double-screening considering other advanced machine learning models (SVR, DNN, and LSTM), i.e., DSSRI-I (SVR), DSSRI-I (DNN), and DSSRI-I (LSTM). From Table 6 and Figure 4, we can find that the highest annualized returns at almost all the given portfolio cardinality levels are obtained by the DSSRI-I. When the portfolio cardinalities are C � 17, 18, 19 and 20, the proposed DSSRI-I provides the highest Sharpe ratio at 0.5517, 0.4952, 0.5049, and 0.5222, respectively. e proposed DSSRI-I generates the second-highest Sharpe ratio with portfolio cardinalities C � 15 and 16, following the DSSRI-I (DNN). e DSSRI-I generates the highest ESG scores from portfolio cardinalities C � 15 to 20 with values of 61.9809, 61.8525, 61.9619, 61.5369, 61.2844, and 60.9385, respectively. In general, compared with other SRI portfolio models based on advanced machine learning models, the proposed DSSRI-I presents obvious advantages in annualized return and ESG performance and has a competitive Sharpe ratio performance. erefore, the proposed DSSRI-I based on the GA-ELM is capable of improving the ESG and financial performance of SRI portfolios.

Conclusion and Future Work
In this work, we propose two SRI portfolio construction models called DSSRI-I and DSSRI-II based on machine learning, intelligent algorithm, and portfolio theory. e models consist of two stages: stock screening and asset allocation. In the stock screening stage, a doublescreening mechanism is developed to generate the input of the SRI portfolio. Specifically, low-ESG stocks are first excluded from the investment universe to guarantee that the  assets are in line with the social values of SRI investors. en, an ELM model with GA optimization is introduced to predict the prices of the remaining stocks and thus further screen high-return potential stocks. In the asset allocation stage, the extended GMV model (or extended MSPR model) considering the ESG factor is introduced to determine the investment proportion of the screened stocks from the first stage. To test the performance, we compared the proposed models with alternative models in terms of annualized return, Sharpe ratios, and ESG scores. e conclusions show the following: (1) compared with the conventional SSRI-GMV (or SSRI-MSPR), the proposed DSSRI-I (or DSSRI-II) performs better on both ESG and financial measures. (2) e proposed DSSRI-I (or DSSRI-II) can obtain a higher annualized return and ESG score than the alternative models based on advanced machine learning models (i.e., DSSRI-I (ELM) (or DSSRI-II (ELM)), DSSRI-I (SVR) (or DSSRI-II (SVR)), DSSRI-I (DNN) (or DSSRI-II (DNN)), and DSSRI-I (LSTM) (or DSSRI-II (LSTM)). (3) e proposed models can generate a competitive Sharpe ratio performance in comparison with that of the above models. erefore, considering the superior performance of the proposed models, they can be regarded as effective tools to construct SRI portfolios.
In the future, our work may focus on the following three aspects. First, other stock markets in different political, economic, and social environments should be studied to test the generalizability of the proposed models. Second, the performance of the proposed models can be further analysed by introducing various intelligent algorithms to optimize the ELM. Finally, the proposed models can be extended to a general framework for portfolio construction based on the personal preference of investors.

Abbreviations
ANNs: Artificial neural networks BP: Back propagation DNN: Deep neural network DSSRI: Double-screening socially responsible investment ELM: Extreme learning machine ESG: Environmental, social, and corporate governance GA: Genetic algorithm GMV: Global minimum variance LSTM: Long short-term memory MSRP: Maximum Sharpe ratio MV: Mean-variance SLFN: Single-hidden layer feedforward neural network SRI: Socially responsible investment SVR: Support vector regression.
Data Availability e data that support the findings of this study are available from the corresponding author upon reasonable request.

Conflicts of Interest
e authors declare that they have no conflicts of interest.