A Lipid Metabolism-Based Seven-Gene Signature Correlates with the Clinical Outcome of Lung Adenocarcinoma

Background Herein, we tried to develop a prognostic prediction model for patients with LUAD based on the expression profiles of lipid metabolism-related genes (LMRGs). Methods Molecular subtypes were identified by non-negative matrix factorization (NMF) clustering. The overall survival (OS) predictive gene signature was developed and validated internally and externally based on online data sets. Time-dependent receiver operating characteristic (ROC) curve, Kaplan–Meier curve, nomogram, restricted mean survival time (EMST), and decision curve analysis (DCA) were used to assess the performance of the gene signature. Results We identified three molecular subtypes in LUAD with distinct characteristics on immune cells infiltration and clinical outcomes. Moreover, we confirmed a seven-gene signature as an independent prognostic factor for patients with LUAD. Calibration and DCA analysis plots indicated the excellent predictive performance of the prognostic nomogram constructed based on the gene signature. In addition, the nomogram showed higher robustness and clinical usability compared with four previously reported prognostic gene signatures. Conclusions Findings in the present study shed new light on the characteristics of lipid metabolism within LUAD, and the established seven-gene signature can be utilized as a new prognostic marker for predicting survival in patients with LUAD.


Introduction
Globally, lung cancer is the leading cause of cancer death worldwide with a five-year survival rate of advanced-stage patients less than 5% [1,2]. As the major histological subtype of lung cancer (approx. 85% cases), lung adenocarcinoma (LUAD) results in the death of most patients from local recurrence or distant metastasis [2]. e prognosis of advanced LUAD is still not satisfactory, and treatment options are limited [3]. In order to anticipate the natural history of the disease in individual patients, it is mandatory that the nature of LUAD in each patient should be clearly understood since only individually tailored molecular profiles and markers could spare the patients from undergoing a potentially more harmful, aggressive chemical therapy or even leave them untreated. erefore, there is an increasing interest in the molecular characterization of LUAD allowing prognosticate overall patient survival. e interaction of tumor cells and lipid metabolism is supposed to play an important role in LUAD progression [4,5]. Although metabolism reprogramming was presumed to be one of the hallmarks of tumorigenesis, molecular characteristics of lipid metabolic disorder within the tumor microenvironment are incompletely understood. Almost nothing is known about the precise role and underlying mechanism of lipid metabolism-related genes (LMRGs) and their gene expression profiles in primary resected LUAD, not yet anything known related to the prognostic distinctions of LUAD. e current study aimed at the discovery of a new prognostic signature in patients with LUAD using Cox and the least absolute shrinkage and selection operator (LASSO) regression models to analyze the expression profile of LMRGs using e Cancer Genome Atlas (TCGA) LUAD data set. Based on LMRG expression data from public databases in TCGA, we constructed molecular subtypes with distinct different immune characteristics and clinical outcomes.
en, we developed a seven-LMRG signature for assessing the prognosis of patients with LUAD and further validated it in TCGA, GSE31210, and GSE50081 data sets.
is signature was tightly associated with patients' outcomes and could serve as an independent pathological factor.

Patients and Data Sets.
e RNA-seq data and clinical information of 594 LUAD samples were downloaded from the TCGA database (http://www.cancer.gov/about-nci/ organization/ccg/research/structural-genomics/TCGA). TCGA LUAD data set was preprocessed with the criteria as follows: (1) excluded samples without clinical data and OS < 30 days; (2) excluded data of normal esophagus tissue sample; (3) excluded genes with FPKM � 0 in 50% of samples; and (4) included the expression profile of genes related to lipid metabolism. Finally, 490 samples from TCGA LUAD data set were enrolled for subsequent analysis. Two data sets, which contains gene expression profiles of LUAD samples in the GEO database, GSE31210 (contains 226 LUAD samples) and GSE50081 (contains 129 LUAD samples), were downloaded from NCBI (http://www.ncbi.nlm.nih.gov/geo/). Both data sets were preprocessed with the criteria as follows: (1) excluded normal tissue sample data; (2) converted OS data from year or month to days; (3) transformed gene probes to the human gene SYMBOL using the bioconductor package of R, those matched to multiple genes were removed, and if multiple probes were matched one gene, the median was selected as the expression profile of this gene; and (4) microarray data was normalized from Affymetrix platform by using robust multiarray average method [6]. e clinicopathological characteristics of patients from these three data sets after preprocessing are summarized in Table 1.
For the TCGA LUAD data set, 90% of them was randomly divided into training cohorts (n � 441), and the entire data set was selected as a validation cohort. GSE31210 and GSE50081 data sets were used as external validation cohorts. For real-time quantitative PCR (RT-qPCR) and immunochemistry (IHC) analysis, 10 NSCLC patients undergoing resection were enrolled in this study from the biobank of FUSCC. Surgically excised tumors used for qRT-PCR were immediately immersed in liquid nitrogen and stored at −80°C. At the same time, formalin-fixed paraffin-embedded (FFPE) tissues were also obtained from the same patient's tissues for IHC. is study was approved by the Institutional Review Boards of Fudan University Shanghai Cancer Center (FUSCC).

Identification of Molecular Subtypes Based on Lipid
Metabolic Genes. A total of 776 genes related to lipid metabolism from six lipid metabolism-related gene sets (Supplementary Table 1) were collected from the Molecular Signature Database v7.0 (MSigDB). Among them, 3 genes were not offered in the TCGA LUAD data set, and 30 genes with FPKM � 0 in more than half of the samples were also excluded. Finally, 743 genes were enrolled for subsequent analysis. Prognostic genes were detected by univariate Cox regression survival analysis using the R package survival coxph function, and log rank P < 0.05 was selected as the threshold. e molecular subtypes were identified based on these prognostic genes using the non-negative matrix factorization (NMF) method, and the optimal number of subtypes were determined according to indicators including cophenetic correlation, residual sum of squares (RSS), dispersion, and silhouette. Cophenetic correlation range between 0 and 1, it positively reflect the stability of the cluster obtained from NMF [7]. e silhouette coefficient can be used to select the appropriate number of clusters. According to the line chart, the point with the largest coefficient change range can be found intuitively, and the point with the largest distortion range reflects the best number of clusters [8]. A residual sum of squares (RSS) is a statistical technique used to measure the amount of variance in a data set that is not explained by a regression model [9]. us, the RSS value negatively reflects the clustering performance of the model.

Analysis of Gene Expression
Profile and Immune Score among Molecular Subtypes. DESeq2 was used to calculate differentially expressed genes (DEGs) among each cluster (FDR <0.05 and |log2FC| > 1). Weighted gene correlation network analysis (WGCNA) coexpression algorithm was used for detecting coexpressed genes and modules by the R package WGCNA [10]. To improve the accuracy of network construction, the transcripts per kilobase of exon model per million mapped reads (TPM) profile of genes were extracted from the TCGA LUAD data sets, and hierarchical cluster analysis was performed on the 490 samples first to remove the outlier samples. Second, the distance between each gene was calculated using the Pearson correlation coefficient; a weighted coexpression network was constructed using the R package WGCNA; and coexpression modules were screened by setting the soft threshold power β as 7. ird, the topology overlap matrix (TOM) was then constructed from the adjacency matrix to avoid the influence of noise and spurious associations. On the basis of TOM, average-linkage hierarchical clustering using the dynamic shear tree method was subsequently conducted to define coexpression modules, and the minimum gene size of each module was set as 30. e feature vector values (eigengenes) of each module were calculated in turn to explore the relationship among modules, and then modules with highly correlated eigengenes were merged into a new module by performing cluster analysis with the following threshold: height � 0.25, DeepSplit � 2, and minModuleSize � 30. In order to identify the modules of interest, the correlation between each coexpression module and patients' clinical features as well as cluster subtypes was further evaluated. Modules with significant correlation with the lipid-metabolism subtypes of LUAD patients were defined as key modules for the subsequent selection of hub genes (Spearman correlation coefficient >0.4; P < 0.05).
Tumor immune estimation resource (TIMER) tool [11] was used to compare the enumeration of six tumor-infiltration immune cell types (including B cell, CD4 + T cell, CD8 + T cell, neutrophil, macrophage, and dendritic cell) of each LUAD sample in the tumor microenvironment (Supplementary Table 2). Next, the difference on the Immune-Score and StromalScore, which represent the relative proportion of immune cells and stromal cells in tumor tissues, among each molecular subtype was calculated by using the R package Estimation of STromal and Immune cells in MAlignant Tumours using Expression data (ESTIMATE) [12].
e ESTIMATEScore, which refers to the purity of tumor tissues, is the sum of ImmuneScore and StromalScore.

Construction of Lipid Metabolism-Related Prognostic
Gene Signature. Coexpression genes in the training set were detected by univariate Cox regression survival analysis, and log rank P < 0.01 was selected as the threshold. To narrow the gene range and maximize the accuracy, Least absolute shrinkage and selection operator (LASSO) Cox regression analysis [13], a method screening signatures with generally effective prognostication performance by performing automatic feature selection, was performed by using the glmnet package of R to identify the prognostic gene. And optimal genes were evaluated by tenfold cross-validation.
Multivariate Cox regression survival analysis was performed to construct the prognostic risk model. e risk score for each patient of the training set was calculated with the linear combinational of the signature gene expression weighted by their regression coefficients. Risk score � (expr gene1 × coefficient gene1 ) + (expr gene2 × coefficient gene2 ) + . . . + (expr genen × coefficient genen ). Receiver operating characteristics (ROC) curves, carried out by using the R package timeROC, were used to analyze the risk score of each sample, and samples were set as a high-or low-risk group by setting the threshold as 0.

Bioinformatic
Analysis. Pathway enrichment analysis of differentially coexpressed genes was performed through the R package WebGestaltR (the threshold FDR <0.05). Singlesample gene set enrichment analysis (ssGSEA) was applied for identifying the relationship between the risk scores of

Statistical Analysis.
Kaplan-Meier curves were applied to assess the difference in OS between different groups. Multivariate Cox regression analyses were performed to assess the independent prognostic factors. Decision curve analysis (DCA), which can evaluate predictive models from the perspective of clinical consequences [14], was performed in the entire cohort to test the clinical usefulness of the nomogram in comparison with the gene signature and clinicopathological parameters. All statistical analyses were using R 3.6.0 (https://mirrors.tuna.tsinghua.edu.cn/CRAN/) with default software parameters. P value <0.05 was considered significant statistically.
Immunohistochemistry Analysis. Formalin-fixed and paraffin-embedded specimens were cut into 5 mm thick sections and mounted on glass slides. IHC was carried out as previously reported [15]. CD4 antibody and CD8 antibody were purchased from Abcam. DAB reagent was purchased from Fansbio (Guangzhou, China).

RT-PCR.
Total RNA was harvested from LUAD tissue using Trizol and reverse-transcribed into cDNA using Pri-meScript RT reagent kit (with gDNA Eraser) (Takara). e cDNA was amplified using the SYBR qPCR Master Mix (Takara) on a real-time PCR system (LightCycler480). Primer sequences are shown in Table 2. All primers were synthesized by Sangon Biotech Co. Ltd. (Shanghai). RT-PCR was carried out as previously reported [15].

Identification of Molecular Subtypes in LUAD.
By univariate Cox regression survival analysis, 126 lipid metabolism-related genes were identified correlated with the overall survival (OS) of patients with LUAD in the TCGA data set (Supplementary Table 3). en, by using non-negative matrix factorization (NMF) method, three molecular subtypes (Cluster 1 (n � 82), Cluster 2 (n � 182), and cluster 3 (n � 226)) were constructed based on these prognostic genes (Figure 1(a); Supplementary Figure 1(a)). Gene expression profile and the distribution of clinicopathological parameters in each subtype were shown in Figure 1(b). We further analyzed the relationship between the molecular subtype and clinicopathological features of LUAD patients. It was observed that most of the patients at the M1 stage were divided into Cluster 1, which had worse survival. e Cluster 2 with the best outcomes inversely showed a trend of younger age and early TNM stage and exhibited wild TP53 and RP1L1 status (Supplementary Figure 1(b)). In addition, the differences in immune characteristics among the three subtypes were analyzed. Cluster 1 showed the lowest proportions of B cell, CD4 + T cell, CD8 + T cell, neutrophil, macrophage, and dendritic cell (DC) than the other two subtypes (Supplementary Figure 1(c)). e calculated ImmuneScore, Stro-malScore, and ESTIMATEScore were also remarkably lower in Cluster 1, which represented less immune and stromal cell components in tumor microenvironment (TME) and lower tumor priority for the samples in the other two subtypes (Supplementary Figure 1(d)); in contrast, Cluster 2 showed the highest ImmuneScore, which suggests that patients in Clusters 1 and 2 have distinct immune and stroma characteristics. Furthermore, Kaplan-Meier method with log-rank tests was applied to explore the difference in prognosis among the three molecular subtypes in LUAD. Although it seems that patients in Cluster 1 confer the worst OS ( Figure 1(c)), there was no significant difference between Clusters 1 and 3 ( Figure 1(d)). However, compared with Cluster 1 (P < 0.001) and Cluster 2 (P � 0.002), patients in Cluster 2 showed the longest OS time (Figure 1(d)).  Table 2: RT-qPCR primer of seven genes.

Genes
Primer LUAD data set, a total of 30 coexpression modules were obtained after module fusion (Figure 2(b); grey modules represent gene sets that could not be merged). Moreover, by analyzing the correlation of the module and genes in the module with phenotypes ( Figure 2(c)), we found that orange (contains 49 genes) module correlated with Cluster 1, magenta (contains 404 genes) module correlated with Cluster 2, and yellow (contains 1090 genes) module cor-

Construction of Prognosis Risk Model Based on Differential
Coexpression Genes. To identify novel genetic biomarkers associated with the clinical outcome of patients with LUAD, univariate Cox proportional hazard regression was applied to these 486 DEGs. And then 186 genes significantly correlated to OS (P < 0.01; Supplementary    and RGS20) were confirmed with tenfold cross-validation and the minimized error rate λ � 0.059 ( Supplementary  Figures 3(a)-3(b)). By applying Kaplan-Meier analysis, we confirmed that all these seven genes were significantly associated with the OS of patients in the training set (Supplementary Figure 3(c)). Among them, CHRDL1 (P < 0.001) showed a significant negative correlation with OS, while the other six genes were positively correlated to OS (all P < 0.001; Supplementary Figures 3(d)-3(i)). e final seven-gene signature was calculated using multivariate Cox survival analysis (Table 3), and a gene-based prognostic model was established to evaluate the survival risk of each patient as follows: RiskScore � −0.0103 * exp CHRDL1 + 0.0001 * exp GAPDH + 0.0105 * exp GNPNAT1 + 0.0039 * exp HTATIP2 + 0.0064 * exp MFI2 + 0.0085 * exp PKP2 + 0.0284 * exp RGS20 .
Based on the risk score formula and the cut-off value of normalized risk score (Z-score � 0), patients were divided into a high-or low-risk group (Figure 3(a)). And a heatmap showing the expression profile of the eight genes illustrated that as the risk score of patients increased, the expression of prognosis-risky genes (GAPDH, GNPNAT1, HTATIP2, MFI2, PKP2, and RGS20) were distinctly up-regulated; in contrast, the expression of prognosis-protective gene CHRDL1 was down-regulated. ROC curve showed that the accuracy of the prognostic seven-gene signature for 1-, 3-, and 5-year survival was 0.72, 0.70, and 0.66, respectively (Figure 3(b)). Finally, we divided the samples with Z-scorebased RiskScore greater than zero into the high-risk group and the samples with less than zero into the low-risk group.
Kaplan-Meier curve analysis revealed that the OS time of patients in the high-risk group was significantly shorter than that in the low-risk group (HR � 2.122; 95% CI: 1.554-2.899; P < 0.001; Figure 3(c)).

Validation of the Seven-Gene Signature in the Entire TCGA Data Set and Two GEO LUAD Data Sets.
e entire TCGA LUAD data set (n � 490) was used for internal validation, and the risk score of each sample was calculated, which showed that the association between the gene expression and risk score was consistent with the training set (Figure 4(a)). e ROC curve displayed that the accuracy of the prognostic eight-gene signature for 1-, 3-, and 5-year survival was 0.71, 0.68, and 0.67, respectively (Figure 4(b)). Patients in the internal validation data set were classified into high-and low-risk groups with the same cutoff as used in the training set. As expected, patients in the validation set with high risk scores had shorter OS than those with low risk scores (HR � 2.229; 95% CI: 1.655-3.000; P < 0.001; Figure 4(c)).
Subsequently, the prognostication efficiency of our seven-gene signature was also calculated in the two external validation data sets GSE31210 and GSE50081. e results showed that the association between the gene expression and risk score was consistent with that in the training and internal validation set ( Figure 5(a) and 5(b)). In the GSE31210 data set, e ROC curve displayed that the accuracy of the prognostic seven-gene signature for 1-, 3-, and 5-year survival was 0.87, 0.67, and 0.72, respectively ( Figure 5(c)).        Journal of Oncology As expected, patients in the GSE31210 data set with high risk scores had shorter OS than those with low risk scores (HR � 5.113; 95% CI: 2.121-12.327; P < 0.001; Figure 5(d)).
In the GSE50081 data set, the ROC curve displayed that the accuracy of the prognostic seven-gene signature for 1-, 3-, and 5-year survival was 0.77, 0.72, and 0.73, respectively ( Figure 5(e)). As expected, patients in the GSE50081 data set with high risk scores had shorter OS than those with low risk scores (HR � 2.687; 95% CI: 1.549-4.662; P < 0.001; Figure 5(f )). erefore, the seven-gene signature exhibited steady effective prognostic classification performance in the internal and two external validation sets.  application, univariate and multivariate Cox regression analysis was used to systematically analyze the clinical outcome of patients in the entire TCGA LUAD data set. Univariate analysis of survival revealed that the seven-gene signature (P < 0.001), age (P � 0.038), N stage (P < 0.001), M stage (P < 0.001), and TP53 mutation status (P < 0.001) were prognostic indicators of OS (Supplementary Figure 4(a)). However, multivariate Cox regression analysis showed that only seven-gene signature (P < 0.001) in addition to TNM stage (P � 0.001) were independent risk factors of OS (Supplementary Figure 4(b)). Overall, these results suggest that the seven-gene signature is a potential independent prognostic factor for LUAD.

e Performance of the Seven-Gene Signature in Comparison to Previous Signatures in TCGA LUAD Data Set.
To assess the predictive power of the seven-gene signature, four published risk models for OS of patients with LUAD: a three-gene signature developed by Shukla et al. [16], an eight-gene signature developed by Li et al. [17], a three-gene signature developed by Yue et al. [18], and a three-gene signature developed by Liu et al. [19] were enrolled for comparison. To improve the comparability of the models, the risk score of each LUAD sample in the TCGA cohort was calculated according to the corresponding genes in all four models by applying the same method being reported [20][21][22]. Kaplan-Meier curve analysis revealed that all four modules showed significant prognostic value in predicting OS (all P < 0.001; Supplementary Figures 5(a)-5(d)). e ROC of each model was evaluated, and the area under the curve (AUC) of all three models were larger than 0.6 ( Supplementary Figures 5(a)-5(d)), while only the AUC of 2-year OS in Shukla's three-gene signature and the AUC of 1-year OS in Li's eight-gene signature were larger than 0.7 ( Supplementary Figures 5(a) and 5(b)). Restricted mean survival time (RMST) was applied to calculate and compare the C-index of all signatures. Although the C-index of our seven-gene signature was only significantly higher than that of Yue's module (P � 0.024) and Liu's module (P � 0.008), our seven-gene signature showed highest C-index (0.710; Figure 6(a)). We also applied a DCA to evaluate our sevengene signature with these four signatures, in which the net benefit together with a broader range of threshold probability of our seven-gene signature ranked as the highest one ( Figure 6(b)), indicating that the seven-gene signature in the present study exhibited a best predictive performance. Taken together, these results suggest that this seven-gene signature is more suitable for predicting the prognosis of patients with LUAD in clinical practice.

GSEA Analysis of Enriched Pathway Based on Risk Score.
ssGSEA was performed to determine the potential related pathways according to patients' prognostic risk in the TCGA training data set, and pathways with Pearson correlation coefficient >0.4 were derived. As shown in Figure 7(a), a total of 23 pathways were identified, and most of the pathways were positively correlated with samples' risk score (red color). By dividing samples into high-and low-risk groups based on whether the risk score is greater than 0 and analyzing the enriched pathway in both groups by using GSEA, we found that four pathways were significantly enriched in the high-risk group: other glycan degradation, glycosaminoglycan degradation, biosynthesis of unsaturated fatty acids, and PPAR signaling pathway (P < 0.05; Figure 7(b)). us, the seven-gene signature may involve in the development and progression of LUAD by participating in these pathways.

Detection of Prognostic Gene Expression in Lung
Adenocarcinoma Tissue. We selected ten patient tissues who underwent surgical treatment in the FUSCC in 2019 for the determination of the expression level of these seven prognostic genes in LUAD. According to the results of the H&E staining of tumor tissues and the immunohistochemical staining of CD4 and CD8 cells in tumor tissues, the tissues were divided into two groups: the low-immune infiltration group and the high-immune infiltration group (Figure 8(a)). Subsequently, we applied RT-PCR to detect the expression level of GAPDH, GNPNAT1, HTATIP2, MFI2, PKP2, RGS20, and CHRDL1 in these 10 tumor tissues. As shown in Figure 8(b), the mRNA level of GAPDH, GNPNAT1, HTATIP2, MFI2, PKP2, and RGS2 showed higher expression tendency in the high-immune infiltration group than the low-immune infiltration group, whereas the expression level of CHRDL1 showed a lower expression tendency in the high-immune infiltration group than the low-immune infiltration group.

Discussion
Cumulative evidences have been yielded in the field of lipid metabolism within lung cancer cells [23]. Under abnormalities in vessels structure attributed to limited nutrient supply and hypoxia, lung cancer cells present multiple metabolic alterations in order to support cell growth, including aerobic glycolysis and de novo lipogenesis [24]. De novo lipogenesis provides intermediate materials to support the phospholipid and triglyceride synthesis, lipid modification of proteins, and fatty acid ß-oxidation in tumor cells, including fatty acid synthase (FAS), stearoyl CoA desaturase 1 (SCD1), adenosine triphosphate-binding cassette family (ABC family), ATP citrate lyase (ACLY), and fatty acids and thus augments the activation of growthpromoting pathways [23]. Lung cancer cells have accelerated lipid metabolism (mainly de novo lipogenesis) than normal lung respiratory epithelium [25]. erefore, lipid metabolism alteration may be a potential indicator of tumor malignancy and thus prognosis.
In order to better understand the biological characteristics of LUAD, we classified LUAD into three subtypes based on a comprehensive analysis of LMRG expression profiles in the TCGA LUAD data set. We demonstrated that the Cluster 1 subtype is associated with the lowest infiltration of immune cells, decreased immune scores, stromal scores, and tumor purity.
is immune-related difference in LMRG-based molecular subtype may reflect the effects of lipid metabolism on tumor immune microenvironment. It has been reported that the lipid levels in DCs can influence the ability of DCs to process antigen [26]. In addition, immune cells can interact with various classes of lipids, and altering lipid metabolism is capable of controlling the activation, differentiation, plasticity, and function of immune cells [27]. To some extent, our data indicate the potential crosstalk between lipid metabolism and immune response [28]. By "immunoediting," tumor cells interact with the human immune system in the origin and progression of tumors, and infiltrating lymphocytes in the tumor microenvironment have been found to mediate adaptive immunity [29]. erefore, the lowest infiltration of immune cells may pave the way for the worst clinical outcome of patients in Cluster 1. Patients in the Cluster 2 subtype exhibited more favorable clinical outcomes, whilst patients in the Cluster 1 subtype exhibited the worst overall survival. Functional and signaling pathway enrichment analysis further showed that overlapping DEGs among the three subtypes mainly participated in the regulation of cell cycle and in some cancerassociated pathways, indicating an interface between lipid metabolism and tumor cell uncontrolled growth [4,5]. Our study, for the first time, stratified the LUAD patients based on LMRG and provided novel insights into predicting the efficacy of patients' survival, as well as potential biomarkers for the response to immunotherapy and targets for immunotherapy.
Given that LMRGs could distinguish patients' clinical and molecular features, we further developed a seven-LMRGs signature that could stratify patients with high or low risk of poor overall survival. Among the seven biomarker genes (CHRDL1, GAPDH, GNPNAT1, HTATIP2, MFI2, PKP2, and RGS20) discovered by the present study, Chordin-like 1 (CHRDL1) is a bone morphogenetic protein (BMP) antagonist. BMPs have emerged as important modulators of cancer aggressiveness. As a tumor suppressor, CHRDL1 is down-expressed in various cancer tissues [30][31][32], and a high CHRDL1 expression level induced decreased tumor progression [31,33]. Glyceraldehyde-3phosphate dehydrogenase (GAPDH) is a pivotal enzyme, and it regulated cellular senescence phenotype in A549 cells via modulating the AMPK network [34], a key pathway implicated in cancer cell lipid metabolism. e relationship between glucosamine 6-phosphate N-acetyltransferase 1 (GNPNAT1/GNA1) and cancer was seldom reported. GNPNAT1 is a key enzyme in the biosynthesis of uridine diphosphate-N-acetylglucosamine, which is an important donor substrate for N-linked glycosylation and thus participates in the regulation of cell growth [35]. HIV-1 Tat interactive protein 2 (HTATIP2) is also a tumor suppressor; by enhancing the HIF2α-regulated β-catenin/c-Myc/MCL-1 signaling, HTATIP2's deletion increases tumor metabolic plasticity, that is, enable tumor cells to exploit alternative metabolic pathways for replenishing TCA cycle intermediates, to avoid dependence on carbon sources from glutamine and fatty acid, thus inducing LUAD cell survival and proliferation [36]. Plakophilin 2 (PKP2) encodes a plakophilin protein that belongs to the member of desmosomal proteins. PKP2 has been verified overexpressed in several types of human cancers including lung cancer; it is an unfavorable prognostic biomarker for LUAD patients but not for LUSC patients [37]. In addition, it exhibits oncogenic roles through activating EGFR signaling pathway in LUAD cells [37]. Although melanotransferrin (MELTF, MFI2) [38] and regulator of G protein signaling 20 (RGS20) [39] have been reported as regulators in human malignancies, seldom were known about the correlation of both genes with LUAD. Although some of the previous studies have identified these genes as prognostic markers in LUAD, they were limited by just a single gene detected, small sample sizes, and lack of independent validation. e use of the LASSO Cox regression model [40] and nomogram [41] allowed us to integrate multiple genes into one tool, which has significantly greater prognostic accuracy than that of a single gene alone or even some previous reported gene signatures.
Some limitations of this study should be taken into consideration. Although TCGA and two GEO data sets enrolled both Caucasian and Japanese populations, this present study may not include patients with LUAD from other areas loading distinct genetic phenotypes and clinical characteristics, making it susceptible to the inherent biases of such a study format. Clearly, our results should be further validated by a prospective study in some worldwide multicenter clinical studies. Moreover, its prognostic role in early LUAD must be further evaluated. In addition, despite growing studies that began focusing on the interaction of tumor cells and associated lipid metabolism in human malignancies, most LMRGs are not yet functionally annotated in LUAD, and the biofunctions of our seven genes have not yet been fully investigated in previous studies. Although the biological functions of the predictive genes were annotated using computational methods, additional studies should be performed to further reveal the mechanisms of the genes involved in the tumorigenesis of LUAD. Furthermore, more evidences are required to find out the biological foundation of their dysregulation in LUAD.
Cancer is a complex disease, and the interaction between the tumor and its microenvironment plays an important role in the progression of cancer disease. e current clinical stratification scheme focuses on tumor histopathology and molecular characteristics of tumor cells [42]. With further research, people have found that different immune cells are involved in different stages of tumor progression and strive to explore how to use the immune environment determinants that affect tumor development in treatment. Studies have shown that lipid metabolism alteration may be a potential indicator of tumor malignancy and thus prognosis. erefore, we established a seven-gene signature in LUAD, trying to connect the immunity and lipid metabolism in series in order to improve the accuracy of prognosis and prediction information. Our research results show that in patients with different levels of immune cell infiltration, the mRNA expression levels of 7 genes were different. Our model suggests that different expression levels indicate different survival risks. e survival risk of the high infiltration group is higher than the low infiltration group. e risk model suggests that we cannot measure the survival risk of patients solely by the infiltration level of certain types of immune cells, which reflects the value of our 7-gene signature model in risk prediction. Biomarkers can represent the status of the tumor microenvironment. For example, the high expression level of PD-1 in some cancer may represent the immunosuppressive status. Biomarkers can be used to assess the immune response status of the tumor microenvironment of different patients and find more suitable treatment options for the patients.
In summary, for the first time, we profiled the lipid metabolism phenotype in LUAD, and our study may provide a better assessment of the LMRG-based classification of LUAD. We uncovered the prognostic value of LMRG in LUAD and identified a lipid metabolism-related signature that could classify LUAD patients with high-and low-risk groups of unfavorable survival. is method might, therefore, help with patient counselling and individualized management of patients with LUAD.
Data Availability e data sets generated and analyzed during the current study are available in the TCGA repository (https://portal. gdc.cancer.gov/) and the GEO repository (https://www.ncbi. nlm.nih.gov/geo/).

Conflicts of Interest
e authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as potential conflicts of interest.

Authors' Contributions
TL, JC, and JL contributed equally to the paper. WN and MX designed the study and revised the manuscript. TL conducted the data process, modal establishment, and visualization of analysis. MX, JC, JL, and WN did the data analysis and interpretation. TL performed statistical analysis performed data analysis and wrote the manuscript. All authors have read and approved the final manuscript.