Research Status and Challenges of Data-Driven Construction Project Management in the Big Data Context

the


Introduction
ere are two reasons as to why the construction industry is facing a data tsunami. Firstly, the complexity of Construction Project is increasing. is is especially the case for megaprojects that are regarded as data-driven complex systems [1][2][3]. Secondly, a large number of advanced information technologies (IT) have been introduced to the construction domain. ese include: building information modeling (BIM), wireless sensor networks (WSN), Internet of things-(IoTs-) related technologies, etc. [4][5][6]. With the use of these technologies, a myriad of multisource heterogeneous data are generated. For example, the design data of BIM for a typical three-story building may exceed 50GB in size [7]. Ultimately, these led to the arrival of the Big Data era in the construction domain [8,9]. erefore, it is imperative to explore the potential changes to and possible innovation of construction project management (CPM) within the context of Big Data.
ere are a number of definitions for Big Data. One definition of Big Data is the dataset that cannot be collected, managed, and processed with regular software tools for a certain period of time [10]. Another description is the data that exceed the capabilities of the organization to store or analyze for accurate and timely decision-making [11,12]. It is well recognized that Big Data share "4V" characteristics, i.e., Volume, Velocity, Variety, and Value [13,14]. ere are two complementary aspects of Big Data value chain in the process of construction project management (CPM), i.e., Big Data management and (BDM) and Big Data analytics and visualization (BDA&V) [8,13,15]. BDM provides an infrastructure to BDA&V, where data management techniques, tools, and platforms, including generation, acquisition, storage, processing, and security can be applied [16].
Comparing to other industries, such as service and manufacturing, the architectural, engineering, construction (AEC) industry has unique characteristics [15,17,18]. For example, due to large scale and complexity, construction projects not only need to take long time of execution but also involve a large number of stakeholders across the entire lifecycle management (LCM). From project initialization, planning, and implementation to delivery, a huge amount of multisource heterogeneous data are produced [6]. Such datasets are so large and heterogeneous that they present a significant challenge to deal with them efficiently by means of traditional database technologies. e relatively clear fact is that the construction industry still falls behind other industries in terms of Big Data utilization [9,13]. Similar to the product lifecycle in the manufacturing industry [15], construction projects' lifecycle also contains several independent phases from planning to decommissioning, namely, planning phase, execution phase, maintenance phase, etc. [19,20]. Although each lifecycle phase seems to only produce data related to participants or tasks in this phase, the reality is that mass data would flow into different phases and interact with each other [6]. erefore, effective CPM requires the realization of adequate information sharing and data-driven decision-making across the whole project lifecycle [19,21], especially for complex megaprojects [5]. As a result, the next innovation wave of CPM uses emerging IT (or Big-Data-related technology) to achieve real-time communication between stakeholders, and utilizes valuable knowledge that is extracted by Big Data analytics to support decision-making [8,22]. is revolution in the construction industry gave birth to the concept of digital construction [23].
Digital construction has elicited growing attention. Previous related reviews mainly focused on using Big-Datarelated technology or analytics to serve each of the construction goals and tasks separately (e.g., [6,8]). Nevertheless, a systematic review for technological change, the resulting managerial process, and organizational changes in the Big Data context, especially from the angle of the whole lifecycle of construction project, is lacking. is study fills the void and proposes the path to achieve a comprehensive data-driven CPM scenario based on technological, process, and organizational innovations within the context of Big Data. is work is organized as follows: e authors first introduce the background and analyze the differences between previous related reviews and this work as well as illustrate the research method. Next, they review the general trend in Big-Datarelated technology application in terms of CPM phases, technology and application, and focused topics. Following this analysis, they discuss the particularized proposals in relation to each of the main topics within CPM. Finally, they propose future research agendas for the major gaps within the existing body of knowledge to facilitate the data-driven CPM in the Big Data environment.

Related Work
e IT revolution in the construction industry gave birth to the concept of digital construction [23]. Table 1 shows the related work on literature reviews for digital construction. Related reviews focused on either a certain aspect of construction management or a particular technique application. For example, [5,24] and [25] focused on technologysupported supply chain, organization, and energy management, respectively; [26,27] primarily focused on a particular technique (e.g. data mining or artificial intelligence) application in the construction domain. Furthermore, two reviews [6,8] focused on the state-of-the-art review of the adoption of IT or Big Data in the construction industry. Nevertheless, [6] did not mention Big Data environment and Big Data analytics but discussed the application prospect of IT in various construction management tasks from the traditional sense. Reference [8] explicitly proposed Big Data in the construction industry, while it mainly focused on using Big Data combined with the specific technology to serve different construction goals and tasks separately.
Unlike previous reviews, our work primarily discusses a comprehensive scenario of data-driven CPM from the angle of whole project lifecycle within the context of Big Data. e comprehensive scenario of data-driven CPM is powered by AI-based Big Data analytics and the state-of-the-art technology-innovations-based collaborative infrastructure. Moreover, the previous reviews mainly focused on the technological change in CPM, whereas our work further explores the changes of managerial process and organizational model based on technological change. For example, our work reveals that data-driven technological innovation lays the foundation for the organizational and process innovation, and these innovations will further improve the management performance of the construction project.

Review Methodology
e Web of Science (WOS) core collection database contains the most reputable and influential journals, and is therefore recognized as the most authoritative data source for studying publications of most subjects [28]. Although Scopus has a wider range of coverage than WoS, there are significant overlaps between WoS and Scopus [29]. In addition, the Science Direct and American Society of Civil Engineers library have been employed as the data source in the recent bibliometric reviews in the research field of construction management (e.g., [5,8,24,26]). erefore, in order to ensure that the articles are the most representative and the number of samples is manageable, only peer-reviewed journal articles included in the Science Citation Index databases (i.e. WoS, Science Direct, and American Society of Civil Engineers library) were considered in our work. Other types of publications such as book reviews, conference proceeding papers, editorials, and news items were removed from this study. e timeframe was limited from 1999 to 2020 because the technology prototype of Big Data was introduced to the construction industry in 1999 [30].
Furthermore, a search term hypothesis was posed, and a test search was conducted to establish an overview of the topic as the foundation. Initially, a combination of keywords including Big Data (or large data/massive data/data-driven/data mining) and the construction industry (or civil engineering/AEC/construction project and management) was used, but the searched literature was relatively limited through those search key phrases. Strategic planning for Big Data applications in the construction domain remains a largely unknown field possibly due to the infancy stage of the technology. Big-Data-related technology is conceptually related to the key information and communication technologies underlying digital construction [8,23]. erefore, the existing emerging information/digital technologies were taken as references similar to most of the related work [6,8,24,27,31].
Emerging information technologies encompass a board spectrum of computing technologies that can be used to create, store, exchange, and use information in various forms, such as hardware, software, telecommunications, intranet and extranet, wireless connection, and social media [24,32,33]. is broad definition also led to various search results from the next round. However, many articles mentioned emerging information technologies, but these papers may not refer to specific construction management process explicitly or implicitly. On the contrary, they focused on other topics, such as pure technical system development or mathematical algorithms. To refine the search, this scope was narrowed to articles related only to applying AEC management, such as improving organizational information processing capacity or facilitating project goal management. After automatic and manual screening, 96 articles were identified as the review range of this study. e analyzed proposals (see Table 2) were summarized for each reference: the main phase of the construction project, the research topic focused on, the main technology/technique used, and the data handling link were mostly involved.

Different Phases of CPM.
e whole lifecycle of construction projects consists of various phases from planning, design, bid/procurement, construction, operation/maintenance to final demolition and recycling [19,20]. As shown in Figure 1, it is worth noting that just a few articles involve the whole lifecycle of construction projects. Compared to product LCM, which integrates each phase from planning to demolition or recycling, construction project management is traditionally divided into separate phases [127]. e traditional project management model was mainly based on the reductionism.
us, some innovative project management approaches (e.g. the meta-synthesis pattern of project management) have been put forward in a bid to improve the project management performance [128,129]. Nowadays, the emergence of Big-Data-related technologies in the construction domain is more conductive to metasynthesis of LCM. Nearly 60 out of 96 articles focus on the construction phase, which indicates that the research of Big Data on the construction phase has received great attention. Significant challenges exist in organizational coordination and objective control, which demands advanced digital tools in this phase [4,130]. e number of articles engaged in operation/maintenance or bid/ procurement phases is ranked second and third, respectively. Although it is well recognized that the design phase plays a crucial role in improving the project management performance in construction projects [131], relatively less researches concentrated on this phase so far. It is striking to note that the least research has been done on the demolition/recycling phase. In a word, this suggests that various digital tools or methods used in other phases are far less than those in the construction phase.

Technology and Application.
Because of unstructured traits and the changing environment, the construction industry is often slow in introducing advanced technologies compared to other industries. However, with the development of diversified, advanced technologies, a growing number of practitioners and researchers are aware that the application of state-of-the-art technologies can help to improve the efficiency and effectiveness of CPM. A series of IT related to Big Data have been introduced into the field of CPM by both researchers and practitioners. e drivers are to enhance the efficiency of construction production, and to acquire a better understanding of business models. e total use frequency of each technology/technique is illustrated in Figure 2. e occurrence of BIM and ML are most common in CPM. Wireless-based network is widely applied as well, and e use frequency of wireless-based network (WBN) is ranked third. e use frequency of RFID and DM is ranked fourth. According to the application classification, IT related to Big Data reported by previous researches are mainly from the two aspects of BDM and BDA&V. It also reflects the fact that most technology application concentrated on the data analytics and visualization aspects for engineering context (as shown in Figure 3).

Research Topics.
e subjects of the existing studies have been categorized by a range of studies. Most of the articles can be classified into several groups despite the diverse research topics (see Figure 4). Some articles covered more than one research topic, which were calculated separately. As shown in Figure 4, the first group, namely, construction performance management involves safety management, progress monitoring, cost control, quality inspection, and building energy management. e second group addresses bidding management, supply chain management, resources management, and wastes management from the perspective of construction supply chain management and resources optimization. e sum of these two groups is nearly 60% of total research topics. e research topics also cover technology innovation (i.e., real-time information communication, information integration, and knowledge management), organization innovation (i.e., organization integration and collaborative and relationship management), and process innovation (i.e., asset management, crowdsourcing, and electronic commerce model). In addition, parts of articles paid attention to the other fields such as design support, compliance checking, claim management, emergency response, etc.

Discussion
Following the statistics and analysis of the existing studies, the particularized proposals in relation to each of the main topics within CPM is discussed.

Data-Driven Management in Various Phases of
Construction. Compared to whole lifecycle of industrial products, which integrates each phase from planning to demolition or recycling, construction project is traditionally divided into separate phases [127]. Specifically, the lifecycle of construction projects consists of various phases from planning/design, bid/procurement, construction, operation/ maintenance to final demolition and recycling [19,20]. According to the existing studies, the proposals that apply emerging IT to deal with CPM issues are as follows:

Planning/Design Phase.
In the schematic design phase, via the Big-Data-based ontologies inference process and semantic reasoning rule, the process of searching the corresponding work items can be automated, which assists to overcome the subjective bias introduced by the cost estimator during the design budgetary estimating process [47]. Similarly, the data mining technique provides an effective method for engineering design, e.g., through valuable knowledge extracted from a large amount of heterogeneous data to improve their accessibility for engineering designers [101].

Bid/Procurement Phase.
A number of Big-Data-related technologies have the potential to be employed in bid/procurement activities. ese include the exploration and exploitation of tender evaluation decision support system (DSS) based on KDD [71]. Similarly, VR, Web2.0, and cloud computing technologies were employed to make sure what to be purchased, and how to elaborate the build procurement strategy [72]. In addition, some ML methods (e.g. SVM, decision trees, GA, and ANN) have been used to predict the deviations of duration and cost during the construction bid process [46].
Big-Data-related technologies could enhance the performance of construction supply chain management. e emerging mobile Internet has offered an opportunity to fulfill various participants' demands about sharing the realtime information [5]. A few researches focused on the issue of construction materials tracking. RFID-based materials tracking system can contribute to the efficiency of acquisition and delivery of data involved in material usage as well as storage [74]. Combining with the wireless sensors, relevant experiments had been conducted on construction sites [73,75,125]. Some methods were put forward to implement the collection, storage, and retrieval of valid data involved in equipment or materials across LCM of construction projects, for example, RFID-based quality management [59], unified architecture component and parts construction [58], and logistics management or vertical transportation [49]. Association rule-based data mining approach can be utilized to explore cause-effect relationships of safety accidents in the construction domain [35,36,39,40]. Integrated advanced technologies (e.g., mobile Internet, WSN, and real-time data transmission) could be used to monitor the safety of crane groups [122,123]. RFID portal and tags have been used to check the safety protection equipment of workers [38]. As a real-time location and tracking technology (ZigBee network or WSN), or computer vision technique (CNN-based), can continually check the location of workers and achieve proactive accident prevention on complex construction sites [37,42,43,45]. ese form an integral part of integral strategies for precaution and accident prevention. e second main stream of data-driven management research in this phase is the quality inspection. A large amount of quality inspection data collected from construction components or structures can be processed by a machine-learning-based algorithm [63]. ANN-based performance prediction models have gained a broad range of applications in the construction domain. In particular, these models are instrumental for the fault detection of engineering structures, which assists engineers to take appropriate decisions when constructing and rehabilitating the construction structures [56,57]. Another main topic of performance management in the construction domain is schedule management.
rough image-based reconstruction technique, as-built building information model can be obtained from scanned point cloud models. An integrated BIM-based AR system could determine whether the construction is taking place in accordance with the planned schedule [50,54,132].
It is well recognized that improper management of resources (e.g. labor resources, materials, equipment) are the main reasons for schedule delay and budget overrun in construction projects. erefore, relevant studies on labor resources arrangement attempted to introduce KDD techniques to discover valuable knowledge from large sets of labor resources data, thereby optimizing resource arrangement practices [53]. Meanwhile, knowledge-based simulation was employed to enhance the reliability of queuing systems during the course of construction projects [124]. Construction fleet operations of megaprojects construction are particularly complicated. In order to solve this problem, a knowledge-based simulation modeling has been provided by Akhavian and Behzadan [77] to visualize the process of construction fleet operations throughout multimodal-process data mining.
New approaches to monitor on-site activities during the construction process have been proposed in previous studies. ese include the integration of visualized big data with BIM, and using time-lapse photos or video streams [96][97][98]. In order to achieve automated compliance checking, natural language processing can be used to automatically extract the concepts as well as their relationships from the text of regulations and contracts [103,104]. Similarly, it has been reported that great potential exists to further apply Big-Data-related technologies to manage claims and contracts. For instance, ML-based approaches have been employed for modeling the judicial reasoning and forecasting the litigation results [106,107].

Operation/Maintenance Phase.
ere are two main topics, i.e., quality inspection and energy management, respectively. Quality inspection is one of the most important control objectives in both construction phase and maintenance phase. ML-based algorithm can be employed to assist engineers to make proper decisions while maintaining the construction structures. For instance, Cheng et al. [62] demonstrated that GA-based multilevel association rule mining is an effective method to analyze the defects of buildings. Similarly, a system integrated with a PDA and wireless web could collect real-time defect data to achieve the rapid detection of apartment housing projects [60]. Natural disasters often cause significant damage to construction. A robotic platform was demonstrated efficiently to identify defects in post-disaster construction by means of imagebased automated 3D crack detection [61,64]. Nowadays, building energy consumption monitoring has become an important topic. Because the real-time building energy monitoring data grow continually, powerful Big Data analysis tools are needed. For example, a time series temporal knowledge discovery in big BAS (building automation systems) data was successfully used by Cabrera et al. [68] for building energy management. Meanwhile, with the growing concern about the whole process management of building energy performance, the reduction of building energy consumption should rely on both effective design and efficient operation. erefore, previous studies have applied BIM together with BMS to support the visualization and improve the building energy performance [69,70]. Another topic during this phase was emergency response. In order to facilitate disaster response operations, Penã-Mora et al. [108] developed an information-technology-based collaboration framework. Firstly, building-assessment-related information is stored in RFID tags, and then connected with the terminal network through digital devices. When necessary, web-based meetings can be held to evaluate the disaster event. During the firefighting process, AR-based context-aware access assists in acquiring the precise position [109,110]. In addition, the premovement time can be shortened through an interactive system, which consists of ZigBee-based localization module and individual wearable devices in case of fire events [112].

Recycling/Demolition Phase.
With the rapid development of urbanization, the adverse impact of the bulk of construction wastes on environment has serious implications worldwide. As a result, a desire to proactively tackle construction wastes at the source of design is also getting stronger. Bilal et al. [79] proposed a Big Data architecture for construction waste analytics, which provides a potential possibility for design exploration and optimization to minimize construction waste. In addition, for waste reduction and environmental protection, EIS (Earth information system) was proposed to facilitate the sharing and reusing of surplus resources or the recycling of each construction site [78].

Data-Driven Innovation of CPM.
In addition, as shown in Figure 4, three groups about the technology innovation (i.e. real-time information communication, information integration, and knowledge management), organization innovation (i.e., organization integration and collaborative and relationship management), and process innovation (i.e. asset information delivery, crowdsourcing, and electronic commerce model) were studied, respectively.

Data-Driven Technological Innovation.
e technological innovation has presented significant opportunities to the construction industry. Big-data-related technologies can capture multisource data and promote the real-time sharing of information across different participants in construction site [31]. e emergence of ICT and mobile Internet technologies (such as RFID, ad hoc network, wireless network, 3D laser scanning, PDA, and so on) facilitates the information communication and fusion on complex construction site [81,83] and improves the management performance.
Moreover, combining with VR technology, these technologies make it possible for the visualized 3D models of construction scene to be presented on PDA [80]. AR technology is recognized as a new generation of VR. It can be used for visualizing and monitoring construction activities in CPM. For instance, Wang et al. [84] combined AR technology with BIM to make the on-site situation or construction operations visualized in real-time. Besides, novel sensing and tracking technologies are often operated in conjunction with AR [84]. e advent of Big-Data-related technologies is shifting the management paradigm of many industries, e.g., progressive realization of knowledge management in CPM [41,72], and the introduction of these novel technologies into construction engineering domain helps to enhance the level of lean construction [126].

Data-Driven Organizational Innovation.
e innovation of technology has led to the change of managerial approaches in the AEC sector. Similarly, Big-Data-related technologies are gradually changing the organization model of CPM. Utilizing Big-Data-related technologies (e.g. cloud computing, VR, web-based, and agent-based) can facilitate the flow of information between internal microstructure and coordinate the work of each participant [89,90]. Generally, a construction project often involves a multidisciplinary team, which has a wide geographic distribution, while a variety of incompatible management information systems are employed [133].
us, it is significant to build a digital collaboration platform to share information effectively amongst distributed project stakeholders as Pena-Mora and Dwivedi [87] and Han et al. [88] proposed.
In addition, some preliminary studies explored the BIM and BSNS (business social networking services) federations based on cloud computing so as to enable data sharing not only by individuals or the management team but also by various organizations (enterprises) across the whole project lifecycle [85]. Similarly, aiming to support better decisionmaking for project stakeholders in the Big Data environment, SOA, as the software architecture, was established to foster the information integration and coordination within the construction supply chain management [5,92].  Figure 5: A comprehensive scenario of data-driven CPM.
In short, with the application of Big-Data-related technologies, the construction project organization will be more flattened and flexible. e innovation of the organizational model can foster more effective communication, closer collaboration, and faster decision, all of which are critical to solve the dynamic and fragmented issues of the construction industry.

Data-Driven Process Innovation.
As construction projects are gradually entering the Big Data era, asset information is becoming deliverable in the AEC sectors. As Morris [134] proposed, "reconstruction project management," digital technologies will change the project delivery model radically. For example, configuration management based on "Big Data analytics" delivers asset information across organizational boundaries. As a result, the asset information often contains design rationale, sources of components, maintenance schedule, and service life, etc. [93]. Indeed, for government invested infrastructure projects, the UK government had already required that the asset information must be delivered through BIM platform [93].
Meanwhile, the digital camera is reshaping the monitoring way for construction activities. Crowdsourcing is to monitor construction activities through the jobsite video streams gathering the intelligence of the crowd, which is one of the novel construction management models [96]. Crowdsourcing often relies on the intuitive web-based platform. It has been proven that crowdsourcing could minimize the time of workface judgment. e significant advantage of this novel management model is that it allows on-site professionals to concentrate on the key node tasks, ultimately improving the performance of CPM [96][97][98].
Similar to e-commerce business, the construction industry has its own integrated electronic commerce management model. For example, Zhang and Tiong [99] established a conceptual model about integrated e-commerce management model for the construction industry. Based on the context of Big Data, construction management innovations of the e-commerce model were explored in various aspects such as interaction between cost control and scheduling process at the project level, interaction between functional divisions and scheduling process at the enterprise level, and interaction between real-time weather information and scheduling process at the industry level.
ere is an increasing level of complexity in construction projects. is is especially the case in megaprojects, which are considered as giant open complex systems. Big-Data-related technologies can assist the meta-synthesis of management processes, and finally achieve the emergence of metaintelligence.

Future Research Agendas
e diversification of research topics demonstrates that various emerging IT concepts have been introduced to explore effective ways to achieve the high quality of CPM under the Big Data scenario. Nevertheless, gaps still exist between the research status quo and the actual demands, which present research opportunities and challenges in the future.

Extension to Each Phase of Project Lifecycle.
Most of the existing studies concentrated on the construction phase, while relatively few studies focused on planning/design phase even though this phase plays a key role in improving the performance of project management [131]. In current project practices, due to the diversity of systems and the limits of technique standards, such as BIM-related software, there is still poor interoperability between each other. is is despite the application of BIM being vigorously promoted in the construction industry. In addition to providing the visualization function to designers, the core role of BIM is to create a data integration platform for collaborative design of various majors and participants, in a bid to realize data (information) or knowledge sharing throughout the project lifecycle [135,136]. In particular, in the context of Big Data, the feasibility and necessity of planning and design process integrating with cloud computing should be paid more attention in advance [137]. Most of the time, however, BIM only contains the internal data of the construction industry, up until recently; this phenomenon has been changed due to the emergence of external data related to CPM. For example, some special interfaces enable the link between BIM and third-party databases or open datasets. Consequently, the historical engineering data, energy consumption data, weather variation data, or natural disasters data can be introduced. ese will lead to the integration of internal data with external data in the construction industry through BIM. erefore, Big Data analytics indeed is the key to design innovation and bring potential opportunities to optimize plan and update design.
Furthermore, some studies have attempted to apply Big-Data-related technologies in bid/procurement phase [72], such as KDD, VR, and ML. However, the degree of concern for this topic is still relatively low. For the sake of reducing the procurement cost or choosing the right contractor or supplier, the effective approach is to establish a knowledge management system, which contains materials' real-time price and qualifications of the bidder. In addition, with the emergence of IoTs, the conventional Internet is not able to cope with this new paradigm that links the physical world to the digital virtual world and to realize the real-time communication and information sharing between different participants in the construction supply chain [5]. But, mobile Internet gradually would play a crucial role in supply chain management in digital information time, especially for complex construction projects. erefore, the expanding scale of construction market urgently requires the relevant technologies combined with Big Data analytics to improve the responsiveness.
For the operation/maintenance phase, existing studies mainly concentrated on the energy management and quality inspection of buildings. ere are both challenges and opportunities to use various sensor networks in emergency response [138]. Some innovative technologies have been Advances in Civil Engineering adopted in disaster response or rescue management, e.g., RFID, WLAN, AR, ZigBee-based, and so on [109,139]. Nevertheless, it still needs to focus on how to improve the responsiveness and accuracy of emergency response in the complex internal space of buildings in the future. ese include combining BIM with GIS data for disaster response in buildings; integrating GPS data with context-aware technology for analyzing the rescuer situation and supporting rescue decision-making. More efforts are required, eventually building a Big Data platform through the integration of these technologies. As a result, digital information is provided to assist post-disaster emergency responses and rescue operations. Moreover, it should be noted that there are few studies on the application of emerging IT on the recycling phase. How to minimize the construction waste presents a critical issue in the LCM of construction projects. Previous studies on waste management in construction projects mainly concerned environmental and techno-economic related factors, as well as impacts of resources policies [140][141][142]. Furthermore, existing waste analysis methods are subject to the restriction of static project attributes. However, design optimization can minimize the waste, which will be the future research trend. Especially, via discovery of implicit interaction between design stage and other stages, Big Data analytics would shift the wastes management approaches from passive governance to proactive analyses. erefore, it warrants further investigation to investigate how the Big-Data-related technologies could be employed to consider dynamic project attributes, thereby minimizing the construction waste.

Strengthening Big Data Management in CPM.
e application of emerging IT is increasingly popular in the construction industry, but it does not yet fully meet the desired data-driven CPM when faced with massive engineering data or information [6].
is situation is largely because most existing studies and practices concentrate on the data analytics and visualization aspects. By contrast, research on data management tasks (i.e., generation, acquisition, storage, processing, and security) for the engineering context is lacking. As the basis of data analytics, the ability of data management should be strengthened.
Data acquisition generally involves data collection and preprocessing [16], which are relatively simple in the manufacturing sector. Smart devices such as sensors and RFID tags can completely and accurately capture and transmit the Big Data related to the product lifecycle. Moreover, the less data noise, standardized data format, and integration of enterprise information systems facilitate processing data and sharing information [15]. By contrast, the data acquisition of the construction domain still faces numerous difficulties such as manual data entry, dynamic and static occlusions, and lack of uniform data format [54,143]. All these issues result in poor interoperability and compatibility. erefore, future studies or practices should pay attention to these areas, for example, using text mining techniques to automate extracting information from documents and developing the common standard format for data exchange between heterogeneous modeling software.
According to the existing research analysis, a small amount of big Earth data (big geospatial data) are collected in the current process of construction. Big Earth data cover the fields of not only atmosphere, geography, geology, environment, and ecology but also information science, space science, cognitive science, and ones closely related to the humanities and social sciences [144]. ey are expected to describe, analyze, model, and predict the dynamic processes of the Earth system as well as the interaction between humans and the planet by analyzing such massive data [144,145]. From smart cities' perspectives, the inclusion of BIM currently represents a relatively small part (a narrow building-level view) within the wider environmental context. External data such as atmosphere, geography, geology, environment, and ecology also need to be embedded into the BIM platform to support the management and decision-making of the whole lifecycle of the construction project. Moreover, in the face of the issues of massive volume and rapidly streaming big Earth data, cloud computing can provide critical support. It supports the processing of Big Data to address the 4Vs and obtain value for better decision support, research, and operations for various geospatial data [145]. Utilizing cloud computing to address big geospatial data challenges also applies to the construction industry. e complexity of CPM practices and various digital information tools involved often lead to the nature of data mess and multisource heterogeneity. Data preprocessing is an important prerequisite to take advantage of acquired data [16]. erefore, the data must be cleaned, integrated, and transformed through various tools such as Apache Hadoop, NoSQL, and MapReduce to extract meaningful information from Big Data [146]. After the extracting, transforming, and loading process, the data become applicable to data mining and various online analytics [16].
In addition, storage and processing of Big Data present significant challenges due to gradually complex CPM. Part of multisource heterogeneous data will be processed in real time by digital processing equipment. On the contrary, huge nonrealtime data will be stored as the raw data style for further data analysis [15]. With the increasing complexity of CPM, the total amount of relevant data has grown rapidly.
is requires stronger computing ability possessed by the data processors and larger storage space configured by the data storage devices. Although the on-site automated tracking technique has been explored [52,133], obstacles remain in data processing, such as lack of the fusing of data between various sources. In addition, data query is another issue in Big Data era [147][148][149]. Facing the huge amount of data, traditional databases will lose flexibility, especially when users attempt to ad hoc query in the information management systems. erefore, to make it easier to access the stored data, online analytical processing and data warehouse should be further explored and exploited. In addition, there are great potentials and challenges to explore more friendly solutions such as natural language query solutions in the construction domain; for example, the exploration of ontologies from the text of construction contractual documents using natural language processing.

Implementation of Comprehensive Data-Driven CPM.
e traditional project management model is mainly based on reductionism.
us, several innovative project management approaches (e.g. the meta-synthesis pattern of project management) have been put forward to improve project management performance [128,129]. Nowadays, the emergence of novel IT or data analytic methods in the construction domain is more conducive to the LCM metasynthesis of a project. Embracing and fusing technology innovations appropriately to create the next-generation IT platform for the AEC industry could usher the long-overdue industrial landscape changes.
Consequently, according to the status quo of the research and potential innovation trends, a comprehensive datadriven CPM scenario should be proposed as a response to the fast-growing practical demands on digital construction. e path to achieve this scenario within the context of Big Data is reflected in Figure 5. Data-driven technological innovation lays the foundation for the organizational and process innovation, and these innovations will further improve the management performance of construction projects. Specifically, this data-driven CPM scenario has the following features: (i) Containing all stakeholders of the AEC industry, such as owners, architects, designers, contractors, project managers, and public officers (ii) Achieving automated, model-based visual analytics for decision-making in all aspects of performance management, such as progress monitoring, construction quality and safety, and building and environmental sustainability (iii) Allowing secure, ubiquitous information retrieval as well as visualized, intuitive remote operations for performing management tasks (iv) Covering the whole project lifecycle including design, development, construction, operation, maintenance, repair, and retrofit of building physical infrastructure systems (v) Forming an effective, transparent multi-stakeholder collaboration environment that minimizes conflicts among different parties and achieves maximum efficiency in planning coordination and contract execution (vi) Integrating with the broader smart city framework and serving as a pillar to improve the urbanization quality of life from perspectives such as infrastructure availability, resiliency, and sustainability Advanced BIM, IoT, computer vision, deep learning, cloud computing, AR, and blockchain technologies show potential in dealing with several important research challenges outlined in this proposal.

Visualization of the Digital Twin Data by Using BIM.
Building-/Infrastructure-related information can be directly or indirectly integrated within available digital technologies in a BIM-enabled environment [150]. While the BIM paradigm was introduced to improve collaboration during design and construction, it quickly became involved in adjacent research areas across the built environment lifecycle at the building, infrastructure, and city levels. BIM now faces considerable challenges to automation and the inclusion of wider environmental contexts because it lacks semantic completeness in areas such as control systems, including sensor networks, social systems, and urban artifacts, beyond the scope of buildings [150]. us, a holistic, scalable semantic approach that factors in dynamic data at different levels is required. e combination of Big Data, IoT, and AI is heralded as a potential solution.
All studies related to smart cities or the AEC industry consider BIM as a part of the cyberphysical systems or digital twin (DT) as an important source of data [151]. DT integrates AI and data analytics to create dynamic digital models that can learn and update the status of the physical counterpart from multiple sources [151]. erefore, the Big Databased LCM scenario is also accompanied by the concept of DT for restoring the complex construction environment, interactive design, simulation construction, and smart asset management.

Model Analytics Based on Full 3D Geometry.
Deep learning, and especially supervised convolutional neural networks (CNN) are now state-of-the-art for object detection, image classification and retrieval, semantic segmentation, human pose estimation, and many other applications [152]. In the deep learning era, object classification, recognition, and retrieval in the visual analysis of 3D models could directly be based on full 3D geometry information, resulting in a more natural manner for interacting with the 3D world [153]. erefore, with the continued fast development pace of computer processing power and deep learning algorithms, direct 3D object analytic methods could bring a new paradigm to the as-built and as-planned building model analytic topic.

Localization and Context Awareness for Visual
Analytics. Another area deep learning technique may shed new light on is the common localization and context awareness challenge, which is the key to as-built visual data analytics and AR interactions. Promising works in this area demonstrate potential. For example, PoseNet [154] developed a deep-learning-based approach to metric localization by training a CNN to regress a camera pose. In the construction domain, state-of-the-art deep-learning-based localization and object recognition mechanisms, along with a priori geometric information from as-planned BIM and GIS, can be used to advance the automatic as-built 3D model building and visual analytic process.

Blockchain-Based Collaborative Infrastructure.
Blockchain technology has already been described as the latest disruptive technology to many industries. Blockchain's decentralization, transparency, immutability, and smart contract features can solve many conflicts and coordination Advances in Civil Engineering issues faced by these industries [155]. Typical projects in the construction industry could consist of hundreds of processes, participants, and products, thus making communication discrepancies, multiparty conflicts, change orders, and resource and schedule overrun a norm. In the construction site, blockchain can be used to improve the reliability and trustworthiness of the construction logbooks, works performed, and material quantities recorded. In the facility maintenance phase, blockchain's main potential is the secure storage of sensor data that are sensitive to privacy. In addition, multiparty intelligent contracts can be applied to guarantee payments for performance upon construction compliance [156]. Integrating with BIM, the process can leverage sensor-based monitoring or visual data analytics to confirm the completion of construction and engineering components, upon which the payment can be made automatically.

Conclusions
is study primarily aims to review and categorize developments in data, and forecast future research challenges and opportunities, in which the path to achieve a comprehensive data-driven CPM scenario based on technological, process, and organizational innovations within the context of Big Data is proposed. e review provides detailed insights to help understand the general trend of CPM phases, technology and application, and research topics. e major findings from this substantial review are as follows: (1) most researchers focus on the construction phase where various digital tools or methods used in other phases are far less than those in the construction phase. (2) e occurrence of BIM and ML technique are most common in CPM, which also reflects that most existing studies concentrate on the data analytics and visualization aspects for the engineering context. (3) More than half of the research focuses on two topics, namely, construction performance management and construction supply chain management and resource optimization.
Following the statistics of the existing studies, the particularized proposals in relation to each of the main topics within CPM is discussed. Furthermore, the corresponding future research agendas are proposed according to the advances and limitations of the current literature to bridge the gaps between theoretical research and practical demands: (1) extension to each phase in the whole project lifecycle, (2) strengthening Big Data management in CPM, and (3) implementation of comprehensive data-driven CPM. e path for achieving a data-driven CPM scenario within the context of Big Data is as follows: (1) visualization of DT data in CPM by using BIM, (2) model analytics based on full 3D geometry, (3) localization and context awareness for visual analytics, and (4) blockchain-based collaborative infrastructure.
is study reveals potential areas in which Big-Datarelated technologies can remarkably advance the state-ofthe-art in the AEC industry. e results can help AEC practitioners better understand how to encourage and manage the diffusion of emerging technologies in CPM. In addition, this study demonstrates that the integration of technology, process, and organization is needed to realize data-driven CPM. e result can help practitioners better understand how to achieve the performance goals of the construction project in the context of Big Data.
Not all aspects of big data or CPM fall within the scope of this review due to the design of the research methodology, leaving out several trails around the subject. In addition, topics such as programming models or mathematical algorithms will probably remain the main issue for years to come, which is not addressed here. A follow-up study is needed to improve the research by providing a more specialized review of frameworks and programming models/ algorithms for BDA to extend the information about the features, advantages, and limitations.

Data Availability
e data used to support the findings of this study are included within the article.

Conflicts of Interest
e authors declare that they have no conflicts of interest.