Skip to main content

Research Repository

Advanced Search

A method to enable automatic extraction of cost and quantity data from hierarchical construction information documents to enable rapid digital comparison and analysis

Adanza Dopazo, Daniel; Mahdjoubi, Lamine; Gething, Bill

A method to enable automatic extraction of cost and quantity data from hierarchical construction information documents to enable rapid digital comparison and analysis Thumbnail


Authors

Daniel Adanza Dopazo

Profile image of Lamine Mahdjoubi

Lamine Mahdjoubi Lamine.Mahdjoubi@uwe.ac.uk
Professor in Info. & Communication & Tech.



Abstract

Context: Despite the effort put into developing standards for structuring construction costs and the strong interest in the field, most construction companies still perform the process of data gathering and processing manually. This provokes inconsistencies, different criteria when classifying, misclassifications, and the process becomes very time-consuming, particularly in large projects. Additionally, the lack of standardization makes cost estimation and comparison tasks very difficult. Objective: The aim of this work was to create a method to extract and organize construction cost and quantity data into a consistent format and structure to enable rapid and reliable digital comparison of the content. Methods: The approach consisted of a two-step method: firstly, the system implemented data mining to review the input document and determine how it was structured based on the position, format, sequence, and content of descriptive and quantitative data. Secondly, the extracted data were processed and classified with a combination of data science and experts’ knowledge to fit a common format. Results: A large variety of information coming from real historical projects was successfully extracted and processed into a common format with 97.5% accuracy using a subset of 5770 assets located on 18 different files, building a solid base for analysis and comparison. Conclusions: A robust and accurate method was developed for extracting hierarchical project cost data to a common machine-readable format to enable rapid and reliable comparison and benchmarking.

Journal Article Type Article
Acceptance Date Sep 6, 2023
Online Publication Date Sep 8, 2023
Publication Date Sep 8, 2023
Deposit Date Sep 20, 2023
Publicly Available Date Sep 22, 2023
Journal Buildings
Electronic ISSN 2075-5309
Publisher MDPI
Peer Reviewed Peer Reviewed
Volume 13
Issue 9
Article Number 2286
DOI https://doi.org/10.3390/buildings13092286
Keywords Building and Construction; Civil and Structural Engineering; Architecture
Public URL https://uwe-repository.worktribe.com/output/11129303
Publisher URL https://www.mdpi.com/2075-5309/13/9/2286

Files








You might also like



Downloadable Citations