Madenli, O., Atasoy, G. and Dikmen, I.
ORCID: https://orcid.org/0000-0002-6988-7557
(2025)
Identification and categorization of defects in construction specifications utilizing natural language processing.
Journal of Construction Engineering and Management.
ISSN 0733-9364
(In Press)
Abstract/Summary
Defective specification statements cause not only a faulty outcome but also disputes among project stakeholders, claims for project budget and time, project disruptions, and even litigation. Identifying defects in technical sections of construction specifications is challenging. This research aims to develop a structured defect framework and implement supervised natural language processing methods for identifying and categorizing defects in specifications. The dataset includes 175 specifications related to 21 different architectural works collected from 16 construction projects. Eight Machine Learning (ML) models, ranging from shallow to transformer-based, were trained and tested with combinations of different text representation techniques. Subsequently, a study with a GenAI tool, ChatGPT-4o, was conducted. Pre-trained RoBERTa model outperformed the recognition of defects in construction specifications with a macro F1 score of 91.2% and 98% accuracy. This research offers a data-driven methodology with practical tools to enhance the quality of specifications and decrease disputes by reducing the defective specification statements during design, bidding, and pre-construction.
| Item Type | Article |
| URI | https://centaur.reading.ac.uk/id/eprint/127063 |
| Refereed | Yes |
| Divisions | Science > School of the Built Environment > Construction Management and Engineering |
| Publisher | American Society of Civil Engineers |
| Download/View statistics | View download statistics for this item |
University Staff: Request a correction | Centaur Editors: Update this record
Download
Download