Enhance the Concrete Crack Classification Based on a Novel Multi-Stage YOLOV10-ViT Framework

2024

Enhancing Concrete Crack Detection with YOLOV10-ViT Framework

Sample size: 12000 publication 10 minutes Evidence: high

Author Information

Author(s): Mayya Ali Mahmoud, Alkayem Nizar Faisal

Primary Institution: Nanjing University of Posts and Telecommunications, China

Can a multi-stage deep learning framework improve the detection and classification of concrete cracks?

The proposed multi-stage YOLOV10-ViT model significantly enhances the accuracy of concrete crack classification compared to individual models.

The YOLOV10 model achieved a precision of 90.67%, recall of 90.03%, and F1-score of 90.34%.
The multi-stage model outperformed the individual ViT model by 10.9%, 19.99%, and 19.2% for precision, recall, and F1-score, respectively.
The ViT model reached a validation accuracy of 99.29% after training.

This study created a smart system that helps find and identify cracks in concrete, making buildings safer.

The study used a multi-stage framework combining YOLOV10 for detection and ViT for classification of concrete cracks.

The model may misclassify certain crack types due to resizing issues between detection and classification stages.

Access the complete publication on the publisher's website