CrackVision12K

DOI

We present the CrackVision12k dataset, a collection of 12,000 crack images derived from 13 publicly available crack datasets. The individual datasets were too small to effectively train a deep learning model. Moreover, the masks in each dataset were annotated using different standards, so unifying the annotations was necessary. To achieve this, we applied various image processing techniques to each dataset to create masks that follow a consistent standard.Crack datasets inherently suffer from class imbalance. To mitigate this issue, we selected images containing crack pixels of more than 5000 pixels and applied data augmentation techniques such as Gaussian noise and rotation. Finally, there is a corresponding refined ground truth for each crack image across the dataset to ensure uniformity and reliability.The 13 datasets we combined are as follows: Aigle-RN, ESAR, LCMS, CRACK500, CrackLS315, CRKWH100, CrackTree260, DeepCrack, GAPS384, Masonry, Stone331, CFD, and SDNet2018.Paper & Code: https://github.com/junegoo94/Hybrid-SegmentorCitation:@misc{goo2024hybridsegmentorhybridapproachautomated,title={Hybrid-Segmentor: A Hybrid Approach to Automated Fine-Grained Crack Segmentation in Civil Infrastructure},author={June Moh Goo and Xenios Milidonis and Alessandro Artusi and Jan Boehm and Carlo Ciliberto},year={2024},eprint={2409.02866},archivePrefix={arXiv},primaryClass={cs.CV},url={https://arxiv.org/abs/2409.02866},}

Identifier
DOI https://doi.org/10.5522/04/26946472.v1
Related Identifier HasPart https://ndownloader.figshare.com/files/49023628
Metadata Access https://api.figshare.com/v2/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=oai:figshare.com:article/26946472
Provenance
Creator Goo, June Moh; Milidonis, Xenios ORCID logo; Artusi, Alessandro (ORCID: 0000-0002-4502-663x); Boehm, Jan; Ciliberto, Carlo
Publisher University College London UCL
Contributor Figshare
Publication Year 2024
Rights https://creativecommons.org/publicdomain/zero/1.0/
OpenAccess true
Contact researchdatarepository(at)ucl.ac.uk
Representation
Language English
Resource Type Dataset
Discipline Construction Engineering and Architecture; Engineering; Engineering Sciences