Radiomics dataset from chest CT of clinically healthy adults

[thumbnail of 1-s2.0-S2352340926001095-main.pdf]
Text
· Restricted to Repository staff only
· The Copyright of this document has not been checked yet. This may affect its availability.
· Available under License Creative Commons Attribution.

Please see our End User Agreement.

It is advisable to refer to the publisher's version if you intend to cite from this work. See Guidance on citing.

Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email

Bedei, V., Ostrovskyy, M., Dey, N., Kotyk, T. and Sherratt, R. S. ORCID: https://orcid.org/0000-0001-7899-4445 (2026) Radiomics dataset from chest CT of clinically healthy adults. Data In Brief, 65. 112556. ISSN 2352-3409 doi: 10.1016/j.dib.2026.112556

Abstract/Summary

This data note describes a structured dataset of lung radiomic features derived from thoracic noncontrast computed tomography examinations of 100 subjects (47 males, 53 females; aged 15–74 years). Participants were selected on the basis of the absence of known lung, pleura, and mediastinum diseases in clinical records and radiology reports, as well as systemic diseases affecting the respiratory system. The included computed tomography studies were performed on a single multidetector CT scanner (Siemens Healthineers SOMATOM go. Now), using a uniform protocol (110 kVp; reconstructed slice thickness 0.8 mm; Br60-type lung kernel). For each case, the target thin-slice DICOM series was converted to the NIfTI format. The lung lobes (“raw” masks), vessels and air pathways were segmented automatically with TotalSegmentator. In addition to “raw” lobe masks, vessel/airway-subtracted (parenchyma) masks were generated. Lobe masks (left lung - 2, right lung - 3) were also combined into the left lung, right lung, and both lungs, resulting in eight ROIs per subject for each mask type—with (“raw”) and without vessel/air pathways (“parenchyma”). For each SubjectID×ROI, radiomic features (107 “original” features—shape, first-order, and texture families) were extracted via a PyRadiomics-based pipeline with fixed settings (B-spline interpolation; resampling to 1×1×1 mm; bin width 25 HU; absolute resegmentation) in two attenuation ranges: −1000 to +200 HU and −950 to 0 HU. The dataset is distributed as (i) a CT protocol table, (ii-iii) two feature tables (“raw” and parenchyma masks), (iv) a JSON file with a computational environment description, (v) a Python extraction script, and (vi) a dictionary file. This dataset can serve as a normative reference for lung radiomics, a benchmark for harmonization and robustness studies, and a control cohort for comparative modelling in diffuse lung diseases, as well as region-specific diseases requiring lobar or single-lung-specific radiomic features (such as emphysema, chronic obstructive pulmonary disease, Swyer–James–MacLeod syndrome, asbestosis, and silicosis).

Altmetric Badge

Dimensions Badge

Item Type Article
URI https://centaur.reading.ac.uk/id/eprint/128486
Identification Number/DOI 10.1016/j.dib.2026.112556
Refereed Yes
Divisions Life Sciences > School of Biological Sciences > Biomedical Sciences
Life Sciences > School of Biological Sciences > Department of Bio-Engineering
Publisher Elsevier
Download/View statistics View download statistics for this item

University Staff: Request a correction | Centaur Editors: Update this record