Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/48356
Title: Extracting structured data from unstructured breast imaging reports with transformer-based models
Authors: CARRILERO MARDONES, Mikel 
Perez-Martin, Jorge
Diez, Francisco Javier
BERMEJO DELGADO, Inigo 
Issue Date: 2026
Publisher: FRONTIERS MEDIA SA
Source: Frontiers in digital health, 7 (Art N° 1718330)
Abstract: Background and objective Structured clinical data is essential for research and informed decision-making, yet medical reports are frequently stored as unstructured free text. This study compared the performance of BERT-based and generative language models in converting unstructured breast imaging reports into structured, tabular data suitable for clinical and research applications.Methods A dataset of 286 anonymised breast imaging reports in Spanish was translated into English and used to evaluate five transformer-based models pre-trained in medical data: BlueBERT, BioBERT, BioMedBERT, BioGPT and ClinicalT5. Two natural language processing approaches were explored: classification of 19 categorical variables (e.g. diagnostic technique, report type, family history, BI-RADS category, tumour shape and margin) and extractive question answering of four entities (patient age, patient history, parenchymal distortion or asymmetries, and tumour size). Multiple fine-tuning strategies and input configurations were tested for each model, and performance was evaluated using accuracy and macro F1 scores.Results BioGPT demonstrated the best performance in classification tasks, achieving an overall accuracy of 96.10% and a macro F1 score of 90.30%. This was significantly better than BERT-based models (p=0.012 for accuracy and p=0.017 for F1), particularly in underrepresented categories such as tumour descriptors. In extractive question answering tasks, BioGPT achieved an average accuracy of 93.24%, which is slightly lower than that of BioMedBERT and ClinicalT5, but not significantly so. Notably, BioGPT could perform classification and extractive question answering simultaneously, which is a capability unavailable in BERT-like models.Conclusions Generative models, particularly BioGPT, offer a robust and scalable approach to automating the extraction of structured information from unstructured breast imaging reports. Their superior performance, combined with their ability to handle multiple tasks concurrently, highlights their potential to reduce the manual effort required for clinical data curation and to enable the efficient integration of imaging data into research and clinical workflows.
Notes: Carrilero-Mardones, M (corresponding author), Univ Nacl Educ Distancia UNED, Dept Artificial Intelligence, Madrid, Spain.
mcarrilero@dia.uned.es
Keywords: BI-RADS;BI-RADS;breast cancer;breast cancer;generative models;generative models;BERT models;BERT models;breast imaging;breast imaging;classification;classification;extractive question answering;extractive question answering;structured reporting TYPE Original Research PUBLISHED;structured reporting
Document URI: http://hdl.handle.net/1942/48356
e-ISSN: 2673-253X
DOI: 10.3389/fdgth.2025.1718330
ISI #: 001667593700001
Rights: 2026 Carrilero-Mardones, Pérez-Martín, Díez and Bermejo Delgado. This is an openaccess article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
Category: A1
Type: Journal Contribution
Appears in Collections:Research publications

Files in This Item:
File Description SizeFormat 
fdgth-07-1718330.pdfPublished version925.69 kBAdobe PDFView/Open
Show full item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.