An ultrasonography of thyroid nodules dataset with pathological diagnosis annotation for deep learning.
Scientific data
Ultrasonography (US) of thyroid nodules is often time consuming and may be inconsistent between observers, with a low positivity rate for malignancy in biopsies. Even after determining the ultrasound Thyroid Imaging Reporting and Data System (TIRADS) stage, Fine needle aspiration biopsy (FNAB) is still required to obtain a definitive diagnosis. Although various deep learning methods were developed in medical field, they tend to be trained using TI-RADS reports as image labels. Here, we present a large US dataset with pathological diagnosis annotation for each case, designed for developing deep learning algorithms to directly infer histological status from thyroid ultrasound images. The dataset was collected from two retrospective cohorts, which consists of 8508 US images from 842 cases. Additionally, we explained three deep learning models used as validation examples using this dataset.
10.1038/s41597-024-04156-5
The added value of including thyroid nodule features into large language models for automatic ACR TI-RADS classification based on ultrasound reports.
Japanese journal of radiology
OBJECTIVE:The ACR Thyroid Imaging, Reporting, and Data System (TI-RADS) uses a score based on ultrasound (US) imaging to stratify the risk of nodule malignancy and recommend appropriate follow-up. This study aims to analyze US reports and explore how Natural Language Processing (NLP) leveraging Transformers models can classify ACR TI-RADS from text reports using the description of thyroid nodule features. MATERIALS AND METHODS:This retrospective study evaluated 16,847 thyroid-free text reports from our institution. An automated system, followed by manual review by a radiologist, established baseline annotations by assigning ACR TI-RADS categories from 1 to 5. Two types of systems were evaluated and compared in the dataset. The first by performing a multiclass classification to detect the associated ACR TI-RADS, and the second by extracting thyroid nodule features from the textual reports and incorporating them into the classifier. RESULTS:Our study showed that models enhanced with specific features systematically outperformed those without. Particularly, the BERTIN model, to which additional features were added, achieved the highest level of accuracy, with a score of 0.8426. Moreover, we found a correlation between the presence of punctate echogenic foci, a feature often linked to malignant thyroid lesions, and increased ACR TI-RADS scores. CONCLUSIONS:The features of the thyroid nodules described in thyroid US reports, such as composition, echogenicity, shape, margin or echogenic foci, help the NLP classifier to predict the associated ACR TI-RADS most accurately.
10.1007/s11604-024-01707-z
Nomogram for predicting cervical lymph node metastasis of papillary thyroid carcinoma using deep learning-based super-resolution ultrasound image.
Discover oncology
OBJECTIVES:To investigate the feasibility and effectiveness of a deep learning (DL) super-resolution (SR) ultrasound image reconstruction model for predicting cervical lymph node status in patients with papillary thyroid carcinoma(PTC). METHODS:In this retrospective study, researchers recruited 544 patients with PTC and randomly assigned them to training and test sets. SR ultrasound images were acquired using SR technology to improve image resolution, and artificial features and DL features were extracted from the original (OR) and SR images, respectively, to construct a ML, DL model. The best model was selected and aggregated with clinical parameters to construct the nomogram. The performance of the model is evaluated by ROC curves, calibration curves and decision curves. RESULTS:In distinguishing the presence or absence of metastatic lymph nodes, the predictive performance of the SR_ResNet 101 and SR_SVM models based on SR outperformed those based on OR. In the test set, SR_SVM AUC was 0.878 (95% CI 0.8203-0.9358), accuracy 0.854, while OR_SVM AUC was 0.822 (95% CI 0.7500-0.8937), accuracy 0.665. SR_ResNet 101 AUC was 0.799 (95% CI 0.7175-0.8806), accuracy 0.793, and OR_ResNet101 AUC was 0.751 (95% CI 0.6620-0.8401), accuracy 0.713. Subsequently, Nomogram_A and Nomogram_B were constructed by integrating the SR_SVM model and SR_ResNet 101 model, respectively, with clinical parameters, while Nomogram_C was constructed solely based on clinical indicators. In the test set, Nomogram_A demonstrated the best performance with an AUC of 0.930 (95% CI 0.8913-0.9682) and accuracy was 0.829. Nomogram_B AUC 0.868 (95% CI 0.8102-0.9261) and accuracy was 0.829, while Nomogram_C AUC 0.880 (95% CI 0.8257-0.9349) and accuracy was 0.787. The DeLong test revealed that the diagnostic performance of Nomogram_A based on SR_SVM was significantly higher than that of Nomogram_B, Nomogram_C, and the level of Radiologist (P < 0.05). The calibration curves and Hosmer-Lemeshow tests confirmed a high degree of fit, and the decision curve analysis demonstrated clinical value and potential patient benefit. CONCLUSIONS:The predictive model constructed using SR reconstructed ultrasound images demonstrated superior performance in predicting preoperative cervical lymph node metastasis in PTC compared to OR images. The nomogram prediction model based on SR images has the potential to enhance the accuracy of predictive models and aid in clinical decision-making.
10.1007/s12672-024-01601-0