Ultrasonography

Kim, Cheon, Choi, Hwang, Shin, Cho, Lee, and Lee: Feasibility of a deep learning artificial intelligence model for the diagnosis of pediatric ileocolic intussusception with grayscale ultrasonography

Original Article

Published online: October 10, 2023

DOI: https://doi.org/10.14366/usg.23153

Feasibility of a deep learning artificial intelligence model for the diagnosis of pediatric ileocolic intussusception with grayscale ultrasonography

Se Woo Kim^1,²

, Jung-Eun Cheon^2,³

, Young Hun Choi³

, Jae-Yeon Hwang⁴

, Su-Mi Shin⁵

, Yeon Jin Cho³

, Seunghyun Lee³

, Seul Bi Lee³

¹Department of Radiology, Seoul National University Hospital, Seoul, Korea

²Department of Radiology, Seoul National University College of Medicine, Seoul, Korea

³Department of Radiology, Seoul National University Children’s Hospital, Seoul, Korea

⁴Department of Radiology, Pusan National University Yangsan Hospital, Yangsan, Korea

⁵Department of Radiology, Seoul National University Seoul Metropolitan Government Boramae Medical Center, Seoul, Korea

Correspondence to: Jung-Eun Cheon, MD, PhD, Department of Radiology, Seoul National University College of Medicine, Seoul National University Children’s Hospital, 101 Daehak-ro, Jongno-gu, Seoul 03080, Korea Tel. +82-2-2072-3608 Fax. +82-2-747-5781 E-mail: cheonje@snu.ac.kr

Received August 7, 2023 Revised October 6, 2023 Accepted October 10, 2023

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted noncommercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

ABSTRACT

Purpose: This study explored the feasibility of utilizing a deep learning artificial intelligence (AI) model to detect ileocolic intussusception on grayscale ultrasound images.

Methods: This retrospective observational study incorporated ultrasound images of children who underwent emergency ultrasonography for suspected ileocolic intussusception. After excluding video clips, Doppler images, and annotated images, 40,765 images from two tertiary hospitals were included (positive-to-negative ratio: hospital A, 2,775:35,373; hospital B, 140:2,477). Images from hospital A were split into a training set, a tuning set, and an internal test set (ITS) at a ratio of 7:1.5:1.5. Images from hospital B comprised an external test set (ETS). For each image indicating intussusception, two radiologists provided a bounding box as the ground-truth label. If intussusception was suspected in the input image, the model generated a bounding box with a confidence score (0-1) at the estimated lesion location. Average precision (AP) was used to evaluate overall model performance. The performance of practical thresholds for the model-generated confidence score, as determined from the ITS, was verified using the ETS.

Results: The AP values for the ITS and ETS were 0.952 and 0.936, respectively. Two confidence thresholds, CT_opt and CT_precision, were set at 0.557 and 0.790, respectively. For the ETS, the perimage precision and recall were 95.7% and 80.0% with CT_opt, and 98.4% and 44.3% with CT_precision. For per-patient diagnosis, the sensitivity and specificity were 100.0% and 97.1% with CT_opt, and 100.0% and 99.0% with CT_precision. The average number of false positives per patient was 0.04 with CT_opt and 0.01 for CT_precision.

Conclusion: The feasibility of using an AI model to diagnose ileocolic intussusception on ultrasonography was demonstrated. However, further study involving bias-free data is warranted for robust clinical validation.

Keywords: Intussusception, Ultrasound, Deep learning, Artificial intelligence, Pediatric emergency medicine

Key point

A deep learning model based on the YOLOv5 architecture, with a speed of several tens of frames per second, successfully diagnosed intussusception on grayscale ultrasound images with acceptable accuracy. The applicability of this deep learning model in the development of real-time ultrasound diagnostic assistance software for point-of-care ultrasound requires further verification.

Graphic Abstract

Graphic Abstract

Introduction

Introduction

Materials and Methods

Materials and Methods

Results

Results

Discussion

Discussion

NOTES

NOTES: Author Contributions

Conceptualization: Kim SW, Cheon JE, Choi YH. Data acquisition: Kim SW, Cheon JE, Choi YH, Hwang JY, Lee S, Lee SB. Data analysis or interpretation: Kim SW, Shin SM. Drafting of the manuscript: Kim SW. Critical revision of the manuscript: Kim SW, Cheon JE, Choi YH, Hwang JY, Shin SM, Lee S, Lee SB. Approval of the final version of the manuscript: all authors.
Jung-Eun Cheon serves as Editor for the Ultrasonography, but has no role in the decision to publish this article. All remaining authors have declared no conflicts of interest.

ACKNOWLEDGMENTS

ACKNOWLEDGMENTS: Young Hun Choi received a research grant from the Seoul National University Research and Development Foundation (SNU R&DB Foundation; Research No. 800-20170130).

Supplementary Material

Supplementary Material

Fig. 1.

Patient inclusion and data preparation.

ER, emergency room; US, ultrasonography; Cvx, convex transducer image; Lin, linear transducer image; LinTz, linear transducer image with trapezoidal window; (+), images with intussusception; (-), images without intussusception.

Fig. 2.

Schematic of YOLOv5 architecture and example images.

A. The input image, with a size of N×N, passes through convolutional and fully connected layers. B. Concurrently with (A), the input image is divided into a grid of A×A cells. C. For each grid, the model generates B predictions (bounding boxes with a width of Wi and a height of Hi) centered at a point (Xi, Yi) within the grid. Each grid has C class probabilities (Pr_obj×Pr_classi). D. Following the application of non-maximum suppression, the remaining bounding box or boxes are proposed as the final prediction, along with the class probability.

Fig. 3.

Precision-recall (PR) curves and F1 score curves of the model for the internal and external test sets.

A,B. PR curve and F1 score curve for the internal test set are depicted. C,D. PR curve and F1 score curve for the external test set are depicted. AUC, area under the curve; CT, confidence threshold.

Fig. 4.

Representative images of false positive (FP) and false negative (FN) cases.

These images are from six children (a 22-month-old girl, a 16-month-old boy, a 24-month-old boy, a 19-month-old boy, a 17-month-old girl, and a 19-month-old boy, corresponding to A-F, respectively). A-C. In the FP cases, normal bowel (A) or kidney (B) were erroneously identified as ileocolic intussusception. Occasionally, cases in which intussusception was suspected but not labeled due to issues with image quality, such as posterior shadowing, were detected with sensitivity (C). D-F. The FN cases included a variety of transverse and longitudinal images captured using both linear and curved transducers.

Fig. 5.

Receiver operating characteristic (ROC) curve, free-response ROC (FROC) curve, and alternative free-response ROC (AFROC) curve for the external test set.

A. ROC curve for per-patient diagnosis in the external test set are depicted. B, C. FROC and AFROC curves representing the external test set are depicted. AUROC, area under receiver operating characteristic curve; AUAFROC, area under the alternative free-response receiver operating characteristic curve; CT, confidence threshold; FP, false positive.

Table 1.

Baseline characteristics of datasets

	Development dataset	External test set	P-value
No. of patients	2,438	124
Age (mo)^a)	21.7 (1.4-59.9)	21.8 (2.2-54.1)	0.914^b)
Sex (M:F)	1,557:881	84:40	0.380
Diagnosis^c)	26.1 (636/2,438)	16.1 (20/124)	0.013
Images	38,148	2,617
Lesion (+)^c)	7.5 (2,775/37,148)	5.3 (140/2,617)	<0.001
Cvx:Lin:LinTz	1,098:846:831	56:38:46	0.646
Lesion (−)^c)	92.5 (35,373/37,148)	94.7 (2,477/2,617)	<0.001
Cvx:Lin:LinTz	12,254:11,471:11,648	1,229:644:604	<0.001

M, male; F, female; Cvx, convex transducer image; Lin, linear transducer image; LinTz, linear transducer image with trapezoidal window.

^a) Data represent average age in months. The data in parentheses represent age ranges.

^b) Either independent t-tests or chi-square tests (other rows) were performed to evaluate statistical significance. P<0.05 indicate statistical significance.

^c) Data are presented as percentages, while the data in parentheses are the numbers used to calculate percentages.

Table 2.

Diagnostic performance of deep learning-based model

	Per-image		Per-patient ETS
	ITS	ETS	Per-patient ETS
mAP_0.5	0.952	0.932	AUROC >0.999
CT_opt, >0.557
Precision (%)	94.5 (377/399)	95.7 (112/117)	Sensitivity (%)=100.0 (20/20)
Recall (%)	90.6 (377/416)	80.0 (112/140)	Specificity (%)=97.1 (101/104)
CT_precision, >0.790
Precision (%)	98.1 (264/269)	98.4 (62/63)	Sensitivity (%)=100.0 (20/20)
Recall (%)	63.5 (264/416)	44.3 (62/140)	Specificity (%)=99.0 (103/104)

The data in parentheses are the numbers used to calculate percentages.

ITS, internal test set; ETS, external test set; mAP_0.5, mean average precision at intersection-over-union threshold of 0.5; AUROC, area under receiver operating characteristic curve; CT, confidence threshold.

REFERENCES

REFERENCES: References

1. Waseem M, Rosenberg HK. Intussusception. Pediatr Emerg Care 2008;24:793–800.
[Article] [PubMed]

2. Yang WC, Chen CY, Wu HP. Etiology of non-traumatic acute abdomen in pediatric emergency departments. World J Clin Cases 2013;1:276–284.
[Article] [PubMed] [PMC]

3. Edwards EA, Pigg N, Courtier J, Zapala MA, MacKenzie JD, Phelps AS. Intussusception: past, present and future. Pediatr Radiol 2017;47:1101–1108.
[Article] [PubMed]

4. McDermott VG. Childhood intussusception and approaches to treatment: a historical review. Pediatr Radiol 1994;24:153–155.
[Article] [PubMed]

5. Applegate KE. Intussusception in children: evidence-based diagnosis and treatment. Pediatr Radiol 2009;39 Suppl 2:S140–S143.
[Article] [PubMed]

6. Hryhorczuk AL, Strouse PJ. Validation of US as a first-line diagnostic test for assessment of pediatric ileocolic intussusception. Pediatr Radiol 2009;39:1075–1079.
[Article] [PubMed]

7. Plut D, Phillips GS, Johnston PR, Lee EY. Practical imaging strategies for intussusception in children. AJR Am J Roentgenol 2020;215:1449–1463.
[Article] [PubMed]

8. Kolar M, Pilkington M, Winthrop A, Theivendram A, Lajkosz K, Brogly SB. Diagnosis and treatment of childhood intussusception from 1997 to 2016: a population-based study. J Pediatr Surg 2020;55:1562–1569.
[Article] [PubMed]

9. Ekenze SO, Mgbor SO. Childhood intussusception: the implications of delayed presentation. Afr J Paediatr Surg 2011;8:15–18.
[Article] [PubMed]

10. Pandey A, Singh S, Wakhlu A, Rawat JJ. Delayed presentation of intussusception in children: a surgical audit. Ann Pediatr Surg 2011;7:130–132.
[Article]

11. Fallon SC, Lopez ME, Zhang W, Brandt ML, Wesson DE, Lee TC, et al. Risk factors for surgery in pediatric intussusception in the era of pneumatic reduction. J Pediatr Surg 2013;48:1032–1036.
[Article] [PubMed]

12. Blackwood BP, Theodorou CM, Hebal F, Hunter MC. Pediatric intussusception: decreased surgical risk with timely transfer to a children's hospital. Pediatr Care (Wilmington) 2016;2:18.
[Article] [PubMed] [PMC]

13. Akello VV, Cheung M, Kurigamba G, Semakula D, Healy JM, Grabski D, et al. Pediatric intussusception in Uganda: differences in management and outcomes with high-income countries. J Pediatr Surg 2020;55:530–534.
[Article] [PubMed]

14. Riera A, Hsiao AL, Langhan ML, Goodman TR, Chen L. Diagnosis of intussusception by physician novice sonographers in the emergency department. Ann Emerg Med 2012;60:264–268.
[Article] [PubMed] [PMC]

15. van Wassenaer EA, Daams JG, Benninga MA, Rosendahl K, Koot BGP, Stafrace S, et al. Non-radiologist-performed abdominal point-of-care ultrasonography in paediatrics: a scoping review. Pediatr Radiol 2021;51:1386–1399.
[Article] [PubMed] [PMC]

16. Tsou PY, Wang YH, Ma YK, Deanehan JK, Gillon J, Chou EH, et al. Accuracy of point-of-care ultrasound and radiology-performed ultrasound for intussusception: a systematic review and meta-analysis. Am J Emerg Med 2019;37:1760–1769.
[Article] [PubMed]

17. Bergmann KR, Arroyo AC, Tessaro MO, Nielson J, Whitcomb V, Madhok M, et al. Diagnostic accuracy of point-of-care ultrasound for intussusception: a multicenter, noninferiority study of paired diagnostic tests. Ann Emerg Med 2021;78:606–615.
[Article] [PubMed]

18. Tonson la Tour A, Desjardins MP, Gravel J. Evaluation of bedside sonography performed by emergency physicians to detect intussusception in children in the emergency department. Acad Emerg Med 2021;28:866–872.
[Article] [PubMed]

19. Kim S, Yoon H, Lee MJ, Kim MJ, Han K, Yoon JK, et al. Performance of deep learning-based algorithm for detection of ileocolic intussusception on abdominal radiographs of young children. Sci Rep 2019;9:19420.
[Article] [PubMed] [PMC]

20. Kwon G, Ryu J, Oh J, Lim J, Kang BK, Ahn C, et al. Deep learning algorithms for detecting and visualising intussusception on plain abdominal radiography in children: a retrospective multicenter study. Sci Rep 2020;10:17582.
[Article] [PubMed] [PMC]

21. Kim YH. Artificial intelligence in medical ultrasonography: driving on an unpaved road. Ultrasonography 2021;40:313–317.
[Article] [PubMed] [PMC]

22. Song KD. Current status of deep learning applications in abdominal ultrasonography. Ultrasonography 2021;40:177–182.
[Article] [PubMed]

23. Wada K, Kubovcik M, Myczko A, Zhu L, Yamaguchi N, Fujii S, et al. Labelme: Image Polygonal Annotation with Python [Internet]. Geneva: Zenodo, 2021. [cited 2023 Aug 7]. Available from: https://doi.org/10.5281/zenodo.5711226.
[Article]

24. Russell BC, Torralba A, Murphy KP, Freeman WT. LabelMe: a database and web-based tool for image annotation. Int J Compout Vis 2008;77:157–173.
[Article]

25. Redmon J, Divvala S, Girshick R, Farhadi A. You only look once: unified, real-time object detection. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR); 2016 Jun 27-30; Las Vegas, NV, USA. Piscataway, NJ: Institute of Electrical and Electronics Engineers, 2016. 779–788.
[Article]

26. Jocher G, Chaurasia A, Stoken A, Boroec J, Kwon Y, Xie T, et al. YOLOv5 by Ultralytics (version 6.1) [Internet]. Geneva: Zenodo, 2022. [cited 2023 Aug 7]. Available from: https://doi.org/10.5281/zenodo.6222936.
[Article]

27. Zou Z, Chen K, Shi Z, Guo Y, Ye J. Object detection in 20 years: a survey. Proc IEEE 2023;111:257–276.
[Article]

28. Padilla R, Netto SL, Da Silva EA. A survey on performance metrics for object-detection algorithms. 2020 International Conference on Systems, Signals and Image Processing (IWSSIP); 2020 Jul 1-3; Niteroi, Brazil. Piscataway, NJ: Institute of Electrical and Electronics Engineers, 2020. 237–242.
[Article]

29. Ying X. An overview of overfitting and its solutions. J Phys Conf Ser 2019;1168:022022.
[Article]

Feasibility of a deep learning artificial intelligence model for the diagnosis of pediatric ileocolic intussusception with grayscale ultrasonography

Supplementary Material