New Scoring Systems Increase Accuracy of AI-Generated Radiology Reports
By MedImaging International staff writers Posted on 07 Aug 2023 |

Artificial intelligence (AI) tools that efficiently produce detailed narrative reports of CT scans or X-rays can significantly lighten the workload of busy radiologists. These AI reports go beyond simple identification of abnormalities and instead provide complex diagnostic information, detailed descriptions, nuanced findings, and appropriate degrees of uncertainty, similar to how human radiologists describe scan results. While several AI models capable of generating such detailed medical imaging reports have emerged, automated scoring systems meant to assess these tools are proving to be less effective at gauging their performance, according to a new study.
In the study, researchers at Harvard Medical School (Boston, MA, USA) tested various scoring metrics on AI-generated narrative reports and had six human radiologists read these reports. The analysis revealed that automated scoring systems performed poorly compared to human radiologists when it came to evaluating AI-generated reports. These systems misinterpreted and even missed significant clinical errors made by the AI tool. Ensuring the reliability of scoring systems is crucial for AI tools to continue improving and gaining clinicians' trust. However, the metrics tested in the study failed to reliably identify clinical errors in the AI reports, highlighting an urgent need for improvement and the development of high-fidelity scoring systems that accurately monitor tool performance.
In order to create better scoring metrics, the research team designed a new method called RadGraph F1 for evaluating the performance of AI tools generating radiology reports from medical images. Additionally, they created a composite evaluation tool called RadCliQ, which combines multiple metrics to produce a single score that more closely aligns with how a human radiologist would assess an AI model's performance. Using these new scoring tools, the researchers evaluated several state-of-the-art AI models and found a notable gap between their actual scores and the top possible scores.
Going forward, the researchers envision building generalist medical AI models capable of performing various complex tasks, including solving novel problems. Such AI systems could effectively communicate with radiologists and physicians about medical images, assisting in diagnosis and treatment decisions. The team also aims to develop AI assistants that can explain imaging findings directly to patients using everyday language, enhancing patient understanding and engagement. Ultimately, these advancements could revolutionize medical imaging practices, improving efficiency, accuracy, and patient care.
“Accurately evaluating AI systems is the critical first step toward generating radiology reports that are clinically useful and trustworthy,” said study senior author Pranav Rajpurkar, assistant professor of biomedical informatics in the Blavatnik Institute at HMS. “By aligning better with radiologists, our new metrics will accelerate development of AI that integrates seamlessly into the clinical workflow to improve patient care,”
Related Links:
Harvard Medical School
Latest General/Advanced Imaging News
- AI-Powered Imaging System Improves Lung Cancer Diagnosis
- AI Model Significantly Enhances Low-Dose CT Capabilities
- Ultra-Low Dose CT Aids Pneumonia Diagnosis in Immunocompromised Patients
- AI Reduces CT Lung Cancer Screening Workload by Almost 80%
- Cutting-Edge Technology Combines Light and Sound for Real-Time Stroke Monitoring
- AI System Detects Subtle Changes in Series of Medical Images Over Time
- New CT Scan Technique to Improve Prognosis and Treatments for Head and Neck Cancers
- World’s First Mobile Whole-Body CT Scanner to Provide Diagnostics at POC
- Comprehensive CT Scans Could Identify Atherosclerosis Among Lung Cancer Patients
- AI Improves Detection of Colorectal Cancer on Routine Abdominopelvic CT Scans
- Super-Resolution Technology Enhances Clinical Bone Imaging to Predict Osteoporotic Fracture Risk
- AI-Powered Abdomen Map Enables Early Cancer Detection
- Deep Learning Model Detects Lung Tumors on CT
- AI Predicts Cardiovascular Risk from CT Scans
- Deep Learning Based Algorithms Improve Tumor Detection in PET/CT Scans
- New Technology Provides Coronary Artery Calcification Scoring on Ungated Chest CT Scans
Channels
Radiography
view channel
World's Largest Class Single Crystal Diamond Radiation Detector Opens New Possibilities for Diagnostic Imaging
Diamonds possess ideal physical properties for radiation detection, such as exceptional thermal and chemical stability along with a quick response time. Made of carbon with an atomic number of six, diamonds... Read more
AI-Powered Imaging Technique Shows Promise in Evaluating Patients for PCI
Percutaneous coronary intervention (PCI), also known as coronary angioplasty, is a minimally invasive procedure where small metal tubes called stents are inserted into partially blocked coronary arteries... Read moreMRI
view channel
AI Tool Predicts Relapse of Pediatric Brain Cancer from Brain MRI Scans
Many pediatric gliomas are treatable with surgery alone, but relapses can be catastrophic. Predicting which patients are at risk for recurrence remains challenging, leading to frequent follow-ups with... Read more
AI Tool Tracks Effectiveness of Multiple Sclerosis Treatments Using Brain MRI Scans
Multiple sclerosis (MS) is a condition in which the immune system attacks the brain and spinal cord, leading to impairments in movement, sensation, and cognition. Magnetic Resonance Imaging (MRI) markers... Read more
Ultra-Powerful MRI Scans Enable Life-Changing Surgery in Treatment-Resistant Epileptic Patients
Approximately 360,000 individuals in the UK suffer from focal epilepsy, a condition in which seizures spread from one part of the brain. Around a third of these patients experience persistent seizures... Read moreUltrasound
view channel.jpeg)
AI-Powered Lung Ultrasound Outperforms Human Experts in Tuberculosis Diagnosis
Despite global declines in tuberculosis (TB) rates in previous years, the incidence of TB rose by 4.6% from 2020 to 2023. Early screening and rapid diagnosis are essential elements of the World Health... Read more
AI Identifies Heart Valve Disease from Common Imaging Test
Tricuspid regurgitation is a condition where the heart's tricuspid valve does not close completely during contraction, leading to backward blood flow, which can result in heart failure. A new artificial... Read moreNuclear Medicine
view channel
Novel Radiolabeled Antibody Improves Diagnosis and Treatment of Solid Tumors
Interleukin-13 receptor α-2 (IL13Rα2) is a cell surface receptor commonly found in solid tumors such as glioblastoma, melanoma, and breast cancer. It is minimally expressed in normal tissues, making it... Read more
Novel PET Imaging Approach Offers Never-Before-Seen View of Neuroinflammation
COX-2, an enzyme that plays a key role in brain inflammation, can be significantly upregulated by inflammatory stimuli and neuroexcitation. Researchers suggest that COX-2 density in the brain could serve... Read moreImaging IT
view channel
New Google Cloud Medical Imaging Suite Makes Imaging Healthcare Data More Accessible
Medical imaging is a critical tool used to diagnose patients, and there are billions of medical images scanned globally each year. Imaging data accounts for about 90% of all healthcare data1 and, until... Read more
Global AI in Medical Diagnostics Market to Be Driven by Demand for Image Recognition in Radiology
The global artificial intelligence (AI) in medical diagnostics market is expanding with early disease detection being one of its key applications and image recognition becoming a compelling consumer proposition... Read moreIndustry News
view channel
GE HealthCare and NVIDIA Collaboration to Reimagine Diagnostic Imaging
GE HealthCare (Chicago, IL, USA) has entered into a collaboration with NVIDIA (Santa Clara, CA, USA), expanding the existing relationship between the two companies to focus on pioneering innovation in... Read more
Patient-Specific 3D-Printed Phantoms Transform CT Imaging
New research has highlighted how anatomically precise, patient-specific 3D-printed phantoms are proving to be scalable, cost-effective, and efficient tools in the development of new CT scan algorithms... Read more
Siemens and Sectra Collaborate on Enhancing Radiology Workflows
Siemens Healthineers (Forchheim, Germany) and Sectra (Linköping, Sweden) have entered into a collaboration aimed at enhancing radiologists' diagnostic capabilities and, in turn, improving patient care... Read more