Researchers Publish Chest X-Ray Dataset to Train AI Models
By MedImaging International staff writers Posted on 20 Feb 2019 |

Image: The CheXpert dataset of chest X-rays is designed for automated chest X-ray interpretation (Photo courtesy of Stanford University School of Medicine).
Researchers from the Stanford University School of Medicine (Stanford, CA, USA) have published CheXpert, a large dataset of chest X-rays and competition for automated chest X-ray interpretation, which features uncertainty labels and radiologist-labeled reference standard evaluation sets. Automated chest radiograph interpretation at the level of practicing radiologists could provide substantial benefit in many medical settings, from improved workflow prioritization and clinical decision support to large-scale screening and global population health initiatives.
CheXpert consists of 224,316 chest radiographs of 65,240 patients collected from Stanford Hospital that were performed between October 2002 and July 2017 in both inpatient and outpatient centers, along with their associated radiology reports. The dataset was co-released with MIMIC-CXR, a large dataset of 371,920 chest X-rays associated with 227,943 imaging studies sourced from the Beth Israel Deaconess Medical Center between 2011-2016.
One of the main obstacles in the development of chest radiograph interpretation models has been the lack of datasets with strong radiologist-annotated groundtruth and expert scores against which researchers can compare their models. CheXpert is expected to address that gap, making it easy to track the progress of models over time on a clinically important task.
The researchers have also developed and open-sourced the CheXpert labeler, an automated rule-based labeler to extract observations from the free text radiology reports to be used as structured labels for the images. This is expected to help other institutions extract structured labels from their reports and release other large repositories of data that will allow for cross-institutional testing of medical imaging models. The dataset is expected to help in the development and validation of chest radiograph interpretation models towards improving healthcare access and delivery worldwide.
Related Links:
Stanford University School of Medicine
CheXpert consists of 224,316 chest radiographs of 65,240 patients collected from Stanford Hospital that were performed between October 2002 and July 2017 in both inpatient and outpatient centers, along with their associated radiology reports. The dataset was co-released with MIMIC-CXR, a large dataset of 371,920 chest X-rays associated with 227,943 imaging studies sourced from the Beth Israel Deaconess Medical Center between 2011-2016.
One of the main obstacles in the development of chest radiograph interpretation models has been the lack of datasets with strong radiologist-annotated groundtruth and expert scores against which researchers can compare their models. CheXpert is expected to address that gap, making it easy to track the progress of models over time on a clinically important task.
The researchers have also developed and open-sourced the CheXpert labeler, an automated rule-based labeler to extract observations from the free text radiology reports to be used as structured labels for the images. This is expected to help other institutions extract structured labels from their reports and release other large repositories of data that will allow for cross-institutional testing of medical imaging models. The dataset is expected to help in the development and validation of chest radiograph interpretation models towards improving healthcare access and delivery worldwide.
Related Links:
Stanford University School of Medicine
Latest Industry News News
- GE HealthCare and NVIDIA Collaboration to Reimagine Diagnostic Imaging
- Patient-Specific 3D-Printed Phantoms Transform CT Imaging
- Siemens and Sectra Collaborate on Enhancing Radiology Workflows
- Bracco Diagnostics and ColoWatch Partner to Expand Availability CRC Screening Tests Using Virtual Colonoscopy
- Mindray Partners with TeleRay to Streamline Ultrasound Delivery
- Philips and Medtronic Partner on Stroke Care
- Siemens and Medtronic Enter into Global Partnership for Advancing Spine Care Imaging Technologies
- RSNA 2024 Technical Exhibits to Showcase Latest Advances in Radiology
- Bracco Collaborates with Arrayus on Microbubble-Assisted Focused Ultrasound Therapy for Pancreatic Cancer
- Innovative Collaboration to Enhance Ischemic Stroke Detection and Elevate Standards in Diagnostic Imaging
- RSNA 2024 Registration Opens
- Microsoft collaborates with Leading Academic Medical Systems to Advance AI in Medical Imaging
- GE HealthCare Acquires Intelligent Ultrasound Group’s Clinical Artificial Intelligence Business
- Bayer and Rad AI Collaborate on Expanding Use of Cutting Edge AI Radiology Operational Solutions
- Polish Med-Tech Company BrainScan to Expand Extensively into Foreign Markets
- Hologic Acquires UK-Based Breast Surgical Guidance Company Endomagnetics Ltd.
Channels
Radiography
view channel
World's Largest Class Single Crystal Diamond Radiation Detector Opens New Possibilities for Diagnostic Imaging
Diamonds possess ideal physical properties for radiation detection, such as exceptional thermal and chemical stability along with a quick response time. Made of carbon with an atomic number of six, diamonds... Read more
AI-Powered Imaging Technique Shows Promise in Evaluating Patients for PCI
Percutaneous coronary intervention (PCI), also known as coronary angioplasty, is a minimally invasive procedure where small metal tubes called stents are inserted into partially blocked coronary arteries... Read moreMRI
view channel
AI Tool Tracks Effectiveness of Multiple Sclerosis Treatments Using Brain MRI Scans
Multiple sclerosis (MS) is a condition in which the immune system attacks the brain and spinal cord, leading to impairments in movement, sensation, and cognition. Magnetic Resonance Imaging (MRI) markers... Read more
Ultra-Powerful MRI Scans Enable Life-Changing Surgery in Treatment-Resistant Epileptic Patients
Approximately 360,000 individuals in the UK suffer from focal epilepsy, a condition in which seizures spread from one part of the brain. Around a third of these patients experience persistent seizures... Read more
AI-Powered MRI Technology Improves Parkinson’s Diagnoses
Current research shows that the accuracy of diagnosing Parkinson’s disease typically ranges from 55% to 78% within the first five years of assessment. This is partly due to the similarities shared by Parkinson’s... Read more
Biparametric MRI Combined with AI Enhances Detection of Clinically Significant Prostate Cancer
Artificial intelligence (AI) technologies are transforming the way medical images are analyzed, offering unprecedented capabilities in quantitatively extracting features that go beyond traditional visual... Read moreUltrasound
view channel
AI Identifies Heart Valve Disease from Common Imaging Test
Tricuspid regurgitation is a condition where the heart's tricuspid valve does not close completely during contraction, leading to backward blood flow, which can result in heart failure. A new artificial... Read more
Novel Imaging Method Enables Early Diagnosis and Treatment Monitoring of Type 2 Diabetes
Type 2 diabetes is recognized as an autoimmune inflammatory disease, where chronic inflammation leads to alterations in pancreatic islet microvasculature, a key factor in β-cell dysfunction.... Read moreNuclear Medicine
view channel
Novel PET Imaging Approach Offers Never-Before-Seen View of Neuroinflammation
COX-2, an enzyme that plays a key role in brain inflammation, can be significantly upregulated by inflammatory stimuli and neuroexcitation. Researchers suggest that COX-2 density in the brain could serve... Read more
Novel Radiotracer Identifies Biomarker for Triple-Negative Breast Cancer
Triple-negative breast cancer (TNBC), which represents 15-20% of all breast cancer cases, is one of the most aggressive subtypes, with a five-year survival rate of about 40%. Due to its significant heterogeneity... Read moreGeneral/Advanced Imaging
view channel
AI-Powered Imaging System Improves Lung Cancer Diagnosis
Given the need to detect lung cancer at earlier stages, there is an increasing need for a definitive diagnostic pathway for patients with suspicious pulmonary nodules. However, obtaining tissue samples... Read more
AI Model Significantly Enhances Low-Dose CT Capabilities
Lung cancer remains one of the most challenging diseases, making early diagnosis vital for effective treatment. Fortunately, advancements in artificial intelligence (AI) are revolutionizing lung cancer... Read moreImaging IT
view channel
New Google Cloud Medical Imaging Suite Makes Imaging Healthcare Data More Accessible
Medical imaging is a critical tool used to diagnose patients, and there are billions of medical images scanned globally each year. Imaging data accounts for about 90% of all healthcare data1 and, until... Read more