A multi-institutional machine-learning score that predicts bacteremia and gram class at blood-culture initiation

December 7, 2025

Authors: Seongman Bae, Se Yoon Park, Myung-Soo Kim, Gyucheol Choi, Yongseop Lee, Euijin Chang, Sungwon Lim, Jamin Koo, Sung-Han Kim

Abstract

Bacteremia is a clinically significant condition due to its potential to rapidly progress to sepsis, a life-threatening complication if not promptly managed. Early identification and timely initiation of appropriate antibiotic therapy are crucial for improving outcomes. However, because blood culture results typically require more than 48 hours, clinicians often face challenges in making timely decisions, resulting in the overuse or misuse of antibiotics and associated adverse effects. To address this gap, we developed and externally validated a multi-institutional machine-learning score that, at the moment a blood culture is ordered, delivers both the probability of bloodstream infection and the predicted Gram class of the causative pathogen.

We extracted 58 structured EHR covariates—age, sex, 11 chronic‐disease flags, and 46 routinely obtained laboratory and vital-sign variables—from all adult inpatients who underwent blood-culture testing at Asan Medical Center (AMC) between March 2019 and December 2024 (development cohort, n = 86,802). For model derivation, we retained encounters with ≤ 5 % missing features, yielding 18,870 culture events (1,305 bacteremia-positive, 17,565 negative); these were randomly partitioned 4:1 into training and internal-test subsets. A stacked ensemble comprising gradient-boosted decision trees and a transformer encoder was optimized to maximize F₁ score for bacteremia detection. Gram-class prediction was trained on the 1,216 mono-microbial bacteremia cases (835 Gram-negative, 381 Gram-positive; ≤ 3 % missingness) using the same 4:1 split. External validation employed all blood-culture encounters from Severance Hospital during March 2022–December 2024 (n = 39,861), with no feature exclusion except unit harmonization. Models were locked before external inference; performance was reported as ROC-AUC, sensitivity, specificity, and calibration metrics for both the AMC hold-out set and the full Severance cohort.

In the internal hold-out set from AMC, the bacteremia model achieved an ROC-AUC of 0.854. When applied, without retraining, to the full external test (Severance) cohort (≤ 25 % feature-missingness, n = 23,189), discrimination remained clinically robust (ROC-AUC = 0.800). Tightening the inclusion criterion to ≤5 % missingness—thereby restricting the Severance test set to 6,016 culture events—raised performance to ROC-AUC of 0.902, confirming that data completeness was a principal driver of cross-site accuracy. Exploratory stratification revealed marked differences in baseline comorbidities, which may account for inter-site differences in predictive performance.

For culture-positive encounters, the Gram-class classifier distinguished Gram-negative from Gram-positive bacteremia with an ROC-AUC of 0.802 on the AMC internal test set. At the operating point selected during training, the model correctly identified 75 % of Gram-positive cases and 74 % of Gram-negative cases; post-test probability was strongest for Gram-negative infection, with a positive-predictive value of 0.87. Permutation-based feature-importance analysis singled out procalcitonin, erythrocyte sedimentation rate, and alkaline phosphatase as the highest-impact covariates, consistent with known pathogen-specific inflammatory profiles. These results suggest that, once bacteremia is flagged, the model provides clinically actionable guidance on likely organism polarity, offering a biologically plausible path to more precise empiric antibiotic selection.

Presentations

Quantitative ex vivo synergy profiling uncovers heterogeneous combination responses in AML primary samples

67th ASH Annual Meeting and Exposition
Learn More →

Ex vivo drug sensitivity testing in Korean AML patients: Integration of functional and genomic profiles for predicting clinical response and survival

67th ASH Annual Meeting and Exposition
Learn More →

Large-Scale Validation of a Flow Cytometry Platform for Comprehensive Immunophenotyping, Cell Size Determination, and Prognostic Stratification in Canine Lymphomas

Veterinary Cancer Society Annual Conference 2025
Learn More →

Lumps, Lab Work, and a Little AI: A Tech’s Guide to Canine Lymphoma

Veterinary Cancer Society Annual Conference 2025
Learn More →

When AI Disappoints: Improving Outcomes with Real-World Clinical Data

Veterinary Cancer Society Annual Conference 2025
Learn More →

Ethos Discovery Stratified Outcomes for Lymphoma in Dogs (SOLID) Study: Informing Subclassification & Prognosis

ACVIM Forum 2025
Learn More →

Chemotherapy selection for advanced or metastatic pancreatic cancer using machine learning models trained with multi-center datasets

2025 ASCO Gastrointestinal Cancers Symposium
Learn More →

Optimizing the first line therapy of canine B-cell lymphomas via machine learning

Symposium on Artificial Intelligence in Veterinary Medicine (SAVY) 2.0
Learn More →

A study on the relationship between ex vivo drug sensitivity and clinical outcome of acute lymphoblastic leukemia

AACR Special Conference in Cancer Research: Functional and Genomic Precision Medicine in Cancer: Different Perspectives, Common Goals
Learn More →

Boosting the power of AI-based treatment outcome predictions for canine lymphoma by combining tumor and germline genetic biomarkers and live-cell analytics

Veterinary Cancer Society Annual Conference 2024
Learn More →

Decoding Feline Lymphoma: From Diagnosis to Prognosis

Veterinary Cancer Society Annual Conference 2024
Learn More →

Novel Genetic Biomarkers of Chemotherapeutic Response in Canine Lymphoma and Improved Predictive Power With Integration of Machine Learning

Veterinary Cancer Society Annual Conference 2024
Learn More →

Predicting Chemotherapy Response in Patients With Advanced or Metastatic Pancreatic Cancer Using Machine Learning

ASCO Breakthrough 2024
Learn More →

Improving Canine Lymphoma Treatment Outcomes by Individualizing Drug Selection using Machine-Learning-Based Predictive Models

Symposium on Artificial Intelligence in Veterinary Medicine (SAVY)
Learn More →

Treatment-specific risk stratification of feline lymphoma based on unsupervised clustering of flow cytometry results

World Veterinary Cancer Congress 2024
Learn More →

Dramatically increased clinical remission rates and survival times in dogs with high-grade T-cell lymphoma and relapsed B-cell lymphoma in clinical study of AI decision support

World Veterinary Cancer Congress 2024
Learn More →

AI-driven Personalized Medicine for Cancer Care

Precision Medicine World Conference 2024
Learn More →

Increased survival and remission rates in prospective study of relapsed B-cell lymphoma patients treated by oncologists using ImpriMed's AI predictions

Veterinary Cancer Society Annual Conference 2023
Learn More →

Identification of drug response biomarkers for canine lymphoma in large-scale NGS screen

Veterinary Cancer Society Annual Conference 2023
Learn More →

Impact of AI on clinical practice and outcome

Veterinary Cancer Society Annual Conference 2023
Learn More →

Canine Lymphoma Diagnostics Past, Present and Future: How AI technology and genetics are pushing the boundaries in veterinary medicine

Veterinary Cancer Society Annual Conference 2023
Learn More →

ImpriMed: AI-driven Personalized Medicine for Pet Cancer Care

Veterinary Cancer Society-Veterinary Society of Surgical Oncology Collaborative Conference 2023
Learn More →

Reinventing precision medicine: from dogs to humans

Precision Medicine World Conference 2023
Learn More →

ImpriMed Research Update

Veterinary Cancer Society Annual Conference 2022
Learn More →

Identification of Novel Predictive Biomarkers of Anticancer Drug Responses in Canine B-Cell Lymphoma Using Targeted NGS

Veterinary Cancer Society Annual Conference 2022
Learn More →

ImpriMed Research Update

Veterinary Cancer Society Mid-Year Conference 2022
Learn More →

Drugs predicted to be effective using artificial intelligence (AI) double the clinical response rate in canines with relapsed B-cell lymphoma

Veterinary Cancer Society Mid-Year Conference 2022
Learn More →

Precision medicine for pet cancer care

Animal Health, Nutrition and Technology Innovation Europe 2022
Learn More →

ImpriMed: Experiences from the eyes of an oncologist in clinical practice

Veterinary Cancer Society Annual Conference 2021
Learn More →

Predicting dynamic clinical outcomes of (L-)CHOP chemotherapy for canine lymphoma patients using an artificial intelligence model

Veterinary Cancer Society Annual Conference 2021
Learn More →

ImpriMed: A data-driven, personalized chemotherapy drug testing service for canine blood cancer patients

Veterinary Cancer Society Annual Conference 2021
Learn More →

Predicting likelihood of in vivo chemotherapy response in canine lymphoma using ex vivo drug sensitivity and immunophenotyping data in a machine learning model

Veterinary Cancer Society Annual Conference 2020
Learn More →

Individualizing chemotherapy for canine lymphoma based on ex vivo drug sensitivity test

World Veterinary Cancer Congress 2020
Learn More →

From dog to human: precision medicine for comparative oncology

Precision Medicine World Conference 2020
Learn More →

ImpriMed: precision medicine for pet cancer care

NYC Oncology Conference 2019
Learn More →

Individualizing chemotherapy for canine lymphoma based on ex vivo drug sensitivity test

Veterinary Cancer Society Annual Conference 2019
Learn More →