Machine Learning Improves External Validation for Early GDM Risk Prediction

Featured

Off

Page Content

#ffffff

Early identification of women at risk of GDM may allow timely intervention and reduce adverse pregnancy outcomes. A study published in Diabetes Research and Clinical Practice developed and validated prediction models using routinely collected early-pregnancy clinical data.

The derivation dataset included 20,435 pregnant women enrolled at Beijing Obstetrics and Gynecology Hospital. Model performance was evaluated in a temporal validation cohort of 1,997 women and an external validation cohort of 100 women from additional subcenters. Two modeling approaches were assessed: logistic regression and XGBoost.

In temporal validation, both models demonstrated comparable discrimination. Logistic regression achieved an AUC of 0.738 (95% CI, 0.707 to 0.771), while XGBoost achieved an AUC of 0.737 (95% CI, 0.706 to 0.767).These findings indicate similar internal and temporal performance.

Differences emerged in external validation. Logistic regression showed an AUC of 0.674 (95% CI, 0.440 to 0.879), whereas XGBoost demonstrated an AUC of 0.737 (95% CI, 0.510 to 0.929). These results suggest that while both approaches perform adequately in internal settings, XGBoost shows greater robustness when applied to external populations.

Anonymous user

Authenticated user

Premium

Paid / Sponsored

Key highlights

Early-pregnancy prediction models were developed for gestational diabetes mellitus (GDM).
Logistic regression and eXtreme Gradient Boosting (XGBoost) models showed similar performance in temporal validation.
The XGBoost model demonstrated more robust discrimination in external validation.

Source

Liu X, Hu F, Yan R, et al. Development and validation of prediction models for gestational diabetes mellitus in first-trimester pregnant women. Diabetes Res Clin Pract. 2025;Articles in Press:113048. Published online December 16, 2025. doi:10.1016/j.diabres.2025.113048

Thumbnail