BACKGROUND

The American Academy of Pediatrics recommends up to 7 days of observation for neonatal opioid withdrawal syndrome (NOWS) in infants with chronic opioid exposure. However, many of these infants will not develop NOWS, and infants with seemingly less exposure to opioids may develop severe NOWS that requires in-hospital pharmacotherapy. We adapted and validated a prediction model to help clinicians identify infants at birth who will develop severe NOWS.

METHODS

This prognostic study included 33 991 births. Severe NOWS was defined as administration of oral morphine. We applied logistic regression with a least absolute shrinkage selection operator approach to develop a severe NOWS prediction model using 37 predictors. To contrast the model with guideline screening criteria, we conducted a decision curve analysis with chronic opioid exposure defined as the mother receiving a diagnosis for opioid use disorder (OUD) or a prescription for long-acting opioids before delivery.

RESULTS

A total of 108 infants were treated with oral morphine for NOWS, and 1243 infants had chronic opioid exposure. The model was highly discriminative, with an area under the receiver operating curve of 0.959 (95% CI, 0.940–0.976). The strongest predictor was mothers’ diagnoses of OUD (adjusted odds ratio, 47.0; 95% CI, 26.7–82.7). The decision curve analysis shows a higher benefit with the model across all levels of risk, compared with using the guideline criteria.

CONCLUSION

Risk prediction for severe NOWS at birth may better support clinicians in tailoring nonpharmacologic measures and deciding whether to extend birth hospitalization than screening for chronic opioid exposure alone.

What’s Known on This Subject:

The risk for severe neonatal opioid withdrawal syndrome (NOWS) is difficult to determine at the time of birth because other factors can modify the impact of opioid exposure, and these factors are unassessed in guideline screening criteria.

What This Study Adds:

We adapted and validated a prediction model for NOWS using electronic health record data that are routinely collected and available at the time of birth. Our findings suggest predicting severe NOWS may be superior to relying on opioid exposure alone.

Neonatal opioid withdrawal syndrome (NOWS) affects nearly 1 in 100 births nationwide, resulting in longer and more complicated hospital stays.1,2 Infants diagnosed with NOWS spend, on average, 14 more days in the hospital compared with infants without NOWS.1–3 Despite this, there are few tools to guide early treatment of infants at risk for developing NOWS given that the severity is multifactorial and can be difficult to assess at birth.4,5 

The American Academy of Pediatrics (AAP) recommends at least 3 days of observation for infants exposed to short-acting opioids and up to 7 days for infants exposed to long-acting opioids, buprenorphine, or methadone.4 However, characterizing opioid exposure can be difficult on screening, especially in the milieu of stigma and legal complications, potentiating exposure from tobacco and certain prescription medications, and variation of infant metabolism.4 Furthermore, many opioid-exposed infants could be safely discharged without extended hospital observation because they will never be diagnosed with NOWS or only have relatively mild clinical signs.6 The decision to observe, perform further tests, or initiate treatment occurs soon after birth and often depends on balancing the anticipated severity of NOWS with the burden of treatment.4 Failing to identify infants at risk or idle monitoring of infants at mistakenly low risk can delay interventions and assessments, such as breastfeeding and toxicology.4,5,7 If we can accurately assess the likelihood of developing severe NOWS soon after birth, then we can tailor the intensity of interventions and anticipate the duration of in-hospital treatment.

Given the multifactorial nature of NOWS severity and the limitations of guideline screening that relies on characterizing opioid exposure, prognostic models have been developed.8–11 In prior work, we developed a NOWS prediction model using state-wide administrative claims.9 Although this model had good performance, it was limited as a clinical tool because it primarily relied on diagnostic codes and did not use infant treatment data. This could hamper the ability to assess the severity of NOWS and can lead to model miscalibration.12 Moreover, the model did not incorporate potentially important predictors available at the time of birth to assess risk, such as the Apgar score. Therefore, we aimed to develop a more robust clinical tool by adapting our prior model and externally validating the approach with electronic health record (EHR) data. 8,9 Our objective was to adapt and validate a prognostic model that will help clinicians identify infants at the time of birth who are at risk for developing severe NOWS. We also sought to assess the benefit of the model relative to an EHR definition of chronic opioid exposure based on guideline screening criteria.

In this prognostic study, we used EHR data from Vanderbilt University Medical Center, a tertiary medical center in the Southeast United States. The cohort included births in the medical center between November 2017 and January 2024. We included all live infants with a gestational age of more than 33 weeks and excluded infants who were deceased on discharge or were transferred from another hospital. We extracted data for 33 991 infants and 29 001 mothers (mothers could have more than 1 birth). This study was approved by the Vanderbilt University Medical Center Institutional Review Board, and we used the TRIPOD reporting guidelines for prediction models (Supplemental Figure 1).13 

We adapted a model to predict severe NOWS, which was defined as administration of oral morphine. Oral morphine is the standard pharmacotherapy for NOWS at our institution, and we identified administration within 7 days of birth.4 On chart review, we found that some infants who developed severe NOWS would be missed by limiting the cohort to those with high-risk exposure (ie, hepatitis C or medications for opioid use disorder [OUD]); therefore, we fit the model with the entire cohort of 33 991 births, which aligns with our prior general population model that used administrative data.9 By including the full cohort according to our inclusion criteria, the model can be used to screen infants among the general population and not just those known to be at high risk.10 To assess for oral morphine use with indications other than NOWS, we reviewed charts for 27 infants who received oral morphine without the previously validated International Classification of Diseases, Tenth Revision, Clinical Modification P96.1; all were found to have clinical diagnoses of NOWS.14 In 2022, our institution transitioned from the Finnegan Score to Eat, Sleep, Console to guide treatment. Throughout the study period, however, our institution has emphasized nonpharmacologic measures, prior to pharmacotherapy, with care outside the neonatal intensive care unit (NICU) and keeping the maternal-infant dyad intact.15 

We identified relevant predictors a priori from our previous model and additional studies.4,8–10,12,16 In total, we extracted 37 candidate predictors for mothers and infants, including medications, infectious disease, social history, and physiologic measures at birth. As proxy indicators of illicitly manufactured fentanyl use, we included intravenous (IV) drug use and diagnosis of hepatitis C and OUD. Supplemental Table 1 provides candidate predictors and definitions. Race and insurance at delivery were used to characterize the cohort but excluded as predictors.17,18 Chosen predictors were routinely collected, available at the time of birth, and amenable to automated extraction and processing in the EHR. We tested for redundancy by examining the variance in each candidate predictor that could be explained by other candidate predictors using flexible parametric additive models.19 Exposure to any prescription opioids, which grouped short- and long-acting opioid prescriptions together, was removed because of redundancy. We included, however, the individual predictors for short- and long-acting opioid prescriptions separated.

TABLE 1.

Cohort Characteristics and Candidate Predictors by Chronic Exposure to Opioids and Severe Neonatal Opioid Withdrawal Syndrome

CharacteristicFull CohortChronic Exposure to OpioidsSevere NOWS
33 991 1243 108 
Infants 
 Gestational age, mean (SD), weeks 37.9 (2.8) 37.3 (3.0) 37.6 (2.2) 
  Missing, n (%) 498 (1.5) 17 (1.4) 2 (1.9) 
 Birth weight, mean (SD), grams 3277.6 (538.5) 3137.2 (562.4) 3141.5 (467.1) 
  Missing, n (%) 4163 (12.3) 240 (19.3) 33 (30.6) 
 Head circumference, mean (SD), inches 10.5 (13.4) 9.4 (6.1) 7.3 (6.7) 
  Missing, n (%) 7668 (22.6) 364 (29.3) 49 (45.4) 
 Sex, n (%) 
  Female 16 405 (48.3) 663 (53.3) 44 (40.7) 
  Male 17 586 (51.7) 580 (46.7) 64 (59.3) 
 Race, n (%)a 
  White 22 053 (64.9) 998 (80.3) 97 (89.8) 
  Black 6214 (18.3) 233 (18.7) 17 (15.7) 
  Other 196 (0.6) 5 (0.4) 0 (0.0) 
 Apgar (0 to 10) 
  1-min score, mean (SD) 7.4 (1.7) 7.3 (1.9) 7 (2.1) 
  5-min score, mean (SD) 8.5 (1.3) 8.3 (1.6) 7.9 (2.1) 
  10-min score, mean (SD) 0.4 (1.8) 0.6 (2.0) 1.2 (2.7) 
Mothers 
 Age, mean (SD), years 29.5 (5.7) 29.8 (5.4) 30.6 (5.5) 
  Missing, n (%) 271 (0.8) 17 (1.4) 5 (4.6) 
 Insurance, n (%)a 
  Medicaid 12 543 (36.9) 509 (40.9) 45 (41.7) 
  Private 11 862 (34.9) 380 (30.6) 30 (27.8) 
  Other 6450 (19.0) 226 (18.2) 21 (19.4) 
  Uninsured 3136 (9.2) 128 (10.3) 12 (11.1) 
 Opioid use disorder, n (%) 1154 (3.4) 1154 (92.8) 86 (79.6) 
 Intravenous drug use, n (%) 82 (0.2) 66 (5.3) 10 (9.3) 
 Hepatitis C diagnosis, n (%) 320 (0.9) 182 (14.6) 21 (19.4) 
 Cigarette smoking documentation in last 30 d of pregnancy, n (%) 
  Yes 2473 (7.3) 613 (49.3) 68 (63.0) 
  Packs/day 0 (0.1) 0.2 (0.4) 0.3 (0.4) 
  Pack-years 0.3 (2.0) 2.1 (5.0) 1.9 (4.5) 
 Cigarette smoking documentation during pregnancy to last 31 d, n (%) 
  Yes 2669 (7.9) 563 (45.3) 57 (52.8) 
  Packs/day 0 (0.2) 0.2 (0.4) 0.3 (0.4) 
  Pack-years 0.3 (1.8) 1.6 (4.4) 1.9 (4.5) 
 Opioid exposure in last 30 to 2 d of pregnancy 
  Any opioid, n (%) 1505 (4.4) 430 (34.6) 40 (37.0) 
  Any opioid, mean (SD), count 0.2 (1.4) 2.1 (5.2) 2.6 (4.2) 
  Short-acting opioid, n (%) 1137 (3.3) 75 (6.0) 8 (7.4) 
  Long-acting opioid, n (%) 384 (1.1) 384 (30.9) 37 (37.3) 
 Opioid exposure in last 90 to 31 d of pregnancy 
  Any opioid, n (%) 815 (2.4) 419 (33.7) 44 (40.7) 
  Any opioid, mean (SD), count 0.2 (1.7) 3.5 (7.3) 4.1 (6.5) 
  Short-acting opioid, n (%) 453 (1.3) 60 (4.8) 5 (4.6) 
  Long-acting opioid, n (%) 391 (1.2) 391 (31.5) 44 (40.7) 
 Medication exposures in last 30 to 2 d of pregnancy, n (%) 
  Gabapentin 70 (0.2) 23 (1.9) 3 (2.8) 
  SSRI 394 (1.2) 53 (4.3) 10 (9.3) 
  Benzodiazepine 22 (0.1) 7 (0.6) 1 (0.9) 
  Sedative hypnotic 25 (0.1) 0 (0.0) 0 (0.0) 
  Atypical antipsychotic 61 (0.2) 37 (3.0) 7 (6.5) 
  Antinausea 2387 (7.0) 183 (14.7) 27 (25.0) 
  Bupropion 43 (0.1) 11 (0.9) 0 (0.0) 
 Medication exposure during pregnancy to last 31 d, n (%) 
  Gabapentin 137 (0.4) 32 (2.6) 6 (5.6) 
  SSRI 1499 (4.4) 121 (9.7) 23 (21.3) 
  Benzodiazepine 119 (0.4) 46 (3.7) 9 (8.3) 
  Sedative hypnotic 25 (0.1) 0 (0) 0 (0.0) 
  Atypical antipsychotic 153 (0.5) 70 (5.6) 10 (9.3) 
  Antinausea 7888 (23.2) 445 (35.8) 40 (37.0) 
  Bupropion 171 (0.5) 28 (2.3) 3 (2.8) 
CharacteristicFull CohortChronic Exposure to OpioidsSevere NOWS
33 991 1243 108 
Infants 
 Gestational age, mean (SD), weeks 37.9 (2.8) 37.3 (3.0) 37.6 (2.2) 
  Missing, n (%) 498 (1.5) 17 (1.4) 2 (1.9) 
 Birth weight, mean (SD), grams 3277.6 (538.5) 3137.2 (562.4) 3141.5 (467.1) 
  Missing, n (%) 4163 (12.3) 240 (19.3) 33 (30.6) 
 Head circumference, mean (SD), inches 10.5 (13.4) 9.4 (6.1) 7.3 (6.7) 
  Missing, n (%) 7668 (22.6) 364 (29.3) 49 (45.4) 
 Sex, n (%) 
  Female 16 405 (48.3) 663 (53.3) 44 (40.7) 
  Male 17 586 (51.7) 580 (46.7) 64 (59.3) 
 Race, n (%)a 
  White 22 053 (64.9) 998 (80.3) 97 (89.8) 
  Black 6214 (18.3) 233 (18.7) 17 (15.7) 
  Other 196 (0.6) 5 (0.4) 0 (0.0) 
 Apgar (0 to 10) 
  1-min score, mean (SD) 7.4 (1.7) 7.3 (1.9) 7 (2.1) 
  5-min score, mean (SD) 8.5 (1.3) 8.3 (1.6) 7.9 (2.1) 
  10-min score, mean (SD) 0.4 (1.8) 0.6 (2.0) 1.2 (2.7) 
Mothers 
 Age, mean (SD), years 29.5 (5.7) 29.8 (5.4) 30.6 (5.5) 
  Missing, n (%) 271 (0.8) 17 (1.4) 5 (4.6) 
 Insurance, n (%)a 
  Medicaid 12 543 (36.9) 509 (40.9) 45 (41.7) 
  Private 11 862 (34.9) 380 (30.6) 30 (27.8) 
  Other 6450 (19.0) 226 (18.2) 21 (19.4) 
  Uninsured 3136 (9.2) 128 (10.3) 12 (11.1) 
 Opioid use disorder, n (%) 1154 (3.4) 1154 (92.8) 86 (79.6) 
 Intravenous drug use, n (%) 82 (0.2) 66 (5.3) 10 (9.3) 
 Hepatitis C diagnosis, n (%) 320 (0.9) 182 (14.6) 21 (19.4) 
 Cigarette smoking documentation in last 30 d of pregnancy, n (%) 
  Yes 2473 (7.3) 613 (49.3) 68 (63.0) 
  Packs/day 0 (0.1) 0.2 (0.4) 0.3 (0.4) 
  Pack-years 0.3 (2.0) 2.1 (5.0) 1.9 (4.5) 
 Cigarette smoking documentation during pregnancy to last 31 d, n (%) 
  Yes 2669 (7.9) 563 (45.3) 57 (52.8) 
  Packs/day 0 (0.2) 0.2 (0.4) 0.3 (0.4) 
  Pack-years 0.3 (1.8) 1.6 (4.4) 1.9 (4.5) 
 Opioid exposure in last 30 to 2 d of pregnancy 
  Any opioid, n (%) 1505 (4.4) 430 (34.6) 40 (37.0) 
  Any opioid, mean (SD), count 0.2 (1.4) 2.1 (5.2) 2.6 (4.2) 
  Short-acting opioid, n (%) 1137 (3.3) 75 (6.0) 8 (7.4) 
  Long-acting opioid, n (%) 384 (1.1) 384 (30.9) 37 (37.3) 
 Opioid exposure in last 90 to 31 d of pregnancy 
  Any opioid, n (%) 815 (2.4) 419 (33.7) 44 (40.7) 
  Any opioid, mean (SD), count 0.2 (1.7) 3.5 (7.3) 4.1 (6.5) 
  Short-acting opioid, n (%) 453 (1.3) 60 (4.8) 5 (4.6) 
  Long-acting opioid, n (%) 391 (1.2) 391 (31.5) 44 (40.7) 
 Medication exposures in last 30 to 2 d of pregnancy, n (%) 
  Gabapentin 70 (0.2) 23 (1.9) 3 (2.8) 
  SSRI 394 (1.2) 53 (4.3) 10 (9.3) 
  Benzodiazepine 22 (0.1) 7 (0.6) 1 (0.9) 
  Sedative hypnotic 25 (0.1) 0 (0.0) 0 (0.0) 
  Atypical antipsychotic 61 (0.2) 37 (3.0) 7 (6.5) 
  Antinausea 2387 (7.0) 183 (14.7) 27 (25.0) 
  Bupropion 43 (0.1) 11 (0.9) 0 (0.0) 
 Medication exposure during pregnancy to last 31 d, n (%) 
  Gabapentin 137 (0.4) 32 (2.6) 6 (5.6) 
  SSRI 1499 (4.4) 121 (9.7) 23 (21.3) 
  Benzodiazepine 119 (0.4) 46 (3.7) 9 (8.3) 
  Sedative hypnotic 25 (0.1) 0 (0) 0 (0.0) 
  Atypical antipsychotic 153 (0.5) 70 (5.6) 10 (9.3) 
  Antinausea 7888 (23.2) 445 (35.8) 40 (37.0) 
  Bupropion 171 (0.5) 28 (2.3) 3 (2.8) 

Abbreviations: n, number of infants; N, total number of infants; NOWS, neonatal opioid withdrawal syndrome; SD, standard deviation; SSRI, selective serotonin reuptake inhibitor.

Infants were considered to have chronic exposure to opioids if the mother received a diagnosis for opioid use disorder at any time prior to delivery or received a prescription for long-acting opioids, buprenorphine, or methadone between 90 and 2 days prior to delivery. “Severe NOWS” was defined as administering oral morphine up to 7 days after birth. “Missing” is the number and percent prior to imputation.

a

Race and insurance were not included as candidate predictors in the model.

We summarized cohort characteristics descriptively using means and standard deviations for continuous variables and counts with percentages for categorical variables. To predict severe NOWS, we used logistic regression with a least absolute shrinkage and selection operator (LASSO) approach, which allowed us to avoid overfitting the model by using the most influential predictors. Predictors were standardized (scaled and centered) prior to running the LASSO regression. LASSO introduces a penalty term to the standard regression model, which forces some of the regression coefficients to shrink toward zero, effectively performing predictor selection.20 We chose to use LASSO regression, rather than backward selection with our prior model, to better handle the high-dimensional EHR data with a relatively low number of NOWS observations.9 To design for clinical use, missing numeric measures for mother’s age as well as the infant’s gestational age, birth weight, and head circumference were to be imputed with the cohort median for when measures become available.21 

We evaluated model discrimination and calibration using the area under the curve (AUC), Brier score, calibration slope and intercept, integrated calibration index (ICI), and visual examination of calibration curves.22,23 The Brier score is an overall accuracy metric with both discrimination and calibration elements and is calculated by the squared difference between the prediction (0 to 1) and outcome (0 = no NOWS and 1 = NOWS).22 The ICI is a numeric summary of model calibration across the predicted probabilities.23 It is the weighted average of the absolute difference between the observed and predicted probabilities; therefore, a lower ICI indicates better calibration. A slope greater than 1 indicates potential underfitting and a slope less than 1 indicates potential overfitting.23 The model was validated using bootstrap cross-validation with replacement.24 

We conducted a decision curve analysis to assess the clinical utility of the model compared to an EHR definition of chronic opioid exposure based on the AAP-recommended screening criteria.25–28 Aligning with the AAP guideline and our prior work defining opioid exposure, we defined chronic opioid exposure as the mother receiving a diagnosis of OUD at any time before delivery or receiving a prescription for long-acting opioids, buprenorphine, or methadone between 90 and 2 days before delivery.12 A decision curve analysis can evaluate the consequences of diagnostic tests, biomarkers, and prognostic models by incorporating the benefits and harms associated with testing. 25–28 The net benefit combines the number of true positives (eg, treating infants with NOWS) and false positives (eg, treating infants without NOWS) at different threshold probabilities representing the cutoff point for treatment. For instance, if the predicted probability of severe NOWS is 5% or higher, the cutoff point assumes that an infant should receive treatment (eg, prolong observation or administer oral morphine). That is, clinicians would prefer to treat 20 infants (who may or may not develop severe NOWS) to prevent severe NOWS complications in 1 infant. To assess the relative utility of our model compared to guideline criteria, we plotted the net benefit across a range of threshold probabilities. All analyses were performed in R version 4.3.3.

Of 33 991 births, the mean (SD) age was 29.5 (5.7) years for mothers and 37.9 (2.8) weeks for infants. Most infants were characterized as either white (64.9%) or Black (18.3%). A total of 1243 (4%) infants were considered to have chronic opioid exposure according to our EHR-based definition of the guideline criteria, and 108 (0.3%) infants were treated with oral morphine for NOWS (Table 1). Of the 108 infants, 18 (17%) did not meet the criteria for chronic opioid exposure and 4 (4%) were given oral morphine after the birth hospitalization.

Among 37 candidate predictors, the LASSO approach helped identify 7 that were necessary to accurately predict severe NOWS. Included predictors in order of decreasing strength were OUD, cigarette smoking, 5-minute Apgar score (0–10; lower score indicates more severe illness at 5 minutes of life), and prescriptions for selective serotonin reuptake inhibitor (SSRI), antinausea, long-acting opioid, and gabapentin medications (Figure 1). Among the strongest predictors, the association with severe NOWS was significant for OUD (adjusted odds ratio [aOR], 47.0; 95% CI, 26.7–82.7; P < .001), cigarette smoking (aOR, 2.8; 95% CI, 1.8–4.6; P < .001), 5-minute Apgar score (aOR, 0.82; 95% CI, 0.75–0.90; P < .001), and SSRI (aOR, 2.5; 95% CI, 1.5–4.3; P < .001). On validation of the model, we observed a high level of discrimination (AUC, 0.959; 95% CI, 0.940–0.976) and good calibration as measured by the ICI (0.001), Brier score (0.003), and inspection of the calibration curve (Figure 2).

FIGURE 1.

Predictors included in the model by decreasing importance as measured by the adjusted strength of the predictor. OR and 95% CI provide the association and variance of each predictor with neonatal opioid withdrawal syndrome pharmacotherapy.

Abbreviations: OR, odds ratio; SSRI, selective serotonin reuptake inhibitor.
FIGURE 1.

Predictors included in the model by decreasing importance as measured by the adjusted strength of the predictor. OR and 95% CI provide the association and variance of each predictor with neonatal opioid withdrawal syndrome pharmacotherapy.

Abbreviations: OR, odds ratio; SSRI, selective serotonin reuptake inhibitor.
Close modal
FIGURE 2.

Calibration curve for predicting neonatal opioid withdrawal syndrome treated with pharmacotherapy. Logistic calibration (solid line) represents parameter-based calibration (logistic regression model fit between predicted and observed values). Nonparametric calibration (dotted line) represents locally estimated scatterplot smoothing between predicted and observed values.

FIGURE 2.

Calibration curve for predicting neonatal opioid withdrawal syndrome treated with pharmacotherapy. Logistic calibration (solid line) represents parameter-based calibration (logistic regression model fit between predicted and observed values). Nonparametric calibration (dotted line) represents locally estimated scatterplot smoothing between predicted and observed values.

Close modal

We compared the AAP screening criteria to the prediction model with a decision curve analysis. Figure 3 shows the model provided a higher net benefit across all threshold probabilities, compared with screening with the criteria for chronic opioid exposure. At a risk threshold of 5%, the difference in net benefit suggests that the model compared to guideline criteria would identify 3 additional infants per 10 000 who develop severe NOWS. This would occur without resulting in unnecessary treatments, such as extended observation due to identifying false positive cases (Table 2). Moreover, compared to the guideline criteria, the model would reduce the number of false positives or unnecessary extended hospitalizations by 50%. For risk thresholds greater than 8%, the guideline criteria provided a negative net benefit primarily due to the increased number of false positives.

FIGURE 3.

Decision curve analysis for severe NOWS compared to guideline criteria for chronic opioid exposure. Chronic exposure to opioids was defined as the mother receiving a diagnosis of opioid use disorder at any time prior to delivery or receiving a prescription for long-acting opioids, buprenorphine, or methadone between 90 and 2 days prior to delivery. A higher net benefit represents a higher proportion of true positives to false positives. The threshold probability represents clinician preference for balancing potentially unnecessary treatment with potential undertreatment of NOWS cases, with a lower threshold meaning that clinicians prefer to not miss cases at the risk of potentially overtreating. The NOWS prediction model (blue line) provides higher net benefit across all threshold probabilities when compared with the guideline criteria for chronic opioid exposure (purple line). “Treat All” (red line) assumes that all infants are treated with pharmacotherapy, and “Treat None” assumes no infants were treated.

Abbreviation: NOWS, neonatal opioid withdrawal syndrome.
FIGURE 3.

Decision curve analysis for severe NOWS compared to guideline criteria for chronic opioid exposure. Chronic exposure to opioids was defined as the mother receiving a diagnosis of opioid use disorder at any time prior to delivery or receiving a prescription for long-acting opioids, buprenorphine, or methadone between 90 and 2 days prior to delivery. A higher net benefit represents a higher proportion of true positives to false positives. The threshold probability represents clinician preference for balancing potentially unnecessary treatment with potential undertreatment of NOWS cases, with a lower threshold meaning that clinicians prefer to not miss cases at the risk of potentially overtreating. The NOWS prediction model (blue line) provides higher net benefit across all threshold probabilities when compared with the guideline criteria for chronic opioid exposure (purple line). “Treat All” (red line) assumes that all infants are treated with pharmacotherapy, and “Treat None” assumes no infants were treated.

Abbreviation: NOWS, neonatal opioid withdrawal syndrome.
Close modal
TABLE 2.

Comparison of Net Benefit for Pharmacotherapy Model and Guideline Criteria to Predict Severe Neonatal Opioid Withdrawal Syndrome at Risk Threshold of 5%

StatisticResult
Default strategies 
 Net benefit of treating all infants −0.04929 
 Net benefit of treating no infants 
NOWS model 
 Net benefit 0.00113 
 Identified infants that develop severe NOWS without unnecessary treatment 11.3 per 10 000 infants 
 True positives 20 per 10 000 infants 
 False positives 171 per 10 000 infants 
Guideline criteria 
 Net benefit 0.00085 
 Identified infants that develop severe NOWS without unnecessary treatment 8.5 per 10 000 infants 
 True positives 27 per 10 000 infants 
 False positives 339 per 10 000 infants 
Difference in net benefit between model and guideline criteria 0.00027 
Additional infants identified using the model that develop severe NOWS (without unnecessary treatment, such as extended observation and nonpharmacologic measures) 2.7 per 10 000 infants 
Calculation 
 Net benefit = true-positive rate − (false-positive rate × factor) 
 Factor = threshold probability/(1 − threshold probability) 
StatisticResult
Default strategies 
 Net benefit of treating all infants −0.04929 
 Net benefit of treating no infants 
NOWS model 
 Net benefit 0.00113 
 Identified infants that develop severe NOWS without unnecessary treatment 11.3 per 10 000 infants 
 True positives 20 per 10 000 infants 
 False positives 171 per 10 000 infants 
Guideline criteria 
 Net benefit 0.00085 
 Identified infants that develop severe NOWS without unnecessary treatment 8.5 per 10 000 infants 
 True positives 27 per 10 000 infants 
 False positives 339 per 10 000 infants 
Difference in net benefit between model and guideline criteria 0.00027 
Additional infants identified using the model that develop severe NOWS (without unnecessary treatment, such as extended observation and nonpharmacologic measures) 2.7 per 10 000 infants 
Calculation 
 Net benefit = true-positive rate − (false-positive rate × factor) 
 Factor = threshold probability/(1 − threshold probability) 

Abbreviation: NOWS, neonatal opioid withdrawal syndrome.

At a risk threshold of 5%, the NOWS model and guideline criteria identify a similar number of infants who develop severe NOWS; however, the guideline criteria misidentify far more infants who will not develop severe NOWS. Moreover, with a similar number of infants treated unnecessarily, the NOWS model correctly identified nearly 3 additional infants who developed severe NOWS and were not considered to be at risk according to the guideline criteria. Guideline criteria for chronic exposure to opioids: diagnosis of opioid use disorder at any time before delivery or receiving a prescription for long-acting opioids, buprenorphine, or methadone between 90 and 2 days before delivery. True positive is when the model or guideline criteria correctly identify severe NOWS. False positive is when the model or guideline criteria incorrectly identify an infant to develop severe NOWS.

We adapted and validated a model to predict severe NOWS using EHR data that are routinely collected and available at the time of birth. The model incorporated multiple predictors beyond opioid exposure to accurately predict treatment of NOWS with pharmacotherapy. Rather than relying solely on guideline criteria for chronic opioid exposure, using multivariable prediction as a clinical tool may help clinicians identify exposed infants who are at low risk of severe NOWS for earlier discharge. Following clinical trials examining the NOWS prediction tool, this approach may guide extended observation periods and early nonpharmacologic measures for infants likely to develop severe NOWS and allow for discharge of infants who are unlikely to return to the hospital for pharmacologic treatment later.

We found that polysubstance exposure was strongly associated with NOWS. Similar to our prior model, we found exposure to cigarette smoking, SSRIs, antinausea medications, and gabapentin were important predictors of severe NOWS.9 We did not, however, find that exposure to short-acting opioids was an important predictor, which may reflect a decrease in opioid prescribing in the past decade.29 Our model included OUD treatment with buprenorphine and methadone under “long-acting opioids,” with approximately 97% of mothers with this exposure being treated with buprenorphine.30 Although cigarette smoking documentation in the social history section of the EHR can be inaccurate, we found it to be one of the strongest predictors.31 This study provides additional evidence that exposure to multiple substances, beyond opioids, may augment onset and severity of NOWS and should be considered in screening for in-hospital observation.

Few models to predict NOWS have been published and none have demonstrated the performance necessary for use as a clinical tool or have been implemented in routine patient care.8–11 Isemann and colleagues developed a prediction tool for administration of pharmacotherapy within 36 hours from birth among 264 infants with in utero exposure to opioids.8 The tool incorporated type of opioid exposure and predictive signs from the Modified Finnegan Score, including increased muscle tone, tremors when disturbed, and excoriation. In addition to concerns with relatively low sensitivity (38%) associated with the tool, the Finnegan Score is subjective and may be decreasing in use as the Eat, Sleep, Console assessment is becoming more broadly adopted.4,5 Our prior model used Medicaid data between 2009 and 2014 to predict diagnosis of NOWS during the birth hospitalization among 218 020 maternal-infant dyads, from which 3208 infants were diagnosed with NOWS.9 Although the discrimination was good (AUC, 0.89), calibration figures show that in infants with low and high risk levels, the model overestimated and underestimated risk, respectively. Furthermore, reliance on administrative data could limit the clinical utility, and use of data prior to 2014 may not reflect current opioid prescribing and widespread use of illicitly manufactured fentanyl.

The current model enhances clinical utility by predicting the need for NOWS pharmacotherapy at birth using routinely collected EHR data, making it a valuable clinical tool. Our previous model included diagnosis of hepatitis C as a proxy for illicitly manufactured fentanyl; however, we added maternal history of OUD and IV drug use to further address this important predictor. Additionally, infants’ head circumference and Apgar scores are incorporated, as they are recorded in the EHR at birth for all deliveries and can reflect variation in infant health and metabolism. We tested the model’s ability to predict NOWS diagnosis within our cohort and found calibration issues, indicating challenges with using diagnostic codes for outcomes, likely due to inconsistencies and temporal changes in coding practices (Supplemental Figure 2).32 We maintained a similar approach for assessing medication exposure, focusing on the 30 days prior to delivery.

Although we anticipate this model could support clinicians in identifying infants who may develop severe NOWS, informing clinical decision-making to extend birth hospitalization observation periods or safely discharging low-risk infants, implementing a clinical tool is not without implications for clinician workload and administrative oversight; therefore, it is important to first understand the value of the information the tool provides and then determine real-world effectiveness of the tool with a pragmatic trial.33–35 A decision curve analysis is an underutilized method to evaluate the benefit of prognostic models before pursuing resource-intensive assessments. Core to the decision curve analysis is the threshold probability, which in our use case relates to the relative value of an infant receiving proactive treatment and possibly extended hospitalization if NOWS is present or avoiding these interventions if NOWS is not present. Although certain nonpharmacologic interventions, such as breastfeeding and parent coaching, can be considered minimally invasive, managing infants in the NICU or extending hospitalization for observation are both costly and invasive. For example, if the mother-infant dyad could remain together outside the NICU, the threshold probability would be relatively low, meaning the concern of overtreating is offset by the concern of delaying early interventions. Meshing evidence-based treatment with hospital protocols will require a participatory approach with partners including clinicians, health information technology administrators, and health system leaders.36–38 We implemented the model in the EHR, separate from patient care, to gather prospective data and initiate partner discussions, prepare for a pragmatic trial, and plan for a sustainable intervention.21 Supplemental Table 2 provides the equation and predictor coefficients we used to implement the model in the EHR.

We used multiple approaches to develop the NOWS prediction model so it can be disseminated to other health care institutions. First, we used medication and diagnostic code sets created by the EHR vendor (Epic) and based on the National Library of Medicine, Value Set Authority Center.39,40 Second, we used data that are common among institutions, structured, and available without additional user input. Third, we used a regression approach that is readily interpretable by users and programable in the EHR. Although we had sufficient sample size to include important features without concern of overfitting the model, the NOWS event rate was low. Among our cohort, we found the incidence rate of NOWS was 3.2 cases per 1000 births, which is within range of what others have observed (2.5–16.2 cases per 1000 births).3,4,6 We anticipate the lower observed rate is due to a subset of NOWS that is treated with pharmacotherapy. A low event rate may overestimate performance of the model due to accurately identifying many negative cases, which is why routine maintenance is crucial for clinical prediction tools.41,42 Furthermore, inclusion of preterm infants aged 35 weeks or younger may require additional investigation for confounding variables that impact NOWS severity and management. We may have missed medication exposure if patients received prescriptions outside the institution. Because the cohort was from a comprehensive regional pediatric center with a neonatal intensive care unit, the model may not generalize to other institutions with different services or protocols, geographic coverage, OUD treatment coverage, or EHR system.

We adapted a model to accurately predict severe NOWS at the time of birth using EHR data. As a clinical tool, the model has the potential to improve the accuracy of screening infants for guideline-recommended observation and guide nonpharmacologic and pharmacologic treatment immediately after birth.

Dr Reese conceptualized and designed the study, acquired the data, conducted primary analyses, drafted the initial manuscript, and revised the manuscript for important intellectual content. Dr Patrick conceptualized and designed the study, drafted the initial manuscript, and revised the manuscript for important intellectual content. Drs Wiese and Leech conceptualized and designed the study and revised the manuscript for important intellectual content. Mr Domenico, Ms McNeer, and Dr Davis assisted with analyses and revised the manuscript for important intellectual content. Drs Matheny and Wright assisted with interpretation of data and revised the manuscript for important intellectual content. All authors approved the final manuscript as submitted and agree to be accountable for all aspects of the work.

CONFLICT OF INTEREST DISCLOSURES: The authors have no conflicts of interest relevant to this article to disclose.

FUNDING: Agency for Healthcare Research and Quality: K08HS029695 (T.J.R.); National Institutes of Health: P50DA046351 (T.J.R., S.W.P., A.D.W., A.L., E.M.), K01DA051683 (A.D.W.), and K01DA050740 (A.A.L.). The funder/sponsor did not participate in the work.

AAP

American Academy of Pediatrics

AUC

area under the curve

EHR

electronic health record

ICI

integrated calibration index

IV

intravenous

LASSO

least absolute shrinkage and selection operator

NICU

neonatal intensive care unit

NOWS

neonatal opioid withdrawal syndrome

OUD

opioid use disorder

SSRI

selective serotonin reuptake inhibitor

1
Hirai
AH
,
Ko
JY
,
Owens
PL
,
Stocks
C
,
Patrick
SW
.
Neonatal abstinence syndrome and maternal opioid-related diagnoses in the US, 2010–2017
.
JAMA.
2021
;
325
(
2
):
146
155
. PubMed doi: 10.1001/jama.2020.24991
2
Strahan
AE
,
Guy
GP
Jr
,
Bohm
M
,
Frey
M
,
Ko
JY
.
Neonatal abstinence syndrome incidence and health care costs in the United States, 2016
.
JAMA Pediatr.
2020
;
174
(
2
):
200
202
. PubMed doi: 10.1001/jamapediatrics.2019.4791
3
Patrick
SW
,
Davis
MM
,
Lehmann
CU
,
Cooper
WO
.
Increasing incidence and geographic distribution of neonatal abstinence syndrome: United States 2009 to 2012
.
J Perinatol.
2015
;
35
(
8
):
650
655
. PubMed doi: 10.1038/jp.2015.36
4
Patrick
SW
,
Barfield
WD
,
Poindexter
BB
, et al;
Committee on Fetus and Newborn, Committee on Substance Use and Prevention
.
Neonatal opioid withdrawal syndrome
.
Pediatrics.
2020
;
146
(
5
):
e2020029074
. PubMed doi: 10.1542/peds.2020-029074
5
Mascarenhas
M
,
Wachman
EM
,
Chandra
I
,
Xue
R
,
Sarathy
L
,
Schiff
DM
.
Advances in the care of infants with prenatal opioid exposure and neonatal opioid withdrawal syndrome
.
Pediatrics.
2024
;
153
(
2
):
e2023062871
. PubMed doi: 10.1542/peds.2023-062871
6
Conradt
E
,
Flannery
T
,
Aschner
JL
, et al
.
Prenatal opioid exposure: neurodevelopmental consequences and future research priorities
.
Pediatrics.
2019
;
144
(
3
):
e20190128
. PubMed doi: 10.1542/peds.2019-0128
7
Wabuyele
SL
,
Colby
JM
,
McMillin
GA
.
Detection of drug-exposed newborns
.
Ther Drug Monit.
2018
;
40
(
2
):
166
185
. doi: 10.1097/FTD.0000000000000485
8
Isemann
BT
,
Stoeckle
EC
,
Taleghani
AA
,
Mueller
EW
.
Early prediction tool to identify the need for pharmacotherapy in infants at risk of neonatal abstinence syndrome
.
Pharmacotherapy.
2017
;
37
(
7
):
840
848
. PubMed doi: 10.1002/phar.1948
9
Patrick
SW
,
Slaughter
JC
,
Harrell
FE
Jr
, et al
.
Development and validation of a model to predict neonatal abstinence syndrome
.
J Pediatr.
2021
;
229
:
154
160.e6
. PubMed doi: 10.1016/j.jpeds.2020.10.030
10
Kaltenbach
K
,
Holbrook
AM
,
Coyle
MG
, et al
.
Predicting treatment for neonatal abstinence syndrome in infants born to women maintained on opioid agonist medication
.
Addiction.
2012
;
107
(
suppl 1
):
45
52
. PubMed doi: 10.1111/j.1360-0443.2012.04038.x
11
Oji-Mmuo
CN
,
Schaefer
EW
,
Liao
L
,
Kaiser
JR
,
Sekhar
DL
.
The possibility of early discharge for newborns being monitored for neonatal abstinence syndrome based on Modified Finnegan Score distributions
.
Clin Pediatr (Phila).
2019
;
58
(
6
):
641
646
. PubMed doi: 10.1177/0009922819832022
12
Wiese
AD
,
Phillippi
JC
,
Muhar
A
, et al
.
Performance of phenotype algorithms for the identification of opioid-exposed infants
.
Hosp Pediatr.
2024
;
14
(
6
):
438
447
. PubMed doi: 10.1542/hpeds.2023-007546
13
Collins
GS
,
Reitsma
JB
,
Altman
DG
,
Moons
KGM
;
members of the TRIPOD group
.
Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement
.
Eur Urol.
2015
;
67
(
6
):
1142
1151
. PubMed doi: 10.1016/j.eururo.2014.11.025
14
Maalouf
FI
,
Cooper
WO
,
Stratton
SM
, et al
.
Positive predictive value of administrative data for neonatal abstinence syndrome
.
Pediatrics.
2019
;
143
(
1
):
e20174183
. doi: 10.1542/peds.2017-4183
15
Crook
TW
,
Munn
EK
,
Scott
TA
, et al
.
Improving the discharge process for opioid-exposed neonates
.
Hosp Pediatr.
2019
;
9
(
8
):
643
648
. PubMed doi: 10.1542/hpeds.2019-0088
16
Morimoto
D
,
Washio
Y
,
Sato
T
, et al
.
Prediction model for nonopiate-induced neonatal abstinence syndrome
.
Pediatr Int.
2023
;
65
(
1
):
e15435
. PubMed doi: 10.1111/ped.15435
17
Landy
R
,
Gomez
I
,
Caverly
TJ
, et al
.
Methods for using race and ethnicity in prediction models for lung cancer screening eligibility
.
JAMA Netw Open.
2023
;
6
(
9
):
e2331155
. PubMed doi: 10.1001/jamanetworkopen.2023.31155
18
Patrick
SW
,
Dupont
WD
,
McNeer
E
, et al
.
Association of individual and community factors with hepatitis C infections among pregnant people and newborns
.
JAMA Health Forum.
2021
;
2
(
10
):
e213470
. PubMed doi: 10.1001/jamahealthforum.2021.3470
19
Harrell
FE
.Multivariable modeling strategies. In:
Harrell
FE
, ed.
Regression Modeling Strategies: With Applications to Linear Models, Logistic and Ordinal Regression, and Survival Analysis.
2nd ed.
Springer
;
2015
:
63
102
.
20
Tibshirani
R
.
Regression shrinkage and selection via the Lasso
.
J R Stat Soc Series B Stat Methodol.
1996
;
58
(
1
):
267
288
. doi: 10.1111/j.2517-6161.1996.tb02080.x
21
Reese
TJ
,
Domenico
HJ
,
Hernandez
A
, et al
.
Implementable prediction of pressure injuries in hospitalized adults: model development and validation
.
JMIR Med Inform.
2024
;
12
:
e51842
. PubMed doi: 10.2196/51842
22
Assel
M
,
Sjoberg
DD
,
Vickers
AJ
.
The Brier score does not evaluate the clinical utility of diagnostic tests or prediction models
.
Diagn Progn Res.
2017
;
1
(
1
):
19
. PubMed doi: 10.1186/s41512-017-0020-3
23
Austin
PC
,
Steyerberg
EW
.
The integrated calibration index (ICI) and related metrics for quantifying the calibration of logistic regression models
.
Stat Med.
2019
;
38
(
21
):
4051
4065
. PubMed doi: 10.1002/sim.8281
24
Steyerberg
EW
,
Harrell
FE
,
Borsboom
GJJM
, et al
.
Internal validation of predictive models: efficiency of some procedures for logistic regression analysis
.
J Clin Epidemiol.
2001
;
54
(
8
):
774
781
.
25
Vickers
AJ
,
Elkin
EB
.
Decision curve analysis: a novel method for evaluating prediction models
.
Med Decis Making.
2006
;
26
(
6
):
565
574
. PubMed doi: 10.1177/0272989X06295361
26
Fitzgerald
M
,
Saville
BR
,
Lewis
RJ
.
Decision curve analysis
.
JAMA.
2015
;
313
(
4
):
409
410
. PubMed doi: 10.1001/jama.2015.37
27
Siddiqui
MM
,
Rais-Bahrami
S
,
Turkbey
B
, et al
.
Comparison of MR/ultrasound fusion-guided biopsy with ultrasound-guided biopsy for the diagnosis of prostate cancer
.
JAMA.
2015
;
313
(
4
):
390
397
. PubMed doi: 10.1001/jama.2014.17942
28
Van Calster
B
,
Vickers
AJ
.
Calibration of risk prediction models: impact on decision-analytic performance
.
Med Decis Making.
2015
;
35
(
2
):
162
169
. PubMed doi: 10.1177/0272989X14547233
29
Reese
TJ
,
Nelson
SD
,
Marcovitz
D
, et al
.
Evaluation of compensatory prescribing after opioid-restricting legislation
.
J Gen Intern Med.
2023
;
38
(
5
):
1338
1340
. Published online 2022 PubMed doi: 10.1007/s11606-022-07941-3
30
Kast
KA
,
Le
TDV
,
Stewart
LS
, et al
.
Impact of inpatient addiction psychiatry consultation on opioid use disorder outcomes
.
Am J Addict.
2024
;
33
(
5
):
543
550
. Published online March 28, 2024 PubMed doi: 10.1111/ajad.13540
31
Kukhareva
PV
,
Caverly
TJ
,
Li
H
, et al
.
Inaccuracies in electronic health records smoking data and a potential approach to address resulting underestimation in determining lung cancer screening eligibility
.
J Am Med Inform Assoc.
2022
;
29
(
5
):
779
788
. PubMed doi: 10.1093/jamia/ocac020
32
Henkhaus
LE
,
Buntin
MB
,
Henderson
SC
,
Lai
P
,
Patrick
SW
.
Disparities in receipt of medications for opioid use disorder among pregnant women
.
Subst Abus.
2022
;
43
(
1
):
508
513
. PubMed doi: 10.1080/08897077.2021.1949664
33
Topol
EJ
.
High-performance medicine: the convergence of human and artificial intelligence
.
Nat Med.
2019
;
25
(
1
):
44
56
. PubMed doi: 10.1038/s41591-018-0300-7
34
Byrne
DW
.
Artificial Intelligence for Improved Patient Outcomes: Principles for Moving Forward with Rigorous Science.
Lippincott Williams & Wilkins
;
2022
.
35
Cabitza
F
,
Zeitoun
JD
.
The proof of the pudding: in praise of a culture of real-world validation for medical artificial intelligence
.
Ann Transl Med.
2019
;
7
(
8
):
161
. PubMed doi: 10.21037/atm.2019.04.07
36
Reese
TJ
,
Mixon
AS
,
Matheny
ME
, et al
.
Using intervention mapping to design and implement a multicomponent intervention to improve antibiotic and NSAID prescribing
.
Transl Behav Med.
2023
;
13
(
12
):
928
943
. PubMed doi: 10.1093/tbm/ibad063
37
Schlechter
CR
,
Reese
TJ
,
Wirth
J
, et al
.
Rapid-cycle designs to adapt interventions for COVID-19 in safety-net healthcare systems
.
Transl Behav Med.
2023
;
13
(
6
):
389
399
. PubMed doi: 10.1093/tbm/ibac101
38
Reese
TJ
,
Liu
S
,
Steitz
B
, et al
.
Conceptualizing clinical decision support as complex interventions: a meta-analysis of comparative effectiveness trials
.
J Am Med Inform Assoc.
2022
;
29
(
10
):
1744
1756
. PubMed doi: 10.1093/jamia/ocac089
39
Reese
T
,
Wright
A
,
Liu
S
, et al
.
Improving the specificity of drug-drug interaction alerts: can it be done?
Am J Health Syst Pharm.
2022
;
79
(
13
):
1086
1095
. PubMed doi: 10.1093/ajhp/zxac045
40
National Library of Medicine
.
NLM Value Set Authority Center: Code Systems and Tools.
National Library of Medicine
. Updated August 9,
2024
. Accessed September 19, 2023. https://www.nlm.nih.gov/vsac/support/authorguidelines/code-systems.html
41
Davis
SE
,
Greevy
RA
Jr
,
Lasko
TA
,
Walsh
CG
,
Matheny
ME
.
Detection of calibration drift in clinical prediction models to inform model updating
.
J Biomed Inform.
2020
;
112
:
103611
. PubMed doi: 10.1016/j.jbi.2020.103611
42
Davis
SE
,
Lasko
TA
,
Chen
G
,
Siew
ED
,
Matheny
ME
.
Calibration drift in regression and machine learning models for acute kidney injury
.
J Am Med Inform Assoc.
2017
;
24
(
6
):
1052
1061
. PubMed doi: 10.1093/jamia/ocx030

Supplementary data