OBJECTIVES:

To identify potentially modifiable or actionable factors related to study completion among healthy mother-infant dyads participating in prospective research.

PATIENTS/METHODS:

We conducted a secondary analysis of completion data from a prospective study on newborn jaundice in the first week of life at a tertiary-care hospital in Philadelphia, PA, from 2015 to 2019. Participation in the original study involved enrollment before newborn discharge and subsequent follow-up for a jaundice assessment between 2 and 6 days of life. For this study, our primary outcome was completion of all study procedures. Associations between predictor variables and the outcome were assessed using bivariate and multivariable analyses. We fit a predictive model of study completion using logistic regression and validated the model using 5-fold cross-validation.

RESULTS:

Of 501 mother-infant dyads enrolled in the original study, 304 completed the study. Median maternal age was 28 years and 81.8% of mothers delivered via vaginal birth. Study completion was associated with colocation of the study visit with the initial well-child visit (adjusted odds ratio [aOR], 2.99, 95% confidence interval [CI], 2.01–4.46) and provision of an alternate phone number by the participant (aOR, 1.99; 95% CI, 1.34–2.96). The cross-validated model performed similarly to our final predictive model and had an average area under the receiver operating characteristic curve of 0.67 (range, 0.59-0.72), with a sensitivity of 68% and specificity of 60%.

CONCLUSIONS:

Findings demonstrate the importance of communication and patient-centric approaches for recruitment and retention in newborn research. Future work should incorporate these approaches while continuing to evaluate study retention strategies.

Enrolling and retaining newborns in prospective research studies remains a priority for researchers and funding agencies. Previous studies involving newborns have primarily focused on demographic and socioeconomic predictors of study retention and completion in high-risk cohorts, including preterm and low birth weight infants, with variable findings.15  Factors associated with retention among these higher risk cohorts include higher maternal education status, higher socioeconomic status and greater income, access to social support, having health insurance, higher infant birth weight, and full-term gestational age. Factors associated with a lack of retention include larger household size, delivery via cesarean section, higher parity, and worse maternal mental health status, whereas factors such as maternal race have yielded varying results.13,68 

Despite these previous findings, few studies have evaluated factors affecting retention of well newborns in prospective research. Recent initiatives such as the Better Outcomes through Research for Newborns network have highlighted the importance of research (and by extension, participant retention) among healthy newborns.9  One example area of research in this population is neonatal jaundice, which occurs in up to 60% of full-term newborns with follow-up usually occurring within a week of birth and, if left untreated, can lead to potentially serious complications, including kernicterus.10,11 

For the current study, we used secondary data from a prospective study designed to test the efficacy of a smartphone application to detect neonatal jaundice12  and constructed a predictive model of study completion. Our overall objective was to focus on identifying participant and study-related factors, especially modifiable and/or actionable factors that are predictive of study completion and that may be generally applicable to prospective research involving healthy newborns. Additionally, we take our predictive modeling 1 step further than many retention studies in the literature by conducting tests of internal validation using the k-fold cross-validation technique, as described in the Methods.

Participants in the current study were drawn from a single tertiary-care urban hospital in Philadelphia, PA, as part of an original study to assess the efficacy of a smartphone application to detect and measure neonatal jaundice in the first week of life among singleton births >35 weeks’ gestational age. Full details of the original study are described elsewhere.12  Exclusion criteria for the original study included any infants who were transferred to the NICU or had undergone phototherapy treatment. In short, mothers were approached between July 2015 and March 2019 during the birth hospitalization to enroll their newborns in the study. Consenting mothers were asked to complete a brief instrument capturing sociodemographic characteristics and contact information. The study visit was scheduled to occur at an outpatient laboratory within walking distance of the birth hospital between 2 and 6 days of life and was scheduled to coincide with the newborns’ first well visit with their pediatrician. Following enrollment, multiple reminder texts were sent to participants instructing them of their follow-up visit; additional attempts and phone calls were made in further efforts to contact participants in the absence of affirmative text responses. At the scheduled outpatient laboratory visit, the smartphone application to assess for jaundice was used, immediately after which a serum bilirubin was obtained. Data for this study were abstracted from the electronic health record (EHR) and participant study records from the original study retrospectively; any participants who completed their study in hospital were also excluded from analysis. The current study of mother-infant dyads was approved by the university’s institutional review board.

Variables that were identified as empirically important to our study outcome based on their clinical relevance or evidence from previous studies were abstracted from 1 of 2 sources: the EHR or primary study materials. Data abstracted from the EHR included maternal age, parity, delivery mode, length of stay, insurance status, discharge day, and follow-up health care provider location. Follow-up location was recoded into a binary variable of follow-up “colocation,” in which “colocation” consisted of visits occurring in 1 of 3 physician practices across the street from or in the same building as the laboratory where the study visit occurred, and “no colocation” for all other locations. Original primary data from the parent study used for this substudy included infant race/ethnicity as specified by the mother and whether an alternate phone number was provided. Of note, information on maternal race was not collected. Participants in the original study were given $50 gift cards for participation; if their follow-up visit was not colocated, they were given an additional $10 gift card to cover the cost of transit and/or parking for the study.

Our primary outcome for all analyses was study completion for the newborn jaundice study, defined as completing both the smartphone application procedure and blood collection at the scheduled study visit.

Variables identified via the literature as having a possible role in study completion including maternal age, parity, delivery mode, length of stay, maternal and infant insurance status, discharge day, follow-up location, infant race/ethnicity, and provision of an alternate contact number were abstracted from the EHR or primary study materials; univariate and bivariate analyses were conducted. After assessing for evidence of collinearity and correlation in several variables of concern (maternal and infant insurance, delivery mode, and length of stay), we determined to include only maternal age, parity, delivery mode, maternal insurance status, discharge day, follow-up location, infant race/ethnicity, and provision of an alternate contact number in our final models, excluding length of stay and infant insurance status. Estimates are shown as odds ratios (ORs) with corresponding 95% confidence intervals (CIs).

To evaluate the predictive validity of the final study completion model, we calculated the following measures of accuracy: sensitivity, specificity, positive and negative predictive values, and percent correctly classified. Receiver operating characteristic curves, which plot the true positive rate of the model against the false-positive rate of the model at various thresholds, were used to compare the predictive power of the model. We report the area under the receiver operating characteristic (AUROC) curve, to quantitatively assess performance of the model. Internal validation of the model was then conducted using k-fold validation, with 5 folds (n = 100-101 per fold) to ensure that each testing fold had a sufficient sample size.1315  The resultant predictive outcomes from the cross-validation model were compared with the predictive outcomes from the full predictive model via confusion matrix, which compares actual versus predicted outcomes using cross-tables, as well as via comparison of measures of predictive validity (eg, sensitivity, specificity) between the full and cross-validated models.

All statistical analyses were conducted using SAS 9.4 (SAS Corporation, Cary, NC).

Of the 558 participants in the original study, 501 (89.8%) met the stated inclusion criteria for this study. Among those included, 60.7% (n = 304) went on to complete their study visit (study completers), whereas 39.3% (n = 197) did not complete the study visit (study noncompleters). In our sample, 47.9% (n = 240) reported infant race/ethnicity as non-Hispanic Black/African American. The median age of mothers was 28 years (mean, 28.2), median parity level was 1 previous child (mean, 1.2), and median length of stay was 2 days (mean, 2.2). Approximately 81.8% (n = 410) delivered via vaginal birth.

Bivariate analyses of study completers versus study noncompleters revealed statistically significant differences based on maternal length of stay (2.1 days vs 2.3 days, P = .01) vaginal versus cesarean section delivery (84.5% vs 77.7%, P = .05), follow-up colocation versus no colocation (75.7% vs 53.3%, P < .001), and provision of an alternate phone number versus no alternate number (49.0% vs 36.0%, P = .004). Full descriptive statistics and bivariate analyses can be found in Table 1.

TABLE 1

Descriptive Statistics and Bivariate Analyses (by Completion Status)

Total Enrollees (n = 501)Study Completers (n = 304)Study Noncompleters (n = 197)
 Mean (SD) or Percent (nMean (SD) or Percent (nMean (SD) or Percent (nP 
Maternal age, y 28.2 (5.7) 28.3 (5.7) 28.2 (5.7) .82 
Length of stay, d 2.2 (0.5) 2.1 (0.4) 2.3 (0.6) .01* 
Parity 1.2 (1.3) 1.2 (1.4) 1.2 (1.3) .84 
Delivery mode     
 Vaginal 81.8 (410) 84.5 (257) 77.7 (153)  
 Cesarean section 18.2 (91) 15.5 (47) 22.3 (44) .05* 
Infant insurance status     
 Medicaid insurance/uninsured 67.3 (337) 66.8 (203) 68.0 (134)  
 Private insurance 32.7 (164) 33.2 (101) 32.0 (63) .77 
Maternal insurance status     
 Medicaid insurance/uninsured 64.5 (323) 63.8 (194) 65.5 (129)  
 Private insurance 35.5 (178) 36.2 (110) 34.5 (68) .70 
Day of discharge     
 Monday 9.6 (48) 7.9 (24) 12.2 (24)  
 Tuesday 13.0 (65) 13.5 (41) 12.2 (24)  
 Wednesday 11.4 (57) 12.2 (37) 10.2 (20)  
 Thursday 18.2 (91) 18.4 (56) 17.8 (35)  
 Friday 17.6 (88) 17.1 (52) 18.3 (36)  
 Saturday 18.2 (91) 19.4 (59) 16.2 (32)  
 Sunday 12.2 (61) 11.5 (35) 13.2 (26) .69 
Follow-up colocation     
 Colocated 66.9 (335) 75.7 (230) 53.3 (105)  
 Not colocated 33.1 (166) 24.3 (74) 46.7 (92) <.001* 
Infant race/ethnicity     
 Non-Hispanic Asian 4.0 (20) 4.0 (12) 4.1 (8)  
 Non-Hispanic white 18.0 (90) 16.8 (51) 19.8 (39)  
 Non-Hispanic African American/Black 47.9 (240) 50.3 (153) 44.2 (87) .50 
 Hispanic 18.0 (90) 18.4 (56) 17.2 (34)  
 Non-Hispanic Asian American and Pacific Islander, other, or multiracial 12.2 (61) 10.5 (32) 14.7 (29)  
Alternate phone number     
 Provided 43.9 (220) 49.0 (149) 36.0 (71)  
 Not provided 56.1 (281) 51.0 (155) 64.0 (126) .004* 
Total Enrollees (n = 501)Study Completers (n = 304)Study Noncompleters (n = 197)
 Mean (SD) or Percent (nMean (SD) or Percent (nMean (SD) or Percent (nP 
Maternal age, y 28.2 (5.7) 28.3 (5.7) 28.2 (5.7) .82 
Length of stay, d 2.2 (0.5) 2.1 (0.4) 2.3 (0.6) .01* 
Parity 1.2 (1.3) 1.2 (1.4) 1.2 (1.3) .84 
Delivery mode     
 Vaginal 81.8 (410) 84.5 (257) 77.7 (153)  
 Cesarean section 18.2 (91) 15.5 (47) 22.3 (44) .05* 
Infant insurance status     
 Medicaid insurance/uninsured 67.3 (337) 66.8 (203) 68.0 (134)  
 Private insurance 32.7 (164) 33.2 (101) 32.0 (63) .77 
Maternal insurance status     
 Medicaid insurance/uninsured 64.5 (323) 63.8 (194) 65.5 (129)  
 Private insurance 35.5 (178) 36.2 (110) 34.5 (68) .70 
Day of discharge     
 Monday 9.6 (48) 7.9 (24) 12.2 (24)  
 Tuesday 13.0 (65) 13.5 (41) 12.2 (24)  
 Wednesday 11.4 (57) 12.2 (37) 10.2 (20)  
 Thursday 18.2 (91) 18.4 (56) 17.8 (35)  
 Friday 17.6 (88) 17.1 (52) 18.3 (36)  
 Saturday 18.2 (91) 19.4 (59) 16.2 (32)  
 Sunday 12.2 (61) 11.5 (35) 13.2 (26) .69 
Follow-up colocation     
 Colocated 66.9 (335) 75.7 (230) 53.3 (105)  
 Not colocated 33.1 (166) 24.3 (74) 46.7 (92) <.001* 
Infant race/ethnicity     
 Non-Hispanic Asian 4.0 (20) 4.0 (12) 4.1 (8)  
 Non-Hispanic white 18.0 (90) 16.8 (51) 19.8 (39)  
 Non-Hispanic African American/Black 47.9 (240) 50.3 (153) 44.2 (87) .50 
 Hispanic 18.0 (90) 18.4 (56) 17.2 (34)  
 Non-Hispanic Asian American and Pacific Islander, other, or multiracial 12.2 (61) 10.5 (32) 14.7 (29)  
Alternate phone number     
 Provided 43.9 (220) 49.0 (149) 36.0 (71)  
 Not provided 56.1 (281) 51.0 (155) 64.0 (126) .004* 
*

Significance at a P < .05 value.

Of the variables included in our logistic regression model, only follow-up colocation, provision of an alternate phone number, and delivery mode remained statistically significant predictors of study completion. Holding all other covariates constant, infants who had follow-up colocation were just under 3 times as likely to be study completers versus those that did not (adjusted OR [aOR], 2.99; 95% CI, 2.01–4.46); mothers who provided an alternate phone number were just under twice as likely to be study completers compared with those who had not (aOR, 1.99; 95% CI, 1.34–2.96); and those who delivered via vaginal birth were just over 1.5 times as likely to complete the study (aOR, 1.65; 95% CI,1.01–2.70). Full model information can be found in Table 2.

TABLE 2

Full Predicted Model of Neonatal Study Completion

Predictors of Study CompletionAdjusted Odds of Study Completion aOR (95% CI)
Maternal age, ya 1.01 (0.97–1.05) 
Paritya 1.03 (0.87–1.21) 
Vaginal delivery 1.65 (1.01–2.70)a 
Mother on Medicaid 0.81 (0.52–1.27) 
 Infant race  
 Non-Hispanic white Ref 
 Hispanic 1.70 (0.88–3.30) 
 Non-Hispanic Black 1.78 (0.99–3.21) 
 Non-Hispanic Asian 1.51 (0.54–4.23) 
 Non-Hispanic AAPI, other, or multiracial 1.01 (0.50–2.05) 
Follow-up colocated 2.99 (2.01–4.46)b 
Alternate phone number provided 1.99 (1.34–2.96)b 
Predictors of Study CompletionAdjusted Odds of Study Completion aOR (95% CI)
Maternal age, ya 1.01 (0.97–1.05) 
Paritya 1.03 (0.87–1.21) 
Vaginal delivery 1.65 (1.01–2.70)a 
Mother on Medicaid 0.81 (0.52–1.27) 
 Infant race  
 Non-Hispanic white Ref 
 Hispanic 1.70 (0.88–3.30) 
 Non-Hispanic Black 1.78 (0.99–3.21) 
 Non-Hispanic Asian 1.51 (0.54–4.23) 
 Non-Hispanic AAPI, other, or multiracial 1.01 (0.50–2.05) 
Follow-up colocated 2.99 (2.01–4.46)b 
Alternate phone number provided 1.99 (1.34–2.96)b 

Continuous variable, aOR measures change in odds per unit increase. AAPI, Asian American and Pacific Islander; aOR, adjusted odds ratio.

a

P < .05.

b

P < .001.

The final model had an AUROC of 0.68 (95% CI, 0.63–0.73), indicating statistically better than chance prediction of study completion. Sensitivity of the final model indicates that 69% of study completers were correctly predicted to complete the study, and specificity indicates that 62% of those study noncompleters were correctly predicted not to complete the study.

Cross-validation of the model showed accurate performance with little difference between the predictive model and the cross-validated model, which had an average AUROC of 0.65, a sensitivity of 68%, and a specificity of 60%. The confusion matrix, which shows differences in predicted and actual outcome between the predictive and cross-validated model, can be found in Table 3. These results showed that classification of final and cross-validated models had near-total concordance, with only 5 predictions shifting in the cross-validation. Full diagnostic measures of the predictive and cross-validated models can be found in Table 4 for comparison. An overlay of the final model AUROC curve and the cross-validation AUROC curves is shown in Fig 1 and demonstrates visual similarity, noting that the range of cross-validation values (0.59-0.72) only slightly differs from the CI of the full model (95% CI, 0.63–0.73).

FIGURE 1

Overlay of ROC curves between full prediction model and cross-validated test models. The full prediction model ROC is labeled predict, with cross-validation ROCs labeled CV1-CV5. ROC, receiver operating characteristic.

FIGURE 1

Overlay of ROC curves between full prediction model and cross-validated test models. The full prediction model ROC is labeled predict, with cross-validation ROCs labeled CV1-CV5. ROC, receiver operating characteristic.

Close modal
Table 3

Cross-Validation Confusion Matrix

Predictive Outcome
Predicted Model 
  Noncompletion Completion Total 
True outcome Noncompletion 81 116 197 
Completion 50 254 304 
Total 131 370 501 
Cross-Validated Model 
  Noncompletion Completion Total 
True outcome Noncompletion 78 119 197 
Completion 52 252 304 
Total 130 371 501 
Predictive Outcome
Predicted Model 
  Noncompletion Completion Total 
True outcome Noncompletion 81 116 197 
Completion 50 254 304 
Total 131 370 501 
Cross-Validated Model 
  Noncompletion Completion Total 
True outcome Noncompletion 78 119 197 
Completion 52 252 304 
Total 130 371 501 

Table displays actual and model-predicted outcomes in both the full predicted model (as shown in Table 2) in the top half of the table, and the cross-validation model directly below. Column variables for noncompletion and completion indicate actual outcomes; row variables indicate the outcomes predicted by the model.

Table 4

Cross-Validation Diagnostic Evaluation

Predicted Full ModelCross-Validated Model
 Percent (95% CI) Percent (95% CI) 
PPV 83.6 (80.2–86.5) 82.9 (79.45–85.8) 
NPV 41.1 (36.3–46.1) 39.6 (34.8–44.6) 
Sensitivity 68.7 (63.7–73.3) 67.9 (62.9–72.7) 
Specificity 61.8 (52.9–70.2) 60.0 (51.1–68.5) 
correctly classified 66.9 (62.6–71.0) 65.9 (61.5–70.0) 
AUROCa 0.68 (0.63–0.73) 0.65 (0.59–0.72) 
Predicted Full ModelCross-Validated Model
 Percent (95% CI) Percent (95% CI) 
PPV 83.6 (80.2–86.5) 82.9 (79.45–85.8) 
NPV 41.1 (36.3–46.1) 39.6 (34.8–44.6) 
Sensitivity 68.7 (63.7–73.3) 67.9 (62.9–72.7) 
Specificity 61.8 (52.9–70.2) 60.0 (51.1–68.5) 
correctly classified 66.9 (62.6–71.0) 65.9 (61.5–70.0) 
AUROCa 0.68 (0.63–0.73) 0.65 (0.59–0.72) 
a

The AUROC for the cross-validated model is presented as the average with range in parentheses. AUROC, area under the receiver operating characteristic; NPV, negative predictive value; PPV, positive predictive value.

Among mother-infant dyads enrolled in a prospective cohort study on neonatal jaundice, we identified a few factors predictive of higher odds of study completion. In our bivariate analysis, we identified maternal length of stay, delivery mode, follow-up colocation, and provision of alternate phone number as predictors of completion. In our final model, delivery mode, physical colocation of the first infant follow-up visit with the study visit, and provision of an alternate phone number retained significance; no other variables included were significant predictors. These findings offer guidance to researchers, particularly in the design phase of studies, who conduct hospital-based studies involving prospective recruitment of healthy newborns and their families.

We identified delivery mode as a potentially important and actionable predictor of study completion. Our findings suggest that delivery via cesarean section may be slightly predictive of noncompletion. Given that a full recovery following cesarean section can take significant time after hospital discharge and may cause reduced mobility and increased pain, as well as other physical and mental health issues,16  women who undergo a cesarean section delivery may require additional accommodations for study completion, particularly when the study requires follow-up visits. Additionally, it has been noted that timing of first outpatient well visits for those delivering via cesarean section is later than those delivering via vaginal birth,17  which may corroborate evidence of issues of mobility and may indicate a need for improving the ease of participation for mothers delivering via cesarean section.

Additionally, colocation of the first infant follow-up and the study visit was an important highly predictive factor of study completion. It is likely that part of the explanation for these findings is that the coordination of the study visit with the outpatient visit saves significant time and effort for families wanting to participate and allows for a more uniform and personalized approach (eg, the research team member maintains consistent communication leading up to the study visit and conducts the study visit first, then subsequently guides the mother-infant dyad to their well visit) compared with those following up at an office that is remote from the study visit. This finding is also especially notable considering that participants who did not have colocated follow-up were given additional financial compensation for the cost of travel. This suggests that additional barriers to participation were not sufficiently addressed by additional financial incentives. In addition, one must also consider that local context may play a role. Nearly one-quarter of Philadelphians rely on public transit18 ; given the already difficult task of transportation after giving birth, this may represent an additional burden to study completion beyond financial compensation. Although it is difficult with our current data to confirm that a significant proportion of our sample did rely on public transit, data from other studies have suggested that transit may play an important role in hospital access, and particularly for low-income populations,19,20  which is important to consider given that nearly 70% of our sample was uninsured or on Medicaid.

Other cohort studies of longer duration have shown that consistent communication is an important predictor of study completion21 ; in our sample, the mother providing an alternate phone number may have facilitated easier/more consistent communication by offering multiple points of contact, given the reliance on phone communication methods in this study as described in the Methods. It is also possible that by obtaining contact information directly from the participants (as opposed to via extraction from EHR data) may have led to improved accuracy of contact information compared with what can be obtained from EHR data, which may be prone to error or inaccuracy. Although outside the scope of our current study, future research on study design and implementation may investigate whether contact information provided by study participants to research coordinators is more accurate than EHR-derived contact information. It is also possible that provision of an alternate contact number may in part be a function of social support because individuals without social support may not have a second point of contact to provide, though we did not collect information on social support.

Our findings highlight potential barriers to study completion in otherwise willing participants. These findings strengthen existing evidence and show that some factors identified in high-risk infants, such as staff-participant communication, are also applicable to studies with healthy newborns. As a means of improving retention, our findings support obtaining alternate phone numbers directly from participants when possible and colocate study visits with other necessary clinical care. Although not directly supported by our analysis, we argue, based on informed speculation from our findings regarding predictiveness of delivery mode as well as other literature on the topic, that researchers focusing on well-newborn populations should also seek to address potential barriers to participation a priori during study design, (eg, how the burden of cesarean section delivery or the use of public transit may hamper participation after enrollment). In addition, future studies should collect completion and retention data as well as sociodemographic and other study factors previously identified in the literature to evaluate study recruitment and retention practices. Accurate collection of these data in real time can help to improve researchers’ abilities to prospectively identify and address potential socioeconomic or structural barriers participants may face in participating in research, which may also improve equity of participation in research.

Finally, as noted in the Results, many factors previously identified as potentially predictive (eg, race/ethnicity, insurance status, parity) were not predictive in our models. This contradiction should be considered in combination with our diagnostic statistics, which indicate our model is better than chance at predicting study completion, but not perfect. It is possible that our findings represent a refutation of previous research models predicting study completion. However, based on our experience as clinicians and epidemiologists, we suspect it is more likely that prediction models, and predictive indicators, may vary substantially from population to population, even when those population-level differences are minor (eg, primarily healthy newborns versus high-risk newborns). Thus, before using these models in other settings, investigators should consider whether variables such as race/ethnicity, insurance status, and parity might modify the observed associations, with careful consideration of how this could affect external validity of the models.

These findings should be considered in light of several strengths and limitations. First, our methods allowed us to assess the predictive accuracy of our model, providing additional confidence in our results going a step further than many studies that look at factors of study completion. Second, our large sample of healthy newborns represents a relatively understudied subpopulation in the context of neonatal research.

One important limitation of this study was the absence of potentially important but unmeasured predictors of study completion (eg, maternal health literacy, attitudes toward research, maternal race/ethnicity [particularly as a proxy for racism]),3,6  which may have contributed to the observed modest predictive accuracy of our model. Taken in the context of previous studies with conflicting results on predictors of study completion, results of this study highlight the need for a nuanced approach to prediction and the general difficulty of constructing predictive models for study completion. The positive and negative predictive values reported from our analyses should be interpreted cautiously given that these measures are sensitive to prevalence of the outcome; as such, we recommend focusing on sensitivity and specificity.

Because our data came from a single hospital, generalizability is also a potential concern. However, this single urban tertiary-care hospital serves a diverse patient population from all parts of Philadelphia and surrounding counties such as Bucks, Montgomery, and Delaware.22  We also note that given the period of the current study (occurring within the first week of life), our findings may not generalize to well-newborn research requiring follow-up over on longer time scales. Furthermore, because phototherapy was an exclusion factor for the original study, it is likely that severity of hyperbilirubinemia was associated with eligibility for the current study and thus could cause some degree of selection/sampling bias. However, the outcome of our current study (study completion) was agnostic to neonatal bilirubin level, making selection bias less likely to present as a major issue. Although outside of the scope of this manuscript, methods for quantitative assessment of selection bias are available and may be an area of focus for future research.23,24 

We identified several factors predictive of study completion in a cohort of healthy newborns. These factors emphasize the importance of good participant-coordinator communication and more patient-centric approaches in clinical research to improve retention, such as those aimed at reducing participant burden. Additionally, our findings present future researchers with cost-effective guidance on study design, which may improve participant retention among diverse urban populations of healthy newborns. Future work should aim to prioritize these and other factors that maximize completion while continuing to collect information important to evaluating retention strategies. In particular, evaluating and addressing potential socioeconomic barriers to study completion may increase equity in prospective clinical research.

The authors thank the sitewide principal investigator for the original study, Dr James Taylor, as well as previous research coordinators Mitali Panchani and Rachel Powell for their invaluable help in collecting the data necessary to conduct this analysis.

FUNDING: No external funding.

CONFLICT OF INTEREST DISCLOSURES: The authors have indicated they have no potential conflicts of interest to disclose.

Mr Fusfeld conducted data collection in parent study, designed and conducted current substudy, and fully wrote and edited manuscript, with assistance from coauthors in editing. Dr Goyal was coprimary advisor on project, assisted in design and analysis of substudy, helped throughout during manuscript preparation, was site co-principal investigator for the parent study. Dr Goldstein was statistical advisor on project, assisted in analysis of substudy, and helped throughout during manuscript preparation. Dr Chung was primary advisor on project, assisted in design and analysis of substudy, helped throughout during manuscript preparation, and was site principal investigator for the parent study.

1.
Aylward
GP
,
Hatcher
RP
,
Stripp
B
,
Gustafson
NF
,
Leavitt
LA
.
Who goes and who stays: subject loss in a multicenter, longitudinal follow-up study
.
J Dev Behav Pediatr
.
1985
;
6
(
1
):
3
8
2.
Callanan
C
,
Doyle
L
,
Rickards
A
,
Kelly
E
,
Ford
G
,
Davis
N
.
Children followed with difficulty: how do they differ?
J Paediatr Child Health
.
2001
;
37
(
2
):
152
156
3.
Constantine
WL
,
Haynes
CW
,
Spiker
D
,
Kendall-Tackett
K
,
Constantine
NA
.
Recruitment and retention in a clinical trial for low birth weight, premature infants
.
J Dev Behav Pediatr
.
1993
;
14
(
1
):
1
7
4.
DeMauro
SB
,
Bellamy
SL
,
Fernando
M
,
Hoffmann
J
,
Gratton
T
,
Schmidt
B
;
PROP Investigators
.
Patient, family, and center-based factors associated with attrition in neonatal clinical research: a prospective study
.
Neonatology
.
2019
;
115
(
4
):
328
334
5.
Wolke
D
,
Söhne
B
,
Ohrt
B
,
Riegel
K
.
Follow-up of preterm children: important to document dropouts
.
Lancet
.
1995
;
345
(
8947
):
447
6.
Cunningham
M
,
Thomson
V
,
McKiever
E
,
Dickinson
LM
,
Furniss
A
,
Allison
MA
.
Infant, maternal, and hospital factors’ role in loss to follow-up after failed newborn hearing screening
.
Acad Pediatr
.
2018
;
18
(
2
):
188
195
7.
Sullivan
JA
,
Wiese
AM
,
Boone
KM
,
Rausch
J
,
Keim
SA
.
To attend, or not to attend: examining caregiver intentions and study compliance in a pediatric, randomized controlled trial
.
Clin Trials
.
2020
;
17
(
2
):
223
230
8.
Zeitlin
W
,
Auerbach
C
,
Mason
S
,
Spivak
L
,
Erdman
A
.
Factors predicating loss to follow-up with rescreening in early hearing detection and intervention programs
.
Fam Soc
.
2019
;
100
(
2
):
213
223
9.
Simpson
E
,
Goyal
NK
,
Dhepyasuwan
N
, et al
.
Prioritizing a research agenda: a Delphi study of the better outcomes through research for newborns (BORN) network
.
Hosp Pediatr
.
2014
;
4
(
4
):
195
202
10.
Hamza
A
.
Kernicterus
.
Autops Case Rep
.
2019
;
9
(
1
):
e2018057
11.
Porter
ML
,
Dennis
BL
.
Hyperbilirubinemia in the term newborn
.
Am Fam Physician
.
2002
;
65
(
4
):
599
606
12.
Taylor
JA
,
Stout
JW
,
de Greef
L
, et al
.
Use of a smartphone app to assess neonatal jaundice
.
Pediatrics
.
2017
;
140
(
3
):
e20170312
13.
Flatley
C
,
Gibbons
K
,
Hurst
C
,
Flenady
V
,
Kumar
S
.
Cross-validated prediction model for severe adverse neonatal outcomes in a term, non-anomalous, singleton cohort
.
BMJ Paediatr Open
.
2019
;
3
(
1
):
e000424
14.
Steyerberg
EW
,
Harrell
FE
Jr
,
Borsboom
GJ
,
Eijkemans
MJ
,
Vergouwe
Y
,
Habbema
JD
.
Internal validation of predictive models: efficiency of some procedures for logistic regression analysis
.
J Clin Epidemiol
.
2001
;
54
(
8
):
774
781
15.
Steyerberg
EW
,
Vickers
AJ
,
Cook
NR
, et al
.
Assessing the performance of prediction models: a framework for traditional and novel measures
.
Epidemiology
.
2010
;
21
(
1
):
128
138
16.
Kealy
MA
,
Small
RE
,
Liamputtong
P
.
Recovery after caesarean birth: a qualitative study of women’s accounts in Victoria, Australia
.
BMC Pregnancy Childbirth
.
2010
;
10
:
47
17.
O’Donnell
HC
,
Trachtman
RA
,
Islam
S
,
Racine
AD
.
Factors associated with timing of first outpatient visit after newborn hospital discharge
.
Acad Pediatr
.
2014
;
14
(
1
):
77
83
18.
Budick
S
.
The cost of commuting for Philadelphians
.
19.
Goldstein
ND
,
Kahal
D
,
Testa
K
,
Burstyn
I
.
Inverse probability weighting for selection bias in a Delaware community health center electronic medical record study of community deprivation and hepatitis C prevalence
.
Ann Epidemiol
.
2021
;
60
:
1
7
20.
Heaps
WA
,
Abramsohn
E
,
Skillen
E
.
Public transportation in the US: a driver of health and equity
.
21.
MacBean
V
,
Drysdale
SB
,
Zivanovic
S
,
Peacock
JL
,
Greenough
A
.
Participant retention in follow-up studies of prematurely born children
.
BMC Public Health
.
2019
;
19
(
1
):
1233
22.
Thomas Jefferson University Hospital
.
Southeastern Pennsylvania community health needs assessment
.
23.
Lash
TL
,
Fink
AK
,
Fox
MP
.
A guide to implementing quantitative bias analysis. In: Lash TL, Fox MP, Fink AK, eds
.
Applying Quantitative Bias Analysis to Epidemiologic Data
.
New York, NY
;
2009
:
13
32
24.
Lash
TL
,
Fink
AK
,
Fox
MP
.
Data sources for bias analysis
. In:
Lash
TL
,
Fox
MP
,
Fink
AK
, eds.
Applying Quantitative Bias Analysis to Epidemiologic Data
.
New York, NY
:
Springer
;
2009
:
33
41