Video Abstract

Video Abstract

Close modal
OBJECTIVES

To evaluate the diagnostic accuracy of the Early Autism Evaluation (EAE) Hub system, a statewide network that provides specialized training and collaborative support to community primary care providers in the diagnosis of young children at risk for autism spectrum disorder (ASD).

METHODS

EAE Hub clinicians referred children, aged 14 to 48 months, to this prospective diagnostic study for blinded follow-up expert evaluation including assessment of developmental level, adaptive behavior, and ASD symptom severity. The primary outcome was agreement on categorical ASD diagnosis between EAE Hub clinician (index diagnosis) and ASD expert (reference standard).

RESULTS

Among 126 children (mean age: 2.6 years; 77% male; 14% Latinx; 66% non-Latinx white), 82% (n = 103) had consistent ASD outcomes between the index and reference evaluation. Sensitivity was 81.5%, specificity was 82.4%, positive predictive value was 92.6%, and negative predictive value was 62.2%. There was no difference in accuracy by EAE Hub clinician or site. Across measures of development, there were significant differences between true positive and false negative (FN) cases (all Ps < .001; Cohen’s d = 1.1–1.4), with true positive cases evidencing greater impairment.

CONCLUSIONS

Community-based primary care clinicians who receive specialty training can make accurate ASD diagnoses in most cases. Diagnostic disagreements were predominately FN cases in which EAE Hub clinicians had difficulty differentiating ASD and global developmental delay. FN cases were associated with a differential diagnostic and phenotypic profile. This research has significant implications for the development of future population health solutions that address ASD diagnostic delays.

What’s Known on This Subject:

Finding effective and scalable solutions to address autism diagnostic delays and disparities is a public health imperative. Tiered community-based approaches that enhance the capacity of primary care clinicians to provide diagnostic evaluations hold promise for addressing this problem.

What This Study Adds:

Primary care clinicians trained as part of a statewide system can make accurate ASD diagnoses in a majority of cases, extending evidence that tiered, community-based models may be a valid approach for reducing ASD diagnostic delays.

Autism spectrum disorder (ASD) is a complex neurodevelopmental disorder defined by impairments in social communication and the presence of restricted and repetitive behaviors,1  with estimated prevalence of 1 in 36 8-year-old children.2  Although reliable ASD diagnosis is often possible in the second year of life,3  the median age of diagnosis in the United States is 49 months.2  For many children, this delay46  is because of a bottleneck in access to diagnostic evaluations.7  Shortages of specialists8  trained to provide diagnostic evaluations and clustering of available specialists in metropolitan areas911 result in excessive family travel requirements, lost wages, and need to find alternative caregivers for other children or dependents.1215  Further, labor- and cost-intensive evaluation models and assessment tools limit efficiency and contribute to organizational1618  and family19  burden. These factors, together with systemic influences on socioeconomic status, cultural stigma, and reduced access to information, education, and community resources, contribute to substantial diagnostic disparities for historically minoritized children and families.2022  ASD diagnostic delays impede enrollment in targeted interventions, with cascading individual2325  and societal2628  consequences. As such, finding feasible, equitable, and scalable solutions that address ASD diagnostic delays and disparities is a public health imperative.

In recent years, tension between the notion that ASD diagnostic evaluation must be expert-driven to maintain quality standards29  and the very significant demands for increasing capacity of diagnostic service systems18  has grown. Yet, there seems to be increasing recognition that tiered, community-based approaches that enhance the capacity of primary care providers (PCP) to conduct diagnostic evaluations of young children at risk for ASD hold promise for reducing delays and disparities.30,31  Both the American Academy of Pediatrics32  and the Canadian Pediatric Society33  now recognize that general pediatricians with training in application of Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition, ASD criteria can make a clinical diagnosis of ASD. Although nonspecialist providers such as PCPs will not have the expertise to make a definitive diagnosis for all at-risk children (ie, given the substantial heterogeneity of the disorder), there is mounting evidence to suggest that PCPs can make an initial clinical diagnosis to facilitate initiation of services for many young children with clear ASD symptoms.18,30,31,34 

Data on the implementation of novel diagnostic technologies35,36  and streamlined training30  and evaluation models3740  are emerging rapidly. Findings suggest that these innovative diagnostic approaches may shift clinician knowledge and perceived competency, improve access, and result in moderate to high degree of diagnostic accuracy. However, studies on PCP training in ASD diagnosis have been limited by small sample sizes, variable methodological quality, and heterogenous design and selection of outcome variables.30  Guan et al30  recently called for more rigorous studies of PCP evaluation models that include demographic characteristics of clinicians and patients, comprehensive assessments of outcome, and data on diagnostic accuracy.

Community-based, PCP-delivered ASD diagnostic evaluation models have high potential to reduce diagnostic delays and disparities. As such, our goal in the present prospective diagnostic study was to evaluate the diagnostic accuracy of a statewide model of early ASD evaluation in the primary care setting. Specifically, we present indices of diagnostic accuracy between PCPs trained to deliver ASD evaluations and comprehensive expert ASD evaluation, as well as differences in diagnostic, demographic, and phenotypic characteristics across diagnostic accuracy groups.

This study took place within the Early Autism Evaluation (EAE) Hub system, a statewide network of community PCPs trained to provide streamlined diagnostic evaluations for young children, aged 14 to 48 months, at risk for ASD.39  As outlined in McNally Keehn et al,39  EAE Hub clinician training involved didactics in ASD evaluation of young children, as well as a clinical practicum with practice-based coaching until mastery of all components of the standard clinical diagnostic evaluation was achieved. After initial training, all PCPs participated in a virtual longitudinal learning collaborative (which meets every month) with content on challenging case presentations, updated ASD diagnosis and care management practice standards, billing and coding guidance, and information on statewide ASD resources. EAE Hubs receive referrals from regional PCPs for evaluation of children determined to be at risk for ASD on the basis of surveillance and/or developmental screening, and then follow a standard clinical evaluation protocol including administration/review of standard developmental and autism screening tools (ie, Ages and Stages Questionnaire-3 and Modified Checklist for Autism in Toddlers–Revised With Follow-Up), a developmental history and Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition, focused ASD clinical interview, physical examination, and administration of an observational assessment of ASD (Screening Tool for Autism in Toddlers41 ). The EAE Hub clinician then issues a best-estimate ASD diagnosis and report with clinical recommendations, including information on community and statewide interventions and resources for children with ASD and developmental disabilities.

Standards for Reporting Diagnostic Accuracy guidelines were followed in the design and conduct of this prospective diagnostic study. Seven EAE Hubs set within primary care practices (ie, including 6 large health system group practices and 1 private practice) referred children, aged 14 to 48 months evaluated for ASD in the community primary care setting, to the study between June 2019 and August 2022. To be included, children were aged 14 to 48 months at time of EAE Hub evaluation and had an English-speaking primary caregiver/guardian. This study was approved by the Indiana University School of Medicine institutional review board, and written informed consent was obtained from legal guardians of all participants.

EAE Hubs were recruited into the study in a nonrandom, staggered manner during the study period; site startup was impacted by coronavirus disease 2019 institutional research and patient care regulations. Each site referred a prospective, consecutive sample of children who received an EAE Hub evaluation after site startup until ∼20 children from each site were enrolled (note: Site 1 recruited a greater number of participants because they served as a pilot and study site; Site 4 recruited fewer participants because of relocation of the EAE Hub clinician during the study period). This recruitment procedure allowed the study team to maintain diagnostic blindness and assess children with both ASD and non-ASD outcomes without referral bias. During EAE Hub evaluations, clinicians (or a member of the EAE Hub clinical team; eg, nurse or medical assistant) provided caregivers of children evaluated with a study brochure, brief verbal description of the study, and obtained signed consent to share contact information with the study team for recruitment and enrollment. Once enrolled, an electronic caregiver-report survey (ie, caregiver-reported demographic data on child race/ethnicity and caregiver/family income and education level) and EAE Hub evaluation data (ie, index ASD diagnosis and clinician diagnostic certainty) were collected by a member of the study team (Fig 1). The study team, consisting of a licensed clinical psychologist (R.M.K., B.E., or T.R.) with expertise in evaluation of ASD in toddlers and young children and clinical research technician (advanced graduate student or postdoctoral fellow: G.K., L.H., or A.M.M.), traveled to the EAE Hub to conduct a follow-up gold-standard ASD diagnostic assessment within 16 weeks of the initial EAE Hub evaluation. Figure 1 and Supplemental Information detail the outcome measures, including child, caregiver, and clinician measures, administered to obtain a best-estimate ASD diagnosis (reference standard diagnosis). Participants were compensated with a gift card in the amount of $25 per hour of completed research evaluation (up to a maximum of $75).

FIGURE 1

Study procedure.

FIGURE 1

Study procedure.

Close modal

Data analyses were performed using SPSS (IBM SPSS Statistics, Version 28, Armonk, NY: IBM Corp) and JMP (JMP, Version 13.0, Cary, NC: SAS Institute Inc.). Continuous variables are reported as means and SDs, and categorical variables are reported as absolute frequencies and percentages. Diagnostic accuracy, the primary outcome of interest, was calculated by comparing percentage agreement between the EAE Hub diagnosis and ASD specialist on categorical ASD diagnosis (ASD; non-ASD). Chance-corrected agreement (κ) and accuracy indices of sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and the 95% confidence interval (CI) are reported. There were no missing data or indeterminate results regarding index or reference diagnoses. Variability in diagnostic accuracy by EAE Hub site and clinician was examined via a series of χ2 and Fisher’s exact tests. Exploratory post-hoc analyses were conducted to examine diagnostic, demographic, and phenotypic differences between true positive (TP) and false negative (FN) outcome groups to understand differences between children who were correctly diagnosed with ASD and those who were missed (or unable to be definitively diagnosed) by nonspecialist clinicians. χ2 and 2-sided t tests were used to compare categorical and continuous variables, respectively. Finally, an additional subset analysis of accuracy indices for cases with EAE Hub clinician diagnostic certainty ratings of Highly or Completely Certain was conducted to examine whether agreement improved on the basis of clinician perceptions of diagnostic confidence. A sample size of N = 126 provided an upper/lower limit 95% CI width of 0.067 for overall diagnostic agreement, assuming 82% agreement (ie, as observed in this study).

Participant flow through the study is detailed in Fig 2. Of 182 referred children, 131 enrolled, and index and reference standard diagnosis evaluations were included in the final analysis for 126 children across 6 EAE Hubs. Ten clinicians conducted a mean of 12.6 (SD = 7.4) evaluations (Supplemental Table 4 for EAE Hub clinician demographics and learning collaborative participation). Mean age of children was 2.6 years; 77% (n = 97) were male, 14% (n = 18) were Hispanic/Latinx, and 66% (n = 84) were non-Latinx white (Table 1). Across all children evaluated, scores on measures of developmental and adaptive skills fell well below the average range and ASD symptom severity was in the moderate range.42  Seventy-five percent (n = 94) of children had a reference diagnosis of ASD; 10% had global developmental delay (GDD) (n = 13), 10% had language delay (n = 13), and 5% (n = 6) had another emotional, behavioral, or medical concern (n = 6).

FIGURE 2

EAE Hub site and participant flow through the study. Site 1 served as a pilot and study site, resulting in > 20 participants referred. Site 4 ended recruitment before meeting criterion because of relocation of the EAE Hub clinician during the study period. Startup and recruitment duration for some sites was impacted by coronavirus disease 2019 institutional research and patient care regulations.

FIGURE 2

EAE Hub site and participant flow through the study. Site 1 served as a pilot and study site, resulting in > 20 participants referred. Site 4 ended recruitment before meeting criterion because of relocation of the EAE Hub clinician during the study period. Startup and recruitment duration for some sites was impacted by coronavirus disease 2019 institutional research and patient care regulations.

Close modal
TABLE 1

Child Demographics and Clinical Characteristics

All Included
N (%)
Number of participants 126 
Age, m (SD), in mo 2.6 (0.6) 
Sex  
 Male 97 (77) 
 Female 29 (23) 
Race/ethnicity  
 >1 race 7 (6) 
 Asian American 1 (1) 
 Black 10 (8) 
 Hispanic/Latinx, any race 18 (14) 
 White 84 (66) 
 Unknown/not reported 6 (5) 
Yearly household income  
 <$25 000 21 (17) 
 $25 000–$49 999 40 (32) 
 $50 000–$74 999 29 (23) 
 $75 000–$99 000 12 (10) 
 $100 000–$149 999 13 (10) 
 >$150 000 2 (2) 
Primary caregiver education level  
 Less than high school diploma 8 (6) 
 High school diploma/GED 36 (29) 
 Some college, no degree 32 (25) 
 Associate degree/postsecondary certificate 14 (11) 
 Bachelor’s degree 24 (19) 
 Graduate degree 12 (10) 
MSEL ELC, m (SD) 61.8 (14.7) 
MSEL nonverbal DQ, m (SD) 70.4 (17.3) 
MSEL verbal DQ, m (SD) 52.1 (24.2) 
Vineland-3 ABC, m (SD) 67.9 (12.0) 
ADOS-2 CSS, m (SD) 7.0 (3.0) 
Reference diagnosis of ASD 94 (75) 
Reference diagnosis of GDD 13 (10) 
Reference diagnosis of LD 13 (10) 
Reference diagnosis of othera 5 (4) 
Reference diagnosis of known genetic syndrome 1 (1) 
All Included
N (%)
Number of participants 126 
Age, m (SD), in mo 2.6 (0.6) 
Sex  
 Male 97 (77) 
 Female 29 (23) 
Race/ethnicity  
 >1 race 7 (6) 
 Asian American 1 (1) 
 Black 10 (8) 
 Hispanic/Latinx, any race 18 (14) 
 White 84 (66) 
 Unknown/not reported 6 (5) 
Yearly household income  
 <$25 000 21 (17) 
 $25 000–$49 999 40 (32) 
 $50 000–$74 999 29 (23) 
 $75 000–$99 000 12 (10) 
 $100 000–$149 999 13 (10) 
 >$150 000 2 (2) 
Primary caregiver education level  
 Less than high school diploma 8 (6) 
 High school diploma/GED 36 (29) 
 Some college, no degree 32 (25) 
 Associate degree/postsecondary certificate 14 (11) 
 Bachelor’s degree 24 (19) 
 Graduate degree 12 (10) 
MSEL ELC, m (SD) 61.8 (14.7) 
MSEL nonverbal DQ, m (SD) 70.4 (17.3) 
MSEL verbal DQ, m (SD) 52.1 (24.2) 
Vineland-3 ABC, m (SD) 67.9 (12.0) 
ADOS-2 CSS, m (SD) 7.0 (3.0) 
Reference diagnosis of ASD 94 (75) 
Reference diagnosis of GDD 13 (10) 
Reference diagnosis of LD 13 (10) 
Reference diagnosis of othera 5 (4) 
Reference diagnosis of known genetic syndrome 1 (1) 

ADOS-2 CSS, Autism Diagnostic Observation Schedule, Second Edition, Calibrated Severity Score; DQ, developmental quotient; LD, language delay; m, mean; MSEL ELC, Mullen Scales of Early Learning Early Learning Composite; Vineland-3 ABC, Vineland Adaptive Behavior Scales, Third Edition, Adaptive Behavior Composite.

a

Other diagnosis includes emotional behavioral concerns such as separation anxiety, sensory processing impairment, and parent–child relational problem.

Of 126 children evaluated, ASD diagnosis was consistent between the EAE Hub evaluation (index diagnosis) and expert research evaluation (reference diagnosis) for 82% (n = 103) of cases (Table 2). Chance-corrected diagnostic agreement was moderate, κ = 0.580 (95% CI, 0.429–0.731). Sensitivity, or correct classification of ASD diagnosis, was 81.5% (95% CI, 72.4–88.1), whereas specificity, or correct classification of non-ASD diagnosis, was 82.4% (95% CI, 66.5–91.7). PPV was 92.6% (95% CI, 84.8–96.6) and NPV was 62.2% (95% CI, 47.6–74.9). Overall, 60% (n =75) of cases were TP (index = ASD; reference = ASD), 5% (n = 6) were false positive (FP: index = ASD; reference = non-ASD), 22% (n = 28) were true negative (TN: index = non-ASD; reference = non-ASD), and 14% (n = 17) were FN (index = non-ASD; reference = ASD) Table 2).

TABLE 2

Agreement Between Index and Reference Diagnosis (N = 126)

Reference Standard Diagnosis
ASDNon-ASD
Index diagnosis ASD 75 (60) 6 (5) 
Non-ASD 17 (14) 28 (22) 
Reference Standard Diagnosis
ASDNon-ASD
Index diagnosis ASD 75 (60) 6 (5) 
Non-ASD 17 (14) 28 (22) 

Data represented as number (%). Index diagnosis is based on EAE Hub evaluation. Reference standard diagnosis based on blinded expert research diagnosis.

There was no difference between EAE Hub sites in overall accuracy (ie, accurate versus not; TP + TN versus FP + FN; P = .89) or proportion of FN (compared with TP) cases (P = .67) (Supplemental Table 5). Similarly, there was no difference in overall accuracy (P = .24) or proportion of FN cases (P = .09) by EAE Hub clinician for those submitting data for ≥5 children (n = 8).

Descriptive, clinical, and phenotypic data by diagnostic group can be found in Supplemental Table 6. To address the question of what diagnostic, demographic, and/or phenotypic factors may be associated with FN diagnoses made by trained PCPs, we conducted an exploratory analysis of differences between TP and FN cases (Table 3). There was a significant difference in dichotomized (ie, Highly or Completely Certain versus all other ratings) index clinician diagnostic certainty ratings between TP and FN groups, P = .002, with a higher proportion of Highly–Completely Certain ratings for the TP (95%; 71 of 75), as compared with the FN (65%; 11 of 17) group. Similarly, index clinicians flagged a significantly higher proportion of FN cases (69% of FN; 17% TP) for specialty follow-up evaluation, P ≤ .001. There were no demographic differences by age or sex (Ps > .20). Across measures of developmental and adaptive skills (ie, Mullen Scales of Early Learning Verbal Developmental Quotient, Nonverbal Developmental Quotient, and Early Learning Composite; Vineland-3 Adaptive Behavior Composite), there were significant differences between TP and FN cases (all Ps < .001), with the TP group evidencing significantly greater impairment as compared with the FN group. There was no significant difference between Autism Diagnostic Observation Schedule, Second Edition, Calibrated Severity Scores between the TP and FN groups (P = .28), suggesting no meaningful differences in ASD symptom severity between groups. To address factors that may be associated with FP diagnoses, we examined differences between TN and FP cases (Supplemental Table 7). Although these results must be interpreted with substantial caution given the small sample size, it appears that there may be a trend toward older age in the FP group.

TABLE 3

Diagnostic, Demographic, and Phenotypic Differences between True Positive and False Negative Cases

TPFN
n = 75n = 17PEffect Size
Clinical characteristics 
Index diagnostic certainty, n (%)   .002 9.7 
 Completely/highly certain 71 (95) 11 (65)  [2.4–39.9]a 
 All other ratings 4 (5) 6 (35)   
Specialty evaluation, n (%)   <.001 10.6 
 Referral not recommended 58 (83) 5 (31)  [3.1–36.2]a 
 Referral recommended 12 (17) 11 (69)   
Demographic characteristics 
Age, m (SD) y 2.7 (0.6) 2.7 (0.6) .89 0.04b 
Sex, n (%)   .20 2.1 
 Male 63 (84) 12 (71)  [0.7–7.4]a 
 Female 12 (16) 5 (29)   
Phenotypic characteristics 
 MSEL, m (SD)     
 Verbal DQ 39.8 (17.9) 60.1 (18.7) <.001 1.4b 
 Nonverbal DQ 63.0 (15.4) 78.7 (10.8) <.001 1.1b 
 MSEL ELC 54.7 (7.6) 64.5 (12.0) <.001 1.1b 
Vineland-3 ABC, m (SD) 62.3 (9.6) 72.1 (4.8) <.001 1.1b 
ADOS-2 CSS, m (SD) 8.6 (1.6) 8.1 (2.0) .28 0.3b 
TPFN
n = 75n = 17PEffect Size
Clinical characteristics 
Index diagnostic certainty, n (%)   .002 9.7 
 Completely/highly certain 71 (95) 11 (65)  [2.4–39.9]a 
 All other ratings 4 (5) 6 (35)   
Specialty evaluation, n (%)   <.001 10.6 
 Referral not recommended 58 (83) 5 (31)  [3.1–36.2]a 
 Referral recommended 12 (17) 11 (69)   
Demographic characteristics 
Age, m (SD) y 2.7 (0.6) 2.7 (0.6) .89 0.04b 
Sex, n (%)   .20 2.1 
 Male 63 (84) 12 (71)  [0.7–7.4]a 
 Female 12 (16) 5 (29)   
Phenotypic characteristics 
 MSEL, m (SD)     
 Verbal DQ 39.8 (17.9) 60.1 (18.7) <.001 1.4b 
 Nonverbal DQ 63.0 (15.4) 78.7 (10.8) <.001 1.1b 
 MSEL ELC 54.7 (7.6) 64.5 (12.0) <.001 1.1b 
Vineland-3 ABC, m (SD) 62.3 (9.6) 72.1 (4.8) <.001 1.1b 
ADOS-2 CSS, m (SD) 8.6 (1.6) 8.1 (2.0) .28 0.3b 

Index diagnosis is based on EAE Hub evaluation. Categorical variables presented as number (%); continuous variables presented as mean (SD). P values represent 2-sided significance of t test for continuous variables and χ2 or Fisher’s exact test for categorical variables. ADOS-2 CSS, Autism Diagnostic Observation Schedule, Second Edition, Calibrated Severity Score; DQ, developmental quotient; m, mean; MSEL ELC, Mullen Scales of Early Learning Early Learning Composite; Vineland-3 ABC, Vineland Adaptive Behavior Scales, Third Edition, Adaptive Behavior Composite.

a

Effect size reported as odds ratio [95% CI].

b

Effect size reported as Cohen’s d.

Among the subset of cases in which an EAE Hub clinician rated diagnostic certainty to be Highly or Completely Certain (N = 105), sensitivity was 84.5% (95% CI, 75.3–90.7) and specificity was 90.5% (95% CI, 71.1–97.3). PPV was 97.3% (95% CI, 90.5–99.2) and NPV was 59.4% (95% CI, 42.3–74.5).

In this prospective diagnostic study, we found 82% agreement between trained primary care clinicians and blinded expert research evaluation on categorical ASD diagnosis of children aged 14 to 48 months. Accuracy indices of sensitivity, specificity, and PPV were high (ie, 81.5%, 82.4%, and 92.6%, respectively), whereas NPV was substantially lower (ie, 62.2%). There were no statistically significant differences in accuracy by EAE Hub site or clinician. Diagnostic disagreements were predominately FN cases in which EAE Hub clinicians had difficulty differentiating ASD and GDD. Clinicians flagged most of these cases for follow-up specialty evaluation. FN (as compared with TP) cases were associated with a differential diagnostic and phenotypic, but not demographic, profile. When analysis was restricted to cases in which EAE Hub clinicians rated their diagnostic certainty high, measures of sensitivity, specificity, and PPV improved. To our knowledge, this is the largest study to date that evaluates diagnostic accuracy of a coordinated system of diagnosis in the primary care setting. Notable strengths of this study include the diversity of included primary care index clinicians (ie, from large health system group practices, federally qualified health centers, and private practices) and children evaluated (ie, from diverse socioeconomic and family education backgrounds), large sample size, and rigorous methodology (ie, including blinded reference standard evaluations).

Although existing reports of primary care-based models of ASD diagnosis show promising evidence for improved service access and acceptable accuracy, studies have been limited by small sample size and reduced methodological rigor, or have not used a standard approach for training and diagnostic evaluation.30  Nonetheless, our 82% diagnostic agreement between index and reference diagnosis is comparable to that of others who have reported rates of agreement between PCP and expert evaluation between 71% and 85%.34,43,44  Importantly, across EAE Hub sites and clinicians, including PCPs and nurse practitioners, there was no difference in overall accuracy or rate of FN cases. Given the small number of NPs in our study, future examination of accuracy with a larger sample of diverse clinicians is needed.

Our study is the first to report on ASD accuracy metrics between nonspecialist clinicians and expert diagnosis when a standard training and clinical pathway is followed. Findings suggest that PCPs who receive specialty training are highly reliable when they confirm an ASD diagnosis, as evidenced by our very low rate of FP cases (6%) and high PPV (92.6%). Clinicians were unable to make a definitive diagnosis or missed ASD in 14% of cases, resulting in low NPV (62.2%). Similar to the findings of Penner et al,34  FN cases evidenced higher verbal and nonverbal developmental level and adaptive skills, though most in our study still met criteria for GDD. Notably, there was no difference in ASD symptom severity between TP and FN cases, suggesting that index clinicians may place more emphasis on developmental impairment than ASD-specific symptoms when making diagnostic decisions. FN cases were associated with lower index clinician diagnostic certainty and higher rates of referral (69% of cases) for specialty evaluation, suggesting that clinicians recognized that these children demonstrated a more complex profile, making differential diagnosis between GDD and ASD challenging. When analysis of diagnostic agreement was restricted to only cases for which index diagnostic certainty was high, sensitivity, specificity, and PPV increased, suggesting that primary care clinicians perceive their ability to render a correct ASD diagnosis with high accuracy. Future research should evaluate whether triaging cases for specialty evaluation on the basis of the child’s overall developmental level (ie, those with higher developmental skills) and/or low clinician diagnostic certainty may mitigate the rate of FN diagnosis in the primary care setting.

A primary limitation of the current study is the high proportion (75%) of reference ASD diagnosis in the sample, resulting in low sample size for comparisons by accuracy subgroup. Because of small sample size, we did not use modeling approaches to adjust for potential correlated outcomes of patients clustered within EAE Hub sites or clinicians; however, we did examine site and clinician differences on overall accuracy and proportion of FN cases, and results were not significant. We also did not collect data on those children evaluated in the EAE Hub system who did not consent to participate, and thus we cannot rule out unmeasured bias in our findings or confirm the generalizability of our findings to all young children who require ASD evaluation. Inclusion of only children with English-speaking caregivers limits the generalizability of our findings, and further solutions to ensure equitable access to diagnostic evaluations are necessary. Although we asked clinicians to flag children that required specialty follow-up evaluation, we designed the study to force index clinicians to make a binary (ASD present/absent) choice about ASD outcome, perhaps artificially deflating accuracy indices because of caution against overdiagnosis. Finally, although an initial ASD diagnosis is needed to access specific services, longitudinal developmental evaluation is important for individualized intervention and prognosis planning.45  As such, tiered diagnostic approaches represent an important and promising solution to one component of the larger ASD service bottleneck problem.

Tiered diagnostic approaches, including primary care-based models such as the EAE Hub system, are now being tested as a solution to address the need for increased ASD diagnostic evaluation capacity.18,31  The EAE Hub system was developed with the primary goal of lowering the age of ASD diagnosis through providing streamlined access to localized diagnostic evaluation within the primary care setting. We have previously shown that the EAE Hub model, which involves intensive training for PCPs in ASD diagnosis and ongoing participation in a longitudinal learning collaborative, is feasible,39  with >4000 children evaluated since 2012. In the present prospective diagnostic study, we extend the empirical support for this model by demonstrating a high level of diagnostic accuracy (ie, 82%) in a sample of diverse community PCPs and at-risk children. Additional research is needed to understand implementation promotors and barriers to broad scale-up of community-based ASD models, as well as replication and comparative effectiveness studies that allow for determination of the key components of training and model implementation necessary for success. Testing strategies aimed to mitigate FN cases will be an essential next step in ensuring the accuracy and quality of streamlined, community-based ASD evaluations. Further, it will be critical to develop and test adaptations of tiered diagnostic approaches for non-English speaking children, as well as to determine whether these types of models lead to earlier intervention enrollment, both efforts currently underway in the EAE Hub system. Collectively, the study of innovative diagnostic models has important implications for how future population health solutions that address the ASD diagnosis crisis are designed and implemented. ASD experts, self-advocates and families, and other health service stakeholders (ie, insurers, service providers) must work together to construct and put into action flexible, evolving, evidence-driven health policies that account for scientific innovation and advancements in ASD diagnosis.17,46 

We thank Mary Delaney for her work as a practice liaison for the Indiana University School of Medicine EAE Hub leadership team, as well as Yvonne Purcell for her support to the team during the data collection phase of the study. We also thank the EAE Hub clinicians for their commitment to the children and families of Indiana, our learning collaborative network, and their participation in this research. Finally, and most importantly, we thank the children and caregivers who participated in this study.

Drs McNally Keehn and Keehn conceptualized and designed the study, designed the data collection instruments, led data collection, analysis, and interpretation, and drafted the initial manuscript; Drs Swigonski, Monahan, and Ciccarelli contributed to data analysis and interpretation efforts; Drs Enneking, Ryan, Martin, Hamrick, and Kadlaskar, and Ms Paxton contributed to acquisition of data; and all authors reviewed and revised the manuscript for important intellectual content, approved the final manuscript as submitted, and agree to be accountable for all aspects of the work.

Deidentified individual participant data has been shared as part of the National Institute of Mental Health Data Archive.

COMPANION PAPER: A companion to this article can be found online at www.pediatrics.org/cgi/doi/10.1542/peds.2023-062279.

FUNDING: Supported by the National Institute of Mental Health grant #R21MH121747 (Drs McNally Keehn, Keehn, Swigonski), pilot funding from the Indiana Clinical and Translational Sciences Institute grant #UL1TR002529 (Dr McNally Keehn), and Purdue Big Idea Challenge 2.0.

CONFLICT OF INTEREST DISCLOSURES: The authors have indicated they have no conflicts of interest relevant to this article to disclose.

ASD

autism spectrum disorder

CI

confidence interval

EAE

Early Autism Evaluation

FN

false negative

FP

false positive

GDD

global developmental delay

NPV

negative predictive value

PCP

primary care provider

PPV

positive predictive value

TN

true negative

TP

true positive

1
American Psychiatric Association
, ed.
Diagnostic and Statistical Manual of Mental Disorders
, 5th ed.
Washington, DC
:
American Psychiatric Publishing
;
2013
2
Maenner
MJ
,
Warren
Z
,
Williams
AR
, et al
.
Prevalence and characteristics of autism spectrum disorder among children aged 8 years–Autism and Developmental Disabilities Monitoring Network, 11 sites, United States, 2020
.
MMWR Surveill Summ
.
2023
;
72
(
2
):
1
14
3
Pierce
K
,
Gazestani
VH
,
Bacon
E
, et al
.
Evaluation of the diagnostic stability of the early autism spectrum disorder phenotype in the general population starting at 12 months
.
JAMA Pediatr
.
2019
;
173
(
6
):
578
587
4
Wiggins
LD
,
Baio
J
,
Rice
C
.
Examination of the time between first evaluation and first autism spectrum diagnosis in a population-based sample
.
J Dev Behav Pediatr
.
2006
;
27
(
2 Suppl
):
S79
S87
5
Zuckerman
K
,
Lindly
OJ
,
Chavez
AE
.
Timeliness of autism spectrum disorder diagnosis and use of services among US elementary school-aged children
.
Psychiatr Serv
.
2017
;
68
(
1
):
33
40
6
Jimenez
ME
,
Martinez Alcaraz
E
,
Williams
J
,
Strom
BL
.
Access to developmental pediatrics evaluations for at-risk children
.
J Dev Behav Pediatr
.
2017
;
38
(
3
):
228
232
7
Gordon-Lipkin
E
,
Foster
J
,
Peacock
G
.
Whittling down the wait time: exploring models to minimize the delay from initial concern to diagnosis and treatment of autism spectrum disorder
.
Pediatr Clin North Am
.
2016
;
63
(
5
):
851
859
8
Bridgemohan
C
,
Bauer
NS
,
Nielsen
BA
, et al
.
A workforce survey on developmental-behavioral pediatrics
.
Pediatrics
.
2018
;
141
(
3
):
e20172164
9
Drahota
A
,
Sadler
R
,
Hippensteel
C
,
Ingersoll
B
,
Bishop
L
.
Service deserts and service oases: Utilizing geographic information systems to evaluate service availability for individuals with autism spectrum disorder
.
Autism
.
2020
;
24
(
8
):
2008
2020
10
Martinez
M
,
Thomas
KC
,
Williams
CS
, et al
.
Family experiences with the diagnosis of autism spectrum disorder: System barriers and facilitators of efficient diagnosis
.
J Autism Dev Disord
.
2018
;
48
(
7
):
2368
2378
11
Kalkbrenner
AE
,
Daniels
JL
,
Emch
M
,
Morrissey
J
,
Poole
C
,
Chen
J-C
.
Geographic access to health services and diagnosis with an autism spectrum disorder
.
Ann Epidemiol
.
2011
;
21
(
4
):
304
310
12
Petruccelli
M
,
Ramella
L
,
Schaefer
AJ
, et al
.
A taxonomy of reported harms in pediatric autism spectrum disorder screening: provider and parent perspectives
.
J Autism Dev Disord
.
2022
;
52
(
2
):
647
673
13
Patten
E
,
Baranek
GT
,
Watson
LR
,
Schultz
B
.
Child and family characteristics influencing intervention choices in autism spectrum disorders
.
Focus Autism Other Dev Disabl
.
2013
;
28
(
3
):
138
146
14
Pickard
KE
,
Ingersoll
BR
.
Quality versus quantity: the role of socioeconomic status on parent-reported service knowledge, service use, unmet service needs, and barriers to service use
.
Autism
.
2016
;
20
(
1
):
106
115
15
Porterfield
SL
,
McBride
TD
.
The effect of poverty and caregiver education on perceived need and access to health services among children with special health care needs
.
Am J Public Health
.
2007
;
97
(
2
):
323
329
16
Gwynette
MF
,
McGuire
K
,
Fadus
MC
,
Feder
JD
,
Koth
KA
,
King
BH
.
Overemphasis of the Autism Diagnostic Observation Schedule (ADOS) evaluation subverts a clinician’s ability to provide access to autism services
.
J Am Acad Child Adolesc Psychiatry
.
2019
;
58
(
12
):
1222
1223
17
McNally Keehn
R
,
Tomlin
A
,
Ciccarelli
MR
.
COVID-19 pandemic highlights access barriers for children with autism spectrum disorder
.
J Dev Behav Pediatr
.
2021
;
42
(
7
):
599
601
18
Zwaigenbaum
L
,
Warren
Z
.
Commentary: embracing innovation is necessary to improve assessment and care for individuals with ASD: a reflection on Kanne and Bishop (2020)
.
J Child Psychol Psychiatry
.
2021
;
62
(
2
):
143
145
19
Broder-Fingert
S
,
Qin
S
,
Goupil
J
, et al
.
A mixed-methods process evaluation of Family Navigation implementation for autism spectrum disorder
.
Autism
.
2019
;
23
(
5
):
1288
1299
20
Aylward
BS
,
Gal-Szabo
DE
,
Taraman
S
.
Racial, ethnic, and sociodemographic disparities in diagnosis of children with autism spectrum disorder
.
J Dev Behav Pediatr
.
2021
;
42
(
8
):
682
689
21
Zeleke
WA
,
Hughes
TL
,
Drozda
N
.
Disparities in diagnosis and service access for minority children with ASD in the United States
.
J Autism Dev Disord
.
2019
;
49
(
10
):
4320
4331
22
Wiggins
LD
,
Durkin
M
,
Esler
A
, et al
.
Disparities in documented diagnoses of autism spectrum disorder based on demographic, individual, and service factors
.
Autism Res
.
2020
;
13
(
3
):
464
473
23
Dimian
AF
,
Symons
FJ
,
Wolff
JJ
.
Delay to early intensive behavioral intervention and educational outcomes for a Medicaid-enrolled cohort of children with autism
.
J Autism Dev Disord
.
2021
;
51
(
4
):
1054
1066
24
Fuller
EA
,
Kaiser
AP
.
The effects of early intervention on social communication outcomes for children with autism spectrum disorder: a meta-analysis
.
J Autism Dev Disord
.
2020
;
50
(
5
):
1683
1700
10.1007/s10803-019-03927-z
25
Steinbrenner
JR
,
Hume
K
,
Odom
SL
, et al
.
Evidence-based practices for children, youth, and young adults with autism
.
FPG Child Development Institute
.
2020
26
Cidav
Z
,
Munson
J
,
Estes
A
,
Dawson
G
,
Rogers
S
,
Mandell
D
.
Cost offset associated with early start Denver model for children with autism
.
J Am Acad Child Adolesc Psychiatry
.
2017
;
56
(
9
):
777
783
27
Penner
M
,
Rayar
M
,
Bashir
N
,
Roberts
SW
,
Hancock-Howard
RL
,
Coyte
PC
.
Cost-effectiveness analysis comparing pre-diagnosis autism spectrum disorder (ASD)-targeted intervention with Ontario’s autism intervention program
.
J Autism Dev Disord
.
2015
;
45
(
9
):
2833
2847
28
Peters-Scheffer
N
,
Didden
R
,
Korzilius
H
,
Matson
J
.
Cost comparison of early intensive behavioral intervention and treatment as usual for children with autism spectrum disorder in The Netherlands
.
Res Dev Disabil
.
2012
;
33
(
6
):
1763
1772
29
Kanne
SM
,
Bishop
SL
.
Editorial perspective: the autism waitlist crisis and remembering what families need
.
J Child Psychol Psychiatry
.
2021
;
62
(
2
):
140
142
30
Guan
X
,
Zwaigenbaum
L
,
Sonnenberg
LK
.
Building capacity for community pediatric autism diagnosis: a systemic review of physician training programs
.
J Dev Behav Pediatr
.
2022
;
43
(
1
):
44
54
31
Wieckowski
AT
,
Zuckerman
KE
,
Broder-Fingert
S
,
Robins
DL
.
Addressing current barriers to autism diagnoses through a tiered diagnostic approach involving pediatric primary care providers
.
Autism Res
.
2022
;
15
(
12
):
2216
2222
32
Hyman
SL
,
Levy
SE
,
Myers
SM
.
Council on Children With Disabilities, Section on Developmental and Behavioral Pediatrics
.
Identification, evaluation, and management of children with autism spectrum disorder
.
Pediatrics
.
2020
;
145
(
1
):
e20193447
33
Brian
JA
,
Zwaigenbaum
L
,
Ip
A
.
Standards of diagnostic assessment for autism spectrum disorder
.
Paediatr Child Health
.
2019
;
24
(
7
):
444
460
34
Penner
M
,
Senman
L
,
Andoni
L
, et al
.
Concordance of diagnosis of autism spectrum disorder made by pediatricians versus a multidisciplinary specialist team
.
JAMA Netw Open
.
2023
;
6
(
1
):
e2252879
35
Kanne
SM
,
Carpenter
LA
,
Warren
Z
.
Screening in toddlers and preschoolers at risk for autism spectrum disorder: evaluating a novel mobile-health screening tool
.
Autism Res
.
2018
;
11
(
7
):
1038
1049
36
Megerian
JT
,
Dey
S
,
Melmed
RD
, et al
.
Evaluation of an artificial intelligence-based medical device for diagnosis of autism spectrum disorder
.
NPJ Digit Med
.
2022
;
5
(
1
):
57
37
Hine
JF
,
Allin
J
,
Allman
A
, et al
.
Increasing access to autism spectrum disorder diagnostic consultation in rural and underserved communities: streamlined evaluation within primary care
.
J Dev Behav Pediatr
.
2020
;
41
(
1
):
16
22
38
Mazurek
MO
,
Curran
A
,
Burnette
C
,
Sohl
K
.
ECHO autism STAT: accelerating early access to autism diagnosis
.
J Autism Dev Disord
.
2019
;
49
(
1
):
127
137
39
McNally Keehn
R
,
Ciccarelli
M
,
Szczepaniak
D
,
Tomlin
A
,
Lock
T
,
Swigonski
N
.
A statewide tiered system for screening and diagnosis of autism spectrum disorder
.
Pediatrics
.
2020
;
146
(
2
):
e20193876
40
Rotholz
DA
,
Kinsman
AM
,
Lacy
KK
,
Charles
J
.
Improving early identification and intervention for children at risk for autism spectrum disorder
.
Pediatrics
.
2017
;
139
(
2
):
e20161061
41
Stone
WL
,
Coonrod
EE
,
Turner
LM
,
Pozdol
SL
.
Psychometric properties of the STAT for early autism screening
.
J Autism Dev Disord
.
2004
;
34
(
6
):
691
701
42
Lord
C
,
Rutter
M
,
Dilavore
P
,
Risi
S
,
Gotham
K
,
Bishop
S
.
Autism Diagnostic Observation Schedule, Second Edition (ADOS-2)
.
Torrance, CA
:
Western Psychological Services
;
2012
43
Swanson
AR
,
Warren
ZE
,
Stone
WL
,
Vehorn
AC
,
Dohrmann
E
,
Humberd
Q
.
The diagnosis of autism in community pediatric settings: does advanced training facilitate practice change?
Autism
.
2014
;
18
(
5
):
555
561
44
Warren
Z
,
Stone
W
,
Humberd
Q
.
A training model for the diagnosis of autism in community pediatric practice
.
J Dev Behav Pediatr
.
2009
;
30
(
5
):
442
446
45
Lord
C
,
Charman
T
,
Havdahl
A
, et al
.
The Lancet Commission on the future of care and clinical research in autism
.
Lancet
.
2022
;
399
(
10321
):
271
334
46
Barbaresi
W
,
Cacia
J
,
Friedman
S
, et al
.
Clinician diagnostic certainty and the role of the autism diagnostic observation schedule in autism spectrum disorder diagnosis in young children
.
JAMA Pediatr
.
2022
;
176
(
12
):
1233
1241

Supplementary data