CONTEXT:

Recommendations conflict regarding universal application of formal screening instruments in primary care (PC) and PC-like settings for autism spectrum disorder (ASD).

OBJECTIVES:

We systematically reviewed evidence for universal screening of children for ASD in PC.

DATA SOURCES:

We searched Medline, PsychInfo, Educational Resources Informational Clearinghouse, and Cumulative Index of Nursing and Allied Health Literature.

STUDY SELECTION:

We included studies in which researchers report psychometric properties of screening tools in unselected populations across PC and PC-like settings.

DATA EXTRACTION:

At least 2 authors reviewed each study, extracted data, checked accuracy, and assigned quality ratings using predefined criteria.

RESULTS:

We found evidence for moderate to high positive predictive values for ASD screening tools to identify children aged 16 to 40 months and 1 study for ≥48 months in PC and PC-like settings. Limited evidence evaluating sensitivity, specificity, and negative predictive value of instruments was available. No studies directly evaluated the impact of screening on treatment or harm.

LIMITATIONS:

Potential limitations include publication bias, selective reporting within studies, and a constrained search.

CONCLUSIONS:

ASD screening tools can be used to accurately identify percentages of unselected populations of young children for ASD in PC and PC-like settings. The scope of challenges associated with establishing direct linkage suggests that clinical and policy groups will likely continue to guide screening practices. ASD is a common neurodevelopmental disorder associated with significant life span costs.1,2  Growing evidence supports functional gains and improved outcomes for young children receiving intensive intervention, so early identification on a population level is a pressing public health challenge.3,4 

Historically, children with ASD have been identified in pediatric primary care (PC) through soliciting parental concerns, as part of provider-driven surveillance, and periodic developmental screening using a standardized tool. Although developmental vulnerabilities associated with ASD are often identifiable in toddlers, the median age of diagnosis in the United States is >4 years.5  Given concerns that (1) delays in diagnosis limit access to intervention,6  (2) brief PC observations may fail to elicit concerns,7  and (3) ethnic, racial, and linguistic disparities exist in early identification,810  the American Academy of Pediatrics recommends the use of screening tools as part of developmental surveillance at well-child visits and ASD-specific screeners at the 18- and 24-month visits.11  However, in 2016, the US Preventive Services Task Force (USPSTF) issued a report noting insufficient evidence for or against universal screening for ASD in a well-child asymptomatic population.12  Although there was evidence that screening tools had sufficient accuracy, the USPSTF found insufficient evidence regarding the impact of early treatment on cases detected by screening compared with usual case detection.12  This discrepancy between American Academy of Pediatrics guidance and insufficient evidence in the USPSTF report has proven confusing and controversial to the scientific community, pediatric providers, and those they serve.13 

Since the USPSTF report on screening, several groups have conducted systematic reviews and meta-analyses of ASD screening.1420  Many of these reviews, including both meta-analyses, were focused solely on younger children or specific tools.14,1719  There are compelling data that many young children are still not screened for ASD by pediatricians,21  and limited work has been done to interpret available universal screening applications. Furthermore, there is growing awareness that ASD screening and surveillance tools and processes may need to be considered throughout childhood (ie, many children still are not identified as toddlers; large populations of school-aged children with ASD as a primary concern). As a result, this systematic review was undertaken to determine if an updated review of additional studies might provide data to support screening for autism using available universal screening tools from infancy through early adolescence in nonreferred populations.

This work is a systematic review of evidence for screening for ASD in PC or PC-like settings for children through age 13. We defined “PC-like settings” as places that screened an entire population (ie universal screening). Although there is substantial evidence, including several recent systematic reviews,1418,20  supporting the reliability of ASD screeners within and across preselected and referral populations,22  this review was focused on studies using nonreferred populations seen in general PC.

On the basis of a preliminary literature review, we developed a list of 4 key questions (KQs) around autism screening. KQ 1 examined the methods and tools used in PC and PC-like settings to screen for children in 3 age ranges (0–2, 3–5, and 6–12 years). KQ 2 reviewed the psychometric performance characteristics associated with these screening methods. KQ 3 reviewed whether there were adequate data to examine a possible association between early screening and improved outcomes. Finally, KQ 4 reviewed evidence for harm of screening. See Supplemental Tables 7 through 10 for details of KQs.

Study inclusion and exclusion criteria are listed in Tables 1 and 2. We searched Medline through OVID, PsycINFO, Educational Resources Informational Clearinghouse, and Cumulative Index of Nursing and Allied Health Literature for English-language articles published from January 2000 to June 2018. The start date of 2000 coincides with the publication of the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text Revision (DSM-IV-TR) and the wide-scale availability of gold-standard autism assessment instruments. Search strategies and terms included a combination of medical subject headings and keywords relevant to screening for ASD and the KQs (see Supplemental Tables 7 through 10). After the initial literature search, reference lists of recent narrative and systematic reviews were reviewed to identify missed articles.

TABLE 1

KQs

Description of KQs
KQ 1 Available tools 
 What are the screening methods and tools used in PC to screen for ASD for children aged (1) 0–2 y, (2) 3–5 y, and/or (3) 6–12 y? 
KQ 2 Accurate identification of ASD 
 What are the performance characteristics associated with the screening methods and tools used in PC? 
  Do risk factors for ASD modify performance? (Prematurity, sibling status) 
  Does age modify performance characteristics? (Optimal range for instruments) 
  Do characteristics of the child or family modify performance characteristics? (ID, SES, caregiver literacy, race and ethnicity, language preference) 
KQ 3 Association with improved outcome 
 Is there an association between early screening and outcome measures? 
 Example measures: referral completion, timely diagnosis, treatment activation, child development, ASD-related symptoms, reduction in family stress 
KQ 4 Harm 
 What is the harm of screening for the child and family? 
Description of KQs
KQ 1 Available tools 
 What are the screening methods and tools used in PC to screen for ASD for children aged (1) 0–2 y, (2) 3–5 y, and/or (3) 6–12 y? 
KQ 2 Accurate identification of ASD 
 What are the performance characteristics associated with the screening methods and tools used in PC? 
  Do risk factors for ASD modify performance? (Prematurity, sibling status) 
  Does age modify performance characteristics? (Optimal range for instruments) 
  Do characteristics of the child or family modify performance characteristics? (ID, SES, caregiver literacy, race and ethnicity, language preference) 
KQ 3 Association with improved outcome 
 Is there an association between early screening and outcome measures? 
 Example measures: referral completion, timely diagnosis, treatment activation, child development, ASD-related symptoms, reduction in family stress 
KQ 4 Harm 
 What is the harm of screening for the child and family? 

ID, intellectual disability; SES, socioeconomic status.

TABLE 2

Inclusion and Exclusion Criteria

CategoryInclusionExclusion
Participants (study population) Definition of disease: clinical diagnosis of ASD Studies that exclusively focus on infants, older children, or adults or that assess general developmental screening 
Young children ages 0–12 y undergoing screening for ASD in a PC or PC-like setting Non-PC setting 
Screener evaluated in preselected/diagnoses sample 
Interventions Methods, tools, and approaches used specifically to screen for ASD Studies of screening for other conditions, general developmental screening, or genetic or biomarker screening 
Comparison No screening or alternate screening approaches when comparing 2 or more approaches  
Outcomes Accurate identification properties, timing of referral and diagnosis, timing of access to intervention)  
Study design   
 Size Any study design with N ≥ 100 Studies with N < 100 
 Setting PC settings and PC-like settings (eg, well screening across early intervention and home visit settings) Studies not conducted in 1 of the following countries were rated as “very high human development” on the United Nations Development Program’s International Human Development Index: Andorra, Argentina, Australia, Austria, Bahrain, Barbados, Belgium, Brunei Darussalam, Canada, Chile, Croatia, Cyprus, Czech Republic, Denmark, Estonia, Finland, France, Germany, Greece, Hong Kong (China), Hungary, Iceland, Ireland, Israel, Italy, Japan, Republic of Korea, Latvia, Liechtenstein, Lithuania, Luxembourg, Malta, the Netherlands, New Zealand, Norway, Poland, Portugal, Qatar, Singapore, Slovakia, Slovenia, Spain, Sweden, Switzerland, United Arab Emirates, United Kingdom, United States 
CategoryInclusionExclusion
Participants (study population) Definition of disease: clinical diagnosis of ASD Studies that exclusively focus on infants, older children, or adults or that assess general developmental screening 
Young children ages 0–12 y undergoing screening for ASD in a PC or PC-like setting Non-PC setting 
Screener evaluated in preselected/diagnoses sample 
Interventions Methods, tools, and approaches used specifically to screen for ASD Studies of screening for other conditions, general developmental screening, or genetic or biomarker screening 
Comparison No screening or alternate screening approaches when comparing 2 or more approaches  
Outcomes Accurate identification properties, timing of referral and diagnosis, timing of access to intervention)  
Study design   
 Size Any study design with N ≥ 100 Studies with N < 100 
 Setting PC settings and PC-like settings (eg, well screening across early intervention and home visit settings) Studies not conducted in 1 of the following countries were rated as “very high human development” on the United Nations Development Program’s International Human Development Index: Andorra, Argentina, Australia, Austria, Bahrain, Barbados, Belgium, Brunei Darussalam, Canada, Chile, Croatia, Cyprus, Czech Republic, Denmark, Estonia, Finland, France, Germany, Greece, Hong Kong (China), Hungary, Iceland, Ireland, Israel, Italy, Japan, Republic of Korea, Latvia, Liechtenstein, Lithuania, Luxembourg, Malta, the Netherlands, New Zealand, Norway, Poland, Portugal, Qatar, Singapore, Slovakia, Slovenia, Spain, Sweden, Switzerland, United Arab Emirates, United Kingdom, United States 

Multiple reviewers independently screened the titles and abstracts for inclusion and exclusion criteria. We identified 5255 unique citations addressing screening. After abstract review, we excluded 4846 publications, leaving 409 articles for full text review. See Fig 1 for a flow diagram of process. Two reviewers examined each article for inclusion and exclusion, with disagreements resolved by consensus of 4 authors (S.E.L., A.W., J.F., Z.W.). Articles that did not address the KQs, were conducted outside of countries rated as “very high human development” by the United Nations Development Program, included high-risk or referral populations, or with a sample size of <100 subjects were excluded.

FIGURE 1

Systematic multistage review process.

FIGURE 1

Systematic multistage review process.

Close modal

Team members entered information for each KQ in the 27 included articles into a structured abstraction form. A second team member independently reviewed the abstraction. We abstracted or calculated sensitivity, specificity, positive and negative predictive values (NPVs), and identification rates wherever possible. Then, using a systematic quality rating tool developed through key stakeholder appraisal of current tools and processes, 2 reviewers independently rated the quality of each study as good, fair, or poor. The tool established clear benchmarks for quality on the basis of sample size, reference standard, use in general population, and inclusion of participants with negative screening results (see Table 3). Discrepancies in quality rating were resolved through team consensus.

TABLE 3

Quality Rating Definition

QualityCharacteristics
Good Evaluates relevant available screening test 
 Uses a credible reference standard 
 Interprets reference standard independently of screening test 
 Reliability of test assessed 
 Has few or handles indeterminate results in a reasonable manner 
 Includes large No. patients (>500) with and without condition 
 Includes participants drawn from the general population 
 Follows a sampling method of participants who screened negative 
Fair Evaluated relevant available screening test 
 Uses a reference standard that is reasonable although not best 
 Interprets reference standard independent of screening test 
 Moderate sample size (≥200) 
Poor Has “fatal” flaw(s) such as: 
  Uses inappropriate reference standard 
  Screening test improperly administered 
  Biased ascertainment of reference standard 
  Small sample size (100–199) 
QualityCharacteristics
Good Evaluates relevant available screening test 
 Uses a credible reference standard 
 Interprets reference standard independently of screening test 
 Reliability of test assessed 
 Has few or handles indeterminate results in a reasonable manner 
 Includes large No. patients (>500) with and without condition 
 Includes participants drawn from the general population 
 Follows a sampling method of participants who screened negative 
Fair Evaluated relevant available screening test 
 Uses a reference standard that is reasonable although not best 
 Interprets reference standard independent of screening test 
 Moderate sample size (≥200) 
Poor Has “fatal” flaw(s) such as: 
  Uses inappropriate reference standard 
  Screening test improperly administered 
  Biased ascertainment of reference standard 
  Small sample size (100–199) 

Among the 27 unique studies included in this review, 7 were good quality (Table 4), 18 were fair quality (Table 5), and 2 were poor quality (Table 6). Articles primarily addressed the known screening tests and their psychometric properties (KQ 1 and KQ 2), with notable exception of limited data surrounding comparative psychometrics based on risk factors, child age, and child and family characteristics; little evidence for improved outcomes (KQ 3); and no results regarding harm (KQ 4).

TABLE 4

Good-Quality Studies (N = 8)

Reference and CountryToolAges (mo)nReference InstrumentsSensitivity (95% CI)Specificity (95% CI)PPV (95% CI)NPV (95% CI)Location of Screening and Comments
Baird et al,23  United Kingdom CHAT 16–20 16 235 ADI-R 0.213 0.99 0.833a
0.588b 
NR Screened at PC medical visit 
Wetherby et al,24  United States ITC 6–24 5385 MSEL, Vineland, ADOS 0.933 NR 0.73c (0.69–0.77) 0.93c (0.91–0.95) Screened by health care or child care provider; 23% of sample had multiple ITCs and included in multiple age groups 
Pierce et al,25  United States ITC 10–16 10 479 ADOS, Mullen NR NR 0.18c (0.15–21) NR Screened at 1-y well-infant visit 
Robins et al,26  United States M-CHAT; M-CHAT-R/F 18–24 16 071 ADOS, CARS, DSM-IV 0.91 (0.86–0.96) 0.96 (0.95–0.96) 0.138 0.509d 0.999 Screened at pediatric well-child visits; do not have data on follow-up on 99.7% of negative screen results 
Sturner et al,27  United States M-CHAT; M-CHAT/F 14.7–40.8 5071 ADOS-2, MSEL, DSM-5 0.59d (0.49–0.69) 0.71d (0.62–0.80) 0.40 (0.30–0.50)
0.58d (0.48–0.67) 
NR Screened at routine 18- and 24-mo visits 
Baduel et al,28  France M-CHAT; M-CHAT/F 24 1227 ADOS-G, PEP-R, Vineland 0.67 (0.41–0.86) 0.94
0.99d 
0.14
0.60d 
0.99 Screened in PC or day care 
Zahorodny et al,29  United States PDQ-1 Young = 18–24
Middle = 25–30
Old = 31–36 
1959 ADI-R Young = 0.8462
Middle = 0.7143
Old = 1.00 
Young = 0.9962
Middle = 1.00
Old = 1.00 
Young = 0.7857
Middle = 1.00
Old = 1.00 
Young = 0.9975
Middle = 0.9965
Old = 1.00 
PC practices and programs 
Reference and CountryToolAges (mo)nReference InstrumentsSensitivity (95% CI)Specificity (95% CI)PPV (95% CI)NPV (95% CI)Location of Screening and Comments
Baird et al,23  United Kingdom CHAT 16–20 16 235 ADI-R 0.213 0.99 0.833a
0.588b 
NR Screened at PC medical visit 
Wetherby et al,24  United States ITC 6–24 5385 MSEL, Vineland, ADOS 0.933 NR 0.73c (0.69–0.77) 0.93c (0.91–0.95) Screened by health care or child care provider; 23% of sample had multiple ITCs and included in multiple age groups 
Pierce et al,25  United States ITC 10–16 10 479 ADOS, Mullen NR NR 0.18c (0.15–21) NR Screened at 1-y well-infant visit 
Robins et al,26  United States M-CHAT; M-CHAT-R/F 18–24 16 071 ADOS, CARS, DSM-IV 0.91 (0.86–0.96) 0.96 (0.95–0.96) 0.138 0.509d 0.999 Screened at pediatric well-child visits; do not have data on follow-up on 99.7% of negative screen results 
Sturner et al,27  United States M-CHAT; M-CHAT/F 14.7–40.8 5071 ADOS-2, MSEL, DSM-5 0.59d (0.49–0.69) 0.71d (0.62–0.80) 0.40 (0.30–0.50)
0.58d (0.48–0.67) 
NR Screened at routine 18- and 24-mo visits 
Baduel et al,28  France M-CHAT; M-CHAT/F 24 1227 ADOS-G, PEP-R, Vineland 0.67 (0.41–0.86) 0.94
0.99d 
0.14
0.60d 
0.99 Screened in PC or day care 
Zahorodny et al,29  United States PDQ-1 Young = 18–24
Middle = 25–30
Old = 31–36 
1959 ADI-R Young = 0.8462
Middle = 0.7143
Old = 1.00 
Young = 0.9962
Middle = 1.00
Old = 1.00 
Young = 0.7857
Middle = 1.00
Old = 1.00 
Young = 0.9975
Middle = 0.9965
Old = 1.00 
PC practices and programs 

ADI-R, Autism Diagnostic Interview–Revised; ADOS, Autism Diagnostic Observation Schedule; ADOS-2, Autism Diagnostic Observation Schedule, Second Edition; ADOS-G, Autism Diagnostic Observation Schedule–Generic; CARS, Childhood Autism Rating Scale; DSM-5, Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition; MSEL, Mullen Scales of Early Learning; NR, not reported; PEP-R, Psychoeducational Profile–Revised.

a

PPV for high-risk threshold.

b

PPV for medium-risk threshold.

c

Calculated by author J.F.

d

Based on M-CHAT/F interview.

TABLE 5

Fair-Quality Studies (N = 17 Unique Studies)

Reference and CountryToolAges (mo)nReference InstrumentsSensitivity (95% CI)Specificity (95% CI)PPV (95% CI)NPV (95% CI)Location of Screening and Comments
Robins et al,30  United States M-CHAT 18–30 1293a VABS, CARS, DSM-IV 0.87 0.99 0.80 0.99 Mixed sample (87% PC + 13% early intervention) 
Dietz et al,31  the Netherlands ESAT 14–15 31 724 ADOS-G, clinical assessment NR NR 0.25 NR Brief prescreen in PC, then positive results were screened with ESAT 
VanDenHeuvel et al,32  Ireland CHAT 18 2117 Clinical assessment NR NR 0.43b (0.24–0.62) NR From child development screening clinics 
Kleinman et al,33  United States M-CHAT; M-CHAT/F 16–30 3793a ADI-R, ADOS, DSM-IV 0.91 NR 0.36 (0.31–0.40)
0.74c (0.68–0.80) 
NR Mixed sample (PC + early intervention) 
Pandey et al,34  United States M-CHAT; M-CHAT/F Young: 16–23 6050 ADI-R, ADOS, DSM-IV, CARS NR NR Young = 0.04 NR Mixed sample (PC + early intervention); 56% of sample same as Kleinman et al.33  Psychometrics calculated for PC only 
Young = 4265 0.28c (0.20–0.35) 
Old: 24–30 Old = 1785 Old = 0.16 
0.61c (0.53–0.70) 
Robins,35  United States M-CHAT; M-CHAT/F 14–27 4797 ADI-R, ADOS, DSM-IV, CARS NR NR 0.058
0.57c 
NR Screened at well visit in PC setting 
Honda et al,36  Japan YACHT 18 2814 Clinical assessment 0.6 NR NR NR PC setting. For DD, sensitivity = 0.826, specificity = 0.839 
Canal-Bedia et al,37  Spain M-CHAT 18–36 S1: 2480
S2: 2055 
ADOS, DSM-IV S1: 1.0
S2: 1.0 
S1: 0.98 (0.98–0.99)
S2: 0.98 (0.98–0.99) 
S1: 0.35 (0.23–0.46)
S2: 0.19 (0.05–0.33) 
S1: 1.0
S2:1.0 
Well-child visit 
Miller et al,38  United States M-CHAT; ITC 14–30 796 ADOS-T, Mullen, DSM-IV NR NR 0.27d (0.20–0.33) NR PC setting 
Nygren et al,39  Sweden M-CHAT/F 30 3999 ADOS, VABS, DISCO-11 0.767c (0.614–0.882) NR 0.917c (0.775–0.982) NR Child Health Clinics 
Chlebowski et al,40  United States M-CHAT; M-CHAT/F 18–24 18 898 ADOS, ADI-R, Mullen, Vineland, DSM-IV NR NR 0.06
0.54d 
NR Screened at pediatric well-child visits 
Kamio et al,41  Japan M-CHAT; M-CHAT/F 4–26 1187 CARS, ADOS, ADI-R, DSM-IV 0.725
0.476c 
0.843
0.986c 
0.116
0.455c 
NR Report results of evaluation at age 2 y 
Khowaja et al,42  United States M-CHAT/F; M-CHAT-R/F 16–30 11 845 MSEL, Vineland, BASC, ADI-R, ADOS, CARS, DSM-IV-TR NR NR 0.617e (0.511–0.723)
0.522f (0.404–0.640) 
NR In PCP offices during well-child visits 
Samango-Sprouse et al,43  United States PDDST-II 4–9 1024 MSEL, PLS-4, ELM-2, DSM-IV NR NR 0.48b (0.39–0.57) NR Screening at 4-, 6-, and 9-mo well-child visit 
Kamio et al,44  Japan M-CHAT-JV 18–36 1851 DSM-IV-TR, ADI-R, ADOS, CARS 0.231 (0.087–0.425) 0.991 (0.988–0.995) 0.333 (0.126–0.614) NR Community cohort representing 82% of children in Fukuoka 
Hardy et al,45  United States M-CHAT-R; M-CHAT-R/F 16–31 2848 Vineland II, ASI, ADOS, MSEL 0.81 (0.72–0.90) NR 0.076b (0.06–0.09)
0.27b,c (0.22–0.32) 
NR In PCP office 
Eugenin et al,10  Chile M-CHAT; M-CHAT/F 16–30 200 Developmental evaluation by specialist NR NR 0.045b (0.014–0.077)
0.0.40b,c (0.18–0.62) 
NR Primary health care centers 
Janvier et al,8  United States M-CHAT; M-CHAT/F 16–76 351 ADOS NR NR 0.15 (0.11–0.18)b
0.37b,c (0.29–0.44) 
NR Head start and child care program 
Janvier et al,8  United States SCQ 16–76 848 ADOS NR NR 0.75b (0.67–0.83) NR Head start and child care program 
Reference and CountryToolAges (mo)nReference InstrumentsSensitivity (95% CI)Specificity (95% CI)PPV (95% CI)NPV (95% CI)Location of Screening and Comments
Robins et al,30  United States M-CHAT 18–30 1293a VABS, CARS, DSM-IV 0.87 0.99 0.80 0.99 Mixed sample (87% PC + 13% early intervention) 
Dietz et al,31  the Netherlands ESAT 14–15 31 724 ADOS-G, clinical assessment NR NR 0.25 NR Brief prescreen in PC, then positive results were screened with ESAT 
VanDenHeuvel et al,32  Ireland CHAT 18 2117 Clinical assessment NR NR 0.43b (0.24–0.62) NR From child development screening clinics 
Kleinman et al,33  United States M-CHAT; M-CHAT/F 16–30 3793a ADI-R, ADOS, DSM-IV 0.91 NR 0.36 (0.31–0.40)
0.74c (0.68–0.80) 
NR Mixed sample (PC + early intervention) 
Pandey et al,34  United States M-CHAT; M-CHAT/F Young: 16–23 6050 ADI-R, ADOS, DSM-IV, CARS NR NR Young = 0.04 NR Mixed sample (PC + early intervention); 56% of sample same as Kleinman et al.33  Psychometrics calculated for PC only 
Young = 4265 0.28c (0.20–0.35) 
Old: 24–30 Old = 1785 Old = 0.16 
0.61c (0.53–0.70) 
Robins,35  United States M-CHAT; M-CHAT/F 14–27 4797 ADI-R, ADOS, DSM-IV, CARS NR NR 0.058
0.57c 
NR Screened at well visit in PC setting 
Honda et al,36  Japan YACHT 18 2814 Clinical assessment 0.6 NR NR NR PC setting. For DD, sensitivity = 0.826, specificity = 0.839 
Canal-Bedia et al,37  Spain M-CHAT 18–36 S1: 2480
S2: 2055 
ADOS, DSM-IV S1: 1.0
S2: 1.0 
S1: 0.98 (0.98–0.99)
S2: 0.98 (0.98–0.99) 
S1: 0.35 (0.23–0.46)
S2: 0.19 (0.05–0.33) 
S1: 1.0
S2:1.0 
Well-child visit 
Miller et al,38  United States M-CHAT; ITC 14–30 796 ADOS-T, Mullen, DSM-IV NR NR 0.27d (0.20–0.33) NR PC setting 
Nygren et al,39  Sweden M-CHAT/F 30 3999 ADOS, VABS, DISCO-11 0.767c (0.614–0.882) NR 0.917c (0.775–0.982) NR Child Health Clinics 
Chlebowski et al,40  United States M-CHAT; M-CHAT/F 18–24 18 898 ADOS, ADI-R, Mullen, Vineland, DSM-IV NR NR 0.06
0.54d 
NR Screened at pediatric well-child visits 
Kamio et al,41  Japan M-CHAT; M-CHAT/F 4–26 1187 CARS, ADOS, ADI-R, DSM-IV 0.725
0.476c 
0.843
0.986c 
0.116
0.455c 
NR Report results of evaluation at age 2 y 
Khowaja et al,42  United States M-CHAT/F; M-CHAT-R/F 16–30 11 845 MSEL, Vineland, BASC, ADI-R, ADOS, CARS, DSM-IV-TR NR NR 0.617e (0.511–0.723)
0.522f (0.404–0.640) 
NR In PCP offices during well-child visits 
Samango-Sprouse et al,43  United States PDDST-II 4–9 1024 MSEL, PLS-4, ELM-2, DSM-IV NR NR 0.48b (0.39–0.57) NR Screening at 4-, 6-, and 9-mo well-child visit 
Kamio et al,44  Japan M-CHAT-JV 18–36 1851 DSM-IV-TR, ADI-R, ADOS, CARS 0.231 (0.087–0.425) 0.991 (0.988–0.995) 0.333 (0.126–0.614) NR Community cohort representing 82% of children in Fukuoka 
Hardy et al,45  United States M-CHAT-R; M-CHAT-R/F 16–31 2848 Vineland II, ASI, ADOS, MSEL 0.81 (0.72–0.90) NR 0.076b (0.06–0.09)
0.27b,c (0.22–0.32) 
NR In PCP office 
Eugenin et al,10  Chile M-CHAT; M-CHAT/F 16–30 200 Developmental evaluation by specialist NR NR 0.045b (0.014–0.077)
0.0.40b,c (0.18–0.62) 
NR Primary health care centers 
Janvier et al,8  United States M-CHAT; M-CHAT/F 16–76 351 ADOS NR NR 0.15 (0.11–0.18)b
0.37b,c (0.29–0.44) 
NR Head start and child care program 
Janvier et al,8  United States SCQ 16–76 848 ADOS NR NR 0.75b (0.67–0.83) NR Head start and child care program 

ADI-R, Autism Diagnostic Interview–Revised; ADOS, Autism Diagnostic Observation Schedule; ADOS-G, Autism Diagnostic Observation Schedule–Generic; ADOS-T, autism diagnostic observation schedule (toddler version); ASI, Autism Spectrum Interventions; ASQ, Ages and Stages Questionnaire; BASC, Behavior Assessment system for Children; CARS, Childhood Autism Rating Scale; DD, developmental delay; DISCO-11, Diagnostic Interview of Social and Communication Disorders (version 11); ELM-2, Early Language Milestone Scale (second edition); FUI, follow-up interview; M-CHAT-JV, Modified Checklist for Autism in Toddlers, Japanese Version; MSEL, Mullen Scales of Early Learning; NR, not reported; PCP, primary care provider; PDDST-II, Pervasive Developmental Disorders Screening Test II; PEP-R, Psychoeducational Profile–Revised; PLS-4, Preschool Language Scale (version 4); S1, low and high risk; S2, low risk; SCQ, Social Communication Questionnaire; VABS, Vineland Adaptive Behavior Scale; YACHT, Young Autism and Other Developmental Disorders Checkup Tool.

a

Used a mixed sample of general population and early intervention.

b

Calculated by author J.F.

c

M-CHAT/F interview.

d

PPV for positive result on either M-CHAT or ITC.

e

Racial or ethnic minority sample.

f

Non-Hispanic or Non-Latino white sample only.

TABLE 6

Poor-Quality Studies (N = 2)

Reference and CountryToolAges (mo)nReference InstrumentsSensitivity (95% CI)Specificity (95% CI)PPV (95% CI)NPV (95% CI)Location of Screening and Comments
Seung et al,46  Korea K-M-CHAT 16–36 2048 None NR NR NR NR Screened at day care centers, public health centers, hospitals, private pediatric center 
Cuesta-Gómez et al,9  Argentina M-CHAT; M-CHAT/F 18–24 420 DSM-IV-TR NR NR 0.111a (0.37–0.185)
1.00a,b 
NR Health centers, community action centers 
Reference and CountryToolAges (mo)nReference InstrumentsSensitivity (95% CI)Specificity (95% CI)PPV (95% CI)NPV (95% CI)Location of Screening and Comments
Seung et al,46  Korea K-M-CHAT 16–36 2048 None NR NR NR NR Screened at day care centers, public health centers, hospitals, private pediatric center 
Cuesta-Gómez et al,9  Argentina M-CHAT; M-CHAT/F 18–24 420 DSM-IV-TR NR NR 0.111a (0.37–0.185)
1.00a,b 
NR Health centers, community action centers 

K-M-CHAT, Korean M-CHAT; NR, not reported.

a

Calculated by author J.F.

b

M-CHAT/F interview.

This specific review included studies of screening strategies for young children (<13 years of age) in PC and PC-like settings. No studies were identified meeting inclusion criteria for children in the 6 to 12 age range.

Checklist for Autism in Toddlers

The Checklist for Autism in Toddlers (CHAT) was developed to identify ASD at 18 months via parent report and practitioner observation of early red flags (eg, lack of pointing, pretend play).23  Two studies were included, of good and fair quality, respectively.23,32 

In the good-quality study, researchers managed a birth cohort of children screened in PC at 18 months.23  ASD diagnostic data were linked to CHAT scores and clinical diagnosis for psychometric performance characteristics. Use of the CHAT resulted in substantial underidentification of ASD (ie, low sensitivity and modest positive predictive value [PPV]) in a low-risk population. In the fair-quality study of the CHAT, it was not possible to extract true sensitivity, specificity, or NPV. A PPV of 0.43 was calculated by author J.F.; however, this number should be interpreted with caution because of the small sample size and large loss to follow-up.32 

Modified Checklist for Autism in Toddlers

The Modified Checklist for Autism in Toddlers (M-CHAT) is a parent-completed questionnaire to identify children 16 to 30 months at risk for a diagnosis of ASD.30  In several population-based studies, researchers have examined its psychometric characteristics, and revisions of the form and content have decreased the number of questions from the original 23 (M-CHAT) to 20, with additional items as follow-up questions for screen failures (Modified Checklist for Autism in Toddlers, Revised with Follow-Up [M-CHAT-R/F]).26,41  This instrument has been translated into several languages and studied in multiple countries.37,39,41,44,46,47  Nineteen studies of the M-CHAT were included, with 3 being good quality, 14 being fair quality, and 2 being poor quality (Tables 4 through 6).

In the largest M-CHAT study,40  presenting cumulative data from a group or series of investigations, children were screened with the M-CHAT at well-child visits. Of the positive M-CHAT screening results, 74.6% completed the Modified Checklist for Autism in Toddlers–Follow-Up (M-CHAT/F), and 21% remained screen positive. Subsequently, 60.7% of those who continued to screen positive after follow-up completed a diagnostic evaluation. Over half (53.8%) received an ASD diagnosis, and 97.7% were identified as having a Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition (DSM-IV) diagnosis or developmental concerns. Because not all children screened received a diagnostic evaluation, sensitivity and specificity could not be calculated. The authors attempted to identify missed cases through multiple methods (eg, provider concerns, administration of other screeners), but the rules and data related to this procedure were not clear enough to calculate psychometric characteristics. In the same capacity, given limitations regarding these data (eg, nonrandom sampling, unclear rules for pediatrician flags and screening), there are insufficient data for understanding true sensitivity, specificity, and NPV, and therefore the study received a fair rating.

In a second good-quality study, the researchers examined the reliability and validity of the M-CHAT and M-CHAT-R/F in a large population of toddlers at well-child visits.26  Of M-CHAT positive screening results, 82% were followed-up with the M-CHAT-R/F, and 36.8% remained screen positive. Of those who screened positive for both stages of the M-CHAT-R/F, 63% had a diagnostic evaluation. Almost half of this group (48%) received an ASD diagnosis, 47% were classified as having other concerns, and 5% were classified as typically developing. PPV was higher when taking into account all actionable developmental concerns (0.95 vs 0.48). To test potential missed cases, one site conducted a stratified sampling of children who screened positive on M-CHAT-R/F who passed the follow-up interview. Some children who screened negative on M-CHAT-R/F were also identified as at risk by physician concerns and other screeners, and additional children were identified with ASD.40  Given the limitations regarding these data (eg, nonrandom sampling, unclear rules for pediatrician flags, and alternate scoring), there are still concerns about understanding true sensitivity, specificity, and NPV. Additional good-quality studies report PPVs for the M-CHAT between 0.14 and 0.5 and M-CHAT/F between 0.48 and 0.67.27,28 

Seven investigators reported using translations of the M-CHAT or M-CHAT/F.10,28,37,39,41,44,46  One study in France was of good quality, with moderate sensitivity (0.67), high specificity (0.94 and 0.99 with follow-up), low to moderate PPV (0.14 and 0.60 with follow-up), and high NPV (0.99).28  The other studies were of fair or poor quality.10,37,39,41,44,46  Given substantial modifications in use and translation, it is not clear how these instruments might relate to other US population studies.

Although the quality of the English studies of the M-CHAT and M-CHAT/F limits interpretation of population-level psychometric properties, data are available in which it is suggested that screening identified children before documented parent and provider concerns. In one fair-quality study, <20% of children screening at risk were flagged by surveillance.35  An additional fair-quality study in the United States screened toddlers at all scheduled medical appointments (well-child, sick, follow-up, injection visit) over 6 months at a large community-based pediatric practice.38  They used a combination of the M-CHAT and the Infant Toddler Checklist (ITC) and managed failures in either instrument. Although no PPV can be calculated for the M-CHAT independently, the combined ITC and/or M-CHAT PPV is estimated at 0.27. In this study, less than two-thirds (60%) of cases were identified before parent concerns, and 50% were identified before pediatrician concerns.38 

ITC

The ITC is a communication and broad developmental screener that is part of a larger assessment system, the Communication and Symbolic Behavior Scale (CSBS) Developmental Profile.24  The ITC was not designed as a stand-alone screener for ASD but has been studied in relation to identification of ASD. The ITC is a 24-item questionnaire of specific social communication milestones, with an overall question regarding presence of concerns. Cutoff scores are available from normative samples for children <24 months of age. Three studies were included, 2 good quality24,25  and 1 fair quality.38 

In the first good-quality study, children (6–24 months) were recruited from PC settings.24  Children identified by screening positive or from parental concerns were further assessed with the CSBS sampling technique (ie, brief interactive assessment coded by researchers). Data were linked to an ASD prevalence database to identify ASD risk through questionnaire results, CSBS assessment data, and state administrative data. In the study, the authors reported a sensitivity of 0.933 and a calculated PPV of 0.69 to 0.77. There was substantial variability in repeated screens and timing of positive screens, which limits the understanding of the ITC as a single point and time screener. In addition, the population questionnaire surveillance technique resulted in a low yield rate, which may affect understanding of true sensitivity of the measure. In the second good-quality study, researchers examined the effectiveness of the ITC as an ASD screener at 12-month well-child visits conducted by 137 pediatricians.25  A small subset of children who screened negative was referred to as a nonrandom “typically developing” control sample. A total of 73% of this group was ultimately designated with normal development. Of screen failures, 17% were diagnosed with ASD, 30% had language delay, 5% had developmental delay, 20% had another diagnosis, and 25% were false-positive results. PPV was substantially increased when inclusive of all developmental diagnoses to 0.75. In the fair-quality study, researchers screened toddlers with the ITC and the M-CHAT, as reviewed above.38 

Psychological Development Questionnaire–1

The Psychological Development Questionnaire–1 (PDQ-1) is a 10-item questionnaire that primarily reflects caregiver evaluation of their child’s level of communication and social orientation. It was developed in 2018 and therefore has only been evaluated in one known study by the measure authors.29  This study was of good quality and included a large and diverse sample.29  Particularly among children 31 to 36 months, the PDQ-1 exhibited very high psychometric properties, with high sensitivity (0.71–1.0), high specificity (0.99–1.0), PPV of 0.79–1.0, and NPV of 0.99–1.0.

Other Screeners

In additional studies in the United States included in this review, researchers used the Social Communication Questionnaire and the Pervasive Developmental Disorders Screening Test II as potential screeners, but only 1 fair-quality study of each was documented, which limited our ability to understand potential value,8,43  particularly in children >31 months.8  Two other instruments, the Early Screening of Autistic Traits (ESAT) and the Young Autism and Other Developmental Disorders Checkup Tool, were used in other countries for unselected population screening in children <18 months of age.31,36  Because of limited relevance to common PC practices (eg, involving extensive procedural evaluation), they are not discussed in this review.

No studies directly addressed the benefits of referral for early treatment after screening within the study. Available data are limited to either screening in relation to initiation of service24,25,35,38,40,48  or studies of intervention with no direct link to screening and identification.4,49 

No included studies addressed harm of screening.

Results of this systematic review documents that widely available ASD screeners, when incorporated into universal screening of unselected pediatric populations between 16 and 40 months of age, can accurately identify many children with ASD. The most extensively studied tool (ie, M-CHAT family of screeners) has demonstrated PPVs of 0.06 to 0.60 in the highest-quality studies.28,40  However, this variability highlights that the screener differs in its effectiveness to identify ASD in children across ages, settings, and contexts, with psychometrics varying substantially on the basis of use of follow-up interview for validation. Importantly, when other developmental concerns are incorporated into assessment of PPV, false-positives are infrequent. Only one included study examined psychometric properties or characteristics of universal screening for children >40 months.8 

Despite these modest to high PPVs, methods employed to study ASD screening at present remain insufficient to understand additional psychometric properties. Specifically, few of the studies provided adequate data of random or cohort samples to determine the frequency of missed cases. In addition, the attrition seen in study validation is substantial across almost all employed tools. In future studies, researchers should focus on longitudinal population-based samples to address issues of NPV as well as provide clearer data regarding the link of screening to improved outcomes.

Comparing estimated prevalence rates of ASD in community population studies (ie, 1 in 591  in 8-year-olds and 1 in 75 in 4-year-olds50 ) is a gross validation method for evaluation of population identification. Calculated rates for studies included in this review appeared modest at 1 in 149 and 1 in 300.26,30,40  These numbers are below the conservative estimated prevalence bands based on epidemiological studies.1,5,50  It is important to note that almost all of these screening studies have (1) involved research opt-in follow-up procedures, which may skew the sample; (2) experienced significant attrition; and (3) excluded patients previously referred for concerns based on developmental surveillance. In this capacity, the PPVs reported may represent conservative estimates of the ability of screening tools and broader surveillance processes to identify children with ASD. Such selection and evaluation within this systematic review are intentional because it approximates screening in the absence of concerns in the PC office, where parents (or providers) may not have identified concerns to discuss during routine care. However, in most cases, screening in the absence of concerns represents only a portion of our understanding of the process of early identification of ASD. If children are referred earlier to services for general developmental or specific ASD concerns and are subsequently excluded from this type of evaluation of screening results, one must be cautious in interpreting base rates of identification. It is possible that systematic specific ASD screening of young children when combined with identification of concerns via surveillance and general developmental screening before designated screening time points may identify a much larger percentage of children. Unfortunately, few studies have included methodologies for systematic evaluation of deployment of hybrid surveillance screening procedures. This is unfortunate because real-world pediatric care involves not just simple point-in-time screening but a variety of deployed surveillance and repeated screening practices over time. This suggests the need for pragmatic clinical trials of such strategies to optimize our ability to identify young children as well as develop coherent strategies for monitoring for ASD symptoms at later ages, given the near absence of data to this end.

Other well-studied tools in comparison with the MCHAT-R/F have demonstrated modest psychometric characteristics when used as single-point screeners of well-child populations. The ITC can be used to identify many children with actionable developmental concerns, before parental and clinician concern between 12 and 24 months of age, but using it in isolation may label ≥10% of the total low-risk population at risk for ASD or other communication disorders.24  Other screening measures included in this review currently do not have replicated results suggestive of superior performance, although new instruments and approaches may be able to improve psychometric functioning.29  There are some emerging data from non-US–based programs and adaptations regarding the potential for combined screening and surveillance and prospective strategies involving community health visitation and trained providers to identify ASD at early ages.49,51  However, these tools have not yet been studied in PC settings and practices in the United States.

In several included articles, it is highlighted that disparities in access to use of screeners and diagnostic resources exist across racial, ethnic, linguistic, and socioeconomic lines.810  Even among those who have been screened, children of families with lower education or who identify as racial minorities were more likely to screen as a false-positive.42  Additionally, those with lower educational attainment had lower odds of completing the follow-up interview compared with those with more education.42  Disparities may be reduced by having early child care providers screen for ASD in their classrooms8  or having the screen administered by an interviewer instead of being self-administered.9 

Although there have been many studies of ASD screeners for young children in clinical and convenience samples, there are few studies of universal screening practices in PC settings, and there is a dearth of knowledge about screening beyond the early toddler and preschool years. We explicitly excluded studies of screeners used in preidentified risk or diagnostic samples because screening these groups may represent a different population than children identified during universal screening in general pediatric care. Including children who have been referred may give information about factors associated with an early diagnosis, which is not the aim of this review. Consequently, excluding these studies from our review resulted in a small number of included studies. Findings of this report should be interpreted with significant caution in that they represent studies not of “universal screening” in the broadest form but rather “universal screening” excluding children who have been previously referred for risk or previously identified with neurodevelopment concerns. In reality, data inclusive of children identified by surveillance and screening procedures would ultimately provide a more-complete picture of how effective these processes may be.

Other major limitations are lack of data and methods to examine screening processes and tools and little information on how to capture potential false-negatives during the screening process. Few included studies beyond the original CHAT investigation adequately follow a large enough random sample of children to be able to comment on NPV, specificity, or sensitivity in population form.23  The available included studies also had high rates of attrition, although our assessment of quality did not evaluate studies on the basis of attrition. There is little information available about children who fail to complete referrals for diagnostic evaluations. Data from these studies lack how psychometric performance characteristics of screeners in PC settings may vary on the basis of child and family risk factors. Similarly, included studies of the psychometric functioning of early screeners were almost universally accomplished with the aid of specific research supports and processes, with limited data about how feasible all stages of the screeners may be in pragmatic implementation (ie, required 2-stage implementation of screener and algorithmic interview). Finally, in the data, it was suggested that children whose screening results placed them in the at-risk category and who were later diagnosed with ASD received higher levels of service in comparison with children without such screening results.52  However, systematic prospective studies have not yet been conducted that examine service access and the impact of service delivery postdiagnosis for children screened at risk.

To date, most available data regarding screening for risk for a diagnosis of ASD involve preschool children. Children with ASD who are identified after the age of 4 years may reflect differences in development of social skills in the context of adequate language development and/or milder behavioral issues. There is a critical need for development of enhanced tools and systematic study of older children.

As ASD screening takes place in a broader surveillance process wherein pediatric providers monitor many aspects of growth, development, and health, and it is urgent to develop streamlined ways to integrate these screeners that can be evaluated in a methodologically rigorous manner (ie, pragmatic trials). Specifically, data inclusive of children identified by surveillance and screening procedures would provide a more-complete picture of how effective these processes may be in shaping child outcomes. Similarly, studies of comparative implementation of different screening and surveillance practices and tools (eg, repeated screening procedures, combined use of measures and observers, direct comparison of available tools, validation strategies) would be valuable. Despite resource challenges, research adequately evaluating participants who screened negative would provide valuable data. Similarly, data directly evaluating traditional intermediate and longer-term health outcomes, as well as pertinent outcomes for the ASD population (eg, quality of child and family life, adaptive and social integration measures), are lacking. There are potential financial and human costs and harm to individuals, families, and systems related to optimal deployment of screening tools and processes; however, little empirical evidence of these factors exists. Moreover, given the fundamental heterogeneity of ASD of developmental and behavioral presentation, comprehensive study of the characteristics of children identified at certain ages with specific tools is necessary to the development of processes designed to pick up heterogeneous profiles at later ages.

We found significant evidence that formal ASD screening of children at young ages (16–40 months) in general pediatric practices has PPVs for identifying children with ASD of ∼50%.23,24,2629,40  When considering the ability of this screening to identify other developmental conditions, the PPV rises to ∼95%.26,40  Little information is available on the degree to which screening tools miss positive cases. In addition, although there are separate studies of potential benefits of screening in initiation of services and of potential intervention benefit, no studies directly relating screening to clinical outcomes and/or harm exist. Given the pragmatic, ethical, and political challenges associated with establishing this linkage, clinical and policy groups will likely have to continue to guide screening practices in absence of this evidence. Essentially, such recommendations and practice behaviors must act on data documenting (1) that screening unselected young children will pick up many, but not all, children and (2) that early treatment can improve certain outcomes for children, with substantial variation in terms of what treatments are available and what benefits will be seen.

In this capacity, practical implications of these findings are simultaneously supportive of (1) potential use of validated ASD screening tools at young ages in PC in unselected populations, acknowledging known psychometric and service limits of available use, and (2) the urgent need for rigorous development and evaluation of enhanced strategies for more effectively identifying even-larger groups of young and school-aged children with ASD across varied systems of care.

Drs Levy and Warren participated in data collection (review of abstracts and full studies) and interrater reliability activities for abstraction and quality determination, drafted the initial manuscript, and reviewed and revised the manuscript; Ms Wolfe participated in coordinating data collection, data collection (review of abstracts and full studies), and contributed to writing and editing the manuscript; Dr Coury and Mr Farmer participated in data collection (review of abstracts and full studies) and reviewed and revised the manuscript; Drs Duby, Schor, and Van Cleave participated in data collection (review of full studies) and reviewed and revised the manuscript; and all authors participated in conceptualizing and designing the study and data collection instruments, approved the final manuscript as submitted, and agree to be accountable for all aspects of the work.

FUNDING: Supported by the Health Resources and Services Administration of the US Department of Health and Human Services under cooperative agreement UA3 MC11054–Autism Intervention Research Network on Physical Health. This information or content and conclusions are those of the authors and should not be construed as the official position or policy of, nor should any endorsements be inferred by, the Health Resources and Services Administration, US Department of Health and Human Services, or the US Government. This work was conducted through the Autism Speaks Autism Treatment Network serving as the Autism Intervention Research Network on Physical Health.

ASD

autism spectrum disorder

CHAT

Checklist for Autism in Toddlers

CSBS

Communication and Symbolic Behavior Scales

DSM-IV

Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition

DSM-IV-TR

Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text Revision

ESAT

Early Screening of Autistic Traits

ITC

Infant Toddler Checklist

KQ

key question

M-CHAT

Modified Checklist for Autism in Toddlers

M-CHAT/F

Modified Checklist for Autism in Toddlers–Follow-Up

M-CHAT-R/F

Modified Checklist for Autism in Toddlers, Revised with Follow-Up

NPV

negative predictive value

PC

primary care

PDQ-1

Psychological Development Questionnaire–1

PPV

positive predictive value

USPSTF

US Preventive Services Task Force

1
Christensen
DL
,
Baio
J
,
Van Naarden Braun
K
, et al;
Centers for Disease Control and Prevention (CDC)
.
Prevalence and characteristics of autism spectrum disorder among children aged 8 years–autism and developmental disabilities monitoring network, 11 sites, United States, 2012
.
MMWR Surveill Summ
.
2016
;
65
(
3
):
1
23
2
Mandell
DS
,
Cao
J
,
Ittenbach
R
,
Pinto-Martin
J
.
Medicaid expenditures for children with autistic spectrum disorders: 1994 to 1999
.
J Autism Dev Disord
.
2006
;
36
(
4
):
475
485
3
Warren
Z
,
McPheeters
ML
,
Sathe
N
,
Foss-Feig
JH
,
Glasser
A
,
Veenstra-Vanderweele
J
.
A systematic review of early intensive intervention for autism spectrum disorders
.
Pediatrics
.
2011
;
127
(
5
).
4
Weitlauf
AS
,
McPheeters
ML
,
Peters
B
, et al
.
Therapies for Children With Autism Spectrum Disorder: Behavioral Interventions Update [Internet]
.
Rockville, MD
:
Agency for Healthcare Research and Quality
;
2014
5
Baio
J
,
Wiggins
L
,
Christensen
DL
, et al
.
Prevalence of autism spectrum disorder among children aged 8 years - autism and developmental disabilities monitoring network, 11 sites, United States, 2014
.
MMWR Surveill Summ
.
2018
;
67
(
6
):
1
23
6
Stahmer
AC
,
Mandell
DS
.
State infant/toddler program policies for eligibility and services provision for young children with autism
.
Adm Policy Ment Health
.
2007
;
34
(
1
):
29
37
7
Gabrielsen
TP
,
Farley
M
,
Speer
L
,
Villalobos
M
,
Baker
CN
,
Miller
J
.
Identifying autism in a brief observation
.
Pediatrics
.
2015
;
135
(
2
).
8
Janvier
YM
,
Harris
JF
,
Coffield
CN
, et al
.
Screening for autism spectrum disorder in underserved communities: early childcare providers as reporters
.
Autism
.
2016
;
20
(
3
):
364
373
9
Cuesta-Gómez
J
,
Andrea Manzone
A
,
Posada-De-La-Paz
M
.
Modified checklist for autism in toddlers cross-cultural adaptation for Argentina
.
Int J Dev Disabil
.
2016
;
62
(
2
):
117
123
10
Eugenin
M
,
Moore
R
,
Martinez-Gutierrez
J
,
Perez
C
,
Carolina
A
,
Valenzuela
P
.
Screening for autism in Santiago Chile: community perspectives
.
Intl J Child Adolesc Health
.
2015
;
8
(
4
):
439
448
11
Johnson
CP
,
Myers
SM
;
American Academy of Pediatrics Council on Children With Disabilities
.
Identification and evaluation of children with autism spectrum disorders
.
Pediatrics
.
2007
;
120
(
5
):
1183
1215
12
Siu
AL
,
Bibbins-Domingo
K
,
Grossman
DC
, et al;
US Preventive Services Task Force (USPSTF)
.
Screening for autism spectrum disorder in young children: US preventive services task force recommendation statement
.
JAMA
.
2016
;
315
(
7
):
691
696
13
Coury
DL
.
Babies, bathwater, and screening for autism spectrum disorder: comments on the USPSTF recommendations for autism spectrum disorder screening
.
J Dev Behav Pediatr
.
2015
;
36
(
9
):
661
663
14
Al Maskari
TS
,
Melville
CA
,
Willis
DS
.
Systematic review: cultural adaptation and feasibility of screening for autism in non-English speaking countries
.
Int J Ment Health Syst
.
2018
;
12
:
22
15
Stewart
LA
,
Lee
LC
.
Screening for autism spectrum disorder in low- and middle-income countries: a systematic review
.
Autism
.
2017
;
21
(
5
):
527
539
16
Baghdadli
A
,
Russet
F
,
Mottron
L
.
Measurement properties of screening and diagnostic tools for autism spectrum adults of mean normal intelligence: a systematic review
.
Eur Psychiatry
.
2017
;
44
:
104
124
17
Towle
PO
,
Patrick
PA
.
Autism spectrum disorder screening instruments for very young children: a systematic review
.
Autism Res Treat
.
2016
;
2016
:
4624829
18
Yuen
T
,
Penner
M
,
Carter
MT
,
Szatmari
P
,
Ungar
WJ
.
Assessing the accuracy of the Modified Checklist for Autism in Toddlers: a systematic review and meta-analysis
.
Dev Med Child Neurol
.
2018
;
60
(
11
):
1093
1100
19
Sánchez-García
AB
,
Galindo-Villardón
P
,
Nieto-Librero
AB
,
Martín-Rodero
H
,
Robins
DL
.
Toddler screening for autism spectrum disorder: a meta-analysis of diagnostic accuracy
.
J Autism Dev Disord
.
2019
;
49
(
5
):
1837
1852
20
Hirota
T
,
So
R
,
Kim
YS
,
Leventhal
B
,
Epstein
RA
.
A systematic review of screening tools in non-young children and adults for autism spectrum disorder
.
Res Dev Disabil
.
2018
;
80
:
1
12
21
Arunyanart
W
,
Fenick
A
,
Ukritchon
S
,
Imjaijitt
W
,
Northrup
V
,
Weitzman
C
.
Developmental and autism screening: a survey across six states
.
Infants Young Child
.
2012
;
25
(
3
):
175
187
22
Hampton
J
,
Strand
PS
.
A review of level 2 parent-report instruments used to screen children aged 1.5-5 for autism: a meta-analytic update
.
J Autism Dev Disord
.
2015
;
45
(
8
):
2519
2530
23
Baird
G
,
Charman
T
,
Baron-Cohen
S
, et al
.
A screening instrument for autism at 18 months of age: a 6-year follow-up study
.
J Am Acad Child Adolesc Psychiatry
.
2000
;
39
(
6
):
694
702
24
Wetherby
AM
,
Brosnan-Maddox
S
,
Peace
V
,
Newton
L
.
Validation of the Infant-Toddler Checklist as a broadband screener for autism spectrum disorders from 9 to 24 months of age
.
Autism
.
2008
;
12
(
5
):
487
511
25
Pierce
K
,
Carter
C
,
Weinfeld
M
, et al
.
Detecting, studying, and treating autism early: the one-year well-baby check-up approach
.
J Pediatr
.
2011
;
159
(
3
):
458
465–e6
26
Robins
DL
,
Casagrande
K
,
Barton
M
,
Chen
CM
,
Dumont-Mathieu
T
,
Fein
D
.
Validation of the Modified Checklist for Autism in Toddlers, Revised with Follow-up (M-CHAT-R/F)
.
Pediatrics
.
2014
;
133
(
1
):
37
45
27
Sturner
R
,
Howard
B
,
Bergmann
P
, et al
.
Autism screening with online decision support by primary care pediatricians aided by M-CHAT/F
.
Pediatrics
.
2016
;
138
(
3
):
e20153036
28
Baduel
S
,
Guillon
Q
,
Afzali
MH
,
Foudon
N
,
Kruck
J
,
Rogé
B
.
The French version of the Modified-Checklist for Autism in Toddlers (M-CHAT): a validation study on a French sample of 24 month-old children
.
J Autism Dev Disord
.
2017
;
47
(
2
):
297
304
29
Zahorodny
W
,
Shenouda
J
,
Mehta
U
, et al
.
Preliminary evaluation of a brief autism screener for young children
.
J Dev Behav Pediatr
.
2018
;
39
(
3
):
183
191
30
Robins
DL
,
Fein
D
,
Barton
ML
,
Green
JA
.
The Modified Checklist for Autism in Toddlers: an initial study investigating the early detection of autism and pervasive developmental disorders
.
J Autism Dev Disord
.
2001
;
31
(
2
):
131
144
31
Dietz
C
,
Swinkels
S
,
van Daalen
E
,
van Engeland
H
,
Buitelaar
JK
.
Screening for autistic spectrum disorder in children aged 14-15 months. II: population screening with the Early Screening of Autistic Traits Questionnaire (ESAT). Design and general findings
.
J Autism Dev Disord
.
2006
;
36
(
6
):
713
722
32
VanDenHeuvel
A
,
Fitzgerald
M
,
Greiner
B
,
Perry
IJ
.
Screening for autistic spectrum disorder at the 18-month developmental assessment: a population-based study
.
Ir Med J
.
2007
;
100
(
8
):
565
567
33
Kleinman
JM
,
Robins
DL
,
Ventola
PE
, et al
.
The Modified Checklist for Autism in Toddlers: a follow-up study investigating the early detection of autism spectrum disorders
.
J Autism Dev Disord
.
2008
;
38
(
5
):
827
839
34
Pandey
J
,
Verbalis
A
,
Robins
DL
, et al
.
Screening for autism in older and younger toddlers with the Modified Checklist for Autism in Toddlers
.
Autism
.
2008
;
12
(
5
):
513
535
35
Robins
DL
.
Screening for autism spectrum disorders in primary care settings
.
Autism
.
2008
;
12
(
5
):
537
556
36
Honda
H
,
Shimizu
Y
,
Nitto
Y
, et al
.
Extraction and Refinement Strategy for detection of autism in 18-month-olds: a guarantee of higher sensitivity and specificity in the process of mass screening
.
J Child Psychol Psychiatry
.
2009
;
50
(
8
):
972
981
37
Canal-Bedia
R
,
García-Primo
P
,
Martín-Cilleros
MV
, et al
.
Modified checklist for autism in toddlers: cross-cultural adaptation and validation in Spain
.
J Autism Dev Disord
.
2011
;
41
(
10
):
1342
1351
38
Miller
JS
,
Gabrielsen
T
,
Villalobos
M
, et al
.
The each child study: systematic screening for autism spectrum disorders in a pediatric setting
.
Pediatrics
.
2011
;
127
(
5
):
866
871
39
Nygren
G
,
Sandberg
E
,
Gillstedt
F
,
Ekeroth
G
,
Arvidsson
T
,
Gillberg
C
.
A new screening programme for autism in a general population of Swedish toddlers
.
Res Dev Disabil
.
2012
;
33
(
4
):
1200
1210
40
Chlebowski
C
,
Robins
DL
,
Barton
ML
,
Fein
D
.
Large-scale use of the modified checklist for autism in low-risk toddlers
.
Pediatrics
.
2013
;
131
(
4
).
41
Kamio
Y
,
Inada
N
,
Koyama
T
,
Inokuchi
E
,
Tsuchiya
K
,
Kuroda
M
.
Effectiveness of using the Modified Checklist for Autism in Toddlers in two-stage screening of autism spectrum disorder at the 18-month health check-up in Japan
.
J Autism Dev Disord
.
2014
;
44
(
1
):
194
203
42
Khowaja
MK
,
Hazzard
AP
,
Robins
DL
.
Sociodemographic barriers to early detection of autism: screening and evaluation using the M-CHAT, M-CHAT-R, and follow-up
.
J Autism Dev Disord
.
2015
;
45
(
6
):
1797
1808
43
Samango-Sprouse
CA
,
Stapleton
EJ
,
Aliabadi
F
, et al
.
Identification of infants at risk for autism spectrum disorder and developmental language delay prior to 12 months
.
Autism
.
2015
;
19
(
3
):
327
337
44
Kamio
Y
,
Haraguchi
H
,
Stickley
A
,
Ogino
K
,
Ishitobi
M
,
Takahashi
H
.
Brief report: best discriminators for identifying children with autism spectrum disorder at an 18-month health check-up in Japan
.
J Autism Dev Disord
.
2015
;
45
(
12
):
4147
4153
45
Hardy
S
,
Haisley
L
,
Manning
C
,
Fein
D
.
Can screening with the ages and stages questionnaire detect autism?
J Dev Behav Pediatr
.
2015
;
36
(
7
):
536
543
46
Seung
H
,
Ji
J
,
Kim
SJ
, et al
.
Examination of the Korean modified checklist of autism in toddlers: item response theory
.
J Autism Dev Disord
.
2015
;
45
(
9
):
2744
2757
47
Wong
V
,
Hui
LH
,
Lee
WC
, et al
.
A modified screening tool for autism (Checklist for Autism in Toddlers [CHAT-23]) for Chinese children
.
Pediatrics
.
2004
;
114
(
2
).
48
Guevara
JP
,
Gerdes
M
,
Localio
R
, et al
.
Effectiveness of developmental screening in an urban setting
.
Pediatrics
.
2013
;
131
(
1
):
30
37
49
Reichow
B
,
Hume
K
,
Barton
E
,
Boyd
B
.
Early intensive behavioral intervention (EIBI) for young children with autism spectrum disorders (ASD)
.
Cochrane Database Syst Rev
.
2018
;
5
:
CD009260
50
Christensen
DL
,
Bilder
DA
,
Zahorodny
W
, et al
.
Prevalence and characteristics of autism spectrum disorder among 4-year-old children in the autism and developmental disabilities monitoring network
.
J Dev Behav Pediatr
.
2016
;
37
(
1
):
1
8
51
Dietz
C
,
Swinkels
SH
,
van Daalen
E
,
van Engeland
H
,
Buitelaar
JK
.
Parental compliance after screening social development in toddlers
.
Arch Pediatr Adolesc Med
.
2007
;
161
(
4
):
363
368
52
Rotholz
DA
,
Kinsman
AM
,
Lacy
KK
,
Charles
J
.
Improving early identification and intervention for children at risk for autism spectrum disorder
.
Pediatrics
.
2017
;
139
(
2
):
e20161061

Competing Interests

POTENTIAL CONFLICT OF INTEREST: The authors have indicated they have no potential conflicts of interest to disclose.

FINANCIAL DISCLOSURE: The authors have indicated they have no financial relationships relevant to this article to disclose.