Video Abstract

Video Abstract

Close modal
BACKGROUND

Early development of gross motor skills is foundational for the upcoming neurocognitive performance. Here, we studied whether at-home wearable measurements performed by the parents could be used to quantify and track infants’ developing motor abilities.

METHODS

Unsupervised at-home measurements of the infants’ spontaneous activity were made repeatedly by the parents using a multisensor wearable suit (altogether 620 measurements from 134 infants at age 4–22 months). Machine learning-based algorithms were developed to detect the reaching of gross motor milestones (GMM), to measure times spent in key postures, and to track the overall motor development longitudinally. Parental questionnaires regarding GMMs were used for developing the algorithms, and the results were benchmarked with the interrater agreement levels established by World Health Organization (WHO). A total of 97 infants were used for the algorithm development and cross-validation, whereas an external validation was done using 37 infants from an independent recruitment in the same hospital.

RESULTS

The algorithms detected the reaching of GMMs very accurately (cross-validation: accuracy, 90.9%-95.5%; external validation, 92.4%-96.8%), which compares well with the human experts in the WHO reference study. The wearable-derived postural times showed strong correlation to parental assessments (ρ = .48–.81). Individual trajectories of motor maturation showed strong correlation to infants’ age (ρ = .93).

CONCLUSIONS

These findings suggest that infants’ gross motor skills can be quantified reliably and automatically from unsupervised at-home wearable recordings. Such methodology could be used in health care practice and in all developmental studies for gaining real-world quantitation and tracking of infants’ motor abilities.

What’s Known on This Subject:

Gross motor skills are considered an efficient proxy of infants’ wider neurodevelopment, and they are used as key benchmarks in early interventions. The current approach is to survey motor milestones based on parental interviews or brief observations during clinical visits.

What This Study Adds:

We report development and external validation of a wearable method for assessing infants’ gross motor skills. Infant cohorts were longitudinally measured at homes by their parents, and automated analysis accuracy was comparable to experts’ agreement levels reported by the World Health Organization.

Assessing gross motor skills and the reaching of milestones are part of the mainstay of well-child care worldwide.1,2 They are typically assessed several times during the first 2 years of life to ensure an age-appropriate development. The assessment usually relies on parental interviews complemented by brief observations as feasible in the given pediatric clinics.3 Gross motor skills are considered to provide efficient surrogate measures of wider neurodevelopment, with significant links to earlier adversities, environmental and other acquired effects, as well as later neurodevelopmental compromise.3–10 

The global use of reaching gross motor milestones (GMMs) is based on the implicit assumptions that (1) they would be universally present and (2) that their recognition would be unequivocal to lay observers such as the child’s parent. However, the everyday experience and several studies challenge both assumptions. First, there are notable cultural, environmental, and historical variations in the developmental expression of milestone skills.3,11–13 Second, GMM assessment relies heavily on parental information that is unavoidably subjective and potentially challenging in multicultural and/or multilinguistic environments.14 In addition, using discrete milestones for longitudinal tracking of development is often challenged by the considerable intra- and interindividual variation over time.15–17 These considerations together underscore the scarcity of, and the unmet need for, genuinely objective, quantifiable, and ecologically valid measures for longitudinal neurodevelopmental follow-up.

Intuitively, infants’ performance is optimally assessed at home during spontaneous daily activity.18,19 At-home assessments became possible recently with advanced multisensor wearable suits coupled with machine learning-based analyses,20–23 which together may provide accurate and explainable analyses of infants’ motor performance during early development.21,22 Here, we studied whether wearable methodology with at-home measurements could be used for the clinically important aspects of early motor assessment, including (1) detection of reaching the key GMMs,4,24 (2) quantification of times spent in different postures, and (3) longitudinal tracking of individual’s gross motor development.

The overall study design (Figure 1A) included repeated at-home wearable measurements by the parents,21,25 which were used to develop and validate automated algorithms for the following purposes: (1) to detect the reaching of GMMs, (2) to quantify times spent in different postures, and (3) to longitudinally track an individual’s gross motor development. The study was reviewed by the Ethics Committee and approved by the New Children’s Hospital at the Helsinki University Hospital, Helsinki, Finland (HUS/80/2021), and written informed consent was obtained from the legal guardians.

FIGURE 1.

Overview of the study. (A) Wearable recordings during spontaneous activity were automatically processed for second-by-second posture and movement detections. They were used to develop the novel algorithms to detect the reaching of GMM (Figures 2 and 3), to measure times spent in selected postures (Figure 4), and to longitudinally track the holistic gross motor development (Figure 5B and C). The reference information was obtained from parental questionnaires. (B) Visual summary of the posture and movement detections (“MAIJU features”) used for the automated GMM detections. Illustration is reprinted and modified from21 under the Creative Commons license. (C) Photograph of the MAIJU wearable suit (left) an infant wearing the MAIJU suit (right). Photograph published with written parental consent.

Abbreviations: GMM, gross motor milestone; MAIJU, Motor Ability Assessment on Infants With a JUmpsuit.
FIGURE 1.

Overview of the study. (A) Wearable recordings during spontaneous activity were automatically processed for second-by-second posture and movement detections. They were used to develop the novel algorithms to detect the reaching of GMM (Figures 2 and 3), to measure times spent in selected postures (Figure 4), and to longitudinally track the holistic gross motor development (Figure 5B and C). The reference information was obtained from parental questionnaires. (B) Visual summary of the posture and movement detections (“MAIJU features”) used for the automated GMM detections. Illustration is reprinted and modified from21 under the Creative Commons license. (C) Photograph of the MAIJU wearable suit (left) an infant wearing the MAIJU suit (right). Photograph published with written parental consent.

Abbreviations: GMM, gross motor milestone; MAIJU, Motor Ability Assessment on Infants With a JUmpsuit.
Close modal

Two infant cohorts were prospectively and independently recruited for a longitudinal follow-up between ages 4 and 22 months (Table 1; Supplementary Figure 1). Cohort 1, with typically developing infants, was used for the classifier development and cross-validation. Cohort 2 was used for external validation, consisting of infants attending pediatric neurology follow-up due to an identified or suspected developmental risk. This cohort was considered external for the purpose of testing model generalizability because it was recruited separately from a different clinical follow-up program. The recruitment is described in detail in Supplementary Methods 1 and 2. By the time of concluding the present work, the infants in cohort 2 presented with the following clinical characteristics: 8 infants had a spastic hemiplegia, 3 infants had motor and cognitive delay, and 2 infants had motor delay demanding ongoing follow-up. The remaining 24 infants of cohort 2 had transient motor asymmetry, extensor tone, or mild motor delay, which normalized later.

TABLE 1.

Participant and Dataset Characteristics for Both Cohorts

Cohort 1Cohort 2P value
Participants total, n 97 37  
 Male, no. (%) 45 (46) 27 (73) <.001a 
MAIJU recordings total, n 463 157  
Male recordings, n (%) 210 (45) 102(65) <.001a 
Age at recording, mo 
 Mean (SD) 11.7 (3.9) 12.6 (3.9) .01b 
 Median (range) 11.5 (4.5–19.0) 12.4 (4.2–21.9) 
Free playtime within recording, h 
 Mean (SD) 2.2 (1.5) 1.9 (1.3) .03b 
 Median (range) 1.8 (0.5–11.3) 1.4 (0.5–6.9) 
Cohort 1Cohort 2P value
Participants total, n 97 37  
 Male, no. (%) 45 (46) 27 (73) <.001a 
MAIJU recordings total, n 463 157  
Male recordings, n (%) 210 (45) 102(65) <.001a 
Age at recording, mo 
 Mean (SD) 11.7 (3.9) 12.6 (3.9) .01b 
 Median (range) 11.5 (4.5–19.0) 12.4 (4.2–21.9) 
Free playtime within recording, h 
 Mean (SD) 2.2 (1.5) 1.9 (1.3) .03b 
 Median (range) 1.8 (0.5–11.3) 1.4 (0.5–6.9) 

Abbreviation: MAIJU, MAIJU, Motor Ability Assessment on Infants With a JUmpsuit. Statistical differences between the dataset are tested with the χ2 test for categorical variables, and with the Wilcoxon rank-sum test for continuous variables. The associated P values are reported in the fourth column.

a

P < 0.001.

b

P < 0.05.

At-Home Measurements With the MAIJU Wearable

The wearable suit (Motor Assessment of Infants with a JUmpsuit [MAIJU]; Figure 1) is a commonplace infant garment with movement sensors attached to standard locations in each limb.21,22,25 In this study, the sensors collected accelerometer and gyroscope data at 52 Hz (1024 samples per second) over Bluetooth connection to a nearby mobile device. We used local courier service for delivering the suit to the homes and back. The parents dressed the infant in the suit and encouraged the infant to be spontaneously active (ie, “free play”) for the rest of the day or for at least 1 hour.25 Upon returning to the laboratory, the data were uploaded from the mobile device to a computational server “Babacloud” for a fully automated analysis pipeline21,25 (Supplementary Methods 1 and 2).

Parental Questionnaire

The parental questionnaires were collected at each measurement session to serve as the reference for the algorithm development. The parents were requested to assess the infant’s typical performance during the previous week. Multioption scales26 were used to mitigate ambiguities in the verbal GMM definitions4 and to enable alternative definitions during analyses (Supplementary Figure 2). The parents also estimated the typical amounts of time spent in each posture from 1 (ie, least time) to 9 (ie, most time), mimicking real-life health care visits (Supplementary Methods 3).

FIGURE 2.

Cohort-level reaching of the GMMs. Reaching of GMMs as a function of age for the individual recordings (dots; cohort 1) and as trajectories (curves) depicting the empirical cumulative distributions. Note the highly comparable distributions between wearable (MAIJU, blue) and parental (yellow) results, (two-sample K-S). The black bar shows the 1st through 99th percentile age range from the WHO reference study.27 

Abbreviations: GMM, gross motor milestone; K-S, Kolmogorov-Smirnov test; MAIJU, Motor Ability Assessment on Infants With a JUmpsuit; WHO, World Health Organization.
FIGURE 2.

Cohort-level reaching of the GMMs. Reaching of GMMs as a function of age for the individual recordings (dots; cohort 1) and as trajectories (curves) depicting the empirical cumulative distributions. Note the highly comparable distributions between wearable (MAIJU, blue) and parental (yellow) results, (two-sample K-S). The black bar shows the 1st through 99th percentile age range from the WHO reference study.27 

Abbreviations: GMM, gross motor milestone; K-S, Kolmogorov-Smirnov test; MAIJU, Motor Ability Assessment on Infants With a JUmpsuit; WHO, World Health Organization.
Close modal

Choosing of GMMs

To maximize global validity, we adopted GMMs from the WHO’s multicenter reference study,4,24,27 which provides 2 benchmarking measures: (1) the cohort-level age range of reaching the GMMs27 and (2) detection accuracy estimated by expert’s interrater agreement levels.24 We focused primarily on the GMMs performed without support, ie, sitting, 4-limb crawling, standing, and walking. Additionally, “prone crawling” (ie, commando crawling) was taken to reflect early independent moving. The supported types of standing and walking are reported in Supplementary Figure 3 for a full parallel to the WHO’s reference study.4,24 

FIGURE 3.

Accuracy of the automated GMM detections from the wearable measurements. The accuracy (left) and Cohen’s κ (right) of the automated GMM detections compared with the parental information. The MAIJU-based results (blue bars with +/−95% CI; thick, development cohort 1; thin, external validation cohort 2) also compare well with the interrater agreement levels in the WHO reference study24 (black bars). The gray bars depict the clinical baseline where GMM detections are assumed from infant’s age only.

Abbreviations: GMM, gross motor milestone; MAIJU, Motor Ability Assessment on Infants With a JUmpsuit; WHO, World Health Organization.
FIGURE 3.

Accuracy of the automated GMM detections from the wearable measurements. The accuracy (left) and Cohen’s κ (right) of the automated GMM detections compared with the parental information. The MAIJU-based results (blue bars with +/−95% CI; thick, development cohort 1; thin, external validation cohort 2) also compare well with the interrater agreement levels in the WHO reference study24 (black bars). The gray bars depict the clinical baseline where GMM detections are assumed from infant’s age only.

Abbreviations: GMM, gross motor milestone; MAIJU, Motor Ability Assessment on Infants With a JUmpsuit; WHO, World Health Organization.
Close modal

Developing Automated Detection Algorithm for Reaching of GMMs

For assessing the reaching of each GMM from the wearable data, we developed GMM-specific support vector machine classifiers using parental questionnaires as the target. These GMM classifiers were based on the second-by-second-level detections of infants’ postures and movements (“MAIJU features”; Figure 1B) during the measured play time, which all come automatically from MAIJU’s analysis pipeline.21,28 Here, we used those postures and movements that were behaviorally relevant for each specific GMM (Figure 1B; Supplementary Tables 1–2). This top-down approach supports maximal transparency, interpretability, and a full backward tracking of the results in each individual infant. Notably, it follows the well-known rationale in other areas of medicine, such as cardiac monitoring, in which underlying R-peak detections are used for higher-level presentations of the cardiorespiratory status (arrythmias, heart rate variability, etc).29 

The GMM classifiers for 4-limb crawling, sitting, and prone crawling were binary (ie, yes/no), whereas the GMM classifiers for walking and standing consisted of 3 alternative categories (ie, no/supported/unsupported). Because it is possible that an infant does not spontaneously show all of the learned milestone skills during each measurement session, we devised the common clinical rationale as a decision tree algorithm (Supplementary Table 3): reaching a more advanced GMM implies reaching the less advanced GMMs as well. For instance, detection of walking would indicate that the child has also reached the ability to crawl even if it was not directly measured during the given session. All classifiers were developed using MATLAB software (MathWorks, Inc.).

Holistic Estimate of Motor Maturation With the BIMS Score

We recently showed21 that the combination of all of an infant’s postures and movements (Figure 1A) during a wearable measurement session can provide a holistic estimate of motor maturity (Supplementary Table 2). Such quantified measure of a holistic gross motor maturation was conceptualized as a normalized index BABA infant motor score (BIMS). In the BIMS scale, 0 denotes motor performance with supine-dominant preference without rolling (corresponding to an infant aged approximately 4 months), and 100 denotes motor performance with dominant fluent walking (corresponding to a an infant aged approximately 16 months). Therefore, the BIMS of a typically developing infant is expected to correlate strongly with the chronological age (therefore also called “developmental age prediction”).22 

Statistical Analysis

Leave-one-subject-out cross-validation (cohort 1 data) was used to assess performance of the GMM detection algorithms. This means that all measurements from a single subject were used as the test data at a time, and all other measurements were used for model development. The external validation (cohort 2) was done with algorithms developed and validated with the whole cohort 1.

A two-sample Kolmogorov-Smirnov test (K-S, two-tailed P values) was used to compare empirical cumulative distribution functions (Figure 2) with the null hypothesis that they are drawn from the same underlying normal distribution.

A bootstrapping method with 10 000 randomly drawn samples was used to obtain the 95% CI for the GMM detection metrics (Figure 3).

Spearman’s ρ was used to assess correlations between wearable-derived quantitations of infants’ time spent in each posture and the parental assessments of typical postural prevalences (Figure 4), as well as between BIMS and infants’ age at the time of measurement (Figure 5C). Because individual recordings are statistically independent against the comparison targets (ie, parental assessment, age), the correlations and two-tailed P values are reported without controlling for infant identities (IDs).

FIGURE 4.

Wearable-based quantitation of the time spent in key postures. The scatter plots show the relationship between the wearable-derived, objectively quantified time spent in different postures (y axis; proportion of all analyzed time) and the parent-reported “typical amount of time spent” in the given posture (x axis; cohort 1). Note how the parental reporting scale (1–9) compares with widely different ranges of objectively quantified postures, eg, in the crawl posture, range is 0% to 40%, whereas standing posture range is 0% to 80%. The data represent n = 468 recording sessions, and the ρ values are obtained using Spearman’s test.

FIGURE 4.

Wearable-based quantitation of the time spent in key postures. The scatter plots show the relationship between the wearable-derived, objectively quantified time spent in different postures (y axis; proportion of all analyzed time) and the parent-reported “typical amount of time spent” in the given posture (x axis; cohort 1). Note how the parental reporting scale (1–9) compares with widely different ranges of objectively quantified postures, eg, in the crawl posture, range is 0% to 40%, whereas standing posture range is 0% to 80%. The data represent n = 468 recording sessions, and the ρ values are obtained using Spearman’s test.

Close modal
FIGURE 5.

Tracking gross motor development with the reaching of discrete GMMs and wearable recordings. (A) The GMM-based prediction of motor maturity yields a clearly stepwise tracking, whereas (B) the wearable recordings give a continuous spectrum of motor maturity levels. The individual recordings (dots) are colored according to the parentally informed reaching of GMMs by the time of wearable recording. (C) Longitudinal tracking of individuals’ holistic gross motor development with the automated BIMS index strikingly shows consistent individual trajectories (N = 90 infants; colors indicate mean distance from the cohort average). The shaded blue colors in the background show the smoothed cohort 1 average (dark blue), ±1SD (blue), and ±2SD (light blue). Motor maturation in all plots is shown using the same unit, ie, the BIMS index, which ranges from 0 to 100.

Abbreviations: BIMS, BABA infant motor score; GMM, gross motor milestone.
FIGURE 5.

Tracking gross motor development with the reaching of discrete GMMs and wearable recordings. (A) The GMM-based prediction of motor maturity yields a clearly stepwise tracking, whereas (B) the wearable recordings give a continuous spectrum of motor maturity levels. The individual recordings (dots) are colored according to the parentally informed reaching of GMMs by the time of wearable recording. (C) Longitudinal tracking of individuals’ holistic gross motor development with the automated BIMS index strikingly shows consistent individual trajectories (N = 90 infants; colors indicate mean distance from the cohort average). The shaded blue colors in the background show the smoothed cohort 1 average (dark blue), ±1SD (blue), and ±2SD (light blue). Motor maturation in all plots is shown using the same unit, ie, the BIMS index, which ranges from 0 to 100.

Abbreviations: BIMS, BABA infant motor score; GMM, gross motor milestone.
Close modal

Linear mixed-effects modeling (LME) modeling was used to account for constant ID-specific random effects in the age-dependent BIMS scores (Figure 5). In practice, this approach accounts for the different levels of individual growth trajectories that remained strikingly stable over time. The LME model implementation was obtained from MATLAB Statistics & Machine Learning Toolbox (MathWorks, Inc.) with the following formula: Age ∼ BIMS + (1|ID).

Cohort 1 (used for the development and cross-validation of the algorithm) consisted of 97 typically developing infants (47 boys) who were measured repeatedly (1–8 times per infant; median, 5) at age 4 to 19 months for a total number of N = 463 measurements. The external validation cohort (cohort 2) consisted of 37 infants (23 boys) measured repeatedly (1–7 times per infant; median, 6) at age 4 to 22 months for a total number of N = 157 measurements. Detailed comparison of the cohorts is presented in Supplementary Table 4 and Supplementary Figure 1.

We first visualized the cohort-level reaching of each GMM, allowing a convenient comparison among the GMM algorithm, parental information, and the prior multicenter reference cohort reported in the WHO study.27 The cohort-level reaching of GMMs (Figure 2) was nearly identical between automated detections and the parental questionnaires in all milestones (K-S P > .1). Comparison with the WHO reference age windows27 shows high concordance, except for “sitting without support,” which showed later reaching in our study. This is likely due to different definitions; in our study, the parents assessed the child’s typical spontaneous behavior, whereas the WHO study tested the child’s ability to sit when purposefully tested.4 Taken together, the cohort-level reaching of GMMs is comparable between the classifier and parents, and our infant population compares well with the WHO reference data.

Accuracies of the GMM detection algorithms were assessed by comparing them with the parental questionnaire information. We also benchmarked the algorithms’ GMM detection accuracy with the empirical threshold of human equivalent performance,20,30 ie, the level of interrater agreement among trained experts in the WHO data.24 Finally, algorithm accuracies were compared with the “clinical baseline,” in which reaching of GMMs is predicted from an infant’s age alone.

The automated detection of reaching the GMMs (Figure 3; Supplementary Figure 4) was very accurate for independent walking and 4-limb crawling (accuracy and κ ranging from 94.3% to 96.8% and .86 to .93, respectively, for both cohorts) and almost as high for standing (90.9%-92.4% and .82-.84, respectively), sitting (93.1%-95.8% and .79-.84, respectively), and prone crawling (94.3%-95.5% and .77-.81, respectively). Importantly, these accuracies are at a par with human-level performance, as they clearly overlap with the interrater agreement levels in the WHO reference study.24 The algorithms’ accuracies were also found to generalize very well to the external validation data (cohort 2). As a positive control, GMM predictions from the infants’ age alone (“clinical baseline”) was clearly less accurate in infants attending neurological follow-up clinics (cohort 2). The effect sizes and other finding details are presented in Supplementary Table 4 and Supplementary Figure 4.

The fully automated quantitation of infants’ postures was then compared with the parental estimates of time that their child is typically spending in the given postures (numerical scale 1 to 9, with higher number indicating more time spent). We found strong and highly significant correlations between wearable-derived measures and parental assessments (Figure 4), with the strongest correlations for “prone,” “crawl,” and “standing” postures (Spearman’s ρ = .68, .76, .81, respectively; P < .001). The findings also reflect the fundamental problem in assessing quantities with questionnaires: the parental response is, by default, unitless, ordinal more than quantitative, and not normalized across postures. The actually measured and true times spent in different postures can vary widely and irrespective of their correlations with parental assessments. For instance, parents’ ratings of 1 to 9 for crawl posture correspond to 0% to 40% of the time spent in crawling posture, whereas parents’ ratings of 1 to 9 for standing correspond to 0% to 80% of the time spent in standing posture.

We then compared longitudinal tracking of infants’ gross motor development with 2 approaches: (1) By using the conventional, parentally informed reaching of GMMs (dichotomic, ie, yes/no) and (2) by computing the holistic motor maturation measured BIMS from the wearable data. In the typically developing infants (cohort 1), an ideal developmental measure should correlate with the infant’s age, ie, support predicting the age of the given infant. Such age prediction from the parentally informed GMMs was distinctly stepwise (Figure 5A). In contrast, the BIMS measure produced a strikingly smooth continuum across the full age range (Figure 5B), suggesting far richer information related to the gross motor development (see also Supplementary Figures 5–7 for further details). This finding prompted assessing further how BIMS allows longitudinal tracking of an individual’s developing gross motor performance. Pooling 456 repeated measures from 90 typically developing infants (cohort 1) showed that BIMS values from individual measurements correlates strongly with infants’ age (Spearman’s ρRAW = .88, P < .001). Moreover, the individuals’ BIMS trajectories appeared to proceed consistently on their respective percentile levels (“growth channels”), and the age correlations were even higher when accounting for the individual’s mean percentile level (Spearman’s ρLME = .93, P < .001; Figure 5C).

Our study shows that unsupervised at-home wearable measurements with fully automated analyses can very accurately (1) detect reaching of GMMs, (2) quantify the time spent in different postures, and (3) track individual-level gross motor development. Importantly, the accuracy in automated milestone detections were comparable with the experts’ agreement levels reported by the WHO.24 The novel wearable MAIJU accurately reflects the natural continuum of maturing gross motor skills and supports constructing individual-level growth charts of gross motor development.21,22 All of these individual-level data could be used for an objective and effective clinical support in developmental pediatrics, to benchmark therapeutic interventions or other counselling,31 to advance health care equity,32 and to support a wide range of developmental studies across disciplines.3,5–7 

Prior wearable studies on infants’ motor development have only quantified the total amount of gross movements33,34 or selected postures.23,35 The present MAIJU method introduces a clear conceptual advance: Here, all gross motor assessments are based on the fully automated and validated algorithmic detections of the infants’ canonical postures and movements.21,22,25 Such a “bottom-up” approach renders the results fully transparent and explainable, even at the individual level. Meanwhile, the practical ease of the multisensor wearable measurements makes them widely scalable across different health care, societal, and research settings.

Wearable measurements combined to intuitively understandable and transparent analytics hold promise for overcoming many daily challenges in the early neurodevelopmental assessments, such as infants’ stress-related behavior, denial to show skills in a new environment at an outpatient clinic, or inappropriate time schedule overlapping with the infant’s nap or mealtime. The reliability and accuracy of parental information is often unknown,14 whereas the limited hospital or laboratory observations are always ecologically deficient with respect to infants’ natural behavior at home.36 The wearable methods may provide complementary tools by supporting fully unsupervised, at-home measurements that can last long enough to reliably capture children’s typical gross motor abilities during their natural activity (here referred to as “free playtime”).

Introducing novel electronic health technologies to clinical use calls for a careful consideration of the aimed use case and the biological and medical validity of the newly proposed measures.32,37,38 We chose the GMMs as the primary output measure due to their worldwide recognition and routine use as proxies of motor development. The routine dichotomic reporting of reaching the milestones ignores the natural and concurrently evolving spectrum of all the acquired motor skills.3,8,12,15 We devised 2 further measures: First, the time spent in each posture, which has wide implications in developmental3,7,8,39 and clinical studies,9 because recent studies have indicated strong posture-dependence in the early language, social, and other perceptual development.6–8,16,40,41 Second, the holistic measure of motor maturation (BIMS; Figure 5B-C), which combines the situational variability in infants’ motor activity12,15 in a way that mimics the intuitive experts’ assessment. Moreover, such measures can even support building motor growth charts with strikingly stable individual trajectories.

Our work has potential limitations. First, parental assessment of time spent in postures (scale 1–9) is unitless and nonnormalized across postures, which may deflate the reported correlations. Second, the infant cohorts used for the algorithm development and external validation came from distinct populations, but they may not represent all of the variability related to cohorts recruited elsewhere or with other criteria. Our fully automated and fixed analysis pipeline removes operator-related variability, and our experience to date from several ongoing studies in other countries and settings supports strong generalizability. Nevertheless, it is essential to validate the utility of these methods in different use cases and environments.

This is the first study, to our knowledge, to demonstrate that at-home wearable measurements provide objective, reliable, quantified, scalable, and ecologically valid assessments of infants’ gross motor skills. The measures presented are biologically and behaviorally reasoned, transparent, and intuitively explainable, and they hold promise for a wide range of health and research applications such as the following: (1) detecting developmental delays earlier among infants at risk for neurodevelopmental delays, (2) studying the elusive links between gross motor delay and delays in other developmental areas, (3) supporting comparative studies across different neurodevelopmental entities and disease levels,42,43 (4) using as a clinical support system in well-child care, especially in ambiguous situations, (5) providing surrogate outcome measures to benchmark any observational and interventional studies, including global health projects,44 (6) providing actionable, individual-level targets and efficacy metrics for early (physio-)therapy, (7) devising evidence-based and even globally applicable guidelines for the early developmental care,1,13 (8) providing culture- and language-free infant assessments that directly facilitate international studies and very large-scale (big) data pooling, and, finally, (9) improving equity in infant health care by providing the same affordable electronic health solutions. With respect to developmental science in general, the ability to track an individual’s gross motor development with growth charts shows that the overall gross motor development follows predictable trajectories,3,8,22,39 with precision that goes far beyond the conventionally assessed, discrete reaching of milestones.

Dr Airaksinen conceptualized the study, wrote the original draft, performed all analyses, and created all visualizations. Dr Vanhatalo and Dr Haataja were involved in conceptualization, writing of the original draft, and project supervision. Ms Gallen, Mrs Taylor, Ms de Sena, and Mrs Palsa conducted the investigation and data collection, and contributed to the reviewing and editing of the manuscript. All authors approved the final manuscript as submitted and agree to be accountable for all aspects of the work. The data or materials for the experiments reported here can be made available at reasonable request and within relevant legal constraints. Please contact Dr Vanhatalo ([email protected]) for requests.

CONFLICT OF INTEREST DISCLOSURES: The authors do not have conflicts of interest to disclose.

FUNDING: This research was supported by the Research Council of Finland, Suomen Aivosäätiö, Sigrid Juselius Foundation, and HUS New Children’s Hospital/HUS Diagnostic Center research funds. The grants include the following: Research Council of Finland grant 343498 (to Dr Airaksinen); Research Council of Finland grant 335788 (to Dr Vanhatalo); Research Council of Finland grant 332017 (to Dr Vanhatalo); Finnish Pediatric Foundation (Lastentautien säätiö; to Dr Vanhatalo); Aivosäätiö (to Dr Vanhatalo); Sigrid Juselius Foundation (to Dr Vanhatalo); HUS New Children’s Hospital/HUS Diagnostic Center research funds (to Dr Vanhatalo and Dr Airaksinen). The funders did not have any role in study design, data collection, data analyses, interpretation or writing of the report.

BIMS

BABA infant motor score

GMM

gross motor milestone

ID

identity

K-S

Kolmogorov-Smirnov test

LME

linear mixed-effects model

MAIJU

Motor Ability Assessment on Infants With a JUmpsuit

WHO

World Health Organization

1
World Health Organization
.
Motor development milestones
. Published
2006
. Accessed December 7, 2022. https://www.who.int/tools/child-growth-standards/standards/motor-development-milestones
2
Centers for Disease Control and Prevention
.
CDC’s Developmental Milestones
. Published
2023
. Accessed April 29, 2024. https://www.cdc.gov/ncbddd/actearly/milestones/index.html
3
DeMasi
A
,
Schoneveld
E
,
Berger
SE
.
Three decades of infant motor development: cohort effects in motor skill onsets
.
Dev Psychol.
2024
;
60
(
2
):
243
254
. PubMed doi: 10.1037/dev0001628
4
Wijnhoven
TMA
,
de Onis
M
,
Onyango
AW
, et al
.
Assessment of gross motor development in the WHO Multicentre Growth Reference Study
.
Food Nutr Bull.
2004
;
25
(1)(
suppl
):
S37
S45
. PubMed doi: 10.1177/15648265040251S106
5
Adolph
KE
,
Hoch
JE
.
Motor development: embodied, embedded, enculturated, and enabling
.
Annu Rev Psychol.
2019
;
70
(
1
):
141
164
. PubMed doi: 10.1146/annurev-psych-010418-102836
6
Franchak
JM
,
Kadooka
K
,
Fausey
CM
.
Longitudinal relations between independent walking, body position, and object experiences in home life
.
Dev Psychol.
2024
;
60
(
2
):
228
242
. PubMed doi: 10.1037/dev0001678
7
Schneider
JL
,
Iverson
JM
.
Cascades in action: how the transition to walking shapes caregiver communication during everyday interactions
.
Dev Psychol.
2022
;
58
(
1
):
1
16
. PubMed doi: 10.1037/dev0001280
8
Adolph
KE
,
Franchak
JM
.
The development of motor behavior
.
Wiley Interdiscip Rev Cogn Sci.
2017
;
8
(
1–2
):
e1430
. PubMed doi: 10.1002/wcs.1430
9
Boonzaaijer
M
,
Suir
I
,
Mollema
J
,
Nuysink
J
,
Volman
M
,
Jongmans
M
.
Factors associated with gross motor development from birth to independent walking: a systematic review of longitudinal research
.
Child Care Health Dev.
2021
;
47
(
4
):
525
561
. PubMed doi: 10.1111/cch.12830
10
Hewitt
L
,
Kerr
E
,
Stanley
RM
,
Okely
AD
.
Tummy time and infant health outcomes: a systematic review
.
Pediatrics.
2020
;
145
(
6
):
e20192168
. PubMed doi: 10.1542/peds.2019-2168
11
Karasik
LB
,
Tamis-LeMonda
CS
,
Adolph
KE
,
Bornstein
MH
.
Places and postures: a cross-cultural comparison of sitting in 5-month-olds
.
J Cross Cult Psychol.
2015
;
46
(
8
):
1023
1038
. PubMed doi: 10.1177/0022022115593803
12
Schneider
JL
,
Iverson
JM
.
Equifinality in infancy: the many paths to walking
.
Dev Psychobiol.
2023
;
65
(
2
):
e22370
. PubMed doi: 10.1002/dev.22370
13
Karasik
LB
,
Robinson
SR
.
Milestones or millstones: how standard assessments mask cultural variation and misinform policies aimed at early childhood development
.
Policy Insights Behav Brain Sci.
2022
;
9
(
1
):
57
64
. doi: 10.1177/23727322211068546
14
Mendonça
B
,
Sargent
B
,
Fetters
L
.
Cross-cultural validity of standardized motor development screening and assessment tools: a systematic review
.
Dev Med Child Neurol.
2016
;
58
(
12
):
1213
1222
. PubMed doi: 10.1111/dmcn.13263
15
Tupsila
R
,
Siritaratiwat
W
,
Bennett
S
,
Mato
L
,
Keeratisiroj
O
.
Intra-individual variability in gross motor development in healthy full-term infants aged 0–13 months and associated factors during child rearing
.
Children (Basel).
2022
;
9
(
6
):
801
. PubMed doi: 10.3390/children9060801
16
Iverson
JM
.
Developmental variability and developmental cascades: lessons from motor and language development in infancy
.
Curr Dir Psychol Sci.
2021
;
30
(
3
):
228
235
. PubMed doi: 10.1177/0963721421993822
17
Hadders-Algra
M
.
Early human motor development: from variation to the ability to vary and adapt
.
Neurosci Biobehav Rev.
2018
;
90
:
411
427
. PubMed doi: 10.1016/j.neubiorev.2018.05.009
18
Sacks
L
,
Kunkoski
E
,
Noone
M
.
Digital health technologies in pediatric trials
.
Ther Innov Regul Sci.
2022
;
56
(
6
):
929
933
. PubMed doi: 10.1007/s43441-021-00374-w
19
Subbiah
V
.
The next generation of evidence-based medicine
.
Nat Med.
2023
;
29
(
1
):
49
58
. PubMed doi: 10.1038/s41591-022-02160-z
20
Airaksinen
M
,
Räsänen
O
,
Ilén
E
, et al
.
Automatic posture and movement tracking of infants with wearable movement sensors
.
Sci Rep.
2020
;
10
(
1
):
169
. PubMed doi: 10.1038/s41598-019-56862-5
21
Airaksinen
M
,
Gallen
A
,
Kivi
A
, et al
.
Intelligent wearable allows out-of-the-lab tracking of developing motor abilities in infants
.
Commun Med (Lond).
2022
;
2
(
1
):
69
. PubMed doi: 10.1038/s43856-022-00131-6
22
Airaksinen
M
,
Taylor
E
,
Gallen
A
, et al
.
Charting infants’ motor development at home using a wearable system: validation and comparison to physical growth charts
.
EBioMedicine.
2023
;
92
:
104591
. PubMed doi: 10.1016/j.ebiom.2023.104591
23
Franchak
JM
,
Tang
M
,
Rousey
H
,
Luo
C
.
Long-form recording of infant body position in the home using wearable inertial sensors
.
Behav Res Methods.
2024
;
56
(
5
):
4982
5001
. PubMed doi: 10.3758/s13428-023-02236-9
24
WHO Multicentre Growth Reference Study Group
.
Reliability of motor development data in the WHO Multicentre Growth Reference Study
.
Acta Paediatr.
2006
;
95
(
S450
):
47
55
. doi: 10.1111/j.1651-2227.2006.tb02375.x
25
Taylor
E
,
Airaksinen
M
,
Gallen
A
, et al
.
Quantified assessment of infant’s gross motor abilities using a multisensor wearable
.
J Vis Exp.
2024
;
207
(
207
). PubMed doi: 10.3791/65949
26
Haataja
L
,
Mercuri
E
,
Regev
R
, et al
.
Optimality score for the neurologic examination of the infant at 12 and 18 months of age
.
J Pediatr.
1999
;
135
(
2 Pt 1
):
153
161
. PubMed doi: 10.1016/S0022-3476(99)70016-8
27
WHO Multicentre Growth Reference Study Group
.
WHO Motor Development Study: windows of achievement for six gross motor development milestones
.
Acta Paediatr.
2006
;
95
(
S450
):
86
95
. doi: 10.1111/j.1651-2227.2006.tb02379.x
28
Airaksinen
M
,
Vaaras
E
,
Haataja
L
,
Räsänen
O
,
Vanhatalo
S
.
Automatic assessment of infant carrying and holding using at-home wearable recordings
.
Sci Rep.
2024
;
14
(
1
):
4852
. PubMed doi: 10.1038/s41598-024-54536-5
29
Rajendra Acharya
U
,
Paul Joseph
K
,
Kannathal
N
,
Lim
CM
,
Suri
JS
.
Heart rate variability: a review
.
Med Biol Eng Comput.
2006
;
44
(
12
):
1031
1051
. PubMed doi: 10.1007/s11517-006-0119-0
30
Tapani
KT
,
Vanhatalo
S
,
Stevenson
NJ
.
Time-varying EEG correlations improve automated neonatal seizure detection
.
Int J Neural Syst.
2019
;
29
(
4
):
1850030
. PubMed doi: 10.1142/S0129065718500302
31
Vladescu
JC
,
Schnell
LK
,
Day-Watkins
J
.
Infant positioning: a brief review
.
J Appl Behav Anal.
2020
;
53
(
3
):
1237
1241
. PubMed doi: 10.1002/jaba.746
32
Bourgeois
FT
,
Espinoza
JC
.
Advancing equity in medical device development for children
.
JAMA Pediatr.
2023
;
177
(
6
):
561
562
. PubMed doi: 10.1001/jamapediatrics.2023.0790
33
Smith
BA
,
Vanderbilt
DL
,
Applequist
B
,
Kyvelidou
A
.
Sample entropy identifies differences in spontaneous leg movement behavior between infants with typical development and infants at risk of developmental delay
.
Technologies (Basel).
2017
;
5
(
3
):
55
. PubMed doi: 10.3390/technologies5030055
34
Wilson
RB
,
Vangala
S
,
Elashoff
D
,
Safari
T
,
Smith
BA
.
Using wearable sensor technology to measure motion complexity in infants at high familial risk for autism spectrum disorder
.
Sensors (Basel).
2021
;
21
(
2
):
616
. PubMed doi: 10.3390/s21020616
35
Franchak
JM
,
Scott
V
,
Luo
C
.
A contactless method for measuring full-day, naturalistic motor behavior using wearable inertial sensors
.
Front Psychol.
2021
;
12
:
701343
. PubMed doi: 10.3389/fpsyg.2021.701343
36
Tamis-LeMonda
CS
.
The mountain stream of infant development
.
Infancy.
2023
;
28
(
3
):
468
491
. PubMed doi: 10.1111/infa.12538
37
Canali
S
,
Schiaffonati
V
,
Aliverti
A
.
Challenges and recommendations for wearable devices in digital health: data quality, interoperability, health equity, fairness
.
PLOS Digit Health.
2022
;
1
(
10
):
e0000104
. PubMed doi: 10.1371/journal.pdig.0000104
38
Espinoza
J
,
Shah
P
,
Nagendra
G
,
Bar-Cohen
Y
,
Richmond
F
.
Pediatric medical device development and regulation: current state, barriers, and opportunities
.
Pediatrics.
2022
;
149
(
5
):
e2021053390
. PubMed doi: 10.1542/peds.2021-053390
39
Moore
C
,
Dailey
S
,
Garrison
H
,
Amatuni
A
,
Bergelson
E
.
Point, walk, talk: links between three early milestones, from observation and parental report
.
Dev Psychol.
2019
;
55
(
8
):
1579
1593
. PubMed doi: 10.1037/dev0000738
40
Gallen
A
,
Taylor
E
,
Salmi
J
et al
.
Early gross motor performance is associated with concurrent prelinguistic and social development
.
Pediatr Res.
2025
. PubMed doi: 10.1038/s41390-025-03832-5
41
Hoch
JE
,
O’Grady
SM
,
Adolph
KE
.
It’s the journey, not the destination: locomotor exploration in infants
.
Dev Sci.
2019
;
22
(
2
):
e12740
. PubMed doi: 10.1111/desc.12740
42
van Dommelen
P
,
van Dijk
O
,
de Wilde
JA
,
Verkerk
PH
.
Early developmental milestones in Duchenne muscular dystrophy
.
Dev Med Child Neurol.
2020
;
62
(
10
):
1198
1204
. PubMed doi: 10.1111/dmcn.14623
43
Baumer
N
,
DePillis
R
,
Pawlowski
K
,
Zhang
B
,
Mazumdar
M
.
Developmental milestones for children with Down syndrome
.
Pediatrics.
2024
;
154
(
4
):
e2023065402
. PubMed doi: 10.1542/peds.2023-065402
44
World Health Organization
.
Guidance for best practices for clinical trials
. Published
2024
. Accessed October 17, 2024. https://www.who.int/publications/i/item/9789240097711
This is an open access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits noncommercial distribution and reproduction in any medium, provided the original author and source are credited.

Supplementary data