Video Abstract

Video Abstract

Close modal
CONTEXT

Approximately 10% to 20% of patients with Kawasaki disease (KD) are refractory to initial intravenous immunoglobulin (IVIG) therapy. KD is mainly associated with coronary artery abnormalities.

OBJECTIVES

To identify and evaluate all developed prediction models for IVIG resistance in patients with KD and synthesize evidence from external validation studies that evaluated their predictive performances.

DATA SOURCES

PubMed Medline, Dialog Embase, the Cochrane Central Register of Controlled Trials, the World Health Organization International Clinical Trials Registry Platform, and ClinicalTrials.gov were searched from inception until October 5, 2021.

STUDY SELECTION

All cohort studies that reported patients diagnosed with KD who underwent an initial IVIG of 2 g/kg were selected.

DATA EXTRACTION

Study and patient characteristics and model performance measures. Two authors independently extracted data from the studies.

RESULTS

The Kobayashi, Egami, Sano, Formosa, and Harada scores were the only prediction models with 3 or more external validation of the161 model analyses in 48 studies. The summary C–statistics were 0.65 (95% confidence interval [CI]: 0.57–0.73), 0.63 (95% CI: 0.55–0.71), 0.58 (95% CI: 0.55–0.60), 0.50 (95% CI: 0.36–0.63), and 0.63 (95% CI: 0.44–0.78) for the Kobayashi, Egami, Sano, Formosa, and Harada models, respectively. All 5 models showed low positive predictive values (0.14–0.39) and high negative predictive values (0.85–0.92).

LIMITATIONS

Potential differences in the characteristics of the target population among studies and lack of assessment of calibrations.

CONCLUSIONS

None of the 5 prediction models with external validation accurately distinguished between patients with and without IVIG resistance.

Kawasaki disease (KD) is an acute autoimmune systemic vasculitis that is typically self-limited. However, vasculitis occurs mainly in small- and medium-sized vessels, causing coronary artery abnormalities (CAAs) in 5% to 7% of children who receive standard treatment.1  This sometimes leads to life-threatening complications, such as myocardial infarction, arrhythmia, and heart failure.

High–dose intravenous immunoglobulin (IVIG) therapy is a well-established treatment of KD.24  However, approximately 10% to 20% of patients with KD are refractory to this treatment and develop persistent or recurrent fever associated with CAA development after initial IVIG therapy.2,5,6  Clinicians may consider adjuvant therapy with additional initial IVIG therapy for patients at high-risk for IVIG resistance and subsequent CAA.2,4  A systematic review and meta-analysis revealed that combining corticosteroids and initial IVIG therapy for such patients was more effective in preventing CAAs than IVIG alone.7  Thus, reliable IVIG resistance prediction could contribute to decision-making for initial treatment and help prevent CAA.

However, IVIG resistance prediction to reduce the CAA incidence is challenging. Three risk prediction models from Japan810  have been developed and externally validated. These scoring models are considered useful in Japan3  but were less beneficial in non-Japanese populations.11  There are other new prediction models besides these 3 models; however, their validity requires further examination.12,13  Therefore, this systematic review aimed to identify all available developed prediction models to predict IVIG resistance in patients with KD and synthesize available evidence from external validation studies evaluating their predictive performances.

This systematic review and meta-analysis followed the Preferred Reporting Item for Systematic Review and Meta-analysis (PRISMA) guidelines.14  We developed a detailed protocol and registered our study with the International Prospective Register of Systematic Reviews (PROSPERO: CRD42021283276).

Types of Studies

We included all prospective and retrospective cohort studies. We did not apply language or country restrictions. We included all papers, including published articles, unpublished articles, conference abstracts, and letters. We did not include case-control studies (including nested case-control), case reports, and case series. We did not exclude studies based on the observation period or publication year.

Target Population

We included studies reporting any patients diagnosed with KD and undergoing initial IVIG therapy. Studies including participants with initial IVIG doses other than the standard treatment of 2 g/kg24  or those without reported IVIG doses and studies including patients using drugs other than aspirin concurrently with initial IVIG therapy were excluded.

Types of Prognostic and Predictive Models

We included studies aiming to develop, validate (internally or externally), or update prediction models for initial IVIG resistance in patients with KD and prediction models using available predictors in clinical practice before starting IVIG administration. We a priori decided to include models that were not originally developed to predict IVIG resistance but commonly used or validated to predict the outcome. We excluded studies aiming to establish independent prognostic factors associated with IVIG resistance and not to develop a prognostic model by combining 2 or more predictors.

Types of Outcomes to be Predicted

IVIG resistance is generally defined as a recrudescent or persistent fever at least 36 h2  or 24 h3,8  after IVIG administration completion. Other definitions, such as the attending physician’s judgment or a recrudescent or persistent fever for at least 48 h, were also accepted.

Search Methods for Study Identification

We searched the following databases: Medline (PubMed), the Cochrane Central Register of Controlled Trials (Cochrane Library) and Embase (Dialog) from inception to October 2021. For the assessment of reporting bias, we also searched the following databases for ongoing or unpublished trials: the World Health Organization International Clinical Trials Platform Search Portal and ClinicalTrials.gov. Supplemental Table 2 shows the details of search terms for each database. We checked the reference lists of studies, including international guidelines, such as American Heart Association,2  JCS/JSCS 2020,3  and SHARE recommendation,4  and the reference lists of eligible studies and articles that cite eligible studies. The authors of the original studies were contacted when a study was uncertain to be eligible.

Selection of Studies

Two of 6 independent reviewers (Y.K., M.B., S.T., T.A., N.T., and H.T.) screened titles and abstracts, followed by an eligibility assessment based on the full texts. We contacted the original authors if relevant data are missing. Disagreements between the 2 reviewers were resolved by discussion, with a third reviewer as an arbiter for failed discussion. Study selection was documented in a detailed flowchart based on the PRISMA guidelines.14 

Data Extraction and Management

Two of 6 independent reviewers (Y.K., M.B., S.T., T.A., N.T., and H.T.) extracted data from the selected articles following the critical appraisal and data extraction for systematic reviews of prediction modeling studies checklist.15  We categorized the types of model analyses as follows: development model analysis with or without internal validation, validation analysis with a dataset that combines the original dataset for derivation and another dataset, internal-external validation, and external validation analysis.16  The internal-external validation analysis was defined as a validation using a dataset unused for model development in the same study. External validation analysis used datasets from a different study. Discrimination (C–statistic), calibration (observed–to–expected [OE] ratio, calibration–in–the–large and slope, calibration plot, and Hosmer–Lemeshow test), and classification (sensitivity, specificity, positive predictive value [PPV] and negative predictive value [NPV]) were separately extracted for each model if a study reported 2 or more models with different predictors or prediction rules. Cut-offs of 3,17  4,8  or 53,18  points were used for external validation of Kobayashi scores. Studies without the cutoff values were classified as 4 or 5 points based on cited articles.8,18 

Risk of Bias and Applicability Assessment

We used the Prediction model Risk of Bias Assessment Tool (PROBAST) to assess the risk of bias and applicability of each model analysis.19 

Two of 6 independent reviewers (Y.K., M.B., S.T., T.A., N.T., and H.T.) independently assessed these individual studies. The questions assessed the risk of bias for all analyses reported in the selected articles following the 4 domains, namely, participants, predictors, outcome, and analysis. The applicability was assessed through the questions following the 3 domains, namely, participants, predictors, and outcome. We evaluated each domain for risk of bias as “high-risk of bias,” “unclear risk of bias,” “low-risk of bias,” and for applicability as “high concern,” “unclear concern,” and “low concern.” An overall judgment was evaluated as “high-risk of bias” or “high concern” if at least 1 domain of risk of bias or applicability was “high.” Any disagreements were resolved by discussion, with a third reviewer as an arbiter for failed discussion.

In general, prediction models show poorer performance in external validation studies than in their original developmental studies.20  Thus, external validations in a wholly independent sample are warranted to assess the reproducibility and generalizability of newly developed prediction models.21  We planned to perform meta-analyses of C–statistic, OE ratio, or PPV/NPV using the external validation analyses results if more than 3 studies that externally validated the same model were identified. We computed the summary C–statistic for discrimination and PPV and NPV for classification with a 95% confidence interval (CI). If both SE and 95% CI in C–statistic was not reported, we calculated them using the methods presented below (https://www.lerner.ccf.org/qhs/):
c is the C–statistic, n1 is the number of IVIG resistance, and n2 is the number of IVIG responders.

We applied a random-effects model that pools logit transformations of C–statistics. We adopted the restricted maximum likelihood estimation and the Hartung–Knapp–Sidik–Jonkman.22  We used an inverse-variance-weighted random-effects model with the DerSimonian-Laird estimator to estimate the between-study variance and normal approximation intervals based on summary measures to calculate CIs for individual study results and estimate the pooled PPV and NPV.23  To stabilize variances, we transformed the data with the Freeman–Tukey double arcsine transformation.

We computed the 95% prediction interval, which provided a range for the potential performance of a model in a new validation study. We also computed I2 and τ2 statistics to quantify the extent of the heterogeneity. All statistical analyses were performed using R software (version 4·0·3) including “meta,” “metamisc,” and “metafor” packages.

Study location (East Asia, Middle East Asia, Europe, or North America) or timing of outcome definition (recrudescent or persistent fever at least 24, 36, or 48 h after IVIG administration completion) subgroups were analyzed to explore the cause and extent of heterogeneity.

Figure 1 displays a PRISMA 2020 flowchart detailing the study selection process, including the reason for exclusion. We screened 2043 records after removing duplicates and assessed 193 full texts. The clinical trial registry system screened 27 records, and no records were included. We excluded 25 studies having participants with initial IVIG doses other than 2 g/kg or those without reported IVIG doses,2426  and 9 studies with patients using drugs other than aspirin concurrently with initial IVIG therapy (Supplemental Table 3). Finally, the analyses included 48 studies.813,17,18,2766 

FIGURE 1

Preferred reporting items for systematic reviews and meta-analyses diagram of literature search and study selection. Supplemental Table 2 lists the excluded studies with reasons.

FIGURE 1

Preferred reporting items for systematic reviews and meta-analyses diagram of literature search and study selection. Supplemental Table 2 lists the excluded studies with reasons.

Close modal

Supplemental Table 4 shows the characteristics of the 48 included studies that evaluated 161 model analyses. Of which, 35, 6, 4, and 3 were reported from East Asia, Middle East Asia, North America, and Europe, respectively. Calibration performance was unavailable in most studies.

We applied the PROBAST tool to the 161 model analyses (Supplemental Fig 5, Supplemental Table 5). All development model analyses were classified as an overall “high-risk of bias.” All analyses were rated “high-risk of bias” in the analysis domain because of (1) insufficient number of outcome events relative to the parameters among candidate predictors (low events per variable), (2) inappropriate missing data handling, (3) inappropriate selection, and (4) no calibration. Most development model analyses were classified as “low concerns of applicability” (94%; 48 of 51).

All external validation analyses were classified as “high-risk of bias” because of (1) inappropriate missing data handling and (2) no calibration. Approximately two-thirds (74 of 110) of external validation analyses were classified as “low concern of applicability.”

Models with 3 or more external validation analyses in different studies are the Kobayashi, Egami, Sano, Formosa, and Harada scores (Supplemental Table 6). The Kobayashi,8  Egami,9  Sano,10  and Harada67  scores were developed in Japan, whereas the Formosa37  score was developed in Taiwan. The Harada Score67  was originally developed in 1961, not to predict IVIG resistance, but to predict CAA development and determine whether to implement IVIG therapy. However, several studies have evaluated the external validity of the Harada score as a model for predicting IVIG resistance; thus, it was included in the evaluation in this review. Table 1 presents a result summary of the external validation analyses of the 5 prediction models. The included analyses in the meta-analysis revealed a 14% (12% to 20%) median (interquartile range) prevalence of IVIG resistance.

TABLE 1

Prognostic Models for Predicting the Resistance of Initial IVIG Therapy in Patients With KD

ModelMeasureNo of Studies (IVIG Resistance %, Median [IQR]) No of PatientsSummary MeasureRisk Group, No. of Patients IVIG Resistance, n (%)Pooled Result (95% CI)Overall Risk of BiasComments and Predictors
Kobayashi score Discrimination 13 studies C–statistic NA 0.65 (0.57–0.73) High Age ≤ 12 months, 1 point
Illness days ≤ 4 days, 2 points
CRP ≥ 10 mg/dL, 1 point
AST ≥ 100 IU/L, 2 points
PLT < 300,000/mm3, 1 point
Na ≤ 133 mmol/L, 2 points 
 Classification; Cutoff, 5 points 5 studies (22 [15–24])
655 patients 
PPV High-risk (score, 5–11); 197
77 (39) 
0.39 (0.32–0.46) High 
   NPV Low-risk (score, 0–4); 458
71 (16) 
0.85 (0.81–0.89) High 
 Classification;
Cutoff, 4 points 
18 studies (14 [12–17])
6097 patients 
PPV High (score, 4–11); 1,561
483 (31) 
0.21 (0.12–0.31) High 
   NPV Low (score, 0–3); 4,356
442 (10) 
0.89 (0.84–0.92) High 
Egami score Discrimination 12 studies C–statistic NA 0.63 (0.55–0.71) High Age ≤ 6 months, 1 point
Illness days ≤ 4 days, 1 point
CRP ≥ 8 mg/dL, 1 point
ALT ≥ 80 IU/L, 2 points
PLT ≤ 300,000/mm3, 1 point 
 Classification;
Cutoff, 3 points 
20 studies (15 [12–21])
6723 patients 
PPV High (score, 3–6); 1,580
485 (31) 
0.27 (0.18–0.37) High 
   NPV Low (score, 0–2); 5,143
571 (11) 
0.88 (0.83–0.91) High 
Sano score Discrimination 7 studies C–statistic NA 0.58 (0.55–0.60) High CRP ≥ 7.0 mg/dL, 1 point
TB ≥ 0.9 mg/dL, 1 point
AST ≥ 200 IU/L, 1 point 
 Classification;
Cutoff, 2 points 
15 studies (17 [12–24])
3,875 patients 
PPV High (score, 2–3); 527
208 (39) 
0.35 (0.26–0.45) High 
   NPV Low (score, 0–1); 3,348
594 (18) 
0.85 (0.77–0.91) High 
Formosa score Discrimination 5 studies C–statistic NA 0.50 (0.36–0.63) High Lymphadenopathy, 1 point
N% ≥ 60%, 2 points
Alb < 35 g/L, 1 point 
 Classification;
Cutoff, 3 points 
6 studies (13 [7–15])
2579 patients 
PPV High (score, 3–4); 1,244
117 (9.4) 
0.14 (0.07–0.23) High 
   NPV Low (score, 0–2); 1,335
104 (7.8) 
0.92 (0.88–0.95) High 
Harada score Discrimination 4 studies C–statistic NA 0.63 (0.44–0.78) High WBC ≥ 12,000/mm3, 1 point
PLT < 350,000/mm3, 1 point
CRP ≥ 4.0 mg/dL, 1 point
Ht < 35%, 1 point
Alb < 35 g/L, 1 point
Age < 12 months, 1 point
Male, 1 point 
 Classification;
Cutoff, 4 points 
2 studies (12% and 13%)
379 patients 
PPV High (score, 4–7); 198
33 (17) 
0.16 (0.11–0.22) High 
   NPV Low (score, 0–3); 181
15 (8.3) 
0.91 (0.79–0.99) High 
ModelMeasureNo of Studies (IVIG Resistance %, Median [IQR]) No of PatientsSummary MeasureRisk Group, No. of Patients IVIG Resistance, n (%)Pooled Result (95% CI)Overall Risk of BiasComments and Predictors
Kobayashi score Discrimination 13 studies C–statistic NA 0.65 (0.57–0.73) High Age ≤ 12 months, 1 point
Illness days ≤ 4 days, 2 points
CRP ≥ 10 mg/dL, 1 point
AST ≥ 100 IU/L, 2 points
PLT < 300,000/mm3, 1 point
Na ≤ 133 mmol/L, 2 points 
 Classification; Cutoff, 5 points 5 studies (22 [15–24])
655 patients 
PPV High-risk (score, 5–11); 197
77 (39) 
0.39 (0.32–0.46) High 
   NPV Low-risk (score, 0–4); 458
71 (16) 
0.85 (0.81–0.89) High 
 Classification;
Cutoff, 4 points 
18 studies (14 [12–17])
6097 patients 
PPV High (score, 4–11); 1,561
483 (31) 
0.21 (0.12–0.31) High 
   NPV Low (score, 0–3); 4,356
442 (10) 
0.89 (0.84–0.92) High 
Egami score Discrimination 12 studies C–statistic NA 0.63 (0.55–0.71) High Age ≤ 6 months, 1 point
Illness days ≤ 4 days, 1 point
CRP ≥ 8 mg/dL, 1 point
ALT ≥ 80 IU/L, 2 points
PLT ≤ 300,000/mm3, 1 point 
 Classification;
Cutoff, 3 points 
20 studies (15 [12–21])
6723 patients 
PPV High (score, 3–6); 1,580
485 (31) 
0.27 (0.18–0.37) High 
   NPV Low (score, 0–2); 5,143
571 (11) 
0.88 (0.83–0.91) High 
Sano score Discrimination 7 studies C–statistic NA 0.58 (0.55–0.60) High CRP ≥ 7.0 mg/dL, 1 point
TB ≥ 0.9 mg/dL, 1 point
AST ≥ 200 IU/L, 1 point 
 Classification;
Cutoff, 2 points 
15 studies (17 [12–24])
3,875 patients 
PPV High (score, 2–3); 527
208 (39) 
0.35 (0.26–0.45) High 
   NPV Low (score, 0–1); 3,348
594 (18) 
0.85 (0.77–0.91) High 
Formosa score Discrimination 5 studies C–statistic NA 0.50 (0.36–0.63) High Lymphadenopathy, 1 point
N% ≥ 60%, 2 points
Alb < 35 g/L, 1 point 
 Classification;
Cutoff, 3 points 
6 studies (13 [7–15])
2579 patients 
PPV High (score, 3–4); 1,244
117 (9.4) 
0.14 (0.07–0.23) High 
   NPV Low (score, 0–2); 1,335
104 (7.8) 
0.92 (0.88–0.95) High 
Harada score Discrimination 4 studies C–statistic NA 0.63 (0.44–0.78) High WBC ≥ 12,000/mm3, 1 point
PLT < 350,000/mm3, 1 point
CRP ≥ 4.0 mg/dL, 1 point
Ht < 35%, 1 point
Alb < 35 g/L, 1 point
Age < 12 months, 1 point
Male, 1 point 
 Classification;
Cutoff, 4 points 
2 studies (12% and 13%)
379 patients 
PPV High (score, 4–7); 198
33 (17) 
0.16 (0.11–0.22) High 
   NPV Low (score, 0–3); 181
15 (8.3) 
0.91 (0.79–0.99) High 

Population targeted: Patients diagnosed with KD and undergoing an initial 2 g/kg of IVIG therapy. Index model: Multivariable prediction models with at least three external validation analyses. Outcome to be predicted: IVIG resistance. Timing: Timing of making the prediction is just before IVIG administration. Period of the prediction is approximately 24, 36, or 48 h after IVIG completion. Setting: Not specified. NA, not applicable.

Kobayashi Score3,8,18 

This meta-analysis included 35 studies and 33 external validation analyses (Supplemental Table 6). Figure 2A shows low discrimination (C–statistic: 0.65; 95% CI: 0.57–0.73), without differences in the C–statistic between studies from East and Middle East Asia (Supplemental Table 7). Europe has only 1 and North America has no report. Additionally, the C–statistic between studies with different IVIG response determination timing has no difference (Supplemental Table 7). PPV was higher at the cutoff score of 5 points (0.39; 95% CI: 0.32–0.46) (Fig 3A), whereas NPV was higher at the cutoff score of 4 points (0.89; 95% CI: 0.84–0.92) (Fig 4B).

FIGURE 2

Forest plot of C–statistic in external validation analyses. (A) Kobayashi score; (B) Egami score; (C) Sano score; (D) Formosa score; (E) Harada score; 95% CI: 95% confidence interval.

FIGURE 2

Forest plot of C–statistic in external validation analyses. (A) Kobayashi score; (B) Egami score; (C) Sano score; (D) Formosa score; (E) Harada score; 95% CI: 95% confidence interval.

Close modal
FIGURE 3

Forest plot of positive predictive value in external validation analyses. (A) Kobayashi score (≥5); (B) Kobayashi score (≥4); (C) Egami score; (D) Sano score; (E) Formosa score; (F) Harada score.

FIGURE 3

Forest plot of positive predictive value in external validation analyses. (A) Kobayashi score (≥5); (B) Kobayashi score (≥4); (C) Egami score; (D) Sano score; (E) Formosa score; (F) Harada score.

Close modal
FIGURE 4

Forest plot of negative predictive value in external validation analyses. (A) Kobayashi score (≥5); (B) Kobayashi score (≥4); (C) Egami score; (D) Sano score; € Formosa score; (F) Harada score.

FIGURE 4

Forest plot of negative predictive value in external validation analyses. (A) Kobayashi score (≥5); (B) Kobayashi score (≥4); (C) Egami score; (D) Sano score; € Formosa score; (F) Harada score.

Close modal

Egami Score3,9 

This review included 31 studies and 30 external validation analyses (Supplemental Table 6). Figure 2B shows low discrimination (C–statistic: 0.63; 95% CI: 0.55–0.71), without differences in the C–statistic between studies from East and Middle East Asia (Supplemental Table 7). Europe has only 1 and North America has no report. Additionally, the C–statistic between studies with differentIVIG response determination timing has no difference (Supplemental Table 7). Figure 3C and Fig 4C shows low PPV (0.27; 95% CI: 0.18–0.37) and high NPV (0.88; 95% CI: 0.83–0.91).

Sano Score3,10 

This review included 21 studies and 20 external validation analyses (Supplemental Table 6). Figure 2C shows low discrimination (C–statistic: 0.58; 95% CI: 0.55–0.60), without differences in the C–statistic between studies from East Asia and Middle East Asia (Supplemental Table 7). Europe and North America have no reports. Additionally, the C–statistic between studies with different IVIG response determination timing has no difference (Supplemental Table 7). Figure 3D and Fig 4D shows low PPV (0.35; 95% CI: 0.26–0.45) and high NPV (0.85; 95% CI: 0.77–0.91).

Formosa Score37 

This review included 9 studies and eight external validation analyses (Supplemental Table 6). Figure 2D shows low discrimination (C–statistic: 0.50; 95% CI: 0.36–0.63), without differences in the C–statistic between studies from East Asia and Middle East Asia (Supplemental Table 7). Europe has only 1 and North America has no reports. In all studies, IVIG response was determined at 36 h after the end of administration (Supplemental Table 7). Figure 3E and Fig 4E shows low PPV (0.14; 95% CI: 0.07–0.23) and high NPV (0.92; 95% CI: 0.88–0.95).

Harada Score67 

This review included 5 studies and 5 external validation analyses (Supplemental Table 6). Figure 2E shows low discrimination (C–statistic: 0.63; 95% CI: 0.44–0.78), without differences in the C–statistic between studies from East Asia and Middle East Asia (Supplemental Table 7). Europe and North America have no reports. Additionally, the C–statistic between studies with different IVIG response determination timing has no difference (Supplemental Table 7). Figure 3F and Fig 4F shows low PPV (0.16; 95% CI: 0.11–0.22) and high NPV (0.91; 95% CI: 0.79–0.99).

This systematic review of 161 model analyses in 48 studies summarized the currently available evidence on performing prediction models for initial IVIG resistance in patients with KD. Our meta-analyses suggested that none of the prediction models have properly distinguished between patients with and without IVIG resistance.

None of the 5 models that were externally validated could not discriminate well. The summary C–statistics, which integrates the external validation results, was <0.75,22  although the Kobayashi score has the highest C–statistic of 0.65. The poor performance of the existing 5 predictive models may be attributed to the fact that KD is a heterogeneous syndrome. KD is speculated to occur as a result of immune responses to a variety of infectious etiologies, including viral, fungal, and bacterial agents, as well as environmental stimuli.68,69  Genetic factors and racial differences may also lead to variation in clinical symptoms and laboratory parameters, thereby making these predictive models difficult to apply to populations other than the development cohort.

The existing prediction models showed NPV of 85% to 92% and PPV of 14% to 39%, indicating that approximately 10% to 20% of patients classified as low-risk are IVIG resistant, whereas only 15% to 40% of patients classified as high-risk do need adjuvant therapy with additional initial IVIG therapy. The clinically acceptable NPV and PPV on this research topic have no established consensus. Thus, the optimum cutoff point for the existing models should be based on the clinical context in each case. For instance, higher NPV (ie, missing fewer patients with IVIG resistance) would be desirable in patients with a severe condition. A lower cutoff point in the existing prediction models might be more appropriate for such patients. However, the model performance in the studies included in the present review was generally assessed only using a cutoff point that was reported (and arbitrarily determined) in the original developmental studies. Therefore, the model’s performance could not be evaluated with different cutoff points.

In clinical settings, collectively considering the prediction model score and other independent predictors might be practical to make an overall decision,34  because a highly reliable prediction model remained unavailable. Each model had a higher rate of IVIG resistance in the high-risk group than in the low-risk group. The higher the point in the model, the higher the risk of IVIG resistance in the Kobayashi and Formosa scores. For example, a score of 8 on the Kobayashi score is associated with a higher-risk IVIG resistance than a score of 5. Rather than only focusing on the dichotomized judgment (ie, positive or negative) of the prediction models, it is more informative to refer to the score itself. Several predictors as a sole predictor have also been reported.70,71  Furthermore, the development and validation of IVIG adjuvant therapy as the standard first-line therapy, which is more effective in preventing CAA formation than conventional IVIG therapy, may also be warranted if developing a prediction model with good performance is difficult.

Model performance for studies reported from East Asia and Middle East Asia was comparable. Contrastingly, model performance from East Asia and Europe or North America is not comparable because of the limited studies incorporated in this review. This was inconsistent with the previous indications that models developed in Japan might have performance differences between Japanese and non-Japanese patients. Recent studies from the United Kingdom,24  Germany,25  and Spain26  demonstrated that the prediction models developed by the Japanese research group were not as effective for non-Japanese patients. However, these studies did not describe the IVIG dose and administration method. Therefore, further studies with well-defined IVIG doses and administration methods are needed to clarify differences in model performance by race or study area.

This was the first systematic review of prediction models’ performance for initial IVIG resistance in patients with KD. We included 48 studies and performed meta-analyses of the 5 different prediction models using only the external validation results in different studies. Referring to the critical appraisal and data extraction for systematic reviews of prediction modeling studies checklist and PROBAST tool, we could extract and assess detailed data on study characteristics and model performance metrics.

This study had some limitations. First, the effects of differences in inclusion criteria between studies, such as the timing of IVIG administration or complete or incomplete type, were not accounted for. Potential differences in the prediction model’s performance between different timings of initial treatment7  and between complete and incomplete KD have been suggested.35  Second, the calibration, such as OE ratios, could not be assessed because they are not reported in external validation analyses.

The external validity remains unknown for most existing prediction models to predict IVIG resistance in patients with KD. Additionally, externally validated models have no sufficient performance to distinguish between patients with and without IVIG resistance. Therefore, further studies are required to develop and validate prediction models with reliable performance.

We are very grateful to Dr. Jun Watanabe, Department of Surgery, Division of Gastroenterological, General and Transplant Surgery, Jichi Medical University, for providing support to our research; and Enago (www.enago.jp) for their English language review.

COMPANION PAPER: A companion to this article can be found online at http://www.pediatrics.org/cgi/doi/10.1542/peds.2022-060423.

Dr Kuniyoshi contributed to the conceptualization, study design, search strategy development, screening, data extraction, analysis, and writing (original draft preparation, review, and editing); Drs Tsujimoto and Takada contributed to the study design, search strategy development, supervision, and writing (review and editing); Drs Banno, Taito, and Ariie contributed to the study design, search strategy development, screening, data extraction, and writing (review and editing); Drs Takahashi and Tokutake contributed to screening, data extraction, and writing (review and editing); and all authors approved the final manuscript as submitted and agree to be accountable for all aspects of the work.

FUNDING: No external funding.

CONFLICT OF INTEREST DISCLOSURES: The authors have indicated they have no conflicts of interest relevant to this article to disclose.

CAA

coronary artery abnormality

CI

confidence interval

IVIG

intravenous immunoglobulin

KD

Kawasaki disease

NPV

negative predictive value

OE

observed–to–expected

PRISMA

Preferred Reporting Item for Systematic Review and Meta-analysis

PROBAST

Prediction model Risk of Bias Assessment Tool

PPV

positive predictive value

1
Noval Rivas
M
,
Arditi
M
.
Kawasaki disease: pathophysiology and insights from mouse models
.
Nat Rev Rheumatol
.
2020
;
16
(
7
):
391
405
2
McCrindle
BW
,
Rowley
AH
,
Newburger
JW
, et al;
American Heart Association Rheumatic Fever, Endocarditis, and Kawasaki Disease Committee of the Council on Cardiovascular Disease in the Young
;
Council on Cardiovascular and Stroke Nursing
;
Council on Cardiovascular Surgery and Anesthesia
;
and Council on Epidemiology and Prevention
.
Diagnosis, treatment, and long-term management of Kawasaki disease: a scientific statement for health professionals from the American Heart Association
.
Circulation
.
2017
;
135
(
17
):
e927
e999
3
Fukazawa
R
,
Kobayashi
J
,
Ayusawa
M
, et al;
Japanese Circulation Society Joint Working Group
.
JCS/JSCS 2020 guideline on diagnosis and management of cardiovascular sequelae in Kawasaki disease
.
Circ J
.
2020
;
84
(
8
):
1348
1407
4
de Graeff
N
,
Groot
N
,
Ozen
S
, et al
.
European consensus-based recommendations for the diagnosis and treatment of Kawasaki disease - the SHARE initiative
.
Rheumatology (Oxford)
.
2019
;
58
(
4
):
672
682
5
Yan
F
,
Pan
B
,
Sun
H
,
Tian
J
,
Li
M
.
Risk factors of coronary artery abnormality in children with Kawasaki disease: a systematic review and meta-analysis
.
Front Pediatr
.
2019
;
7
:
374
6
Makino
N
,
Nakamura
Y
,
Yashiro
M
, et al
.
Nationwide epidemiologic survey of Kawasaki disease in Japan, 2015-2016
.
Pediatr Int
.
2019
;
61
(
4
):
397
403
7
Chen
S
,
Dong
Y
,
Kiuchi
MG
, et al
.
Coronary artery complication in Kawasaki disease and the importance of early intervention: a systematic review and meta-analysis
.
JAMA Pediatr
.
2016
;
170
(
12
):
1156
1163
8
Kobayashi
T
,
Inoue
Y
,
Takeuchi
K
, et al
.
Prediction of intravenous immunoglobulin unresponsiveness in patients with Kawasaki disease
.
Circulation
.
2006
;
113
(
22
):
2606
2612
9
Egami
K
,
Muta
H
,
Ishii
M
, et al
.
Prediction of resistance to intravenous immunoglobulin treatment in patients with Kawasaki disease
.
J Pediatr
.
2006
;
149
(
2
):
237
240
10
Sano
T
,
Kurotobi
S
,
Matsuzaki
K
, et al
.
Prediction of non-responsiveness to standard high-dose gamma-globulin therapy in patients with acute Kawasaki disease before starting initial treatment
.
Eur J Pediatr
.
2007
;
166
(
2
):
131
137
11
Sleeper
LA
,
Minich
LL
,
McCrindle
BM
, et al;
Pediatric Heart Network Investigators
.
Evaluation of Kawasaki disease risk-scoring systems for intravenous immunoglobulin resistance
.
J Pediatr
.
2011
;
158
(
5
):
831
835.e3
12
Tremoulet
AH
,
Best
BM
,
Song
S
, et al
.
Resistance to intravenous immunoglobulin in children with Kawasaki disease
.
J Pediatr
.
2008
;
153
(
1
):
117
121
13
Yang
S
,
Song
R
,
Zhang
J
,
Li
X
,
Li
C
.
Predictive tool for intravenous immunoglobulin resistance of Kawasaki disease in Beijing
.
Arch Dis Child
.
2019
;
104
(
3
):
262
267
14
Page
MJ
,
McKenzie
JE
,
Bossuyt
PM
, et al
.
The PRISMA 2020 statement: an updated guideline for reporting systematic reviews
.
BMJ
.
2021
;
372
:
n71
15
Moons
KGM
,
de Groot
JAH
, %
Bouwmeester
W
, et al
.
Critical appraisal and data extraction for systematic reviews of prediction modelling studies: the CHARMS checklist
.
PLoS Med
.
2014
;
11
(
10
):
e1001744
16
Steyerberg
EW
,
Harrell
FE
Jr
.
Prediction models need appropriate internal, internal-external, and external validation
.
J Clin Epidemiol
.
2016
;
69
:
245
247
17
Qian
W
,
Tang
Y
,
Yan
W
,
Sun
L
,
Lv
H
.
A comparison of efficacy of six prediction models for intravenous immunoglobulin resistance in Kawasaki disease
.
Ital J Pediatr
.
2018
;
44
(
1
):
33
18
Seki
M
,
Kobayahsi
T
,
Kobayashi
T
, et al
.
External validation of a risk score to predict intravenous immunoglobulin resistance in patients with Kawasaki disease
.
Pediatr Intfect Dis J
.
2011
;
30
(
2
):
145
147
19
Wolff
RF
,
Moons
KGM
,
Riley
RD
, et al;
PROBAST Group
.
PROBAST: a tool to assess the risk of bias and applicability of prediction model studies
.
Ann Intern Med
.
2019
;
170
(
1
):
51
58
20
Siontis
GCM
,
Tzoulaki
I
,
Castaldi
PJ
,
Ioannidis
JPA
.
External validation of new risk prediction models is infrequent and reveals worse prognostic discrimination
.
J Clin Epidemiol
.
2015
;
68
(
1
):
25
34
21
Ramspek
CL
,
Jager
KJ
,
Dekker
FW
, %
Zoccali
C
,
van Diepen
M
.
External validation of prognostic models: what, why, how, when and where?
Clin Kidney J
.
2020
;
14
(
1
):
49
58
22
Debray
TPA
,
Damen
JAAG
,
Snell
KIE
, et al
.
A guide to systematic review and meta-analysis of prediction model performance
.
BMJ
.
2017
;
356
:
i6460
23
Naike
W
.
How to conduct a meta- analysis of proportions in R: a comprehensive tutorial
.
24
Davies
S
,
Sutton
N
,
Blackstock
S
, et al
.
Predicting IVIG resistance in UK Kawasaki disease
.
Arch Dis Child
.
2015
;
100
(
4
):
366
368
25
Jakob
A
,
von Kries
R
,
Horstmann
J
, et al
.
Failure to predict high-risk Kawasaki disease patients in a population-based study cohort in Germany
.
Pediatr Infect Dis J
.
2018
;
37
(
9
):
850
855
26
Sánchez-Manubens
J
,
Antón
J
,
Bou
R
, et al;
Kawasaki Disease in Catalonia Working Group
.
Role of the Egami score to predict immunoglobulin resistance in Kawasaki disease among a Western Mediterranean population
.
Rheumatol Int
.
2016
;
36
(
7
):
905
910
27
Kobayashi
T
,
Inoue
Y
,
Tamura
K
, %
Morikawa
A
,
Kobayashi
T
.
External validation of a scoring system to predict resistance to intravenous immunoglobulin
.
J Pediatr
.
2007
;
150
(
4
):
e37
,
author reply e38
28
Kobayashi
T
,
Inoue
Y
,
Otani
T
, et al
.
Risk stratification in the decision to include prednisolone with intravenous immunoglobulin in primary therapy of Kawasaki disease
.
Pediatr Infect Dis J
.
2009
;
28
(
6
):
498
502
29
Jibiki
T
,
Kato
I
,
Shiohama
T
, et al
.
Intravenous immune globulin plus corticosteroids in refractory Kawasaki disease
.
Pediatr Int
.
2011
;
53
(
5
):
729
735
30
Nomura
Y
,
Arata
M
,
Masuda
K
, et al
.
Kawasaki disease patients with six principal symptoms have a high risk of being a non-responder
.
Pediatr Int
.
2012
;
54
(
1
):
14
18
31
Ogihara
Y
,
Ogata
S
,
Ebato
T
,
Sato
K
,
Honda
T
,
Ishii
M
.
Clinical utility of 3 scoring systems in predicting resistance to intravenous immunoglobulin: a comparison study
.
Pediatr Int
.
2012
;
54
(
suppl 1
):
102
32
Park
HM
,
Lee
DW
,
Hyun
MC
,
Lee
SB
.
Predictors of nonresponse to intravenous immunoglobulin therapy in Kawasaki disease
.
Korean J Pediatr
.
2013
;
56
(
2
):
75
79
33
Choi
MH
,
Park
CS
,
Kim
DS
,
Kim
KH
.
Prediction of intravenous immunoglobulin nonresponse Kawasaki disease in Korea
.
Korean J Pediatr Infect Dis
.
2014
;
21
(
1
):
29
36
34
Yamamoto
N
,
Sato
K
,
Hoshina
T
,
Kojiro
M
,
Kusuhara
K
.
Utility of ferritin as a predictor of the patients with Kawasaki disease refractory to intravenous immunoglobulin therapy
.
Mod Rheumatol
.
2015
;
25
(
6
):
898
902
35
Kanamitsu
K
,
Kakimoto
H
,
Shimada
A
, et al
.
Verification of risk scores to predict i.v. immunoglobulin resistance in incomplete Kawasaki disease
.
Pediatr Int
.
2016
;
58
(
2
):
146
151
36
Kim
BY
,
Kim
D
,
Kim
YH
, et al
.
Non- responders to intravenous immunoglobulin and coronary artery dilatation in Kawasaki disease: predictive parameters in Korean children
.
Korean Circ J
.
2016
;
46
(
4
):
542
549
37
Lin
MT
,
Chang
CH
,
Sun
LC
, et al
.
Risk factors and derived Formosa score for intravenous immunoglobulin unresponsiveness in Taiwanese children with Kawasaki disease
.
J Formos Med Assoc
.
2016
;
115
(
5
):
350
355
38
Loomba
RS
,
Raskin
A
,
Gudausky
TM
,
Kirkpatrick
E
.
Role of the Egami score in predicting intravenous immunoglobulin resistance in Kawasaki disease among different ethnicities
.
Am J Ther
.
2016
;
23
(
6
):
e1293
e1299
39
Okuma
Y
,
Suda
K
,
Nakaoka
H
, et al
.
Serum tenascin-C as a novel predictor for risk of coronary artery lesion and resistance to intravenous immunoglobulin in Kawasaki disease – a multicenter retrospective study
.
Circ J
.
2016
;
80
(
11
):
2376
2381
40
Tang
Y
,
Yan
W
,
Sun
L
, et al
.
Prediction of intravenous immunoglobulin resistance in Kawasaki disease in an East China population
.
Clin Rheumatol
.
2016
;
35
(
11
):
2771
2776
41
Basodan
D
,
Ramanakumar
AV
,
Scuccimarri
R
.
The performance of a new risk assessment scoring system in detecting IVIG resistance in Kawasaki disease as compared to the Kobayashi and Egami scores in a large single centre Canadian cohort
.
Arthritis Rheumatol
.
2017
;
69
:
153
154
42
Berdej-Szczot
E
,
Małecka-Tendera
E
,
Gawlik
T
,
Firek-Pędras
M
,
Szydłowski
L
,
Gawlik
A
.
Risk factors of immunoglobulin resistance and coronary complications in children with Kawasaki disease
.
Kardiol Pol
.
2017
;
75
(
3
):
261
266
43
Hua
W
,
Sun
Y
,
Wang
Y
, et al
.
A new model to predict intravenous immunoglobin-resistant Kawasaki disease
.
Oncotarget
.
2017
;
8
(
46
):
80722
80729
44
Song
R
,
Yao
W
,
Li
X
.
Efficacy of four scoring systems in predicting intravenous immunoglobulin resistance in children with Kawasaki disease in a Children’s Hospital in Beijing, North China
.
J Pediatr
.
2017
;
184
:
120
124
45
Takeuchi
M
,
Inuzuka
R
,
Hayashi
T
, et al
.
Novel risk assessment tool for immunoglobulin resistance in Kawasaki disease: application using a random forest classifier
.
Pediatr Infect Dis J
.
2017
;
36
(
9
):
821
826
46
Arane
K
,
Mendelsohn
K
,
Mimouni
M
, et al
.
Japanese scoring systems to predict resistance to intravenous immunoglobulin in Kawasaki disease were unreliable for Caucasian Israeli children
.
Acta Paediatr
.
2018
;
107
(
12
):
2179
2184
47
Bar-Meir
M
,
Kalisky
I
,
Schwartz
A
,
Somekh
E
,
Tasher
D
;
Israeli Kawasaki Group
.
Prediction of resistance to intravenous immunoglobulin in children with Kawasaki disease
.
J Pediatric Infect Dis Soc
.
2018
;
7
(
1
):
25
29
48
Chantasiriwan
N
,
Silvilairat
S
,
Makonkawkeyoon
K
,
Pongprot
Y
,
Sittiwangkul
R
.
Predictors of intravenous immunoglobulin resistance and coronary artery aneurysm in patients with Kawasaki disease
.
Paediatr Int Child Health
.
2018
;
38
(
3
):
209
212
49
Arslanoglu Aydin
E
,
Ertugrul
I
,
Bilginer
Y
, et al
.
The factors affecting the disease course in Kawasaki disease
.
Rheumatol Int
.
2019
;
39
(
8
):
1343
1349
50
Fabi
M
,
Andreozzi
L
,
Corinaldesi
E
, et al
.
Inability of Asian risk scoring systems to predict intravenous immunoglobulin resistance and coronary lesions in Kawasaki disease in an Italian cohort
.
Eur J Pediatr
.
2019
;
178
(
3
):
315
322
51
Grignani
R
,
Rajgor
DD
,
Leow
YG
,
Heng
CK
,
Quek
SC
.
A novel model for predicting non-responsiveness to intravenous immunoglobulins in Kawasaki disease: the Singapore experience
.
J Paediatr Child Health
.
2019
;
55
(
8
):
962
967
52
Shao
S
,
Luo
C
,
Zhou
K
, et al
.
The role of age-specific N-terminal pro-brain atriuretic peptide cutoff values in predicting intravenous immunoglobulin resistance in Kawasaki disease: a prospective cohort study
.
Pediatr Rheumatol Online J
.
2019
;
17
(
1
):
65
53
Wu
S
,
Long
Y
,
Chen
S
, et al
.
A new scoring system for prediction of intravenous immunoglobulin resistance of Kawasaki disease in infants under 1-year old
.
Front Pediatr
.
2019
;
7
:
514
54
Avramoska
T
,
Osterman
V
,
Bizjak
M
,
Markelj
G
,
Avcin
T
,
Pokorn
M
.
Japanese predictive scores in Slovenian population of children with Kawasaki disease
.
Pediatr Rheumatol
.
2020
;
18
(
suppl 2
):
82
55
Chen
L
,
Song
S
,
Ning
Q
, et al
.
Prediction for intravenous immunoglobulin resistance combining genetic risk loci identified from next generation sequencing and laboratory data in Kawasaki disease
.
Front Pediatr
.
2020
;
8
:
462367
56
Edraki
MR
,
Mohammadi
H
,
Mehdizadegan
N
, et al
.
Japanese Kawasaki disease scoring systems: are they applicable to the Iranian population?
Arch Iran Med
.
2020
;
23
(
1
):
31
36
57
Ha
KS
,
Lee
J
,
Lee
KC
.
Prediction of intravenous immunoglobulin resistance in patients with Kawasaki disease according to the duration of illness prior to treatment
.
Eur J Pediatr
.
2020
;
179
(
2
):
257
264
58
Kanai
T
,
Takeshita
S
,
Kawamura
Y
, et al
.
The combination of the neutrophil-to-lymphocyte and platelet-to-lymphocyte ratios as a novel predictor of intravenous immunoglobulin resistance in patients with Kawasaki disease: a multicenter study
.
Heart Vessels
.
2020
;
35
(
10
):
1463
1472
59
Kuniyoshi
Y
,
Tokutake
H
,
Takahashi
N
,
Kamura
A
,
Yasuda
S
,
Tashiro
M
.
Comparison of machine learning models for prediction of initial intravenous immunoglobulin resistance in children with Kawasaki disease
.
Front Pediatr
.
2020
;
8
:
570834
60
Liu
X
,
Zhou
K
,
Hua
Y
, et al
.
Prospective evaluation of neutrophil-to-lymphocyte ratio and platelet-to-lymphocyte ratio for intravenous immunoglobulin resistance in a large cohort of Kawasaki disease patients
.
Pediatr Infect Dis J
.
2020
;
39
(
3
):
229
231
61
Öztarhan
K
,
Varlı
YZ
,
Aktay Ayaz
N
.
Usefulness of Kawasaki disease risk scoring systems to the Turkish population
.
Anatol J Cardiol
.
2020
;
24
(
2
):
97
106
62
Wang
T
,
Liu
G
,
Lin
H
.
A machine learning approach to predict intravenous immunoglobulin resistance in Kawasaki disease patients: a study based on a Southeast China population
.
PLoS One
.
2020
;
15
(
8
):
e0237321
63
Wu
S
,
Liao
Y
,
Sun
Y
, et al
.
Prediction of intravenous immunoglobulin resistance in Kawasaki disease in children
.
World J Pediatr
.
2020
;
16
(
6
):
607
613
64
Jarutach
J
,
Roymanee
S
,
Wongwaitaweewong
K
.
Verification of “Japanese scoring systems” to predict IVIG resistance and identification of predictors for IVIG resistance in Thai children with Kawasaki disease
.
Pediatr Cardiol
.
2021
;
42
(
8
):
1799
1804
65
Kaya Akca
U
,
Arslanoglu Aydin
E
,
Aykan
HH
, et al
.
Comparison of IVIG resistance predictive models in Kawasaki disease
.
Pediatr Res
.
2022
;
91
(
3
):
621
626
66
Liu
X
,
Wang
L
,
Zhou
K
, et al
.
Predictive value of C-reactive protein to albumin ratio as a biomarker for initial and repeated intravenous immunoglobulin resistance in a large cohort of Kawasaki disease patients: a prospective cohort study
.
Pediatr Rheumatol Online J
.
2021
;
19
(
1
):
24
67
Harada
K
.
Intravenous γ-globulin treatment in Kawasaki disease
.
Acta Paediatr Jpn
.
1991
;
33
(
6
):
805
810
68
Tsoukas
P
,
Yeung
RSM
.
Kawasaki disease and MIS-C share a host immune response
.
Nat Rev Rheumatol
.
2022
;
18
(
10
):
555
556
69
Ghosh
P
,
Katkar
GD
,
Shimizu
C
, et al;
Pediatric Emergency Medicine Kawasaki Disease Research Group
.
An artificial intelligence-guided signature reveals the shared host immune response in MIS-C and Kawasaki disease
.
Nat Commun
.
2022
;
13
(
1
):
2687
70
Baek
JY
,
Song
MS
.
Meta-analysis of factors predicting resistance to intravenous immunoglobulin treatment in patients with Kawasaki disease
.
Korean J Pediatr
.
2016
;
59
(
2
):
80
90
71
Li
X
,
Chen
Y
,
Tang
Y
, et al
.
Predictors of intravenous immunoglobulin-resistant Kawasaki disease in children: a meta-analysis of 4442 cases
.
Eur J Pediatr
.
2018
;
177
(
8
):
1279
1292

Supplementary data