Approximately 10% to 20% of patients with Kawasaki disease (KD) are refractory to initial intravenous immunoglobulin (IVIG) therapy. KD is mainly associated with coronary artery abnormalities.
To identify and evaluate all developed prediction models for IVIG resistance in patients with KD and synthesize evidence from external validation studies that evaluated their predictive performances.
PubMed Medline, Dialog Embase, the Cochrane Central Register of Controlled Trials, the World Health Organization International Clinical Trials Registry Platform, and ClinicalTrials.gov were searched from inception until October 5, 2021.
All cohort studies that reported patients diagnosed with KD who underwent an initial IVIG of 2 g/kg were selected.
Study and patient characteristics and model performance measures. Two authors independently extracted data from the studies.
The Kobayashi, Egami, Sano, Formosa, and Harada scores were the only prediction models with 3 or more external validation of the161 model analyses in 48 studies. The summary C–statistics were 0.65 (95% confidence interval [CI]: 0.57–0.73), 0.63 (95% CI: 0.55–0.71), 0.58 (95% CI: 0.55–0.60), 0.50 (95% CI: 0.36–0.63), and 0.63 (95% CI: 0.44–0.78) for the Kobayashi, Egami, Sano, Formosa, and Harada models, respectively. All 5 models showed low positive predictive values (0.14–0.39) and high negative predictive values (0.85–0.92).
Potential differences in the characteristics of the target population among studies and lack of assessment of calibrations.
None of the 5 prediction models with external validation accurately distinguished between patients with and without IVIG resistance.
Kawasaki disease (KD) is an acute autoimmune systemic vasculitis that is typically self-limited. However, vasculitis occurs mainly in small- and medium-sized vessels, causing coronary artery abnormalities (CAAs) in 5% to 7% of children who receive standard treatment.1 This sometimes leads to life-threatening complications, such as myocardial infarction, arrhythmia, and heart failure.
High–dose intravenous immunoglobulin (IVIG) therapy is a well-established treatment of KD.2–4 However, approximately 10% to 20% of patients with KD are refractory to this treatment and develop persistent or recurrent fever associated with CAA development after initial IVIG therapy.2,5,6 Clinicians may consider adjuvant therapy with additional initial IVIG therapy for patients at high-risk for IVIG resistance and subsequent CAA.2,4 A systematic review and meta-analysis revealed that combining corticosteroids and initial IVIG therapy for such patients was more effective in preventing CAAs than IVIG alone.7 Thus, reliable IVIG resistance prediction could contribute to decision-making for initial treatment and help prevent CAA.
However, IVIG resistance prediction to reduce the CAA incidence is challenging. Three risk prediction models from Japan8–10 have been developed and externally validated. These scoring models are considered useful in Japan3 but were less beneficial in non-Japanese populations.11 There are other new prediction models besides these 3 models; however, their validity requires further examination.12,13 Therefore, this systematic review aimed to identify all available developed prediction models to predict IVIG resistance in patients with KD and synthesize available evidence from external validation studies evaluating their predictive performances.
Methods
Protocol and Registration
This systematic review and meta-analysis followed the Preferred Reporting Item for Systematic Review and Meta-analysis (PRISMA) guidelines.14 We developed a detailed protocol and registered our study with the International Prospective Register of Systematic Reviews (PROSPERO: CRD42021283276).
Criteria for Considering Studies for This Review
Types of Studies
We included all prospective and retrospective cohort studies. We did not apply language or country restrictions. We included all papers, including published articles, unpublished articles, conference abstracts, and letters. We did not include case-control studies (including nested case-control), case reports, and case series. We did not exclude studies based on the observation period or publication year.
Target Population
We included studies reporting any patients diagnosed with KD and undergoing initial IVIG therapy. Studies including participants with initial IVIG doses other than the standard treatment of 2 g/kg2–4 or those without reported IVIG doses and studies including patients using drugs other than aspirin concurrently with initial IVIG therapy were excluded.
Types of Prognostic and Predictive Models
We included studies aiming to develop, validate (internally or externally), or update prediction models for initial IVIG resistance in patients with KD and prediction models using available predictors in clinical practice before starting IVIG administration. We a priori decided to include models that were not originally developed to predict IVIG resistance but commonly used or validated to predict the outcome. We excluded studies aiming to establish independent prognostic factors associated with IVIG resistance and not to develop a prognostic model by combining 2 or more predictors.
Types of Outcomes to be Predicted
Search Methods for Study Identification
We searched the following databases: Medline (PubMed), the Cochrane Central Register of Controlled Trials (Cochrane Library) and Embase (Dialog) from inception to October 2021. For the assessment of reporting bias, we also searched the following databases for ongoing or unpublished trials: the World Health Organization International Clinical Trials Platform Search Portal and ClinicalTrials.gov. Supplemental Table 2 shows the details of search terms for each database. We checked the reference lists of studies, including international guidelines, such as American Heart Association,2 JCS/JSCS 2020,3 and SHARE recommendation,4 and the reference lists of eligible studies and articles that cite eligible studies. The authors of the original studies were contacted when a study was uncertain to be eligible.
Data Collection
Selection of Studies
Two of 6 independent reviewers (Y.K., M.B., S.T., T.A., N.T., and H.T.) screened titles and abstracts, followed by an eligibility assessment based on the full texts. We contacted the original authors if relevant data are missing. Disagreements between the 2 reviewers were resolved by discussion, with a third reviewer as an arbiter for failed discussion. Study selection was documented in a detailed flowchart based on the PRISMA guidelines.14
Data Extraction and Management
Two of 6 independent reviewers (Y.K., M.B., S.T., T.A., N.T., and H.T.) extracted data from the selected articles following the critical appraisal and data extraction for systematic reviews of prediction modeling studies checklist.15 We categorized the types of model analyses as follows: development model analysis with or without internal validation, validation analysis with a dataset that combines the original dataset for derivation and another dataset, internal-external validation, and external validation analysis.16 The internal-external validation analysis was defined as a validation using a dataset unused for model development in the same study. External validation analysis used datasets from a different study. Discrimination (C–statistic), calibration (observed–to–expected [OE] ratio, calibration–in–the–large and slope, calibration plot, and Hosmer–Lemeshow test), and classification (sensitivity, specificity, positive predictive value [PPV] and negative predictive value [NPV]) were separately extracted for each model if a study reported 2 or more models with different predictors or prediction rules. Cut-offs of 3,17 4,8 or 53,18 points were used for external validation of Kobayashi scores. Studies without the cutoff values were classified as 4 or 5 points based on cited articles.8,18
Risk of Bias and Applicability Assessment
We used the Prediction model Risk of Bias Assessment Tool (PROBAST) to assess the risk of bias and applicability of each model analysis.19
Two of 6 independent reviewers (Y.K., M.B., S.T., T.A., N.T., and H.T.) independently assessed these individual studies. The questions assessed the risk of bias for all analyses reported in the selected articles following the 4 domains, namely, participants, predictors, outcome, and analysis. The applicability was assessed through the questions following the 3 domains, namely, participants, predictors, and outcome. We evaluated each domain for risk of bias as “high-risk of bias,” “unclear risk of bias,” “low-risk of bias,” and for applicability as “high concern,” “unclear concern,” and “low concern.” An overall judgment was evaluated as “high-risk of bias” or “high concern” if at least 1 domain of risk of bias or applicability was “high.” Any disagreements were resolved by discussion, with a third reviewer as an arbiter for failed discussion.
Data Synthesis
Data Synthesis and Meta-analysis Approaches
We applied a random-effects model that pools logit transformations of C–statistics. We adopted the restricted maximum likelihood estimation and the Hartung–Knapp–Sidik–Jonkman.22 We used an inverse-variance-weighted random-effects model with the DerSimonian-Laird estimator to estimate the between-study variance and normal approximation intervals based on summary measures to calculate CIs for individual study results and estimate the pooled PPV and NPV.23 To stabilize variances, we transformed the data with the Freeman–Tukey double arcsine transformation.
We computed the 95% prediction interval, which provided a range for the potential performance of a model in a new validation study. We also computed I2 and τ2 statistics to quantify the extent of the heterogeneity. All statistical analyses were performed using R software (version 4·0·3) including “meta,” “metamisc,” and “metafor” packages.
Subgroup analysis and heterogeneity investigation
Study location (East Asia, Middle East Asia, Europe, or North America) or timing of outcome definition (recrudescent or persistent fever at least 24, 36, or 48 h after IVIG administration completion) subgroups were analyzed to explore the cause and extent of heterogeneity.
Results
Literature Search Results
Figure 1 displays a PRISMA 2020 flowchart detailing the study selection process, including the reason for exclusion. We screened 2043 records after removing duplicates and assessed 193 full texts. The clinical trial registry system screened 27 records, and no records were included. We excluded 25 studies having participants with initial IVIG doses other than 2 g/kg or those without reported IVIG doses,24–26 and 9 studies with patients using drugs other than aspirin concurrently with initial IVIG therapy (Supplemental Table 3). Finally, the analyses included 48 studies.8–13,17,18,27–66
Study Characteristics
Supplemental Table 4 shows the characteristics of the 48 included studies that evaluated 161 model analyses. Of which, 35, 6, 4, and 3 were reported from East Asia, Middle East Asia, North America, and Europe, respectively. Calibration performance was unavailable in most studies.
Risk of Bias and Applicability Assessment
We applied the PROBAST tool to the 161 model analyses (Supplemental Fig 5, Supplemental Table 5). All development model analyses were classified as an overall “high-risk of bias.” All analyses were rated “high-risk of bias” in the analysis domain because of (1) insufficient number of outcome events relative to the parameters among candidate predictors (low events per variable), (2) inappropriate missing data handling, (3) inappropriate selection, and (4) no calibration. Most development model analyses were classified as “low concerns of applicability” (94%; 48 of 51).
All external validation analyses were classified as “high-risk of bias” because of (1) inappropriate missing data handling and (2) no calibration. Approximately two-thirds (74 of 110) of external validation analyses were classified as “low concern of applicability.”
Individual Model Performance
Models with 3 or more external validation analyses in different studies are the Kobayashi, Egami, Sano, Formosa, and Harada scores (Supplemental Table 6). The Kobayashi,8 Egami,9 Sano,10 and Harada67 scores were developed in Japan, whereas the Formosa37 score was developed in Taiwan. The Harada Score67 was originally developed in 1961, not to predict IVIG resistance, but to predict CAA development and determine whether to implement IVIG therapy. However, several studies have evaluated the external validity of the Harada score as a model for predicting IVIG resistance; thus, it was included in the evaluation in this review. Table 1 presents a result summary of the external validation analyses of the 5 prediction models. The included analyses in the meta-analysis revealed a 14% (12% to 20%) median (interquartile range) prevalence of IVIG resistance.
Model . | Measure . | No of Studies (IVIG Resistance %, Median [IQR]) No of Patients . | Summary Measure . | Risk Group, No. of Patients IVIG Resistance, n (%) . | Pooled Result (95% CI) . | Overall Risk of Bias . | Comments and Predictors . |
---|---|---|---|---|---|---|---|
Kobayashi score | Discrimination | 13 studies | C–statistic | NA | 0.65 (0.57–0.73) | High | Age ≤ 12 months, 1 point Illness days ≤ 4 days, 2 points CRP ≥ 10 mg/dL, 1 point AST ≥ 100 IU/L, 2 points PLT < 300,000/mm3, 1 point Na ≤ 133 mmol/L, 2 points |
Classification; Cutoff, 5 points | 5 studies (22 [15–24]) 655 patients | PPV | High-risk (score, 5–11); 197 77 (39) | 0.39 (0.32–0.46) | High | ||
NPV | Low-risk (score, 0–4); 458 71 (16) | 0.85 (0.81–0.89) | High | ||||
Classification; Cutoff, 4 points | 18 studies (14 [12–17]) 6097 patients | PPV | High (score, 4–11); 1,561 483 (31) | 0.21 (0.12–0.31) | High | ||
NPV | Low (score, 0–3); 4,356 442 (10) | 0.89 (0.84–0.92) | High | ||||
Egami score | Discrimination | 12 studies | C–statistic | NA | 0.63 (0.55–0.71) | High | Age ≤ 6 months, 1 point Illness days ≤ 4 days, 1 point CRP ≥ 8 mg/dL, 1 point ALT ≥ 80 IU/L, 2 points PLT ≤ 300,000/mm3, 1 point |
Classification; Cutoff, 3 points | 20 studies (15 [12–21]) 6723 patients | PPV | High (score, 3–6); 1,580 485 (31) | 0.27 (0.18–0.37) | High | ||
NPV | Low (score, 0–2); 5,143 571 (11) | 0.88 (0.83–0.91) | High | ||||
Sano score | Discrimination | 7 studies | C–statistic | NA | 0.58 (0.55–0.60) | High | CRP ≥ 7.0 mg/dL, 1 point TB ≥ 0.9 mg/dL, 1 point AST ≥ 200 IU/L, 1 point |
Classification; Cutoff, 2 points | 15 studies (17 [12–24]) 3,875 patients | PPV | High (score, 2–3); 527 208 (39) | 0.35 (0.26–0.45) | High | ||
NPV | Low (score, 0–1); 3,348 594 (18) | 0.85 (0.77–0.91) | High | ||||
Formosa score | Discrimination | 5 studies | C–statistic | NA | 0.50 (0.36–0.63) | High | Lymphadenopathy, 1 point N% ≥ 60%, 2 points Alb < 35 g/L, 1 point |
Classification; Cutoff, 3 points | 6 studies (13 [7–15]) 2579 patients | PPV | High (score, 3–4); 1,244 117 (9.4) | 0.14 (0.07–0.23) | High | ||
NPV | Low (score, 0–2); 1,335 104 (7.8) | 0.92 (0.88–0.95) | High | ||||
Harada score | Discrimination | 4 studies | C–statistic | NA | 0.63 (0.44–0.78) | High | WBC ≥ 12,000/mm3, 1 point PLT < 350,000/mm3, 1 point CRP ≥ 4.0 mg/dL, 1 point Ht < 35%, 1 point Alb < 35 g/L, 1 point Age < 12 months, 1 point Male, 1 point |
Classification; Cutoff, 4 points | 2 studies (12% and 13%) 379 patients | PPV | High (score, 4–7); 198 33 (17) | 0.16 (0.11–0.22) | High | ||
NPV | Low (score, 0–3); 181 15 (8.3) | 0.91 (0.79–0.99) | High |
Model . | Measure . | No of Studies (IVIG Resistance %, Median [IQR]) No of Patients . | Summary Measure . | Risk Group, No. of Patients IVIG Resistance, n (%) . | Pooled Result (95% CI) . | Overall Risk of Bias . | Comments and Predictors . |
---|---|---|---|---|---|---|---|
Kobayashi score | Discrimination | 13 studies | C–statistic | NA | 0.65 (0.57–0.73) | High | Age ≤ 12 months, 1 point Illness days ≤ 4 days, 2 points CRP ≥ 10 mg/dL, 1 point AST ≥ 100 IU/L, 2 points PLT < 300,000/mm3, 1 point Na ≤ 133 mmol/L, 2 points |
Classification; Cutoff, 5 points | 5 studies (22 [15–24]) 655 patients | PPV | High-risk (score, 5–11); 197 77 (39) | 0.39 (0.32–0.46) | High | ||
NPV | Low-risk (score, 0–4); 458 71 (16) | 0.85 (0.81–0.89) | High | ||||
Classification; Cutoff, 4 points | 18 studies (14 [12–17]) 6097 patients | PPV | High (score, 4–11); 1,561 483 (31) | 0.21 (0.12–0.31) | High | ||
NPV | Low (score, 0–3); 4,356 442 (10) | 0.89 (0.84–0.92) | High | ||||
Egami score | Discrimination | 12 studies | C–statistic | NA | 0.63 (0.55–0.71) | High | Age ≤ 6 months, 1 point Illness days ≤ 4 days, 1 point CRP ≥ 8 mg/dL, 1 point ALT ≥ 80 IU/L, 2 points PLT ≤ 300,000/mm3, 1 point |
Classification; Cutoff, 3 points | 20 studies (15 [12–21]) 6723 patients | PPV | High (score, 3–6); 1,580 485 (31) | 0.27 (0.18–0.37) | High | ||
NPV | Low (score, 0–2); 5,143 571 (11) | 0.88 (0.83–0.91) | High | ||||
Sano score | Discrimination | 7 studies | C–statistic | NA | 0.58 (0.55–0.60) | High | CRP ≥ 7.0 mg/dL, 1 point TB ≥ 0.9 mg/dL, 1 point AST ≥ 200 IU/L, 1 point |
Classification; Cutoff, 2 points | 15 studies (17 [12–24]) 3,875 patients | PPV | High (score, 2–3); 527 208 (39) | 0.35 (0.26–0.45) | High | ||
NPV | Low (score, 0–1); 3,348 594 (18) | 0.85 (0.77–0.91) | High | ||||
Formosa score | Discrimination | 5 studies | C–statistic | NA | 0.50 (0.36–0.63) | High | Lymphadenopathy, 1 point N% ≥ 60%, 2 points Alb < 35 g/L, 1 point |
Classification; Cutoff, 3 points | 6 studies (13 [7–15]) 2579 patients | PPV | High (score, 3–4); 1,244 117 (9.4) | 0.14 (0.07–0.23) | High | ||
NPV | Low (score, 0–2); 1,335 104 (7.8) | 0.92 (0.88–0.95) | High | ||||
Harada score | Discrimination | 4 studies | C–statistic | NA | 0.63 (0.44–0.78) | High | WBC ≥ 12,000/mm3, 1 point PLT < 350,000/mm3, 1 point CRP ≥ 4.0 mg/dL, 1 point Ht < 35%, 1 point Alb < 35 g/L, 1 point Age < 12 months, 1 point Male, 1 point |
Classification; Cutoff, 4 points | 2 studies (12% and 13%) 379 patients | PPV | High (score, 4–7); 198 33 (17) | 0.16 (0.11–0.22) | High | ||
NPV | Low (score, 0–3); 181 15 (8.3) | 0.91 (0.79–0.99) | High |
Population targeted: Patients diagnosed with KD and undergoing an initial 2 g/kg of IVIG therapy. Index model: Multivariable prediction models with at least three external validation analyses. Outcome to be predicted: IVIG resistance. Timing: Timing of making the prediction is just before IVIG administration. Period of the prediction is approximately 24, 36, or 48 h after IVIG completion. Setting: Not specified. NA, not applicable.
Kobayashi Score3,8,18
This meta-analysis included 35 studies and 33 external validation analyses (Supplemental Table 6). Figure 2A shows low discrimination (C–statistic: 0.65; 95% CI: 0.57–0.73), without differences in the C–statistic between studies from East and Middle East Asia (Supplemental Table 7). Europe has only 1 and North America has no report. Additionally, the C–statistic between studies with different IVIG response determination timing has no difference (Supplemental Table 7). PPV was higher at the cutoff score of 5 points (0.39; 95% CI: 0.32–0.46) (Fig 3A), whereas NPV was higher at the cutoff score of 4 points (0.89; 95% CI: 0.84–0.92) (Fig 4B).
Egami Score3,9
This review included 31 studies and 30 external validation analyses (Supplemental Table 6). Figure 2B shows low discrimination (C–statistic: 0.63; 95% CI: 0.55–0.71), without differences in the C–statistic between studies from East and Middle East Asia (Supplemental Table 7). Europe has only 1 and North America has no report. Additionally, the C–statistic between studies with differentIVIG response determination timing has no difference (Supplemental Table 7). Figure 3C and Fig 4C shows low PPV (0.27; 95% CI: 0.18–0.37) and high NPV (0.88; 95% CI: 0.83–0.91).
Sano Score3,10
This review included 21 studies and 20 external validation analyses (Supplemental Table 6). Figure 2C shows low discrimination (C–statistic: 0.58; 95% CI: 0.55–0.60), without differences in the C–statistic between studies from East Asia and Middle East Asia (Supplemental Table 7). Europe and North America have no reports. Additionally, the C–statistic between studies with different IVIG response determination timing has no difference (Supplemental Table 7). Figure 3D and Fig 4D shows low PPV (0.35; 95% CI: 0.26–0.45) and high NPV (0.85; 95% CI: 0.77–0.91).
Formosa Score37
This review included 9 studies and eight external validation analyses (Supplemental Table 6). Figure 2D shows low discrimination (C–statistic: 0.50; 95% CI: 0.36–0.63), without differences in the C–statistic between studies from East Asia and Middle East Asia (Supplemental Table 7). Europe has only 1 and North America has no reports. In all studies, IVIG response was determined at 36 h after the end of administration (Supplemental Table 7). Figure 3E and Fig 4E shows low PPV (0.14; 95% CI: 0.07–0.23) and high NPV (0.92; 95% CI: 0.88–0.95).
Harada Score67
This review included 5 studies and 5 external validation analyses (Supplemental Table 6). Figure 2E shows low discrimination (C–statistic: 0.63; 95% CI: 0.44–0.78), without differences in the C–statistic between studies from East Asia and Middle East Asia (Supplemental Table 7). Europe and North America have no reports. Additionally, the C–statistic between studies with different IVIG response determination timing has no difference (Supplemental Table 7). Figure 3F and Fig 4F shows low PPV (0.16; 95% CI: 0.11–0.22) and high NPV (0.91; 95% CI: 0.79–0.99).
Discussion
This systematic review of 161 model analyses in 48 studies summarized the currently available evidence on performing prediction models for initial IVIG resistance in patients with KD. Our meta-analyses suggested that none of the prediction models have properly distinguished between patients with and without IVIG resistance.
None of the 5 models that were externally validated could not discriminate well. The summary C–statistics, which integrates the external validation results, was <0.75,22 although the Kobayashi score has the highest C–statistic of 0.65. The poor performance of the existing 5 predictive models may be attributed to the fact that KD is a heterogeneous syndrome. KD is speculated to occur as a result of immune responses to a variety of infectious etiologies, including viral, fungal, and bacterial agents, as well as environmental stimuli.68,69 Genetic factors and racial differences may also lead to variation in clinical symptoms and laboratory parameters, thereby making these predictive models difficult to apply to populations other than the development cohort.
The existing prediction models showed NPV of 85% to 92% and PPV of 14% to 39%, indicating that approximately 10% to 20% of patients classified as low-risk are IVIG resistant, whereas only 15% to 40% of patients classified as high-risk do need adjuvant therapy with additional initial IVIG therapy. The clinically acceptable NPV and PPV on this research topic have no established consensus. Thus, the optimum cutoff point for the existing models should be based on the clinical context in each case. For instance, higher NPV (ie, missing fewer patients with IVIG resistance) would be desirable in patients with a severe condition. A lower cutoff point in the existing prediction models might be more appropriate for such patients. However, the model performance in the studies included in the present review was generally assessed only using a cutoff point that was reported (and arbitrarily determined) in the original developmental studies. Therefore, the model’s performance could not be evaluated with different cutoff points.
In clinical settings, collectively considering the prediction model score and other independent predictors might be practical to make an overall decision,34 because a highly reliable prediction model remained unavailable. Each model had a higher rate of IVIG resistance in the high-risk group than in the low-risk group. The higher the point in the model, the higher the risk of IVIG resistance in the Kobayashi and Formosa scores. For example, a score of 8 on the Kobayashi score is associated with a higher-risk IVIG resistance than a score of 5. Rather than only focusing on the dichotomized judgment (ie, positive or negative) of the prediction models, it is more informative to refer to the score itself. Several predictors as a sole predictor have also been reported.70,71 Furthermore, the development and validation of IVIG adjuvant therapy as the standard first-line therapy, which is more effective in preventing CAA formation than conventional IVIG therapy, may also be warranted if developing a prediction model with good performance is difficult.
Model performance for studies reported from East Asia and Middle East Asia was comparable. Contrastingly, model performance from East Asia and Europe or North America is not comparable because of the limited studies incorporated in this review. This was inconsistent with the previous indications that models developed in Japan might have performance differences between Japanese and non-Japanese patients. Recent studies from the United Kingdom,24 Germany,25 and Spain26 demonstrated that the prediction models developed by the Japanese research group were not as effective for non-Japanese patients. However, these studies did not describe the IVIG dose and administration method. Therefore, further studies with well-defined IVIG doses and administration methods are needed to clarify differences in model performance by race or study area.
This was the first systematic review of prediction models’ performance for initial IVIG resistance in patients with KD. We included 48 studies and performed meta-analyses of the 5 different prediction models using only the external validation results in different studies. Referring to the critical appraisal and data extraction for systematic reviews of prediction modeling studies checklist and PROBAST tool, we could extract and assess detailed data on study characteristics and model performance metrics.
This study had some limitations. First, the effects of differences in inclusion criteria between studies, such as the timing of IVIG administration or complete or incomplete type, were not accounted for. Potential differences in the prediction model’s performance between different timings of initial treatment7 and between complete and incomplete KD have been suggested.35 Second, the calibration, such as OE ratios, could not be assessed because they are not reported in external validation analyses.
Conclusions
The external validity remains unknown for most existing prediction models to predict IVIG resistance in patients with KD. Additionally, externally validated models have no sufficient performance to distinguish between patients with and without IVIG resistance. Therefore, further studies are required to develop and validate prediction models with reliable performance.
Acknowledgments
We are very grateful to Dr. Jun Watanabe, Department of Surgery, Division of Gastroenterological, General and Transplant Surgery, Jichi Medical University, for providing support to our research; and Enago (www.enago.jp) for their English language review.
COMPANION PAPER: A companion to this article can be found online at http://www.pediatrics.org/cgi/doi/10.1542/peds.2022-060423.
Dr Kuniyoshi contributed to the conceptualization, study design, search strategy development, screening, data extraction, analysis, and writing (original draft preparation, review, and editing); Drs Tsujimoto and Takada contributed to the study design, search strategy development, supervision, and writing (review and editing); Drs Banno, Taito, and Ariie contributed to the study design, search strategy development, screening, data extraction, and writing (review and editing); Drs Takahashi and Tokutake contributed to screening, data extraction, and writing (review and editing); and all authors approved the final manuscript as submitted and agree to be accountable for all aspects of the work.
FUNDING: No external funding.
CONFLICT OF INTEREST DISCLOSURES: The authors have indicated they have no conflicts of interest relevant to this article to disclose.
Comments