BACKGROUND AND OBJECTIVES

Acute hematogenous musculoskeletal infections (MSKI) are medical emergencies with the potential for life-altering complications in afflicted children. Leveraging administrative data to study pediatric MSKI is difficult as many infections are chronic, nonhematogenous, or occur in children with significant comorbidities. The objective of this study was to validate a case-finding algorithm to accurately identify children hospitalized with acute hematogenous MSKI using administrative billing codes.

METHODS

This was a multicenter validation study using the Pediatric Health Information System (PHIS) database. Hospital admissions for MSKI were identified from 6 PHIS hospitals using discharge diagnosis codes. A random subset of admissions underwent manual chart review at each site using predefined criteria to categorize each admission as either “acute hematogenous MSKI” (AH-MSKI) or “not acute hematogenous MSKI.” Ten unique coding algorithms were developed using billing data. The sensitivity and specificity of each algorithm to identify AH-MSKI were calculated using chart review categorizations as the reference standard.

RESULTS

Of the 492 admissions randomly selected for manual review, 244 (49.6%) were classified as AH-MSKI and 248 (50.4%) as not acute hematogenous MSKI. Individual algorithm performance varied widely (sensitivity 31% to 91%; specificity 52% to 98%). Four algorithms demonstrated potential for future use with receiver operating characteristic area under the curve greater than 80%.

CONCLUSIONS

Identifying children with acute hematogenous MSKI based on discharge diagnosis alone is challenging as half have chronic or nonhematogenous infections. We validated several case-finding algorithms using administrative billing codes and detail them here for future use in pediatric MSKI outcomes.

Musculoskeletal infections (MSKI), including osteomyelitis, septic arthritis, and pyomyositis, are commonly encountered resource-intensive infections with the potential for life-altering sequelae in children, including limb-length discrepancy and angular deformity.16  MSKI account for more than 1% of all pediatric hospitalizations and are the most common indication for prolonged antimicrobial therapy in children.2,7  Children with MSKI often require extensive hospital resources, including advanced imaging studies, surgical interventions, and subspecialist consultations; therefore, MSKIs are an important target for studies to optimize clinical outcomes and improve care value.1,712 

Large, multisite prospective trials are needed to identify the ideal diagnostic and therapeutic strategies for these patients, but such studies are costly, slow, and logistically challenging to complete. National administrative databases are an efficient alternative means to study the care of millions of children hospitalized with various conditions. However, prior attempts to study MSKI using administrative codes were limited by the need for time-intensive manual chart reviews to prune the dataset of relevant exclusions and identify only those children with acute hematogenous MSKI.13  An accurate and reliable case-finding algorithm for acute hematogenous MSKI based on readily accessible administrative data could leverage these large pediatric databases to describe variability in clinical approaches and patient outcomes across a large number of pediatric inpatient admissions in the United States. Previous validation studies developed similar administrative coding algorithms for other infections, including urinary tract infections, Clostridioides difficile colitis, peritonsillar abscesses, and community acquired pneumonia.1417  However, no previous study has developed a validated case-finding tool using administrative billing data for pediatric MSKI. The objective of this study was to design and validate an automated case-finding algorithm using the Pediatric Health Information System (PHIS) database to reliably identify children with acute hematogenous MSKI while excluding infections that are subacute, chronic, nonhematogenous, or associated with significant contributing comorbidities.

Data for this study were obtained from PHIS, an administrative database that contains inpatient, emergency department, ambulatory surgery, and observation encounter-level data from over 50 not- for-profit, tertiary care pediatric hospitals in the United States. These hospitals are affiliated with the Children’s Hospital Association (Lenexa, KS). Data quality and reliability are assured through a joint effort between the Children’s Hospital Association and participating hospitals. For the purposes of external benchmarking, participating hospitals provide discharge and encounter data, including demographics, diagnoses, and procedures. Nearly all of these hospitals also submit resource utilization data (eg, pharmaceuticals, imaging, and laboratory) into PHIS. Data are deidentified at the time of data submission and are subjected to a number of reliability and validity checks before being included in the database.18  The PHIS database was queried for inpatient and observation encounters for patients between 6 months and 18 years of age who were admitted between April 1, 2016, and March 31, 2020. Patient encounters were included in the primary cohort if they were assigned a primary or a secondary discharge diagnosis of osteomyelitis, septic arthritis, and/or pyomyositis based on International Classification of Diseases, Tenth Revision (ICD-10) diagnostic codes (Supplemental Table 4). All ICD-10 Clinical Modification codes within diagnostic categories A and B (Infectious and Parasitic Diseases), L (Skin and Subcutaneous Tissue), and M (Musculoskeletal and Connective Tissue) were reviewed for any possible codes associated with infectious osteomyelitis, septic arthritis, and pyomyositis to ensure the list of included patients was comprehensive.19  Six PHIS-participating hospitals were included in this study (Children’s Hospital Colorado, Aurora, Colorado; Seattle Children’s Hospital, Seattle, Washington; Monroe Carell Jr Children’s Hospital at Vanderbilt, Nashville, Tennessee; Children’s Hospital of Philadelphia, Philadelphia, Pennsylvania; Children’s Hospital at St. Louis, St. Louis, Missouri, and Cincinnati Children’s Hospital Medical Center, Cincinnati, Ohio). Sites were chosen to include a diverse mix of geographic regions in the United States. Any encounter in which the patient had been hospitalized for an MSKI within the previous 180 days was excluded from the PHIS query.

To ensure our list of included MSKI ICD-10 diagnostic codes was comprehensive and that no MSKI patients were missed because of coding errors, an additional, separate PHIS query (secondary cohort) was also performed to identify patient encounters with diagnostic and therapeutic characteristics similar to MSKI encounters but without any MSKI ICD-10 discharge diagnoses assigned (Supplemental Table 4).

For the purposes of this study, an acute hematogenous MSKI without contributing comorbidities (AH-MSKI) was defined as a child between 6 months and 18 years of age diagnosed and treated for an acute onset (<2 weeks symptoms) of bacterial osteomyelitis, septic arthritis, and/or pyomyositis without any of the following comorbidities that would contribute to disease pathogenesis: decubitus ulcers; neurologic comorbidity leading to significant decrease in mobility; sickle cell anemia; indwelling musculoskeletal hardware at site of infection; central venous catheter in place before onset of MSKI; surgery to affected extremity within 30 days of symptom onset; infection of the head, orbits, or central nervous system; immunocompromising conditions as defined by Infectious Diseases Society of America Immunocompromised Host guideline20 ; open fracture; penetrating trauma to site of infection; or atypical infections caused by Borrelia burgdorferi, Neisseria gonorrheae, Brucella species, mycobacteria, endemic fungi, Coxiella burnetii, Actinomyces species, or Nocardia species. The above comorbidities and atypical pathogens routinely undergo notably different medical and surgical treatment plans because of differences in disease pathogenesis, complication rates, and expected antibiogram. Therefore, accurately labeling patients with these comorbidities is needed to exclude outliers and identify the population of interest for future MSKI research. Patients who did not meet the above criteria were classified as having a not acute hematogenous MSKI (NAH-MSKI).

Available PHIS data extracted included all primary and secondary discharge diagnoses, demographic data, hospital length of stay, surgical procedures performed, dates and classes of antimicrobial agents received, and time spent in an ICU.

We performed a sample size calculation to determine the number of encounters needed to review for algorithm validation. Assuming a 1:1 ratio of cases to controls (ie, AH-MSKI to NAH-MSKI) would be included via reference standard chart review, to estimate a sensitivity of 80% for the proposed algorithms with a margin of error of 5% and 95% confidence level, we determined we needed to review 492 total patient encounters. Each of the participating PHIS sites reviewed 82 randomly selected encounters from their center. In addition, each site reviewed all encounters from their site-specific “secondary cohort” to ensure there were no missed MSKI.

Each study site underwent a 1-hour virtual training session on chart review instructions for consistency. Each site was provided with a training document to reference the described criteria for what constituted an AH-MSKI versus a NAH-MSKI. Reviewers at each site evaluated provider documentation for encounters using their site-specific electronic medical record system and labeled each encounter as either an AH-MSKI or NAH-MSKI. Each site was instructed to review discharge summaries, history and physical documentation, and infectious diseases or orthopedic consultation notes to determine whether a patient was diagnosed and treated for a bacterial musculoskeletal infection. Reviewers used the diagnostic decisions of the original treating team as clinical truth and did not retrospectively alter discharge diagnoses. AH-MSKI encounters had data extracted via manual chart review for causative bacterial pathogens based on all microbiological data available in the electronic medical record.

If an encounter was ambiguous for determination of AH-MSKI versus NAH-MSKI, reviewers flagged the encounter for additional adjudication. All flagged charts underwent a collaborative discussion (without sharing protected health information) with the coordinating PHIS site to determine whether the encounter should be classified as AH-MSKI or NAH-MSKI. After manual chart reviews were completed, all 492 randomly reviewed encounters underwent additional review by the primary PHIS site leader to identify potential misclassifications, including unusual hospital lengths of stay, isolation of a pathogen rarely associated with AH-MSKI, or discharge diagnoses not typical of AH-MSKI. Any patient encounters flagged as possible misclassifications then underwent a second manual chart review by site leads to ensure accurate categorization for AH-MSKI versus NAH-MSKI for the reference standard. These manual chart review classifications for each encounter were used as the reference standard for algorithm comparison.

All collected information was stored in the coordinating PHIS site’s institutional Research Electronic Data Capture (REDCap) database.21  Data access groups were used within REDCap to ensure site privacy, and no decrypted protected health information was stored in the REDCap database. Each site received approval for this retrospective cohort study through their respective institutional review board.

Nineteen categories of medical conditions or procedures typically assigned to NAH-MSKI were developed. A list of relevant ICD-10 discharge diagnoses or Current Procedural Terminology codes was identified for each of these nineteen categories (Supplemental Table 5). A binary assignment (presence or absence) for each of these 19 indicator categories was given to each PHIS encounter. In addition to ICD-10 discharge diagnoses and Current Procedural Terminology codes assigned to each encounter, hospital length of stay, antimicrobial classes administered, complex chronic condition (CCC) classification flag,22  immunomodulating medications received during admission, disposition location, and encounter type (inpatient versus observation) were used to develop each algorithm (Supplemental Table 6). Ten diagnostic algorithms were developed using clinical reasoning and were further informed by the preliminary relative performance (a basic ratio of false negatives to true negatives) of individual categories using a small subset of the sample. Individual algorithms were designed to maximize sensitivity, maximize specificity, or balance both, and 10 algorithms were developed to offer several comparable options for each of those 3 purposes.

Characteristics of the total study cohort queried from PHIS and the random subset sampled for chart review were described using frequencies and proportions for categorical variables and median and interquartile range (IQR) for continuous variables. Using manual chart review designation as the reference standard, the performance of each of the 10 algorithms was determined by calculating its sensitivity, specificity, and receiver operating characteristic (ROC) area under the curve (AUC), which for a binary predictor is equivalent to the average between sensitivity and specificity when calculated using a strategy accounting for ties. Variation across sites in chart review practices and performance of the algorithms was assessed by calculating their sensitivity, specificity, and ROC AUC for each site and comparing the 95% confidence intervals (CIs). Sensitivity and specificity for each algorithm were also compared across sites using χ2 or Fisher’s exact tests of association. Analyses were performed using R (version 4.1.1, 2021).

A total of 3012 encounters (2757 inpatient admissions and 255 observation encounters) from the 6 participating sites were identified for the primary cohort (Fig 1A). Patient demographics, MSKI diagnoses assigned, and clinical characteristics are shown in Tables 1 and 2. Thirty eight percent of patients had osteomyelitis alone, 16% had septic arthritis alone, 26% had pyomyositis alone, and 20% of patients had more than one MKSI diagnosis assigned (Table 1). A total of 492 randomly selected hospital admissions (82 per site) were assigned reference standard of AH-MSKI or NAH-MSKI status via manual chart review. Of these, 244 (49.6%) were cases of acute hematogenous MSKI without any contributing comorbidities (AH-MSKI) and 248 (50.4%) were either nonbacterial MSKI, non-MSKI, subacute, chronic, or nonhematogenous MSKI, or were associated with significant contributing comorbidities (NAH-MSKI). Most AH-MSKI (84.4%) had osteomyelitis, septic arthritis, or pyomyositis as the primary discharge diagnosis (as opposed to a secondary diagnosis), whereas only 39.9% of NAH-MSKI encounters had MSKI as a primary discharge diagnosis (Table 2). Classification as either nonbacterial or non-MSKI (eg, viral myositis, superficial cellulitis only) was the most common reason to be classified as NAH-MSKI, followed by chronic infections, decubitus ulcers, and severe neurologic comorbidity (Fig 1A).

FIGURE 1A

Flowchart of pediatric health information system encounters reviewed (primary cohort).

FIGURE 1A

Flowchart of pediatric health information system encounters reviewed (primary cohort).

Close modal
FIGURE 1B

Flowchart of pediatric health information system encounters reviewed (secondary cohort).

FIGURE 1B

Flowchart of pediatric health information system encounters reviewed (secondary cohort).

Close modal
TABLE 1

Patient Demographics and Diagnoses

Patient CharacteristicTotal Cohort (N = 3012)Validation Subset (N = 492)
Median age, years (IQR) 7 (9) 7 (9) 
Male sex 1837 (61.0) 298 (60.6) 
Race   
 American Indian, Alaska Native, or Pacific Islander 17 (0.6) 0 (0) 
 Asian 112 (3.7) 13 (2.6) 
 Black 464 (15.4) 71 (14.4) 
 Other or mixed race 495 (16.4) 80 (16.3) 
 White 1877 (62.3) 320 (65.0) 
 Hispanic or Latino ethnicity 363 (12.1) 53 (10.8) 
MSKI diagnoses assigned   
 Osteomyelitis alone (OM) 1142 (37.9) 185 (37.6) 
 Septic arthritis alone (SA) 486 (16.1) 81 (16.5) 
 Pyomyositis alone (PM) 783 (26.0) 133 (27.0) 
More than 1 of the above diagnoses assigned 601 (20.0) 93 (18.9) 
 OM+PM 251 (8.3) 38 (7.7) 
 OM+SA 176 (5.8) 29 (5.9) 
 PM+SA 62 (2.1) 11 (2.2) 
 OM+SA+PM 112 (3.7) 15 (3.0) 
Patient CharacteristicTotal Cohort (N = 3012)Validation Subset (N = 492)
Median age, years (IQR) 7 (9) 7 (9) 
Male sex 1837 (61.0) 298 (60.6) 
Race   
 American Indian, Alaska Native, or Pacific Islander 17 (0.6) 0 (0) 
 Asian 112 (3.7) 13 (2.6) 
 Black 464 (15.4) 71 (14.4) 
 Other or mixed race 495 (16.4) 80 (16.3) 
 White 1877 (62.3) 320 (65.0) 
 Hispanic or Latino ethnicity 363 (12.1) 53 (10.8) 
MSKI diagnoses assigned   
 Osteomyelitis alone (OM) 1142 (37.9) 185 (37.6) 
 Septic arthritis alone (SA) 486 (16.1) 81 (16.5) 
 Pyomyositis alone (PM) 783 (26.0) 133 (27.0) 
More than 1 of the above diagnoses assigned 601 (20.0) 93 (18.9) 
 OM+PM 251 (8.3) 38 (7.7) 
 OM+SA 176 (5.8) 29 (5.9) 
 PM+SA 62 (2.1) 11 (2.2) 
 OM+SA+PM 112 (3.7) 15 (3.0) 

Data presented at n (%) unless noted otherwise.

TABLE 2

Clinical Characteristics by Site for Manually Reviewed Encounters

Characteristic, N (%)All Sites (N = 492)Site 1 (N = 82)Site 2 (N = 82)Site 3 (N = 82)Site 4 (N = 82)Site 5 (N = 82)Site 6 (N = 82)
AH-MSKI 244 (49.6) 39 (47.6) 43 (52.4) 36 (43.9) 42 (51.2) 40 (48.8) 44 (53.7) 
Among AH-MSKI (N = 244) (N = 39) (N = 43) (N = 36) (N = 42) (N = 40) (N = 44) 
Median hospital length of stay, days (IQR) 4 (2) 4 (2.5) 4 (2) 4 (3) 4 (2.8) 3.5 (2.3) 4 (2) 
Pediatric intensive care 12 (4.9) 5 (12.8) 1 (2.3) 3 (8.3) 0 (0) 3 (7.5) 0 (0) 
Surgical procedure during admission 143 (58.6) 24 (61.5) 24 (55.8) 26 (72.2) 25 (59.5) 23 (57.5) 21 (47.7) 
AH-MSKI microbiology results        
 No pathogen identified 98 (40.2) 11 (28.2) 20 (46.5) 15 (41.7) 18 (42.9) 14 (35.0) 20 (45.5) 
 Methicillin-susceptible Staphylococcus aureus (MSSA) 79 (32.4) 20 (51.3) 14 (32.6) 10 (27.8) 11 (26.2) 10 (25.0) 14 (31.8) 
 Methicillin-resistant Staphylococcus aureus (MRSA) 27 (11.1) 3 (7.7) 1 (2.3) 6 (16.7) 7 (16.7) 7 (17.5) 3 (6.8) 
 Kingella kingae 14 (5.7) 2 (5.1) 5 (11.6) 2 (5.6) 2 (4.8) 2 (5.0) 1 (2.3) 
 Streptococcus pyogenes 13 (5.3) 2 (5.1) 1 (2.3) 1 (2.8) 3 (7.1) 3 (7.5) 3 (6.8) 
 Streptococcus pneumoniae 5 (2.0) 1 (2.6) 0 (0) 1 (2.8) 0 (0) 1 (2.5) 2 (4.5) 
 Other or polymicrobial 8 (3.3) 0 (0) 2 (4.7) 1 (2.8) 1 (2.4) 3 (7.5) 1 (2.3) 
MSKI as primary ICD-10 discharge diagnosis        
 Among AH-MSKI (N = 244), 206 (84.4) (N = 39), 32 (82.1) (N = 43), 36 (83.7) (N = 36), 32 (88.9) (N = 42), 31 (73.8) (N = 40), 36 (90.0) (N = 44), 39 (88.6) 
 Among NAH-MSKI (N = 248), 99 (39.9) (N = 43), 18 (41.9) (N = 39), 16 (41.0) (N = 46), 20 (43.5) (N = 40), 17 (42.5) (N = 42), 16 (38.1) (N = 38), 12 (31.6) 
Characteristic, N (%)All Sites (N = 492)Site 1 (N = 82)Site 2 (N = 82)Site 3 (N = 82)Site 4 (N = 82)Site 5 (N = 82)Site 6 (N = 82)
AH-MSKI 244 (49.6) 39 (47.6) 43 (52.4) 36 (43.9) 42 (51.2) 40 (48.8) 44 (53.7) 
Among AH-MSKI (N = 244) (N = 39) (N = 43) (N = 36) (N = 42) (N = 40) (N = 44) 
Median hospital length of stay, days (IQR) 4 (2) 4 (2.5) 4 (2) 4 (3) 4 (2.8) 3.5 (2.3) 4 (2) 
Pediatric intensive care 12 (4.9) 5 (12.8) 1 (2.3) 3 (8.3) 0 (0) 3 (7.5) 0 (0) 
Surgical procedure during admission 143 (58.6) 24 (61.5) 24 (55.8) 26 (72.2) 25 (59.5) 23 (57.5) 21 (47.7) 
AH-MSKI microbiology results        
 No pathogen identified 98 (40.2) 11 (28.2) 20 (46.5) 15 (41.7) 18 (42.9) 14 (35.0) 20 (45.5) 
 Methicillin-susceptible Staphylococcus aureus (MSSA) 79 (32.4) 20 (51.3) 14 (32.6) 10 (27.8) 11 (26.2) 10 (25.0) 14 (31.8) 
 Methicillin-resistant Staphylococcus aureus (MRSA) 27 (11.1) 3 (7.7) 1 (2.3) 6 (16.7) 7 (16.7) 7 (17.5) 3 (6.8) 
 Kingella kingae 14 (5.7) 2 (5.1) 5 (11.6) 2 (5.6) 2 (4.8) 2 (5.0) 1 (2.3) 
 Streptococcus pyogenes 13 (5.3) 2 (5.1) 1 (2.3) 1 (2.8) 3 (7.1) 3 (7.5) 3 (6.8) 
 Streptococcus pneumoniae 5 (2.0) 1 (2.6) 0 (0) 1 (2.8) 0 (0) 1 (2.5) 2 (4.5) 
 Other or polymicrobial 8 (3.3) 0 (0) 2 (4.7) 1 (2.8) 1 (2.4) 3 (7.5) 1 (2.3) 
MSKI as primary ICD-10 discharge diagnosis        
 Among AH-MSKI (N = 244), 206 (84.4) (N = 39), 32 (82.1) (N = 43), 36 (83.7) (N = 36), 32 (88.9) (N = 42), 31 (73.8) (N = 40), 36 (90.0) (N = 44), 39 (88.6) 
 Among NAH-MSKI (N = 248), 99 (39.9) (N = 43), 18 (41.9) (N = 39), 16 (41.0) (N = 46), 20 (43.5) (N = 40), 17 (42.5) (N = 42), 16 (38.1) (N = 38), 12 (31.6) 

Of the 244 AH-MSKI encounters, a pathogen was identified in 146 (59.8%); institution-specific rates of pathogen identification ranged from 53.5% to 71.8%. Among patients with a pathogen identified, Staphylococcus aureus was the most common bacterium (72.6%) and rates of methicillin resistance among S. aureus isolates ranged from 6.7% to 41.2% at the 6 PHIS hospitals (Table 2). Median hospital length of stay was 4 days, and 4.9% of patients required intensive care. A majority of AH-MSKI patients (59%) underwent at least 1 surgical procedure during admission, though there was variability in rate of surgical procedures across PHIS sites (47.7% to 72.2%).

There was variable performance of the 10 algorithms with sensitivity ranging from 31.1% to 91% and 199 specificity ranging from 52.4% to 97.6% (Table 3 and Fig 2). Four algorithms demonstrated potential for future use with ROC AUC greater than 80% (Algorithms 1, 2, 3, and 10). Balancing sensitivity and specificity equally, the highest performing algorithm (Algorithm 3) had a sensitivity of 79.9% and specificity of 86.7% (Table 3). Algorithm 8 had the highest specificity (97.6%) but was associated with the lowest sensitivity (31.1%).

FIGURE 2

Sensitivity (true positive rate) and 1 – specificity (false positive rate) of ten case-finding algorithms for children hospitalized with acute hematogenous musculoskeletal infections.

FIGURE 2

Sensitivity (true positive rate) and 1 – specificity (false positive rate) of ten case-finding algorithms for children hospitalized with acute hematogenous musculoskeletal infections.

Close modal
TABLE 3

Individual Algorithm Performance

Algorithm GoalOptimize Sensitivity and SpecificityOptimize SensitivityOptimize Specificity
Algorithm number 10 
ROC AUC (95% CI) 82.1 (78.4–85.5) 82.5 (78.8–86.2) 83.3 (79.7–86.9) 70.8 (66.2–75.4) 73.1 (67.2–76.2) 71.7 (67.2–76.2) 78.5 (74.5–82.6) 64.4 (59.5–69.2) 76.7 (72.5–80.9) 82.5 (78.8–86.2) 
Sensitivity (95% CI) 81.6 (76.1–86.2) 82.8 (77.5–87.3) 79.9 (74.3–84.8) 54.1 (47.6–60.5) 90.2 (85.7–93.6) 91.0 (86.7–94.3) 88.9 (84.3–92.6) 31.1 (25.4–37.4) 60.7 (54.2–66.8) 80.3 (74.8–85.1) 
Specificity (95% CI) 82.7 (77.4–87.2) 82.3 (76.9–86.8) 86.7 (81.8–90.7) 87.5 (82.7–91.3) 56.0 (49.6–62.3) 52.4 (46.0–58.8) 68.1 (62.0–73.9) 97.6 (94.8–99.1) 92.7 (88.8–95.6) 84.7 (79.6–88.9) 
Algorithm GoalOptimize Sensitivity and SpecificityOptimize SensitivityOptimize Specificity
Algorithm number 10 
ROC AUC (95% CI) 82.1 (78.4–85.5) 82.5 (78.8–86.2) 83.3 (79.7–86.9) 70.8 (66.2–75.4) 73.1 (67.2–76.2) 71.7 (67.2–76.2) 78.5 (74.5–82.6) 64.4 (59.5–69.2) 76.7 (72.5–80.9) 82.5 (78.8–86.2) 
Sensitivity (95% CI) 81.6 (76.1–86.2) 82.8 (77.5–87.3) 79.9 (74.3–84.8) 54.1 (47.6–60.5) 90.2 (85.7–93.6) 91.0 (86.7–94.3) 88.9 (84.3–92.6) 31.1 (25.4–37.4) 60.7 (54.2–66.8) 80.3 (74.8–85.1) 
Specificity (95% CI) 82.7 (77.4–87.2) 82.3 (76.9–86.8) 86.7 (81.8–90.7) 87.5 (82.7–91.3) 56.0 (49.6–62.3) 52.4 (46.0–58.8) 68.1 (62.0–73.9) 97.6 (94.8–99.1) 92.7 (88.8–95.6) 84.7 (79.6–88.9) 

Individual algorithm ROC AUC was comparable across all PHIS sites with overlapping CIs. Sensitivity and specificity were also generally similar across sites. However, several differences in algorithm performance were noted with Site 5 demonstrating a lower sensitivity for Algorithms 1 (P < .001), 3 (P = .002), and 9 (P = .002) compared with the other sites and Site 1 having a lower specificity for Algorithm 8 (P = .04) compared with the other sites. There were no other significant differences in any algorithm sensitivity or specificity among the 6 PHIS sites (Supplemental Fig 3).

A total of 208 encounters (190 inpatient and 18 observation) were identified for the secondary cohort that had characteristics of an AH-MSKI (eg, fever, elevated C-reactive protein, imaging of an extremity, receipt of an antimicrobial) but without any MSKI ICD-10 discharge diagnoses assigned (Fig 1B, Supplemental Table 3). All 208 hospital admissions in the secondary cohort underwent manual chart review at their respective PHIS site. Nearly all secondary cohort encounters were classified as NAH-MSKI (97.6%). There were 5 encounters in the secondary cohort that were classified via manual chart review as AH-MSKI and were missed using our primary cohort MSKI discharge diagnostic codes that found 3012 MSKI encounters.

To date, harnessing administrative data to study acute hematogenous musculoskeletal infections in children has been challenging because of limited specificity of discharge diagnoses for this population. This study is the first to develop and evaluate the performance of several case-finding algorithms designed to reliably identify children hospitalized for acute hematogenous MSKI using administrative data alone. Several algorithms showed promise with sensitivities and specificities greater than 80% and consistent performance across multiple, geographically diverse PHIS hospitals. Nearly half (50.4%) of those patients assigned a discharge diagnosis of osteomyelitis, septic arthritis, and/or pyomyositis were found in manual chart review to have chronic infections, nonhematogenous sources, or significant contributing comorbidities. Therefore, when used in isolation, ICD-10 discharge diagnosis codes are an inaccurate method to identify children hospitalized with acute hematogenous MSKI, and case-finding algorithms like ours are needed to study acute hematogenous MSKI among hospitalized children in PHIS to prevent erroneous conclusions.

When equally balancing sensitivity and specificity, Algorithms 1, 2, 3, and 10 were the highest performing algorithms with Algorithm 3 demonstrating the highest ROC AUC (83.3%). These 4 algorithms shared several similar features that might offer insight into their performance. First, each algorithm excluded patients with short hospitalizations (1 day) and those with unusually long hospitalizations of more than 30 days (Algorithms 1–3) or more than 14 days (Algorithm 10). Second, each used a robust list of excludable discharge diagnoses likely to be assigned to patients without acute hematogenous MSKI, including chronic recurrent multifocal osteomyelitis and decubitus ulcers. Lastly, these algorithms excluded patients who never received an antimicrobial and who had a CCC flag in PHIS.

When comparing algorithm performance, we found use of this CCC flag was a high-yield addition to identify patients with likely excludable conditions. When using a narrowed list of acute MSKI discharge diagnoses (Algorithms 4 and 8), algorithm specificity was greatly increased (up to 97.6%), but at the cost of a reduced sensitivity (down to 31.1%). This suggests that PHIS-based studies limited to a narrow list of ICD-10 discharge diagnoses will fail to identify a significant proportion of inpatient acute hematogenous MSKI encounters, leading to biased results. Depending on the clinical outcomes and population of interest, administrative database studies of acute hematogenous MSKI could select an algorithm tailored to the population of interest. For instance, a study hoping to prioritize exclusion of patients with chronic or nonhematogenous infections would opt for a highly specific algorithm, and therefore Algorithm 8 may be ideal in such a scenario. Similarly, a study interested in only certain MSKI processes, such as osteomyelitis alone for instance, could easily tailor the included ICD-10 codes in these algorithms to capture only those patients of interest while still excluding nonacute or nonhematogenous infections.

When comparing algorithm performance across institutions, the ROC AUC for each algorithm was similar with overlapping CIs across all PHIS sites. Sensitivity and specificity for each algorithm were also mostly similar across sites. However, Site 5 had slightly lower sensitivity observed for Algorithms 1, 3, and 9 and Site 1 had slightly lower specificity for Algorithm 8. It is possible that inconsistency in manual chart review contributed to these findings, though efforts were extensive to ensure consistency at each site. There is also the potential for variable coding practices among institutions which would lead to variable algorithm performance. However, given the large number of intersite comparisons tested for both sensitivity and specificity of 10 algorithms, differences of at least 1 algorithm may be likely across sites because of random sampling error alone. In addition, the lower specificity for Algorithm 8 at Site 1 was because of 4 false positives compared with 0 or 1 at the other sites, a minor difference. In addition, these algorithms are based upon readily available administrative data that are used in many different healthcare databases (eg, discharge diagnoses, length of stay, and administered medications). Therefore, these algorithms are not limited to PHIS studies alone but can now be leveraged for future research to study acute musculoskeletal infections in children using many readily available pediatric administrative databases. These algorithms can now be harnessed to describe the national variability in pediatric MSKI treatment and address lingering questions relevant to this population, including the impact of early operative procedures on clinical outcomes and the burden of MSKI on hospital resources.

There are several limitations to this study. First, algorithms were developed using data from 6 of 52 participating centers in PHIS and may not be applicable to the entire PHIS dataset or other nonfreestanding pediatric hospitals. However, because we included a wide range of geographic regions across the United States, we feel our results are robust and representative. The overall consistent algorithm performance across sites supports the generalizability of these algorithms to other PHIS sites. Furthermore, it is possible that a patient treated for an acute hematogenous MSKI might be missed by these algorithms because of coding errors or misclassification of discharge diagnoses. To evaluate this, we performed manual chart reviews on a secondary cohort of inpatient encounters that met billing code criteria typical for patients with AH-MSKI, but that were not assigned a discharge diagnosis for MSKI. Among the 208 patients in this secondary cohort, only 5 additional AH-MSKI were identified via manual chart review. Compared with the validation subset of the primary cohort where 49.6% of identified encounters were AH-MSKI, only 2.5% of those in the secondary cohort were AH-MSKI. This suggests that the included ICD-10 diagnostic codes for MSKI in these PHIS algorithms are comprehensive and unlikely to miss a significant number of acute hematogenous MSKI encounters. Another potential limitation is that only a subset of patient charts was manually reviewed to develop our reference standard. It was not feasible to review all patient charts for children with MSKI diagnostic codes at each participating center; however, a large randomly selected sample of patients was used for algorithm validation.

This study successfully developed and validated several case-finding algorithms to accurately identify patients with acute hematogenous MSKI in PHIS while excluding those with complicating comorbidities. Future studies will now leverage these algorithms to identify acute hematogenous MSKI encounters based on administrative billing data to compare variable diagnostic and therapeutic strategies for children hospitalized with acute hematogenous MSKI across PHIS hospitals. In addition, our approach offers a methodologic example for how to validate other case-finding algorithms using administrative billing codes for specific pediatric diagnoses.

FUNDING: Salary support for Dr Searns to lead this project was supported by the Research Scholar Award from the Children’s Hospital Colorado Research Institute. The other authors received no additional funding.

CONFLICT OF INTEREST DISCLOSURES: The authors have indicated they have no conflicts of interest to disclose.

Dr Searns conceptualized and designed the study, designed the algorithms, performed site-specific retrospective chart review, analyzed the final dataset, and drafted, reviewed, and critically revised the manuscript for important intellectual content; Dr Rice contributed to study design, including strategies for algorithm comparison and statistical analysis; Ms Bertin contributed to study design, including strategies for algorithm comparison and statistical analysis; Ms Birkholz contributed to study design, helped conceptualize algorithm development, and collected all patient encounters in the Pediatric Health Information System Database; Ms Barganier and Drs Creech, Downes, Hubbell, Kronman, Rolsma, Sydney, O’Leary, Parker, and Dominguez conceptualized and designed the study, contributed to algorithm development, and interpreted the significant findings from the final analysis; and all authors critically reviewed and revised the manuscript for important intellectual content, approved the final manuscript as submitted, and agree to be accountable for all aspects of the work.

1.
Arnold
JC
,
Bradley
JS
.
Osteoarticular infections in children
.
Infect Dis Clin North Am
.
2015
;
29
(
3
):
557
574
2.
Donaldson
N
,
Sanders
J
,
Child
J
,
Parker
S
.
Acute hematogenous bacterial osteoarticular infections in children
.
Pediatr Rev
.
2020
;
41
(
3
):
120
136
3.
Gafur
OA
,
Copley
LA
,
Hollmig
ST
,
Browne
RH
,
Thornton
LA
,
Crawford
SE
.
The impact of the current epidemiology of pediatric musculoskeletal infection on evaluation and treatment guidelines
.
J Pediatr Orthop
.
2008
;
28
(
7
):
777
785
4.
Robinette
ED
,
Brower
L
,
Schaffzin
JK
,
Whitlock
P
,
Shah
SS
,
Connelly
B
;
AHO Care Algorithm Team
.
Use of a clinical care algorithm to improve care for children with hematogenous osteomyelitis
.
Pediatrics
.
2019
;
143
(
1
):
e20180387
5.
Okubo
Y
,
Nochioka
K
,
Testa
M
.
Nationwide survey of pediatric acute osteomyelitis in the USA
.
J Pediatr Orthop B
.
2017
;
26
(
6
):
501
506
6.
Lee
MC
.
To culture or not to culture: commentary on an article by Jarren Section, MS, et al.: “Microbiological culture methods for pediatric musculoskeletal infection. a guideline for optimal use”
.
J Bone Joint Surg Am
.
2015
;
97
(
6
):
e32
7.
Thomsen
I
,
Creech
CB
.
Advances in the diagnosis and management of pediatric osteomyelitis
.
Curr Infect Dis Rep
.
2011
;
13
(
5
):
451
460
8.
Woods
CR
,
Bradley
JS
,
Chatterjee
A
, et al
.
Clinical practice guideline by the Pediatric Infectious Diseases Society and the Infectious Diseases Society of America: 2021 guideline on diagnosis and management of acute hematogenous osteomyelitis in pediatrics
.
J Pediatric Infect Dis Soc
.
2021
;
10
(
8
):
801
844
9.
Copley
LA
.
Pediatric musculoskeletal infection: trends and antibiotic recommendations
.
J Am Acad Orthop Surg
.
2009
;
17
(
10
):
618
626
10.
Funk
SS
,
Copley
LA
.
Acute hematogenous osteomyelitis in children: pathogenesis, diagnosis, and treatment
.
Orthop Clin North Am
.
2017
;
48
(
2
):
199
208
11.
Gornitzky
AL
,
Kim
AE
,
O’Donnell
JM
,
Swarup
I
.
Diagnosis and management of osteomyelitis in children: a critical analysis review
.
JBJS Rev
.
2020
;
8
(
6
):
e1900202
12.
Zaoutis
T
,
Localio
AR
,
Leckerman
K
,
Saddlemire
S
,
Bertoch
D
,
Keren
R
.
Prolonged intravenous therapy versus early transition to oral antimicrobial therapy for acute osteomyelitis in children
.
Pediatrics
.
2009
;
123
(
2
):
636
642
13.
Keren
R
,
Shah
SS
,
Srivastava
R
, et al
;
Pediatric Research in Inpatient Settings Network
.
Comparative effectiveness of intravenous vs oral antibiotics for postdischarge treatment of acute osteomyelitis in children
.
JAMA Pediatr
.
2015
;
169
(
2
):
120
128
14.
Shaklee
J
,
Zerr
DM
,
Elward
A
, et al
.
Improving surveillance for pediatric Clostridium difficile infection: derivation and validation of an accurate case-finding tool
.
Pediatr Infect Dis J
.
2011
;
30
(
3
):
e38
e40
15.
Williams
DJ
,
Shah
SS
,
Myers
A
, et al
.
Identifying pediatric community-acquired pneumonia hospitalizations: accuracy of administrative billing codes
.
JAMA Pediatr
.
2013
;
167
(
9
):
851
858
16.
Chisholm
AG
,
Little
BD
,
Johnson
RF
.
Validating peritonsillar abscess drainage rates using the pediatric hospital information system data
.
Laryngoscope
.
2020
;
130
(
1
):
238
241
17.
Tieder
JS
,
Hall
M
,
Auger
KA
, et al
.
Accuracy of administrative billing codes to detect urinary tract infection hospitalizations
.
Pediatrics
.
2011
;
128
(
2
):
323
330
18.
Mongelluzzo
J
,
Mohamad
Z
,
Ten Have
TR
,
Shah
SS
.
Corticosteroids and mortality in children with bacterial meningitis
.
JAMA
.
2008
;
299
(
17
):
2048
2055
19.
National Center for Health Statistics
.
International Classification of Diseases, Tenth Revision, Clinical Modification (ICD-10-CM)
.
Atlanta, GA
:
Centers for Disease Control and Prevention
;
2015
20.
Rubin
LG
,
Levin
MJ
,
Ljungman
P
, et al
;
Infectious Diseases Society of America
.
2013 IDSA clinical practice guideline for vaccination of the immunocompromised host
.
Clin Infect Dis
.
2014
;
58
(
3
):
309
318
21.
Harris
PA
,
Taylor
R
,
Thielke
R
,
Payne
J
,
Gonzalez
N
,
Conde
JG
.
Research electronic data capture (REDCap)–a metadata-driven methodology and workflow process for providing translational research informatics support
.
J Biomed Inform
.
2009
;
42
(
2
):
377
381
22.
Feudtner
C
,
Feinstein
JA
,
Zhong
W
,
Hall
M
,
Dai
D
.
Pediatric complex chronic conditions classification system version 2: updated for ICD-10 and complex medical technology dependence and transplantation
.
BMC Pediatr
.
2014
;
14
:
199

Supplementary data