Video Abstract
Genetic testing is expanding among ill neonates, yet the influence of genetic results on medical decision-making is not clear. With this study, we sought to determine how different types of genetic information with uncertain implications for prognosis influence clinicians’ decisions to recommend intensive versus palliative care.
We conducted a national study of neonatologists using a split sample experimental design. The questionnaire contained 4 clinical vignettes. Participants were randomly assigned to see one of 2 versions that varied only regarding whether they included the following genetic findings: (1) a variant of uncertain significance; (2) a genetic diagnosis that affects neurodevelopment but not acute survival; (3) a genetic versus nongenetic etiology of equally severe pathology; (4) a pending genetic testing result. Physicians answered questions about recommendations they would make for the patient described in each vignette.
Vignette versions that included a variant of uncertain significance, a diagnosis foreshadowing neurodevelopmental impairment, or a genetic etiology of disease were all associated with an increased likelihood of recommending palliative rather than intensive care. A pending genetic test result did not have a significant effect on care recommendations.
Findings from this study of hypothetical cases suggest neonatologists apply uncertain genetic findings or those that herald neurodevelopmental disability in problematic ways. As genetic testing expands, understanding how it is used in decision-making and educating clinicians regarding appropriate use are paramount.
Genetic testing is expanding among ill neonates. The effect of genetic information on neonatologists’ decision-making is, however, poorly understood.
On the basis of responses to our split sample psychological experiment, genetic findings had a significant impact on critical decisions for ill neonates. Neonatologists applied uncertain genetic findings or those heralding neurodevelopmental impairment in problematic ways.
Genetic information is appealing to neonatologists who must make critical decisions with insufficient and uncertain prognostic data.1 Neonates are afflicted by pathologies with wide-ranging outcomes, and severity of disease may be difficult to gauge clinically at this early point in development. In this context, genetic testing is an attractive way to predict what a future may look like. The NICU has also been a natural launch point for genetic technology because of the high prevalence of genetic disease, with 25% to 60% of critically ill newborns having abnormal findings on whole exome sequencing.2–4
Although use of genetic testing is expanding rapidly in the NICU, the effects of genetic information on clinical decision-making are not fully understood.5–7 Although genetic testing may end diagnostic odysseys, aid in prognostication, and help clinicians tailor treatments, testing also frequently generates ambiguous information.6,8–10 A genetic variant may be unrelated to a patient’s phenotype or have unclear implications for treatment. Even when genetic diagnoses are definitive, the implications for doctors’ decision-making may be less clear.5,6 Genetic findings may have unanticipated consequences for how doctors perceive risk and counsel families.6,11 A genetic diagnosis may be interpreted or applied inappropriately. To our knowledge, no empirical studies examine how neonatologists use genetic findings in their clinical decision-making and whether their application is clinically accurate or ethically sound.
In this study, we sought to evaluate how neonatologists use different types of genetic findings in their thinking about clinical decisions. We used a questionnaire with an experimental design of alternative clinical vignettes, a validated method to obtain information about how physicians practice.12 We sought to test the broad hypothesis that, in some clinical cases, neonatologists apply genetic findings in ways that are unsupported by data or are based on ethically questionable assumptions. Each case was designed to test how neonatologists apply a specific type of genetic information. In the first 3 cases, we evaluated whether inclusion of the following genetic information would affect clinicians’ recommendations regarding invasive and palliative care options: (1) a variant of uncertain significance (VUS), a genetic variant whose effect on health is unknown; (2) a genetic diagnosis affecting neurodevelopment but not acute survival; and (3) a genetic versus nongenetic etiology of equally severe disease. In the final case, we evaluated the hypothesis that a genetic etiology is a key factor in a physician’s recommendations about goals of care and that a pending result from whole exome sequencing would make neonatologists feel less ready to have a goals of care conversation with a patient’s family.
Methods
Study Participants and Data Collection
In the fall of 2020, we emailed a 22-question questionnaire entitled “Making Difficult Decisions in the Neonatal Intensive Care Unit” to roughly 3600 neonatologists. The questionnaire was distributed by e-mail through the listservs of the American Academy of Pediatrics Section on Neonatal and Perinatal Medicine and the Children’s Hospital Neonatal Consortium. The latter was included to increase sampling of those working in level IV NICUs in which genetic testing is most extensively used. In accordance with the tailored design method and listserv regulations, we sent 1 reminder to the former group and 2 to the latter group and offered a $10 Amazon gift card to all who completed the questionnaire.13 Study data were collected and managed by using Research Electronic Data Capture electronic data capture tools hosted at the Children’s Hospital of Philadelphia.14 This study was reviewed and deemed exempt by the Children’s Hospital of Philadelphia Institutional Review Board.
Instrument Development and Design
We developed a case-based psychological experiment after review of the literature and consultation with geneticists and neonatologists at level III and IV NICUs. The 4 fictionalized case vignettes described acutely ill infants. We then pilot-tested the cases for clarity with 10 experienced neonatologists and revised them on the basis of feedback. Case 1 is included as an example in Fig 1, and the full questionnaire is available as Supplemental Information. Hypotheses were tested by using a randomized split-block design with alternative versions of the case vignettes, a method used successfully in other studies.15,16 Each fictionalized case had 2 versions that varied only by the genetic information provided, henceforth referred to as the genetic and nongenetic versions. Participants were randomly assigned to see only one version of each case; they were randomly assigned independently for each case, so a participant might be presented with any combination of the 4 vignettes. Participants were subsequently asked questions about their recommendations for the patient. All responses were recorded on a 5-point Likert scale.
Sample case vignette. This provides the full case of a patient with a VUS that participants saw. Half of participants saw the genetic version of the case, which includes the VUS. The other half saw the nongenetic version of the case, which includes negative genetic testing.
Sample case vignette. This provides the full case of a patient with a VUS that participants saw. Half of participants saw the genetic version of the case, which includes the VUS. The other half saw the nongenetic version of the case, which includes negative genetic testing.
Case 1 described a patient with severe chronic lung disease and asked about provision of tracheostomy and gastrostomy tubes. In the genetic version, the patient had a VUS in a surfactant protein; in the nongenetic version, she had negative genetic testing results. Case 2 described a preterm infant with sepsis and asked about provision of a central line to continue antibiotic therapy. In the genetic version, the patient was incidentally diagnosed with Williams syndrome; in the nongenetic version, he had negative genetic testing results. Williams syndrome is a gene deletion associated with intellectual disability (typically mild) as well as unique personality characteristics and endocrine and cardiac abnormalities.17 Case 3 described a patient with pulmonary hypoplasia and participants were asked about extracorporeal membrane oxygenation (ECMO) candidacy. In the genetic version, the hypoplasia resulted from a mutation with a broad spectrum of outcomes; in the nongenetic version, the etiology was oligohydramnios. For case 3, the following prognostic statement was included after both versions to mitigate uncertainty about disease severity: “Expert consult suggests that outcomes may range from a severe asthma-like syndrome to lethal pulmonary hypoplasia, but prognosis is difficult to assess.” After the first 3 cases, participants were asked whether they would recommend the stated invasive intervention and whether they would recommend palliative care. They were also asked to rate the importance of each of the following factors in making their recommendations: likelihood of short-term survival, likelihood of long-term survival, pain and suffering for the patient, expected long-term neurologic outcome, effect on the child’s family, and allocation of limited resources.
In case 4, the patient was a term male with progressive muscle weakness and ventilator dependence. The genetic version included pending whole exome sequencing, whereas the nongenetic version had no mention of genetic testing. Participants were asked about readiness for a goals of care conversation. T hey were also asked to what degree the following information would inform their recommendations for goals of care: need for long-term intravenous nutrition, brain MRI results, social work assessment of caretakers’ ability to manage a child requiring complex medical care, otolaryngology consult regarding candidacy for tracheostomy, and whether a genetic etiology was found. Finally, respondents completed additional questions about demographic details and work environment.
Statistical Analysis
Responses were analyzed by using Stata version 16.1 (StataCorp, College Station, TX). Statistical inferences were based on two-tailed tests, with significance set at P < .05. Descriptive statistics for physician characteristics and practice settings are presented for the full sample as well as for randomization group by case and are reported as number of participants and percentages. The recommendations for invasive and palliative interventions served as our primary outcome for each vignette. Because this outcome was ordinal output from a 5-point Likert scale, we used the Wilcoxon–Mann-Whitney test to assess for differences in responses between the groups that saw genetic and nongenetic versions of each case. We calculated Cohen’s d effect size for outcomes in which the genetic and nongenetic group had significantly different responses because this is a commonly used measure of the importance of an effect in decision-making.18,19 In a secondary analysis, we used ordered logistic regression to correct for any failures of randomization that had led to unequal distribution of respondent characteristics between the genetic and nongenetic groups.
Results
A total of 551 respondents completed the entire questionnaire. This represents a response rate of 40% (18 of 45) for the Children’s Hospital Neonatal Consortium group, 21% (533 of 2550) for the American Academy of Pediatrics group, and 21% for the cohort overall. The characteristics of respondents are shown in Table 1.
Respondent Characteristics for Overall Sample and by Case Randomization Group
. | Overall (N = 551), . | Case 1 . | Case 2 . | Case 3 . | Case 4 . | ||||
---|---|---|---|---|---|---|---|---|---|
Genetic (n = 268) . | Nongenetic (n = 283) . | Genetic (n = 270) . | Nongenetic (n = 281) . | Genetic (n = 264) . | Nongenetic, (n = 287) . | Genetic (n = 273) . | Nongenetic, (n = 278) . | ||
Years in practice | |||||||||
In fellowship | 66 (12%) | 12% | 12% | 14% | 10% | 16% | 9% | 11% | 13% |
<5 y | 106 (19%) | 20% | 19% | 19% | 19% | 18% | 21% | 18% | 21% |
5–10 y | 88 (16%) | 17% | 15% | 15% | 17% | 16% | 16% | 17% | 15% |
11–20 y | 100 (18%) | 21% | 16% | 19% | 18% | 21% | 16% | 18% | 18% |
>20 y | 191 (35%) | 31% | 38% | 33% | 36% | 30% | 39% | 36% | 34% |
Sex | |||||||||
Female | 342 (62%) | 62% | 62% | 61% | 63% | 63% | 62% | 62% | 62% |
Male | 208 (38%) | 38% | 38% | 39% | 37% | 38% | 38% | 37% | 38% |
Other | 1 (0%) | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% |
Level NICU | |||||||||
Level II | 16 (3%) | 5% | 3% | 3% | 3% | 4% | 2% | 3% | 3% |
Level III | 201 (37%) | 33% | 40% | 34% | 38% | 35% | 38% | 39% | 35% |
Level IV | 334 (61%) | 63% | 58% | 63% | 58% | 61% | 60% | 59% | 62% |
. | Overall (N = 551), . | Case 1 . | Case 2 . | Case 3 . | Case 4 . | ||||
---|---|---|---|---|---|---|---|---|---|
Genetic (n = 268) . | Nongenetic (n = 283) . | Genetic (n = 270) . | Nongenetic (n = 281) . | Genetic (n = 264) . | Nongenetic, (n = 287) . | Genetic (n = 273) . | Nongenetic, (n = 278) . | ||
Years in practice | |||||||||
In fellowship | 66 (12%) | 12% | 12% | 14% | 10% | 16% | 9% | 11% | 13% |
<5 y | 106 (19%) | 20% | 19% | 19% | 19% | 18% | 21% | 18% | 21% |
5–10 y | 88 (16%) | 17% | 15% | 15% | 17% | 16% | 16% | 17% | 15% |
11–20 y | 100 (18%) | 21% | 16% | 19% | 18% | 21% | 16% | 18% | 18% |
>20 y | 191 (35%) | 31% | 38% | 33% | 36% | 30% | 39% | 36% | 34% |
Sex | |||||||||
Female | 342 (62%) | 62% | 62% | 61% | 63% | 63% | 62% | 62% | 62% |
Male | 208 (38%) | 38% | 38% | 39% | 37% | 38% | 38% | 37% | 38% |
Other | 1 (0%) | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% |
Level NICU | |||||||||
Level II | 16 (3%) | 5% | 3% | 3% | 3% | 4% | 2% | 3% | 3% |
Level III | 201 (37%) | 33% | 40% | 34% | 38% | 35% | 38% | 39% | 35% |
Level IV | 334 (61%) | 63% | 58% | 63% | 58% | 61% | 60% | 59% | 62% |
Descriptive statistics for physician characteristics and practice settings are presented for the full sample as well as for randomization group by case and are reported as number of participants and percentages.
Figure 2 displays responses for the primary outcome questions from the 4 clinical cases. In case 1, the finding of a VUS was associated with participants being less likely to recommend a tracheostomy and gastrostomy tube (P < .001; Cohen’s d: 0.55) and more likely to recommend transitioning to palliative care (P < .001; Cohen’s d: 0.46). This confirmed our hypothesis that presence of a VUS would be associated with neonatologists being less likely to recommend invasive care and more likely to recommend palliative care. Pain and suffering were ranked as more important decision-making factors for those who saw the case with the VUS (P = .01).
Primary outcome measures by case. This matrix displays means and confidence intervals for the primary outcome measures. For each case, responses are stratified by whether participants saw the genetic or nongenetic version of the case and whether they were responding to questions about palliative or invasive care. A, Case 1: VUS. B, Case 2: neurodevelopment. C, Case 3: genetic etiology. D, Case 4, Whole exome sequencing pending.
Primary outcome measures by case. This matrix displays means and confidence intervals for the primary outcome measures. For each case, responses are stratified by whether participants saw the genetic or nongenetic version of the case and whether they were responding to questions about palliative or invasive care. A, Case 1: VUS. B, Case 2: neurodevelopment. C, Case 3: genetic etiology. D, Case 4, Whole exome sequencing pending.
In case 2, an incidental finding of Williams syndrome was associated with participants being less likely to recommend central line placement (P < .001; Cohen’s d: 0.35) and more likely to recommend transitioning to palliative care (P < .001; Cohen’s d: 0.37). This confirms that including an incidental genetic finding with neurodevelopmental implications (even mild ones) would be associated with favoring palliative over invasive interventions. Here, pain and suffering (P = .01) and long-term survival (P = .03) were cited as more important factors in the recommendations of participants who saw the version with Williams syndrome.
In case 3, providers were more likely to recommend transitioning to palliative care if the patient had a genetic etiology of disease (P = .004; Cohen’s d: 0.24). There was no significant difference in recommendations about ECMO between the genetic and nongenetic groups (P = .09). This partially supported our hypothesis in that a genetic etiology of disease meant providers were more likely to recommend palliative care and less likely to recommend ECMO. Rationales for decisions did not differ significantly between groups for case 3. Secondary analysis to account for an imbalance in years in practice in case 3 had a minimal effect on the results.
In the final case, reported readiness for a goals of care meeting did not differ significantly if whole exome sequencing was pending (P = .06). Providers ranked finding a genetic etiology as the most important factor informing their recommendations to the family (mean: 2.75). MRI results were second most important in informing recommendations (mean: 2.63), followed by whether the patient could tolerate oral feeds (mean: 2.47), social work assessment (mean: 2.34), and tracheostomy candidacy (mean: 2.17).
Discussion
We used an experimental design to examine how genetic information affects neonatologists’ clinical decisions regarding case vignettes. To capture the inherent complexity and ambiguity associated with genetic test results, we chose results that either have uncertain prognostic value or are associated with a range of clinical outcomes. We studied the impact of 4 types of genetic results on care recommendations: (1) a VUS; (2) a genetic diagnosis with implications for neurodevelopment; (3) a genetic versus nongenetic etiology of disease; and (4) pending genetic testing results. Overall, the presence of genetic findings was associated with providers being less likely to recommend invasive intervention and more likely to recommend a transition to palliative care in 3 of 4 cases.
In the first case, the influence of a VUS on care is, by definition, not supported by prognostically informative evidence. Clinical geneticists agree that VUSs should not be used in medical decision-making.20 This constitutes experimental evidence that ambiguous genetic information inappropriately influences how providers reach clinical decisions. For this case, respondents cited pain and suffering as more important considerations if they saw the genetic version of the case and were thus more likely to recommend palliative care. There is no reason to believe that a VUS would be associated with increased pain and suffering. This pattern is in line with previous work documenting that physicians cite quality of life more often when recommending palliative, rather than invasive, interventions.21 It is also consistent with reported use of suffering to represent the general utility of an intervention and justify a wide variety of decisions.21,22 The reason that a VUS biased respondents to recommend palliative care may be that they suspected this genetic finding would worsen prognosis, compounded by desperation for information when faced with a difficult clinical decision.
In case 2, we selected Williams syndrome to isolate the effect of intellectual disability from survival likelihood. This diagnosis is associated with intellectual disability (usually mild) but should not affect likelihood of acute survival in the absence of structural anomalies. The altered responses in the genetic version of this case reveal that genetic testing heralding future disabilities changes care decisions. Because we did not directly evaluate whether participants thought Williams syndrome affected survival, inaccurate presumptions about survivability may have informed recommendations in addition to predicted intellectual disability. In case 2, pain and suffering as well as long-term survival were cited as a more significant factor for those considering a patient with an incidental Williams syndrome diagnosis. Current life expectancy estimates are lacking for patients with Williams syndrome, yet survival well into older adulthood in most cases makes this a questionable rationale. The effect of Williams syndrome on care recommendations may have been due to misinformation about prognosis or resulted from generalized concerns about quality of life for a patient with intellectual disability.
In case 3, a genetic etiology of disease was associated with an increased likelihood of recommending palliative care despite seeing the same direct prognostic statement in both versions of the vignette. This suggests that having a genetic diagnosis is perceived as an inherent marker of a bad outcome even if available evidence does not support a change in prognosis. This further supports the overall conclusion that genetic findings foster generalized pessimism for neonatologists facing difficult decisions.
In the final case, pending whole exome sequencing was not related to perceived readiness for a goals of care meeting. However, a genetic etiology of disease was regarded as the most important factor informing goals of care, above factors with clearer implications for management needs such as tracheostomy candidacy. This reinforces the general theme of our study: genetic information affects how neonatologists think about difficult care decisions. Effect sizes throughout our study also support this conclusion. Effect sizes are most meaningfully compared with estimates from other similar studies; however, there are no comparable data in this case. Instead, we compare our effect sizes to reference values. They are in the moderate range, at 0.3–0.5, showing that changes in genetic information have a substantial effect on neonatologists’ decisions.
Our study design, a randomized controlled psychological experiment, creates both strengths and limitations. Randomization and control enhance internal validity. In each scenario, we were able to isolate one aspect of genetic information (the independent variable) and evaluate the effect of experimentally controlled variations of that aspect on neonatologists’ care recommendations (the dependent variable), enabling us to conclude that the observed effects were due to our manipulation of aspects of the genetic information.23–25 Doing anything remotely similar to this in a direct clinical study of patients would be not only difficult but, likely, also unethical. An additional strength of our design was that we oversampled neonatologists who work at level IV NICUs, 61% of our sample. Thus, we capture a population of neonatologists who most often order genetic tests and use the results to inform clinical decisions.
Our study design also has limitations. First, we do not know the mechanism underlying the observed results. Genetic findings could affect clinicians’ thinking about care decisions (in practice or in response to this study) in at least two ways: because of the specific genetic content (which presumably is the major influence) or because a genetic finding causes information overload by adding to the overall complexity of the case. Information overload has been widely recognized to impede clear medical decision-making and may lead clinicians to perceive illness as more severe.26 Second, we cannot determine the extent to which the physicians’ responses in this experiment reflect their actions in clinical practice.12 Substantial evidence exists, however, that clinical vignettes accurately capture how physicians practice.15,16 Third, the response rate to the current study, slightly >20%, may limit the generalizability of our results. Although this response rate exceeds the typical response rate of 10% to 15% reported for the Section on Neonatal and Perinatal Medicine listserv, our findings may not generalize to neonatologists who declined to participate.27 Notably, this generalization failure would most likely occur if respondents were both more likely to participate and less likely to use genetic information appropriately, but this seems unlikely, particularly given the high percentage of respondents who work at level IV NICUs. Nonetheless, because this is the first study to look at biases in neonatologists’ decisions as they relate to genetic information, true external validity will arise only from replication of such findings in different contexts and by using different methodologies.28
If the biases demonstrated in our study extend to clinical practice, they have a number of troubling implications. Our findings suggest that ambiguous genetic information may be more than unhelpful; it may be detrimental to clear and rational decision-making by physicians. Also troubling, diagnoses portending intellectual disability may be used to redirect care. If such use occurs in practice, it sends a hurtful message about the value of disabled persons. Disability advocates have also raised concerns that using genetic findings in this way may stem from misunderstandings about what it is like to live with or care for someone with a disability.5
Conclusions
We found that, in responding to clinical vignettes, neonatologists make treatment decisions that are influenced by genetic information. As the prevalence of genetic testing expands in the NICU, recognizing biases associated with genetic diagnoses and facilitating appropriate use of genetic findings by neonatologists, in collaboration with geneticists, is paramount. We hope that these results may raise neonatologists’ awareness of their biases with regard to both genomic information and intellectual disability. Future work should evaluate the effectiveness of educating clinicians on interpretation of genetic information and creating decision-making frameworks to support neonatologists’ use of this ever-expanding and inherently complex information.
Acknowledgments
We acknowledge Erik Brandsma and Karen Puopolo, who both provided thoughtful feedback on early drafts of our clinical vignettes.
Dr Callahan conceptualized and designed the study, created the study instrument, collected and analyzed the data, drafted the initial manuscript, and reviewed and revised the manuscript; Dr Feudtner assisted in designing the study, created the study instrument, analyzed the data, and reviewed and revised the manuscript; Drs Joffe, Munson, Wild, and Skraban assisted in creating the study instrument, interpreting data, and reviewing and revising the manuscript; and all authors approved the final manuscript as submitted and agree to be accountable for all aspects of the work.
FUNDING: Supported by T32 Training Grant No. HG009496 from the National Human Genome Research Institute (KPC). The funder and sponsor had no role in the reported work. Funded by the National Institutes of Health (NIH).
References
Competing Interests
CONFLICT OF INTEREST DISCLOSURES: Dr Joffe received research funding from Pfizer through the University of Pennsylvania until May 2020. All other authors have no conflicts of interest to disclose.
Comments