OBJECTIVES

Despite their overrepresentation, female physicians continue to have lower rates of promotion compared with male physicians. Teaching evaluations play a role in physician advancement. Few studies have investigated gender disparity in resident evaluations of pediatric faculty. We hypothesized that gender disparities in resident evaluations of faculty exist and vary across subspecialties and primary work environments.

METHODS

Pediatric faculty institution-specific evaluations completed by residents from January 1, 2015, to March 9, 2020, were obtained from a single academic center. Mean ratings of faculty performance were compared by gender using a Wilcoxon 2-sample test.

RESULTS

Fifteen-thousand one-hundred and forty-two evaluations (5091 of male faculty and 10 051 of female faculty) were included. Female faculty were rated higher in overall teaching ability (female = 4.67 versus male = 4.65; P = .004). There was no statistical difference in the mean ratings of male and female faculty in the inpatient setting, whereas outpatient female faculty were rated higher in overall teaching ability (female = 4.79 versus male = 4.73; P = .005). For general pediatric faculty, females received higher ratings for overall teaching ability (female = 4.75 versus male = 4.70; P < .001). By contrast, there was no difference in ratings of subspecialty pediatric faculty.

CONCLUSIONS

Pediatric female faculty were statistically rated higher than male faculty in overall teaching ability, although these findings may not be educationally significant. The difference was driven by evaluations in the outpatient setting and for general pediatricians. This study is one of the first in pediatrics adding to the continued investigation of gender disparities in academic medicine.

Females in academic medicine are promoted at lower rates than males and are less likely to hold the ranks of associate or full professor.1  It has been suggested that the underrepresentation of females in leadership positions is linked to their slowed promotional rate.2,3  Although pediatrics is a field in medicine where females make up the majority of physicians, gender disparities in leadership positions and promotion still exist. For example, in 2017, females within pediatrics made up 72.3% of residents, 63.3% of physicians in practice, and 57.4% of academicians, yet only accounted for 34.8% of full-time professors and 26.2% of pediatric department chairs.4  The reasons behind these differences are not fully understood and are thought to be multifactorial.

Teaching evaluations are 1 factor used to make decisions about promotion.5,6  The inherent subjective nature of evaluations allows for potential explicit or implicit biases. For example, a study of a university’s introductory-level online course showed that learners rate instructors labeled as males significantly higher regardless of the instructor’s actual gender.7  Despite greater awareness of disparities in recent years, gender bias has been shown at all levels of medical training811  and continues to impact female faculty. One study noted that medical students gave male faculty significantly higher ratings than female faculty on “overall quality of teaching” in 4 clerkships including pediatrics.12  In male-dominated surgical specialties, female faculty received lower median evaluation scores than their male counterparts.13  In another recent study across departments with relatively equal representation of male and female faculty, residents and fellows rated female faculty lower than male faculty in both overall teaching effectiveness and role modeling.14  Although there have been numerous studies evaluating gender bias in learner evaluations of faculty, there have been few studies within a female predominant field such as pediatrics.

The objective of this study was to evaluate whether gender disparities exist in the teaching evaluations of pediatric faculty completed by residents and whether these differences vary across subspecialties (general pediatrics versus subspecialty) and primary work environment (inpatient versus outpatient). We hypothesized that, despite female physicians outnumbering male physicians in many pediatrics departments, gender disparities in resident teaching evaluations of pediatric faculty exist, particularly when stereotypical gender norms contrast with physician gender. Specifically, we predict that male faculty within pediatrics would be more favorably evaluated as compared with their female counterparts.

Participants included pediatric faculty who served as teaching attending physicians on inpatient pediatrics services (general pediatric inpatient services, well newborn service, pediatric subspecialty inpatient services, PICU, NICU, pediatric emergency department [ED]), and pediatric faculty who supervised outpatient continuity clinic and subspecialty outpatient clinics at a single Midwestern academic tertiary care children’s hospital between January 1, 2015, and March 9, 2020. Inpatient teams were variably composed of medical students, interns (pediatric and/or off-service), residents (pediatric and/or off-service), fellows, and an attending. In the ED, residents worked directly with a faculty member or fellow–faculty dyad. Trainees rotated monthly and faculty rotated every half month on general pediatric services and weekly on inpatient subspecialty services, PICU, and NICU. Outpatient continuity clinics occurred at 1 of 9 university-affiliated, community-based pediatric ambulatory care clinics. Residents averaged 1 half day per week in continuity clinic, working with the same 1 or 2 faculty members over the 3 years of training. Subspecialty outpatient clinics occurred either at the main children’s hospital or at a university-affiliated, community-based pediatric subspecialty clinic. Residents averaged 20 half days per year for up to 2 years of residency.

Inpatient general pediatrics and subspecialty faculty were evaluated by residents at the end of a weeklong or half-month rotation, whereas faculty supervising primary care continuity clinics were evaluated by trainees twice annually. Outpatient subspecialty faculty were evaluated at the end of a half-month rotation. Completion of evaluations is mandatory; however, residents have the option to select out of completing an evaluation should they have “insufficient contact to evaluate.” Evaluations were distributed and collected using MedHub (Minneapolis, MN).

The faculty evaluation tool was developed internally by our residency program graduate medical education committee in 2006. The form utilizes a 5-point Likert scale to answer 17 prompts, including a global assessment of overall teaching ability (Fig 1). The faculty evaluation tool is used for all faculty evaluations across settings and pediatric divisions. Faculty evaluation scores are reviewed during each faculty’s annual review and are included in our institutional promotion packets.

FIGURE 1

Evaluation form of pediatric faculty members completed by residents.

FIGURE 1

Evaluation form of pediatric faculty members completed by residents.

Close modal

The results of trainee evaluations of faculty members were compiled by pediatric residency program administrative staff. The faculty evaluations were deidentified before analysis and only included variables of interest including date of evaluation, rotation for which the evaluation was completed, gender of the trainee who completed the evaluation (evaluator) and of the faculty member (evaluatee), and evaluatee program (general pediatrics versus subspecialty). Overall pediatric faculty demographics and academic rank were collected from a departmental database; these were not linked to individual evaluation scores.

Our primary outcomes of interest were the mean rating of the faculty’s overall teaching ability and the mean rating of each prompt on the evaluation form. Secondary outcomes included association between evaluator–evaluatee gender concordance, teaching setting (inpatient versus outpatient), and faculty specialization status (general pediatrics versus subspecialist). Mean ratings were compared by faculty gender.

Univariate statistics were used to determine frequencies and to check the distributions of evaluation ratings scores. These faculty ratings were all left skewed, with faculty receiving a larger proportion of high ratings compared with lower ratings. The Wilcoxon Mann-Whitney test was used to look for rating differences between 2 groups. Given the large number of statistical comparisons, a more restrictive P value of .01 was used to determine statistical significance to limit those differences that could have resulted from chance. All analyses were completed using SAS 9.4 software and Microsoft Excel.

The study was determined to be exempt by the University of Michigan institutional review board.

In total, 16 845 faculty teaching evaluations were completed by trainees. One-thousand seven-hundred and three evaluations were excluded (1345 [8%] had incomplete data and 358 [2%] were of nonpediatric faculty). Of the final 15 142 evaluations included, 5091 evaluations were of male faculty (33.6%) and 10 051 were of female faculty (66.4%) (Fig 2). Most evaluations were completed by female trainees (n = 11 552, 76.3%). Two-thirds of faculty were subspecialists (67.3%). There was more even distribution among subspecialist faculty (female = 57.8% versus male = 42.4%) compared with general pediatrics, which was predominantly female (female = 86.6% versus male = 13.4%). A higher percentage of male compared with female faculty held higher academic ranks: 26.3% of males held the rank of associate professor versus only 15.3% of females, and 23.2% of male faculty held the rank of professor versus only 11.4% of female faculty (Table 1).

FIGURE 2

Evaluations of pediatrics faculty inclusion and exclusion criteria flowchart.

FIGURE 2

Evaluations of pediatrics faculty inclusion and exclusion criteria flowchart.

Close modal
TABLE 1

Faculty (Evaluatee) and Resident (Evaluator) Characteristics

Total n (%)Female n (%)Male n (%)P
Faculty 301 (N/A) 202 (67.1%) 99 (32.9%) <.001a 
 Rank    <.001b 
  Professor 46 (15.3%) 23 (11.4%) 23 (23.2%)  
  Associate professor 57 (18.9%) 31 (15.3%) 26 (26.3%)  
  Assistant professor 142 (47.2%) 106 (52.5%) 36 (36.4%)  
  Instructor 56 (18.6%) 42 (20.8%) 14 (14.1%)  
 Specialty    <.001b 
  General pediatrics 97 (32.2%) 84 (86.6%) 13 (13.4%)  
  Subspecialist 204 (67.3%) 118 (57.8%) 86 (42.4%)  
Resident 230 170 (73.9%) 60 (26.1%) <.001a 
 Specialty    .56b 
  Categorical pediatrics 149 112 (75.2%) 37 (24.8%)  
  Noncategorical 81 58 (71.6%) 23 (28.4%)  
Total n (%)Female n (%)Male n (%)P
Faculty 301 (N/A) 202 (67.1%) 99 (32.9%) <.001a 
 Rank    <.001b 
  Professor 46 (15.3%) 23 (11.4%) 23 (23.2%)  
  Associate professor 57 (18.9%) 31 (15.3%) 26 (26.3%)  
  Assistant professor 142 (47.2%) 106 (52.5%) 36 (36.4%)  
  Instructor 56 (18.6%) 42 (20.8%) 14 (14.1%)  
 Specialty    <.001b 
  General pediatrics 97 (32.2%) 84 (86.6%) 13 (13.4%)  
  Subspecialist 204 (67.3%) 118 (57.8%) 86 (42.4%)  
Resident 230 170 (73.9%) 60 (26.1%) <.001a 
 Specialty    .56b 
  Categorical pediatrics 149 112 (75.2%) 37 (24.8%)  
  Noncategorical 81 58 (71.6%) 23 (28.4%)  

N/A, not applicable.

a

Z score.

b

χ2.

Faculty of both genders were rated among trainees as having excellent clinical performance and teaching ability. Female faculty were rated as having higher overall teaching ability compared with their male colleagues (female = 4.67 versus male = 4.65; P = .004) and were rated significantly higher than male faculty in 11 of the remaining 16 question prompts (Fig 3). When examining the role of gender concordance between the evaluator and evaluatee, we found no statistically significant differences for any of the evaluation tool prompts (Supplemental Table 2).

FIGURE 3

Mean male and female pediatric faculty evaluations by residents combined inpatient and outpatient setting and general pediatrics and subspecialty. All skills are evaluated on a 5-point Likert scale (1 = strongly disagree, 2 = disagree, 3 = uncertain, 4 = agree, 5 = strongly agree). * P < .01.

FIGURE 3

Mean male and female pediatric faculty evaluations by residents combined inpatient and outpatient setting and general pediatrics and subspecialty. All skills are evaluated on a 5-point Likert scale (1 = strongly disagree, 2 = disagree, 3 = uncertain, 4 = agree, 5 = strongly agree). * P < .01.

Close modal

A total of 11 444 evaluations were completed in the inpatient setting (63.5% female versus 36.5% male faculty evaluated) and 3698 were completed in the outpatient setting (75.4% female versus 24.6% male faculty evaluated). In the inpatient setting, there was no statistical difference in overall teaching performance or in the mean ratings of male and female faculty for any of the evaluation tool prompts. In the outpatient setting, female faculty were rated as having higher overall teaching ability compared with their male colleagues (female = 4.79 versus male = 4.73; P = .005). They were also rated significantly higher than male faculty in 3 of the remaining 16 question prompts (Fig 4 and Supplemental Table 3).

FIGURE 4

Mean outpatient pediatric male and female faculty evaluations by residents. All skills are evaluated on a 5-point Likert scale (1 = strongly disagree, 2 = disagree, 3 = uncertain, 4 = agree, 5 = strongly agree). * P < .01.

FIGURE 4

Mean outpatient pediatric male and female faculty evaluations by residents. All skills are evaluated on a 5-point Likert scale (1 = strongly disagree, 2 = disagree, 3 = uncertain, 4 = agree, 5 = strongly agree). * P < .01.

Close modal

A total of 5281 evaluations were completed of general pediatrics faculty (72% female versus 28% male faculty evaluated), which includes faculty who attended on the inpatient general pediatrics services and outpatient continuity clinic. A total of 10 409 evaluations were completed of subspecialty pediatrics faculty (63% female versus 37% male faculty evaluated), which includes faculty who attended on inpatient subspecialty pediatric services, PICU, NICU, ED, and outpatient subspecialty pediatric clinics. For general pediatric faculty, females received higher ratings than male faculty for overall teaching ability (female = 4.75 versus male = 4.70; P < .0001) and for 15 of the remaining 16 questions on the evaluation tool (Fig 5). By contrast, for subspecialty faculty, there was no statistically significant difference by gender in overall teaching ability or for any of the evaluation tool prompts (Supplemental Table 4).

FIGURE 5

Mean general pediatrics male and female pediatric faculty evaluations by residents. All skills are evaluated on a 5-point Likert scale (1 = strongly disagree, 2 = disagree, 3 = uncertain, 4 = agree, 5 = strongly agree). * P < .01.

FIGURE 5

Mean general pediatrics male and female pediatric faculty evaluations by residents. All skills are evaluated on a 5-point Likert scale (1 = strongly disagree, 2 = disagree, 3 = uncertain, 4 = agree, 5 = strongly agree). * P < .01.

Close modal

This study adds to the growing body of literature looking at gender disparities in academic medicine, specifically in the female-dominant field of pediatrics. Our study found that Pediatric faculty were rated highly with similar ratings by gender. Unexpectedly, female faculty were rated slightly higher than male faculty in overall teaching ability, as well as in a majority of prompts on our evaluation tool. Although the statistically significant differences in evaluation scores were small in this large sample and may not be educationally significant, over time, statistically small effects related to bias can compound and have real-life downstream effects.15,16 

Our findings contrast with previous studies in male-dominant fields of internal medicine and surgery where male faculty were ranked higher in overall teaching evaluations.17,18  One possible explanation for our contrasting results is rooted in gender-based stereotypes. Social role theory is the idea that different descriptions of males and females arise from historical distribution of roles within society.19  For example, females may be more associated with caregiver roles and “communal” traits (ie, compassion, empathy) versus males associated with leadership roles and “agentic” traits (ie, decisiveness, assertiveness).19,20  Research supports that known historical cultural stereotypes disadvantage women in agentic domains, such as medicine and leadership.2123  Additionally, gender-biased language continues to be found in faculty evaluations of trainees, trainee evaluations of faculty, and faculty letters of recommendation.2427  These findings are exacerbated in male-dominated and procedurally focused specialties,28  where female physicians face misalignment between their professional role and preconceived gender-based normative behaviors.13,29 

Our study in pediatrics differs from other studies because of the high representation of female faculty. Representation of females in a field may impact the extent of gender bias, and pediatrics’ high proportion of females may mitigate societal stereotypes explaining our findings of female faculty being scored slightly higher than male faculty. A previous study found that, across all specialties, gender bias was attenuated when there was a higher proportion of females in a field.30  Our findings also align with a recent study in pediatrics that found no difference in the odds of female faculty receiving a score in the top quartile compared with male faculty.31  These findings have been replicated in other female-dominant health fields such as nursing, where gender bias favoring female over male nursing students have been found.32 

Given that pediatrics is a specialty that cares for children and societal expectations of those who work with children to be warm and nurturing, there is potential alignment of expected sex- and role-expectations of pediatricians. As such, trainees may perceive communal traits to align with role expectations of a pediatrician, leading to higher evaluations seen in our female faculty. This is consistent with studies finding pediatric faculty are more likely to describe female residents with communal terms,33  as well as patients identifying communal traits in female physicians.34  Future investigation, such as qualitative assessment, is needed to explore this possible explanation further. Although our study was female predominant for both faculty and residents, gender congruence or discordance in the evaluator–evaluatee relationship was not found to statistically impact total teaching score or individual evaluation prompts. Other studies have found mixed results in terms of impact of gender dyad and evaluations in male predominant fields.35,36 

Differences were found between teaching setting (inpatient versus outpatient) and role (general versus subspecialty faculty). Female faculty practicing in the outpatient setting and in general pediatrics had slightly higher educator ratings, but no significant difference was seen in the inpatient setting and for subspecialty faculty. Interestingly, the slightly higher evaluation of female faculty overall appears to be driven by outpatient and general pediatric faculty evaluations despite only accounting for one-fourth and one-third, respectively, of total evaluations. In both the outpatient setting and general pediatrics, there were significantly more evaluations for female faculty compared with the inpatient setting and subspecialty pediatrics (Table 1). It is known that the percentage of males and females in a field influences the perception of whether gender-based stereotypes are required for success in a particular occupation.37  Given this, it is possible that the differences in gender representation may have influenced our findings to both increase resident perception of teaching in predominantly female work areas and level ratings in subspecialty areas.

Another explanation for the leveling of evaluations in the inpatient setting and for subspecialty faculty lies in social role theory. We hypothesize that the discordance between expected gender roles for female physicians and the clinical requirements of the inpatient setting, where assertiveness, decisiveness, and urgency are necessary, and of many pediatric subspecialties that remain procedurally heavy, may contradict gender-based expectations of female faculty, leading to male and female faculty being rated similarly. Previous studies have described female faculty intentionally working to navigate the more “masculine” expectations of inpatient medicine,38  and showed general internal medicine female faculty were rated lower by trainees in the inpatient setting.17  Conversely, congruence between preconceived gender stereotypes and role expectations in the outpatient setting where communal behaviors, such as compassion and communication, are favored is a possible explanation for female faculty being rated slightly higher in this clinical setting. It is also possible that male faculty in the outpatient setting face incongruence between expected gender norms and role expectations, leading to lower evaluations in this setting.

It is not known if those practicing in general pediatrics or in the outpatient setting had more experience teaching trainees. Similarly, given fellowship training would not be required, it is possible those practicing in general pediatrics were more junior faculty, near peers, or able to relate to trainees in a way subspecialists were not. In addition to being early in career and more time spent teaching compared with conducting research, an area subspecialists may be more engaged in, could impact resident evaluations of faculty.39  The teaching environment and time available for teaching also vary in the inpatient and outpatient setting. Each of these unmeasured factors may have influenced our findings.

Overall, these findings highlight the complex relationship between gender stereotypes and the impact of practice setting and subspecialty field on faculty teaching evaluations. Although evaluation prompts included in the institutional evaluation tool provide some information on factors residents considered when giving numeric scores to faculty members, more work is needed to understand reasons behind residents’ evaluations. For example, although the same overall trend by gender was seen for outpatient and general pediatricians, they differed significantly in responses to the 16 other evaluation questions (Figs 4 and 5). Further work, such as qualitative analysis of narrative evaluations, is needed to better understand how factors such as gender impact resident evaluations of faculty.

Despite female faculty at our institution being rated similarly to males on faculty evaluations, there continue to be lower percentages of female faculty members at the ranks of professor and associate professor, suggesting other factors contribute to the “leaky pipeline.”4  Future studies should seek to evaluate other factors impacting faculty promotion and advancement leading to the disparate advancement of females, like academic productivity, leadership roles, and delegation.

Our study had several limitations. First, this is the experience of a single institution with an internally created faculty evaluation tool, and thus may not be generalizable to other pediatric programs or other specialties. Evaluations at baseline in medical education are subjective and are dependent on many factors that were not measured. Overall, there is a lack of validity and reliability in evaluation tools used for faculty evaluations. Although a single evaluation tool across all faculty decreases variability in questions, some questions may be more or less applicable in different settings or interpreted differently on the basis of the clinical context. In addition, there was no rigorous evaluation of the language of the evaluation form, which could be playing a role in these gender differences. Although the sample size from our single site is quite large, this is also a limitation given that it increases the likelihood that small differences are statistically significant without true meaningful educational differences. Similarly, overall, the majority of faculty received evaluation scores on the high end, making it difficult to determine what is a meaningful difference in teaching ability. Additionally, many other variables that may impact a resident’s evaluation of a faculty member, such as length of clinical relationship and crossover between inpatient and outpatient exposures, were not controlled for. Similarly, we do not know how many faculty members were evaluated in both the inpatient and outpatient setting, or if a resident evaluated a faculty member more than once. Lastly, our study examined only quantitative and not qualitative feedback, so information is limited regarding why a trainee gave a particular response.

In conclusion, our findings show that, in the female-predominant field of pediatrics, at a single institution in the Midwest United States, female faculty are rated slightly higher by trainees than male faculty overall, in the outpatient setting, and for faculty practicing general pediatrics, although it remains unknown if these small findings are educationally significant. This work illustrates the influence of gender balance (and gender predominance in a field), practice setting, and subspecialty on faculty teaching evaluations. Culture-based gender stereotypes could potentially affect faculty evaluations; however, the role of faculty evaluations in promotion and advancement should be investigated further. Future work should continue to explore the extent of these findings across other institutions and specialties, and use qualitative methodology to better understand the potential reasons behind these findings, as well as develop evidence-based methods for mitigating gender bias.:

This study was approved with exemption by the University of Michigan institutional review board, and was presented as an abstract at the 2022 annual Pediatric Hospital Medicine Conference in Orlando, Florida.

Dr Shaw conceptualized and designed the study, supervised data collection, drafted the initial manuscript, and reviewed and revised the manuscript; Dr Rappaport conceptualized and designed the study, collected data, and critically reviewed the manuscript for important intellectual content; Ms Sturza conducted the initial analyses, and reviewed and revised the manuscript; Drs Foo, Blumer, and Jacobson conceptualized and designed the study, supervised data collection, and critically reviewed the manuscript for important intellectual content; Drs Hartley, Lukela, and Sheffield conceptualized and designed the study and critically reviewed the manuscript for important intellectual content; and all authors approved the final manuscript as submitted and agree to be accountable for all aspects of the work.

FUNDING: Statistical support for this project was provided by the Charles Woodson Fund for Clinical Research.

CONFLICT OF INTEREST DISCLOSURES: The authors have indicated they have no conflicts of interest relevant to this article to disclose.

1
Nocco
SE
,
Larson
AR
.
Promotion of women physicians in academic medicine
.
J Womens Health 2002
.
2021
;
30
(
6
):
864
871
2
Murphy
M
,
Callander
JK
,
Dohan
D
,
Grandis
JR
.
Women’s experiences of promotion and tenure in academic medicine and potential implications for gender disparities in career advancement: a qualitative analysis
.
JAMA Netw Open
.
2021
;
4
(
9
):
e2125843
3
Hartmark-Hill
J
,
Hale
T
,
Gallitano
A
.
Women physicians and promotion in academic medicine
.
N Engl J Med
.
2021
;
384
(
7
):
680
4
Spector
ND
,
Asante
PA
,
Marcelin
JR
, et al
.
Women in pediatrics: progress, barriers, and opportunities for equity, diversity, and inclusion
.
Pediatrics
.
2019
;
144
(
5
):
e20192149
5
Atasoylu
AA
,
Wright
SM
,
Beasley
BW
, et al
.
Promotion criteria for clinician-educators
.
J Gen Intern Med
.
2003
;
18
(
9
):
711
716
6
Beasley
BW
,
Wright
SM
,
Cofrancesco
J
Jr
,
Babbott
SF
,
Thomas
PA
,
Bass
EB
.
Promotion criteria for clinician-educators in the United States and Canada. A survey of promotion committee chairpersons
.
JAMA
.
1997
;
278
(
9
):
723
728
7
MacNell
L
,
Driscoll
A
,
Hunt
AN
.
What’s in a name: exposing gender bias in student ratings of teaching
.
Innovative High Educ
.
2015
;
40
(
4
):
291
303
8
Mitchell
KMW
,
Martin
J
.
Gender bias in student evaluations
.
PS Polit Sci Polit
.
2018
;
51
(
03
):
648
652
9
Sprague
J
,
Massoni
K
.
Student evaluations and gendered expectations: what we can’t count can hurt us
.
Sex Roles
.
2005
;
53
(
11-12
):
779
793
10
Klein
R
,
Julian
KA
,
Snyder
ED
, et al
.
Gender Equity in Medicine workgroup
.
Gender bias in resident assessment in graduate medical education: review of the literature
.
J Gen Intern Med
.
2019
;
34
(
5
):
712
719
11
Klein
R
,
Ufere
NN
,
Rao
SR
, et al
.
Gender Equity in Medicine workgroup
.
Association of gender with learner assessment in graduate medical education
.
JAMA Netw Open
.
2020
;
3
(
7
):
e2010888
12
Morgan
HK
,
Purkiss
JA
,
Porter
AC
, et al
.
Student evaluation of faculty physicians: gender differences in teaching evaluations
.
J Womens Health 2002
.
2016
;
25
(
5
):
453
456
13
Fassiotto
M
,
Li
J
,
Maldonado
Y
,
Kothary
N
.
Female surgeons as counter stereotype: the impact of gender perceptions on trainee evaluations of physician faculty
.
J Surg Educ
.
2018
;
75
(
5
):
1140
1148
14
Cullen
MJ
,
Zhou
Y
,
Sackett
PR
,
Mustapha
T
,
Hane
J
,
Culican
SM
.
Differences in trainee evaluations of faculty by rater and ratee gender. [Published online ahead of print April 26, 2023]
Acad Med
.
2023
.
15
Heilman
ME
.
Description and prescription: how gender stereotypes prevent women’s ascent up the organizational ladder
.
J Soc Issues
.
2001
;
57
(
4
):
657
674
16
Greenwald
AG
,
Banaji
MR
,
Nosek
BA
.
Statistically small effects of the Implicit Association Test can have societally large effects
.
J Pers Soc Psychol
.
2015
;
108
(
4
):
553
561
17
Sheffield
V
,
Hartley
S
,
Stansfield
RB
, et al
.
Gendered expectations: the impact of gender, evaluation language, and clinical setting on resident trainee assessment of faculty performance
.
J Gen Intern Med
.
2022
;
37
(
4
):
714
722
18
Angelo
JL
,
Moazzez
A
,
Neville
A
,
Dauphine
C
,
Lona
Y
,
de Virgilio
C
.
Investigating gender differences in faculty evaluations by trainees in a gender-balanced general surgery program
.
J Surg Educ
.
2019
;
76
(
6
):
e132
e137
19
Eagly
AH
,
Wood
W
.
Social role theory of sex differences
. In:
Wong
A
,
Wickramasinghe
M
,
Hoogland
RC
,
Naples
NA
, eds.
The Wiley Blackwell Encyclopedia of Gender and Sexuality Studies
.
John Wiley & Sons, Ltd
;
2016
:
1
3
20
Templeton
K
.
Impact of gender on teaching evaluations of faculty: another example of unconscious bias?
J Womens Health 2002
.
2016
;
25
(
5
):
420
421
21
Carnes
M
,
Bland
C
.
Viewpoint: a challenge to academic health centers and the national institutes of health to prevent unintended gender bias in the selection of clinical and translational science award leaders
.
Acad Med
.
2007
;
82
(
2
):
202
206
22
Moss-Racusin
CA
,
Dovidio
JF
,
Brescoll
VL
,
Graham
MJ
,
Handelsman
J
.
Science faculty’s subtle gender biases favor male students
.
Proc Natl Acad Sci USA
.
2012
;
109
(
41
):
16474
16479
23
Kaatz
A
,
Gutierrez
B
,
Carnes
M
.
Threats to objectivity in peer review: the case of gender
.
Trends Pharmacol Sci
.
2014
;
35
(
8
):
371
373
24
Salles
A
,
Awad
M
,
Goldin
L
, et al
.
Estimating implicit and explicit gender bias among health care professionals and surgeons
.
JAMA Netw Open
.
2019
;
2
(
7
):
e196545
25
Siegelman
JN
,
Lall
M
,
Lee
L
,
Moran
TP
,
Wallenstein
J
,
Shah
B
.
Gender bias in simulation-based assessments of emergency medicine residents
.
J Grad Med Educ
.
2018
;
10
(
4
):
411
415
26
Trix
F
,
Psenka
C
.
Exploring the color of glass: letters of recommendation for female and male medical faculty
.
Discourse Soc
.
2003
;
14
(
2
):
191
220
27
Webber
S
,
Nackers
K
,
Kelly
MM
, et al
.
University of Wisconsin-Madison Collaborative for Research and Education Accelerating Team Empowerment (CREATE)
.
Gender-based linguistic analysis of pediatric clinical faculty evaluations
.
Acad Pediatr
.
2022
;
22
(
2
):
324
331
28
Salles
A
,
Mueller
CM
,
Cohen
GL
.
Exploring the relationship between stereotype perception and residents’ well-being
.
J Am Coll Surg
.
2016
;
222
(
1
):
52
58
29
Carnes
M
,
Bartels
CM
,
Kaatz
A
,
Kolehmainen
C
.
Why is John more likely to become department chair than Jennifer?
Trans Am Clin Climatol Assoc
.
2015
;
126
:
197
214
30
Fassiotto
M
,
Hamel
EO
,
Ku
M
, et al
.
Women in academic medicine: measuring stereotype threat among junior faculty
.
J Womens Health 2002
.
2016
;
25
(
3
):
292
298
31
Tiedt
K
,
Webber
S
,
Babal
J
, et al
.
Gender difference in teaching evaluation scores of pediatric faculty
.
Acad Pediatr
.
2023
;
23
(
3
):
564
568
32
Kiekkas
P
,
Igoumenidis
M
,
Stefanopoulos
N
,
Bakalis
N
,
Kefaliakos
A
,
Aretha
D
.
Gender bias favors female nursing students in the written examination evaluation: crossover study
.
Nurse Educ Today
.
2016
;
45
:
57
62
33
Gold
JM
,
Yemane
L
,
Keppler
H
,
Balasubramanian
V
,
Rassbach
CE
.
Words matter: examining gender differences in the language used to evaluate pediatrics residents
.
Acad Pediatr
.
2022
;
22
(
4
):
698
704
34
Chen
H
,
Pierson
E
,
Schmer-Galunder
S
, et al
.
Gender differences in patient perceptions of physicians’ communal traits and the impact on physician evaluations
.
J Womens Health (Larchmt)
.
2021
;
30
(
4
):
551
556
35
Thackeray
EW
,
Halvorsen
AJ
,
Ficalora
RD
,
Engstler
GJ
,
McDonald
FS
,
Oxentenko
AS
.
The effects of gender and age on evaluation of trainees and faculty in gastroenterology
.
Am J Gastroenterol
.
2012
;
107
(
11
):
1610
1614
36
Heath
JK
,
Clancy
CB
,
Carillo-Perez
A
,
Dine
CJ
.
Assessment of gender-based qualitative differences within trainee evaluations of faculty
.
Ann Am Thorac Soc
.
2020
;
17
(
5
):
621
626
37
Cejka
MA
,
Eagly
AH
.
Gender-stereotypic images of occupations correspond to the sex segregation of employment
.
Pers Soc Psychol Bull
.
1999
;
25
(
4
):
413
423
38
Houchens
N
,
Quinn
M
,
Harrod
M
,
Cronin
DT
,
Hartley
S
,
Saint
S
.
Strategies of female teaching attending physicians to navigate gender-based challenges: an exploratory qualitative study
.
J Hosp Med
.
2020
;
15
(
8
):
454
460
39
Arah
OA
,
Heineman
MJ
,
Lombarts
KMJMH
.
Factors influencing residents’ evaluations of clinical faculty member teaching qualities and role model status
.
Med Educ
.
2012
;
46
(
4
):
381
389

Supplementary data