Skip to main content

Gender gaps in grades versus grade penalties: why grade anomalies may be more detrimental for women aspiring for careers in biological sciences

Abstract

Background

In the US, bioscience programs now often have more women than men who earn degrees at all levels. Despite this, evidence still exists for gender inequity in bioscience and medical fields. For example, women with careers in these fields tend to get paid less and leave these fields more. Here, we present research investigating grade penalties. Grade penalties describe the difference between a students’ grade point average and grade in a particular course when a students’ grade in a class is lower than their grade point average (GPA). For example, a student has a grade penalty of 1 in a course if their GPA is 3.0 (B) and they receive a 2.0 (C) in the course. We hypothesize that grade penalties in these STEM courses can affect students’ STEM-related self-concept negatively with potentially long-term consequences, e.g., in students’ career choices. We investigated grade penalties in science courses for undergraduate students studying biosciences. We propose a framework that posits that students who receive grade penalties in a course may view themselves as less capable in that domain and that gender differences in grade penalties may be a useful measure of gender inequity in the classroom.

Results

In this study, we examined grade penalties of 2445 students majoring in biological science or closely related fields across many commonly taken courses, many of which were mandatory. We find that on average, students received grade penalties in the 12 most commonly taken science courses for biosciences students at our institution. We also find that women had more extreme grade penalties than men in seven of the 12 science classes we investigated.

Conclusions

The pattern of grade penalties found in this study across a range of courses may result in long-term consequences for students’ STEM self-concept. Furthermore, women’s decisions to pursue STEM careers may be affected more by the grade penalty received in required science courses than men’s at least partly, because their grade penalties are often larger. Finally, the grade penalty measure can be easily computed by institutions concerned with equity.

Introduction

Research on gendered performance and persistence differences in science, technology, engineering, and mathematics (STEM) fields is important and has been conducted in the fields in which women are the least likely to earn degrees, such as physics or engineering (Cavallo et al., 2004; Gonzalez & Kuenzi, 2012; Henderson et al., 2020; Maries et al., 2022; Marshman et al., 2018; Nissen, 2019; Sawtelle et al., 2012; Van Dusen & Nissen, 2020; Whitcomb & Singh, 2020; Whitcomb et al., 2020). However, less research has been conducted on gender differences in fields in which women are not currently underrepresented, such as biology (Aguillon et al., 2020; Ballen, et al., 2017a, b; Haak et al., 2011). Concerns about gender equity in STEM education are not limited to considerations of the percentage of women in a STEM field, but includes the experiences of female students when they do participate (Cwik & Singh, 2022; Eaton et al., 2020; Good et al., 2012; Raelin et al., 2014). For example, past research has raised concerns about equity in the fields of biological science and medicine due to gender differences in classroom experiences (Ballen, et al., 2017a; Elliott, 2016; Grunspan et al., 2016), authorship over all career stages (Holman et al., 2018; Huang et al., 2020), and likelihood to stay in the field (Elliott, 2016; Maries et al., 2022; Talking about Leaving Revisited: Persistence, Relocation, and Loss in Undergraduate STEM Education, 2019). One particularly concerning trend for educators relates to this last point: women studying STEM, including bioscience majors, often leave their disciplines with higher grades than the men who remain in the program (Maries et al., 2022; Seymour & Hewitt, 1997; Talking about Leaving Revisited: Persistence, Relocation, and Loss in Undergraduate STEM Education, 2019). In this regard, men and women in the same programs have different outcomes, and these outcomes are not explained by grade differences.

In this research, we introduce a potential measure of gender inequities in the classroom: grade anomalies (Koester et al., 2016; Malespina & Singh, 2022a, b; Matz et al., 2017; Witteveen & Attewell, 2020). A grade anomaly (GA) is the difference between a student’s grade point average (GPA) and their grade in a specific course. We investigated grade penalties, which describe grade anomalies in which a students’ grade in a specific course is lower than their GPA. In this study, with equity and inclusion at the center, we investigate the following research questions regarding grade anomalies in the backdrop of gender inequities in biological sciences despite women outnumbering men.

  1. RQ1.

    For which of their courses do students majoring in biological sciences receive a “grade penalty” and for which courses do they receive a “grade bonus”?

  2. RQ2.

    To what extent do men and women have different grade anomalies in their STEM courses?

  3. RQ3.

    To what extent do gender differences in grade anomalies follow the same trends as gender differences in grades?

Theoretical background

Gender and biological sciences

Undergraduate students pursuing biological science and related majors often go on to pursue either graduate programs in biological sciences or medical school in the future. We discuss an investigation of the grade penalties for both groups of students here. Women earn undergraduate and graduate biosciences degrees at higher rates than men (Bachelor’s Degrees in Physics & STEM Earned by Women, 2020). Moreover, because biosciences degree recipients make up over half of medical school applicants and matriculants in the United States (56% for the 2021–2022 academic year), there are more women from 2018 to 2021 who entered medical school than men (2021 FACTS: Applicants and Matriculants Data, 2021). However, gender equity is not the same as having an equal number of women present in a field. Researchers have investigated many ways in which women who pursue and earn degrees in biological science and related fields do not receive equitable treatment (for example, during careers in biological sciences or medicine or in biology classrooms).

After graduation, women who earn biological sciences degrees are less likely than men to work as scientists after receiving graduate degrees (Elliott, 2016). In addition, one recent study (Huang et al., 2020) found that in biological sciences, women tend to have shorter publishing careers (due to leaving the field) than men, and during their careers, men had higher annual publishing rates than women. The annual publishing rate for biological sciences had a larger gender difference favoring men than any other domain studied, including those in which women are underrepresented at all levels, such as physics and computer science (Huang et al., 2020). Researchers have estimated that biological science authorship in journal articles will not reach gender parity for another 25 years (Holman et al., 2018). In addition, despite their lower publishing rates, women in biological sciences also report feeling more stress arising from the pressure to publish than men do (Husemann et al., 2017). Thus, despite the representation of women in biological sciences, these trends hint at the inequitable culture in sciences that affect the professional advancement of these women. Gender inequities are not just experienced by women who pursue advanced degrees in biology, but those who pursue medical careers as well.

There are gender disparities in compensation and time to promotion for all academic medical specialties (Dandar & Lautenberger, 2021) as well as for physicians (Ly et al., 2016).

While they are pursuing their degrees, undergraduate women are less likely to participate in class discussions in biology classrooms (Eddy et al., 2014), are less likely to be viewed as knowledgeable by their peers (Grunspan et al., 2016), and tend to have lower exam grades than men in their introductory biology classes (Ballen, et al., 2017a). In addition, when women leave STEM disciplines, they often do so with higher grades than the men who remain in the program (Maries et al., 2022; Seymour & Hewitt, 1997; Talking about Leaving Revisited: Persistence, Relocation, and Loss in Undergraduate STEM Education, 2019).

Gender differences in outcomes cannot be explained by a lack of representation in the classroom for students pursuing biological science and health-related majors, because women make up more than half of these populations. Some researchers offer potential explanations such as societal stereotypes and biases about who can excel in these disciplines (Bian et al., 2017; Bleeker & Jacobs, 2004; Eaton et al., 2020; Ganley et al., 2018; Leslie et al., 2015) and gender discrimination in hiring (Moss-Racusin et al., 2012). Others posit that differences in STEM motivational beliefs may explain these differences. One such motivational belief is self-efficacy. Self-efficacy is a student’s belief in their ability to succeed in a domain, and often predicts learning outcomes (Bandura, 1997; Cavallo et al., 2004; Marshman et al., 2018; Nissen & Shemwell, 2016; Pintrich & De Groot, 1990; Raelin et al., 2014; Sawtelle et al., 2012; Van Dusen & Nissen, 2020). Another such belief is academic self-concept, which describes a long-term expectation of success that students hold regarding their academic abilities that depends on outside feedback, such as grades (Eccles & Wigfield, 2020; Gniewosz et al., 2015; Spence, 1983). Low academic self-concept from negative grade feedback may lead to lower future achievement and persistence, because it discourages student engagement in a domain (Spence, 1983). Gender differences in either self-efficacy or academic self-concept may account for some differences in outcomes, because women may opt out of majors or careers if they do not believe they can excel.

Broadly, researchers have proposed many partial explanations of gender differences in classroom experiences and professional trajectories of students pursuing biological sciences. Some of these, such as cultural biases, are not limited to our classrooms, but may also exist within them (Bian et al., 2017; Bleeker & Jacobs, 2004; Eaton et al., 2020; Ganley et al., 2018; Leslie et al., 2015). Other partial explanations, such as motivational beliefs about learning, may be primarily relevant in the classroom (Cavallo et al., 2004; Marshman et al., 2018; Nissen & Shemwell, 2016; Pintrich & De Groot, 1990; Raelin et al., 2014; Sawtelle et al., 2012; Van Dusen & Nissen, 2020). Regardless, to improve outcomes for women, instructors should aim to eliminate these inequities by taking steps to create an inclusive classroom culture and equitable learning environment.

Grade anomalies

One aspect of creating such a learning environment may be to reassure students that they can excel in any domain in which they choose to major. For example, students often have a fairly fixed view of what “kind” of student they are, e.g., students may endorse the idea that ``If I get As, I must be an A kind of person. If I get a C, I am a C kind of person” (Talking about Leaving Revisited: Persistence, Relocation, and Loss in Undergraduate STEM Education, 2019). A GA, which is the difference between a student’s GPA and their grade in a specific course, may challenge or reinforce students’ ideas about what kind of student they are, and if they are capable of succeeding in their chosen domain. Many students who leave STEM explicitly cite lower grades than they are used to as a reason for doing so (Seymour & Hewitt, 1997; Talking about Leaving Revisited: Persistence, Relocation, and Loss in Undergraduate STEM Education, 2019). While students' raw grades are a useful measure, because they allow for direct comparison between students and because they are used by institutions to award scholarships and track student academic standing, we propose that using grade penalty in addition to raw grades gives researchers and instructors more insight into students' view of what “kind” of student they are.

Matz et al. (2017) conducted the first study we know of that focuses on average grade anomaly (AGA). A GA is the difference between a students’ grade point average and grade in a particular course, and AGA is the mean grade anomaly for students who took a course. They used AGA, because it was perceived to be a better measure of how students view their comparative performance than their raw grades across different courses. They found that, at their institution, grade penalties were greater for STEM than non-STEM courses. A student who receives lower grades in their science courses than their humanities courses may take this as a sign that they are not capable of excelling in the sciences, even if the grades they earn are high enough for them to continue in their major (Talking about Leaving Revisited: Persistence, Relocation, and Loss in Undergraduate STEM Education, 2019). Furthermore, within STEM courses, grade penalties were smaller for men than women. In particular, they found that physics courses had the largest grade penalty and largest gender difference in AGA. The Matz et al. (2017) study had similar findings but with a larger student sample across multiple institutions. Across five universities, STEM courses had larger grade penalties and larger gender differences in AGA that usually favored men.

Past work provides evidence that grade penalties are more common and extreme in STEM disciplines than in humanities or social science departments (Koester et al., 2016; Matz et al., 2017; Rask, 2010; Talking about Leaving Revisited: Persistence, Relocation, and Loss in Undergraduate STEM Education, 2019; Tomkin & West, 2022), and women tend to have larger grade penalties than men in STEM subjects, especially in courses students take early in their time at university (Malespina & Singh, 2022a, b; Matz et al., 2017). Here, we present an investigation that focuses on GA in various courses for biological sciences majors in which we analyze data to study if these trends hold in a more homogeneous population of largely pre-health and pre-medical students at a single large university in the US, rather than combining students across institutions and many majors as in prior studies.

Materials and methods

Participants

Participants in this study were enrolled in biosciences majors or students with interest in health professions at a large research university in the US, which is a large, public, and urban institution. The student major breakdown was as follows: 37% Biological Sciences, 1% Bioinformatics/Computational Biology, 3% Ecology and Evolution, 6% Microbiology, 6% Molecular Biology, 30% Neuroscience, 4% Pharmacy, and 13% Rehabilitation Science. All major except for Neuroscience, Pharmacy, and Rehabilitation Science are offered through the Department of Biological Sciences. These students were chosen because of their similar course requirements, especially for large introductory science courses.

Grade data collected over 13 years. and we excluded courses that were taken during the summer semester. We excluded summer courses, because they are not a typical representation of courses at our institution. For example, many summer students do not primarily attend this institution, but are local students visiting home for the summer. In addition, summer courses are typically taught by graduate students, the class sizes are an order of magnitude smaller than those in the Fall and Spring semesters, many students work full time while taking an introductory course which is not as common during the Fall and Spring semesters. In addition, the class sizes are an order of magnitude smaller than those in the Fall and Spring semesters. In total, we had 2445 students who took a total of 89,560 courses. Of these courses, 33,543 were instances of courses of interest (for example, Biochemistry or Physics 1) and 56,017 were instances of other courses (for example, Medical Sociology). The student sample was 58.1% women and 41.9% men. Less than 0.1% did not list their gender, so they were excluded from the study due to small sample size. Students identified with the following races/ethnicities: 68% White, 17% Asian, 3% Hispanic/Latinx, 3% multiracial, 7% African American/Black, and 1% unknown or unspecified. This research was carried out in accordance with the principles outlined in the institution’s Institutional Review Board (IRB) ethical policy, and de-identified demographic data were provided through university records.

Course selection

We chose to study courses that were taken by the largest number of students, excluding non-major electives (for example, "Introduction to Piano" or "Public Speaking") and courses that make up general education requirements. Thus, many courses were mandatory for students in the majors we focus on. The courses we chose are listed in Table 1, along with information about the year in which the students typically take the course. Not all courses were required for students in all the majors in our sample. Information about if a course was required, optional (i.e., an elective that count toward the major), or not required is included in Table 2. We would like to note that, though it is not required for most majors studied, Human Physiology met our criteria, because it is a commonly chosen elective for both Biology and Rehabilitation Science Students. In addition, students in this sample may take either algebra-based or calculus-based physics to fulfil their two-semester physics course requirement, but so few (N = 61) students chose calculus-based physics that they were excluded from this study.

Table 1 Courses studied by year
Table 2 Course requirements by major

Measures

Course grade

Course grades were based on the 0–4 scale used at our university, with A = 4, B = 3, C = 2, D = 1, F = 0 or W (late withdrawal); the suffixes ‘ + ' and ‘−', respectively, add or subtract 0.25 grade points (e.g., B− = 2.75 and B +  = 3.25), except for the A + , which is reported as 4. We are unable to report grading schemes of each instructor, type of course (i.e., traditional lectures or active learning), or any other detailed course-level information due to the large number of courses sampled.

Demographic information

Students provide demographic information as part of university enrollment. Students were given the binary options “male” and “female” to identify their gender upon entering the university, although this conflates gender and sex (Schudson, 2021). This approach also marginalizes gender minority students (Traxler et al., 2016; Van Dusen & Nissen, 2020). We acknowledge the harm that such data collection practices cause (Traxler et al., 2016; Van Dusen & Nissen, 2020), and we are pleased to report that our university has recently switched to collecting gender information using more than binary options.

Grade anomaly

GA was found by first finding each student’s grade point average excluding the course of interest (\({\mathrm{GPA}}_{\mathrm{exc}}\)). This was accomplished using the equation:

$${\mathrm{GPA}}_{\mathrm{exc}}=\frac{{\mathrm{GPA}}_{c}\times {\mathrm{Units}}_{c}-\mathrm{Grade}\times \mathrm{Units}}{{\mathrm{Units}}_{c}-\mathrm{Units}},$$

where \({\mathrm{GPA}}_{c}\) is the student's cumulative GPA, \({\mathrm{Units}}_{c}\) is the cumulative number of units the student has taken (including the semester the course of interest is taken), \(\mathrm{Grade}\) is the grade the student received in an individual course, and \(\mathrm{Units}\) is the number of units associated with an individual course. After finding \({\mathrm{GPA}}_{\mathrm{exc}}\) we can calculate (GA by finding the difference between a student's \({\mathrm{GPA}}_{\mathrm{exc}}\) and the grade received in that class:

$$\mathrm{GA}=\mathrm{Grade}-{\mathrm{GPA}}_{\mathrm{exc}}.$$

A negative GA corresponds to a course grade lower than a students' GPA in other classes and we call this a “grade penalty”. A positive GA corresponds to a course grade higher than a students' GPA in other classes and we call this a “grade bonus''. AGA is the mean of GAs for each course, and is the metric by which we compare courses.

Analyses

To characterize both AGA and grades, we found the sample size, mean, standard deviation, and standard error of each measurement for each course of interest. We calculated these statistics for women and men separately, and then for all students combined. We also compared the effect size of gender on both grade and GA, using Cohen's d to describe the size of the mean differences and unpaired t tests to evaluate the statistical robustness of the differences. Cohen's \(d\) is calculated as follows:

$$d=\frac{{\mu }_{1}-{\mu }_{2}}{\sqrt{{\sigma }_{1}^{2}-{\sigma }_{2}^{2}}},$$

where \({\mu }_{1}\) and \({\mu }_{2}\) are the means of the two groups, \({\sigma }_{1}\) and \({\sigma }_{2}\) are the standard deviations (Frey, 2018) and Cohen's d is considered small if \(d\sim 0.2\), medium if \(d\sim 0.5\), and large if \(d\sim 0.8\) (Cohen, 1988). All analysis was conducted using R (R Core Team, 2020), using the package plotrix (Lemon, 2006) for descriptive statistics, lsr (Navarro, 2015) for effect sizes, and ggplot2 (Wickham, 2016) to create plots.

Results

RQ1: For which of their courses do biosciences students receive a ``grade penalty'' and for which courses do they receive a ``grade bonus''?

To answer RQ1, we calculated AGA for the most popular courses taken by biological sciences students and students in related majors focusing on future careers in health professions. We show the descriptive statistics for both grades and AGA in Table 3 and Fig. 1. We find that students generally received grade penalties in all STEM courses we studied. One course, Human Physiology, has a smaller grade penalty than other courses, while Organic Chemistry 1 and 2 have much larger grade penalties than all other courses. For Organic Chemistry courses, students have an AGA of approximately 1, meaning that on average, students receive one full letter grade lower in these courses than in their other courses.

Table 3 Average grade anomalies (AGAs) and grades
Fig. 1
figure 1

Average grade anomalies (AGAs) for each course of interest. The error bars represent standard errors

There are no grade bonuses in the table, because not all courses that students take are included. For example, students may receive grade bonuses in general education courses, e.g., in social sciences and humanities as well as in laboratory courses.

RQ2: Do men and women have different “grade anomalies'' in their STEM courses?

To investigate if there are differences in grade anomalies between men and women, we grouped students by their self-reported gender and calculated the AGA for both groups for each course of interest. We then calculated Cohen’s d as a measure of effect size between the two groups (Frey, 2018), which can be seen in Table 4. Group differences can also be seen in Fig. 2. Women had statistically indistinguishable AGA outcomes to men in Calculus, Human Physiology, Genetics, Biochemistry, and Chemistry 2. By indistinguishable, we mean there is no statistically significant difference between men and women's AGAs in Table 4. Women did not have favorable AGA outcomes (e.g., smaller grade penalties) compared to men in any course. Men had favorable AGA outcomes compared to women in several courses, meaning there was a statistically significant difference in AGA between man and women, and men tended to have smaller grade penalties than women. These courses included Biology 1, Biology 2, Organic Chemistry 1, Organic Chemistry 2, Chemistry 1, Physics 1, and Physics 2.

Table 4 Average grade anomalies (AGAs), grades, and between-gender effect sizes for each course of interest
Fig. 2
figure 2

Average grade anomalies (AGAs) for each course of interest, by gender. The error bars represent standard errors

RQ3: Do gender differences in “grade anomalies'' follow the same trends as gender differences in grades?

There are many courses for which there was no statistically significant gender differences in grades or grade anomalies, such as Human Physiology, Genetics, Biochemistry, Chemistry 2, and Physics 2, which can be seen in Table 4. With the exception of Chemistry 2, these are all courses that students tend to take in their second year of university or later. There was one course in which the gender difference was both statistically significant and similar for grades and AGA: Organic Chemistry 1. In this course, Cohen's d between genders was similar for Grade and AGA, as shown in Table 4. These similar but significant gaps favoring men can also be seen in Figs. 2 and 3. For the aforementioned six courses, AGA and grades provide similar information.

Fig. 3
figure 3

Average grade anomalies (AGAs) for each course of interest, by gender. The error bars represent standard errors

There are also courses that show difference trends in AGA versus raw grades. For example, Table 4 reveals that Biology 1, Biology 2, Organic Chemistry 2, and Chemistry 1 have no statistically significant gender difference in grades, but all have a statistically significant difference between men and women when in AGA. Similarly, Table 4 also shows that the gender difference in AGA (d = 0.26) is larger than in raw grades (d = 0.11) for Physics 1, even if the gender difference is statistically significant for both measures. Comparing Figs. 2 and 3, it is clear that the gender differences are larger for AGA than raw grades for all of these courses.

There is one course that has a larger gender difference in grades than in AGA: Calculus 1, which can be seen in Table 4. In this course, women tend to have indistinguishable AGAs to men, but statistically significantly higher grades than men.

Discussion

Our primary finding is that there are average grade penalties in all courses studied for undergraduate biosciences majors or those with interest in health-related professions. First, we discuss why these grade penalties can be harmful. According to some studies by Seymour (1997; Talking about Leaving Revisited: Persistence, Relocation, and Loss in Undergraduate STEM Education, 2019), lower than expected grades, even in a single course, can be a catalyst for students to leave STEM majors. They found that this does not just include D and F grades or withdrawal from the course, but grades that were high enough to continue the program that did not meet a student’s personal expectations (Seymour & Hewitt, 1997; Talking about Leaving Revisited: Persistence, Relocation, and Loss in Undergraduate STEM Education, 2019). They also found that this was a particular issue among high-achieving students, who were more likely to endorse perfectionism and feeling like their identity as “good STEM students” was threatened by B’s and C’s, or even a low grade on a single exam (Talking about Leaving Revisited: Persistence, Relocation, and Loss in Undergraduate STEM Education, 2019). At out institution, 16.4% (the portion was very similar for men and women) of students studying Biosciences or Neuroscience leave their major (Maries et al., 2022). Thus, we believe that the courses that have the largest grade penalties, in this case Organic Chemistry 1 and 2, are the courses highly likely to push students out of these disciplines or cause them to question their abilities.

In addition to finding evidence of average grade penalties in some courses overall for all students taken together, we also find evidence of gender differences in average grade penalties in over half of the courses studied, particularly Biology 1 and 2, Organic Chemistry 1 and 2, Chemistry 1, and Physics 1 and 2. In particular, we find Organic Chemistry 1 and 2 concerning, because these courses have the largest average grade penalty women receive, so it is likely to stand out as a unique anomaly, and may impact students’ academic self-concept (e.g., their long-term expectation of academic success) more than other courses.

In prior studies, women report feeling more demoralized than men when they receive low grades, and cite more worry over not understanding material even if they receive A’s, B’s, or C’s (all of which are grades that allow students to continue in most programs) (Goodman, 2002; Talking about Leaving Revisited: Persistence, Relocation, and Loss in Undergraduate STEM Education, 2019). This trend has been found to be particularly strong among high-achieving women (Talking about Leaving Revisited: Persistence, Relocation, and Loss in Undergraduate STEM Education, 2019). Research shows that women often tend to earn higher grades than men with the same standardized test scores (Talking about Leaving Revisited: Persistence, Relocation, and Loss in Undergraduate STEM Education, 2019; Voyer & Voyer, 2014). Women in this sample tended to have higher GPAs than men, especially in non-science courses, so a low grade in a science course may be a larger deviation from women’s personal norms. Because women are often more accustomed to higher grades, they may have more concern about grades that are lower than what they are accustomed to, or they may compare their relatively low STEM grades and view themselves as less able to succeed in biology than a subject that gives them the recognition for their work that they are accustomed to (Seymour & Hewitt, 1997; Talking about Leaving Revisited: Persistence, Relocation, and Loss in Undergraduate STEM Education, 2019).

We find that grade anomalies and raw grade data do not always reveal the same trends. Some courses have larger gender differences in AGA than in grades, such as Biology 1 and 2, Chemistry 1, and Physics 1. This speaks to the usefulness of tracking both AGA and grades of the students. An instructor may think that a small or negligible grade difference by gender in their course implies that there is gender equity in their classroom based upon grade outcomes alone, but without knowing the gender differences in AGA, the instructor will not recognize the grade penalty and how those grades are perceived by female and male students. Understanding both grades and AGA differences may allow instructors to understand both of these classroom-level inequities. In the study discussed here, there is one course that has larger gender differences in grades than in AGA: Calculus 1. We find this course concerning because of the average grade penalty for all students, particularly because this course is most often taken by first-year students, who are more likely to have an academic self-concept that is fragile (Eccles & Wigfield, 2020).

These findings are similar to past work on GA in a variety of ways. For example, past work which included multiple universities in their sample also found that most STEM lecture courses had grade penalties (Koester et al., 2016; Matz et al., 2017). Math and chemistry courses often had the larger grade penalties than physics and biology, which is similar to our study (Matz et al., 2017). Measuring AGA in addition to grades may be a useful way to find inequities in the learning environment. Measuring grades and gendered grade differences is both valuable for and accessible to individual instructors, but grade anomalies may be useful to departments concerned about students’ retention over longer periods and finding which courses may be particularly discouraging to students.

Conclusion

In this investigation, we found that grade penalties exist for all of the courses we studied for undergraduate biosciences majors or those with interest in health-related professions. Furthermore, seven courses had an AGA (larger grade bonuses or smaller grade penalties) that favored men over women, while five courses had an AGA that did not favor either gender. This gender inequity in AGA raises particular concern about the need for an equitable learning environment.

These findings are also valuable, because they provide evidence that courses in STEM departments in general tend to have grade penalties. This supports prior research on grade anomalies in a more homogeneous population, bioscience majors at one university.

Limitations and future research

Although we have evidence of grade penalties in the studied courses as well as gendered GA differences, we did not have access to syllabi or other information about individual courses offered over the 13-year period of data collection. Therefore, we are not able to pinpoint specific practices that may lead to grade penalties, grade bonuses, or gender inequities at our institution. However, based upon the structure of most courses currently offered that were discussed here, it is reasonable to assume that most of these courses have focused on teaching in a traditional, lecture-based, and exam-reliant format.

In addition, this research is based at a primarily white, large, public university in the US. While our results are likely to generalize to similar institutions, we do not know what patterns of grade anomalies and potential gender differences in them exist at smaller liberal arts colleges, minority-serving institutions, or community colleges in the US. In addition, conducting research at a diverse range of institutions in different countries, as well as a focus on how GA affects students from a variety of underrepresented groups, will help us more fully understand how grade anomalies differ for a range of students.

Finally, grade anomalies have the potential to be an easily accessible and useful tool for universities to pinpoint courses that may be particularly discouraging to students or effect retention. One way to realize this potential may be future studies that focus on institutional awareness and potential amelioration of grade penalties.

Availability of data and materials

The data sets used and/or analyzed during the current study are available from the corresponding author on reasonable request and upon institutional approval.

Abbreviations

AGA:

Average grade anomaly

EVT:

Expectancy value theory

GA:

Grade anomaly

SD:

Standard deviation

SEVT:

Situated expectancy value theory

STEM:

Science, technology, engineering, and mathematics

References

Download references

Acknowledgements

The authors would like to thank Robert Devaty  for critical feedback on the manuscript.

Funding

This research was funded by National Science Foundation Award No. DUE- 1524575.

Author information

Authors and Affiliations

Authors

Contributions

AM contributed to analysis and interpretation of data, as well as writing and revision of manuscript. CS contributed to the conception and design of research, acquisition and interpretation of data, and revision of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Alysa Malespina.

Ethics declarations

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Malespina, A., Singh, C. Gender gaps in grades versus grade penalties: why grade anomalies may be more detrimental for women aspiring for careers in biological sciences. IJ STEM Ed 10, 13 (2023). https://doi.org/10.1186/s40594-023-00399-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s40594-023-00399-7

Keywords