Skip to main content

Gender gaps in the performance of Norwegian biology students: the roles of test anxiety and science confidence



Understanding student motivational factors such as test anxiety and science confidence is important for increasing retention in science, technology, engineering, and math (STEM), especially for underrepresented students, such as women. We investigated motivational metrics in over 400 introductory biology students in Norway, a country lauded for its gender equality. Specifically, we measured test anxiety and science confidence and combined students’ survey responses with their performance in the class.


We found that female students expressed more test anxiety than did their male counterparts, and the anxiety they experienced negatively predicted their performance in class. By contrast, the anxiety male students experienced did not predict their performance. Conversely, men had higher confidence than women, and confidence interacted with gender, so that the difference between its impact on men’s and women’s performance was marginally significant.


Our findings have implications for STEM instructors, in Norway and beyond: specifically, to counter gender-based performance gaps in STEM courses, minimize the effects of test anxiety.


Barriers to full participation in STEM

Students enter higher education with different abilities, aspirations, and motivations (Hidi and Harackiewicz, 2000; Wren and Wren, 2003). This variation is not random, but can be predicted in part from a variety of personal, socioeconomic, cultural, and biological factors. In particular, a student’s gender has been shown to have a strong influence on that student’s educational and career aspirations, motivation, retention, and success (Hyde and Durik, 2005; Meece, Glienke and Askew, 2009). In the STEM disciplines (science, technology, engineering, and mathematics), men outnumber women at all career stages (UNESCO, 2015), men exhibit higher levels of retention throughout the career path, and the research output of men is greater (Gibney, 2016; Larivière, Chaoqun Ni, Cronin and Sugimoto, 2013). The Organization for Economic Co-operation and Development (OECD) has voiced concern for the resulting gender gaps in educational choices and in the workforce (OECD, 2014). In the USA, several recent reports have focused on STEM disparities where subtle, or implicit, gender biases can have practical implications; for example, men disproportionately enjoy high leadership positions and prestige (Beede et al., 2011; Grunspan et al., 2016; Isbell, Young and Harcourt, 2012; Moss-Racusin, Dovidio, Brescoll, Graham and Handelsman, 2012; National Science Board, 2015).

Norway is a country known for gender equity (Gulbrandsen, 2007; Teigen and Wängnerud, 2009) and as of this writing has a female prime minister leading a cabinet of 45% women, including ministers of finance, foreign affairs, and higher education. Yet, as women move through the academic and career trajectory, they become less represented due to myriad barriers to retention. Females outnumber males in almost all college-level subjects in Norway, except STEM subjects (other than biology), in which almost 70% of the students are males (Ministry of Education and Research, 2015). Even in disciplines that have relatively high female enrollment at the undergraduate level (e.g., non-STEM, biology), women are still underrepresented at the higher levels (e.g., professors, top administrators), and this phenomenon was implicated in a recent national survey of biology students and teachers (Hole et al., 2016). Given the global demand for STEM professionals (e.g., Caprile, Palmén, Sans and Dente, 2015; National Science Board, 2015), these disparities can cause concern. The uneven female-male ratio (especially in high-status positions) is in itself a barrier to recruitment, and to equalize the field, it is important to first identify mechanisms that hinder or prevent female participation and retention in STEM and then develop instructional interventions for overcoming these. A relatively gender-equal society such as Norway provides an interesting test case for identifying and investigating the underlying causes for the less obvious and therefore more implicit barriers to progression in STEM.

There are several reasons why there could be gender differences in STEM fields. Suggested causes often include social reasons (Rogoff, 2003; West and Zimmerman, 1987; Ceci, Williams, and Barnett, 2009) such as interest in course content (Jones, Howe and Rua, 2000; Hulleman and Harackiewicz, 2009; Diekman, Clark, Johnston, Brown and Steinberg, 2011), science identity (Cundiff, Vescio, Loken and Lo, 2013; Hazari, Sadler and Sonnert, 2013; Robnett, Chemers and Zurbriggen, 2015), and sense of social belonging (Hausmann et al., 2007; Cohen and Garcia, 2008; Walton and Cohen, 2007; Stout, Dasgupta, Hunsinger and McManus, 2011; Eddy and Hogan, 2014). Another important line of research has focused on motivational factors, including gender differences in science confidence (Kitchen, Reeve, Bell, Sudweeks and Bradshaw, 2007; Cotner, Ballen, Brooks and Moore, 2011; Trujillo and Tanner, 2014; Robnett et al., 2015; Ballen, Wieman, Salehi, Searle and Zamudio, 2017; Bussey and Bandura, 1999; Dix, 1987; Fenollar, Román and Cuestas, 2007) and test anxiety (Owens, Stevenson, Hadwin and Norgate, 2014; Ballen, Salehi and Cotner, 2017). For the remainder of this discussion, our focus is on these two motivational constructs. Both have been implicated in numerous discussions of STEM performance and retention, but they have not been—to our knowledge—explored for how they may impact these phenomena in Norwegian higher education.

Science confidence (Bussey & Bandura, 1999; Dix, 1987; Fenollar et al., 2007; Cotner et al., 2011; Nissen and Shemwell, 2016; Sawtelle, Brewe and Kramer, 2012) refers to a student’s perception of their own abilities to execute specific scientific tasks and is closely related to self-regulatory learning and self-efficacy (Stankov, Lee, Luo and Hogan, 2012; Ainscough et al., 2016). Confidence plays a vital part in females’ persistence, retention, and performance in STEM subjects (Macphee, Farro and Canetto, 2013; Lundeberg, Fox and LeCount, 1992), and in general, studies find that females tend to have less science confidence than males (Cotner et al., 2011; Trujillo & Tanner, 2014; Robnett et al., 2015; Ballen, Wieman, Salehi, Searle and Zamudio, 2017). Several theoretical explanations for framing the relationship between confidence, performance, and retention have been suggested, including stereotype threat (Steele 1997; Wheeler and Petty 2001; Cohen and Garcia, 2008)—whereby an awareness of a negative stereotype is subconsciously felt and operationalized—and social cognitive career theory (Bandura 1986; Lent et al., 1994)—whereby a perceived lack of belonging in a discipline informs an individual’s self-evaluation and sense of a future in that discipline.

Test anxiety is defined as “the set of phenomenological, physiological, and behavioral responses that accompany concern about possible negative consequences or failure on an exam or similar evaluative situation” (Zeidner 1998). Due to performance pressure, social pressure, and time constraints, higher levels of test anxiety may reduce performance (Lundeberg et al., 1992; von der Embse, Jester, Roy and Post, 2018). Several theoretical perspectives have been advanced for framing studies of test anxiety (Zeider, 2010; Sommer and Arendasy 2014), for example a cognitive-interference approach to this phenomenon. According to cognitive-interference theory, the experience of test anxiety diverts mental resources (e.g., short-term memory, cognitive processing, problem solving) that are otherwise needed for test-taking (Zeidner 2010; Eysenck et al 2007; Sarason 1984). Significantly, test anxiety may not be felt equally by all students, and its impacts may vary by student characteristics. Studies in the USA indicate that underrepresented minority and female students in STEM courses exhibit more test anxiety than do their non-minority or male counterparts (Payne, Smith and Payne, 1983; Hembree, 1988; Cassady and Johnson, 2002; Chapell et al., 2005; Ballen, Wieman, Salehi, Searle and Zamudio, 2017; von der Embse et al., 2018; Harris et al., 2019). Further, Ballen, Salehi and Cotner (2017) and Salehi et al. (2019) have demonstrated that test anxiety in women—but not in men—is negatively and significantly associated with performance on exams, possibly explaining some of the performance gaps that have been documented in STEM fields (e.g., Koester, Grom and McKay, 2016; Matz et al., 2017). Harris et al. (2019) found nominal gender differences in reported test anxiety and no gender-specific effect of test anxiety on performance in a large biology class, but there was no gender gap in performance in the class under study, and hence no problem to be solved.

In this study, we draw on survey, demographic, and performance data from 3 years of an introductory-biology course at a large university in Norway to explore the possible gender-specific impacts of—and interactions between—test anxiety and science confidence. Our specific research questions were:

  1. 1.

    In this sample of biology students, are there gender differences in this sample of biology students in test anxiety, science confidence, and performance?

  2. 2.

    If performance differences exist, does test anxiety or science confidence predict performance in ways that can explain these differences?

It is especially important to understand these effects because confidence and test anxiety are at least potentially responsive to interventions while other student characteristics (e.g., gender) are less so.


Participants and procedure

The present study is part of a larger project including video recordings of lectures, assessment of teachers, and student surveys initiated by the bioCEED Centre of Excellence in Biology Education (bioCEED, 2013) at the University of Bergen (UiB). The present study reports data collected in three sections of an introductory-biology course taught by the same instructor in Fall 2016, Fall 2017, and Fall 2018. Participants were over 400 undergraduate students in biology. All students were asked to provide gender information. We acknowledge that gender is a complex social and biological construct, and thus the students were given the possibility to specify their gender identity if it did not fit into the category of male or female. However, none of the participants identified themselves as other than male or female, and thus the sample was collapsed into a dichotomous variable. Gender distribution was 36% males and 64% females. The instructor of the course is male.

Critically, the focal course is taught by an acclaimed professor who typically implements evidence-based pedagogies in class. Students have multiple opportunities to contribute in class, via small-group and large-group discussion and an electronic classroom-response system, and tests employ a variety of assessment techniques.

Participants were recruited in class. The students completed a pre-course survey in the first week of the term. Students were informed about the general purpose of the study—without any reference to gender—and that their participation was voluntary. Students also consented to having their survey responses matched, by a third party not involved in the research, with their performance in the course and their overall high-school score (overall high school score refers to the average grade derived from final assessment in each of the students’ subjects, in addition to grades on the oral and written exams; the maximum score is 60). The final year the survey was administered online, but students were given time in class to complete the surveys on their web-enabled devices (computer, tablet, or phone).

Our study design was approved by The Norwegian Centre for Research Data. Specifically, students were informed that the data would be treated confidentially and anonymized in any publications and after the end of the project. Lastly, student participants had the opportunity to withdraw from the study at any time. No rewards were given for participation.


Test anxiety

We employed the 4-item measure for test anxiety retrieved from the short version of the Motivated Strategies for Learning Questionnaire (MSLQ: Duncan and McKeachie, 2005; Pintrich, Smith, Garcia and McKeachie, 1991). An item example is “I am so nervous during a test that I cannot remember facts I have learned.” The participants answered on a 7-point Likert scale ranging from 1 (not at all true of me) to 7 (very true of me). Cronbach’s alpha level for the composite scale was acceptable (0.841)—a finding consistent with prior work (Ballen et al 2017, Salehi et al., 2019). Since this measure was not proximal to any course assessment, we consider it a measurement of trait rather than state anxiety (von der Embse et al., 2018). Sixteen other items from the abbreviated MSLQ were included in the survey; however, responses to those items are not included in the current analysis or discussion.

Science confidence

We used a 13-item scale to measure students’ confidence in comprehending, critically assessing, and communicating scientific concepts. The items of the scale are drawn and adapted from previous studies investigating students’ science confidence (Lopatto, 2004; Seymour, Hunter, Laursen and Deantoni, 2004), though the validity of the scale was not separately evaluated for this population. The scale used in the present study has been employed among biology students and found reliable (Cotner et al., 2011; Cotner, Thompson and Wright, 2017). Participants answered on a 5-point Likert scale including: 1 (not confident), 2 (a little confident), 3 (somewhat confident), 4 (highly confident), and 5 (extremely confident). An example item is “presently, I am confident I can make an argument using scientific evidence.” The 13-item scale produced a satisfactory alpha level (0.872). The science-confidence items are included in Supplemental File 1.

Academic performance

Student academic performance was measured by total points earned in the class, on a 0–100 scale. Point totals are a combination of performance on four exams distributed throughout the semester: (i) multiple choice and writing definitions, (ii) numerical competence with graphical visualization and interpretation of results, (iii) an oral five-minute presentation on a self-elected topic, and (iv) an essay plus short written explanations and definitions. Assessment, and hence the score, emphasizes communication skills, mainly writing and logic, in addition to disciplinary knowledge. Evaluation criteria and assignment types were identical across the 3 years of this study.

Analytical strategy

Our analysis explored the relationships between three predictor variables (gender, test anxiety, and confidence) and academic performance. Because the data in this study were nested in semesters, we used multilevel regression modeling, with class as a random effect, to control for within-semester correlation. For all Likert scale variables, we transformed the categories into numeric values and treated the dependent variables as continuous to facilitate interpretation. Non-parametric tests have yielded similar results to those we report (Murray, 2013; Norman, 2010). The threshold for statistical significance was set at p = 0.05, with p values between 0.05 and 0.10 regarded as marginally significant. Overall high-school score was our only measure of incoming aptitude and preparation, but reporting of this measure was too unreliable to allow us to include it in our statistical models (only about 1/8 of students in this study reported a high-school score).

Because our models lacked a measure of student incoming preparation (analogous to ACT or SAT scores, or GPA in previous classes), we did not expect the models to predict a great deal of the overall variation in total points. Instead, our interest was in sorting out gender-specific effects of particular covariates.


Descriptive statistics showed that female students began class with significantly higher levels of test anxiety, but nearly identical levels of confidence, when compared to male students (see Table 1.)

Table 1 Average confidence and test anxiety, by gender

An independent-sample t test indicated that on average, female students in this class earned significantly more total points than male students did (female mean = 61.09, male mean = 57.37, p = 0.009).

Our initial mixed models produced a Hessian matrix error, indicating that the amount of variation in the outcome associated with the random variable “year” was very small, so that the random variable was not needed in the model. Accordingly, we proceeded with the analysis using ordinary least squares (OLS) regression. Because our main interest was in the differential effects of confidence and test anxiety for male and female students, we estimated separate OLS models for the genders.

Results indicated that pre-class test anxiety was negatively predictive of class performance for female students, with an effect size of about ¼ of a standard deviation, but test anxiety had no discernible predictive power for male students (Fig. 1). For women, each one-point increase in test anxiety was associated with a 2.136 point decrease in total points (Table 2).

Fig. 1
figure 1

Differential impact, by gender, of test anxiety on total points in the course. Note. For women, but not for men, test anxiety was a significant negative predictor of performance

Table 2 Ordinary least squares regressions for predicting student class performance based on test anxiety and confidence

By contrast, pre-class confidence nominally predicted class performance for male students in a negative direction, with a marginally significant effect size of about 1/6 of a standard deviation, while confidence had no predictive power for female students. For men, each one-point increase in confidence was associated with a 3.535 point decrease in total points (Table 2).

To assess the significance of the different ways in which test anxiety and confidence affected the performance of male and female students, we estimated a model for both genders combined, which included interaction variables. This model showed that the interaction between gender and test anxiety was significant at the p ≤ 0.05 level, with female students disadvantaged relative to male students by the anxiety they reported. The interaction between gender and confidence was marginally significant (p = 0.051), with female students possibly gaining an advantage relative to male students through the confidence they reported (see Table 3).

Table 3 Results of a model illustrating the impact of test anxiety, confidence, gender, and interactions on performance

Although low N prevented us from including high-school points as a predictor in our regression models, we did examine the association between high-school points and our predictor variables of interest, namely test anxiety and confidence. These bivariate correlations suggest similar patterns as in the main models—opposite effects of both test anxiety and confidence on the performance of females vs. males but should be interpreted with caution since they are based on a much smaller sample than our other analyses—a sample which may differ from the larger group in unknown ways (Table 4).

Table 4 Correlations among high-school points, test anxiety, and confidence


The present study has been a first step toward investigating motivational differences across gender in a Norwegian sample in higher education. The primary aim of this study was to test whether there are gender differences in two STEM-related motivational constructs—science confidence and test anxiety—in a relatively gender-equal society. We found significant gender differences in test anxiety but not in science confidence, and we found differences in how these constructs predicted learning outcomes for the two genders. While the scope of our study—a single instructor, for a single course, at a single institution in Norway—prohibits extrapolation to Norwegian higher education in general, our findings can serve as an initial exploration into factors that may influence gender-based attrition in STEM. These findings also serve to undermine the hypothesis that the connection between test anxiety and gendered performance differences do not exist outside of the United States.

First, female students started class with more test anxiety than male students did, and the anxiety they experienced negatively predicted their performance in class. By contrast, male students experienced less test anxiety than female students, and the anxiety they did experience seems unrelated to their class performance. These findings echo those of Ballen, Salehi and Cotner (2017) and Salehi et al. (2019), which suggest that female students may be subject to interference by test anxiety, “which explains depressed performance by identifying factors that disturb the process of information recall and utilization during testing situations” (von der Embse et al., 2018, p. 484). The ultimate impact of test anxiety in this sample of students did not contribute to a performance gap between men and women. Rather, in contrast to prior studies in the USA (e.g., Salehi et al., 2019), women outperformed their male peers, in spite of their higher test anxiety and its relationship to performance. The fact that women in this course did not underperform relative to their male peers may be a function of their sheer numbers (with more women than men, and many of these women having below-average test anxiety), the discipline (biology in Norway is not associated with the same gender-based challenges as some other STEM disciplines; e.g., physics, computer science), or the evidence-based pedagogy of the instructor (e.g., using diverse strategies to assess students). Further studies in STEM fields beyond biology, with faculty employing more traditional pedagogies, will shed light on the merits of these possible explanations.

Our data do not allow us to exclude entirely the deficit model, however, which proposes that test anxiety is the result of perceived deficits in preparation, skills, etc. on the part of students (von der Embse et al., 2018). The fact that anxiety was negatively correlated with high-school points for female students is some indication that a deficit model may explain some of the association of anxiety with class performance for female students, consistent with the findings of Salehi et al. (2019).

Second, male students started class with more confidence than female students did, and the confidence they reported was negatively (though not significantly) associated with their performance in class. By contrast, the confidence female students reported was irrelevant to their class performance. And confidence interacted with gender, so that the difference between its effects on the two genders was marginally significant. These data suggest that male students may be subject to an overconfidence effect, whereby attention and motivation are undermined by misplaced confidence in their own abilities (Marshman, Kalender, Nokes-Malach, Schunn and Singh, 2018). The fact that confidence was not correlated at all with high-school points for male students lends some credence to this supposition.

These findings are similar to the gender differences in confidence (Cotner et al., 2011; Nissen & Shemwell, 2016; Sawtelle et al., 2012) and certain motivational constructs (Glynn, Brickman, Armstrong and Taasoobshirazi, 2011) found in college students in the USA. These similarities are surprising; while there are certainly many cultural similarities between the USA and Norway, the status of women is different between the two countries according to a number of indicators (e.g., UNESCO, 2015) and we would have expected those gender differences to impact links between motivational factors, gender and academic performance. The fact that gender differences remain, and are similarly predictive, across different cultures, may suggest some biological basis to these differences. For example, men tend to be more confident with regard to almost everything; this phenomenon may be mediated by testosterone, a steroid hormone that is expressed far more in men than it is in women. Several studies have suggested a link between risk-taking (itself a proxy for confidence) and testosterone levels in both men (Booth, Johnson and Granger, 1999; Coates and Herbert, 2008; Sapienza, Zingales and Maestripier, 2009) and women (van Honk et al., 2004).

However, the literature (discussed above) documenting tractable impacts of the environment on performance—and gaps in performance—is extensive, and we hesitate to invoke biological explanations without ruling out environmental ones. Specifically, the classroom environment may foster the gender differences we have documented here. For example, instructors may harbor biases (e.g., implicit bias; Staats, 2015) and anxieties that lead to subtle behaviors impacting their students. Canning et al (2019) recently documented how the courses of STEM faculty with a “fixed” mindset respecting intelligence demonstrate greater performance gaps between underrepresented students and their well-represented counterparts. And Beilock, Gunderson, Ramirez, and Levine (2010) has illustrated that K-12 teachers’ math anxiety negatively predicts their female students’ math performance. Others have attested to the positive power of simply revealing one’s own biases (Staats, 2015, Moss-Racusin et al., 2016; but see Kalev, Dobbin and Kelly, 2006). For example, Chang et al. (2019) documented attitudinal and behavioral changes associated with bias training, but their work suggests that meaningful change likely requires more than the one-off diversity-training sessions offered at many universities. Given the critical role of awareness, and the general perception of Norway as a gender-equal society, sustained bias training at places like University of Bergen may be warranted.

Further, classroom environments vary with respect to gender-equitable participation, which may be a proxy for confidence and/or sense of inclusion (Caspi, Chajut and Saporta, 2008; Eddy, Brownell and Wenderoth, 2014; Ballen et al., 2019; Neill, Cotner, Driessen and Ballen, 2018). Ballen et al. (2019), in a multi-institutional study including biology courses in Norway, illustrated that smaller class sizes and diverse teaching methods were associated with gender-equitable in-class discussions. Thus, class size and pedagogy may also be associated with confidence and test anxiety, further impacting the performance and participation of women in STEM courses.


There are several limitations worth mentioning when interpreting our findings, in addition to the single-instructor focus of this work discussed above. First, due to a lack of randomization and experimental data, we cannot infer causation. Future studies should investigate if females, compared to males, experience test anxiety in performance situations and how this manifests itself in performance and affect. Moreover, triangulation of the data (e.g., observational data, mixed-method) could have further accounted for some of the unexplained variance in the data. Second, our model is rather simple; future studies could elaborate on our model and include more motivational constructs. Third, given the low response rate on high-school entry grades, we were unable to investigate how prior achievement impacts test anxiety and science confidence. Last, we acknowledge that other unmeasured factors (e.g., cognitive differences, socio-economic status, and personality differences) could have served as mediators or predictors in our model.


Despite the limitations, the present study reveals some interesting relationships between science-related gender differences and motivational variables in a population that has thus far been unexplored along these dimensions. While in this particular course, the impact of test anxiety was not manifest in lower grades among women, that may not always be the case. Different courses, in different STEM disciplines, implementing different pedagogies, may yield different outcomes. Our future work aims to address this possibility. The fact that the instructor of the sampled courses is an award-winning educator implementing several evidence-based teaching strategies—group discussion, polling for formative assessment, and diverse testing strategies—may also limit the ability to extrapolate from our findings.

In light of our results, some practical implications can be suggested—especially in contexts in which the ultimate outcome of these interactions leads to a gender-based grade difference. Gender difference is a factor that biology teachers can be aware of, and, based on our regression analysis, we suggest implementing strategies to enhance students’ science confidence and reduce test anxiety. Prior work has suggested that strategic use of role models, either in the class or as embedded examples, can reduce the gaps in confidence (Cotner et al., 2011) and retention (Bettinger and Long, 2005; Hoffmann and Oreopoulos, 2009) in STEM disciplines. Also, implementing active-learning techniques in the classroom may be especially beneficial for women and underrepresented minority students (Haak et al., 2011, b; Lorenzo, Crouch, & Mazur, 2006). However, because the interaction between gender and confidence was relatively weak compared to that between gender and test anxiety, an emphasis on test anxiety may deliver more positive results. Mitigating the impacts of test anxiety might increase students’ performance (Ballen, Salehi and Cotner, 2017) and, consequently, their science confidence. Strategies could include allowing exam re-takes to reduce perceived risk, setting realistic standards on tests and examination grades, implementing writing exercises targeting testing (Ramirez and Beilock 2011), having several low-stakes tests (rather than a few high-stakes exams; Cotner and Ballen 2017), and helping students focus on intrinsic aspects of learning, as opposed to extrinsic aspects (Deci & Ryan, 1985; Hill & Wigfield, 1984).

Assuming these gender differences with respect to science confidence and test anxiety are consistent in future studies, for example in STEM disciplines beyond biology, the next steps are to implement strategic interventions explicitly targeting known deficiencies. While it may be relatively straightforward to investigate any relationship between variation in affective traits (such as self-beliefs, engagement, and motivation) and performance and retention, designing effective interventions is more challenging. Also, interventions that show promise in one context may not apply to others. Cross-cultural comparisons may help clarify which interventions are broadly applicable, as opposed to those that are restricted to certain populations.

Availability of data and materials

The datasets generated and/or analyzed during the current study are not publicly available due to the restrictions established by the NSD – Norwegian Centre for Research Data, but are available, in aggregate, from the corresponding author on reasonable request.



Science, technology, engineering, and math


University of Bergen, Norway


Centre for Excellence in Biology Education


Ordinary least squares


  • Ainscough, L., Foulis, E., Colthorpe, K., Zimbardi, K., Robertson-Dean, M., Chunduri, P., & Lluka, L. (2016). Changes in biology self-efficacy during a first-year university course. CBE Life Sciences Education, 15, 1–12

    Article  Google Scholar 

  • Ballen, C. J., Aguillon, S. M., Awwad, A., Bjune, A. E., Challou, D., Grace, A., … Cotner, S. (2019). Smaller classes promote equitable student participation in STEM. BioScience, XX(X), 1–12

    Google Scholar 

  • Ballen, C. J., Salehi, S., & Cotner, S. (2017). Exams disadvantage women in introductory biology. PLoS One, 1–14.

  • Ballen, C. J., Wieman, C., Salehi, S., Searle, J. B., & Zamudio, K. R. (2017). Enhancing diversity in undergraduate science: Self-efficacy drives performance gains with active learning. CBE Life Sciences Education, 16(4), ar56.

    Article  Google Scholar 

  • Bandura, A. (1986). Social foundations of thought and action: A social cognitive theory. Prentice-Hall, Inc Retrieved from

  • Beede, D., Julian, T., Langdon, D., McKittrick, G., Khan, B., & Doms, M. (2011). Women in STEM: A gender gap to innovation. Economics and Statistics Administration Issue Brief, 4(11), 1–11.

    Google Scholar 

  • Beilock, S. L., Gunderson, E. A., Ramirez, G., & Levine, S. C. (2010). Female teachers’ math anxiety affects girls’ math achievement. Proceedings of the National Academy of Sciences of the United States of America, 107(5), 1860–1863

    Article  Google Scholar 

  • Bettinger, E. P., & Long, B. T. (2005). Remediation at the community college: Student participation and outcomes. New Directions for Community Colleges, 129, 17–26.

    Article  Google Scholar 

  • bioCEED. (2013). Senter for fremragende utdanning - bioCEED. Retrieved from

    Google Scholar 

  • Booth, A., Johnson, D. R., & Granger, D. A. (1999). Testosterone and men’s depression: The role of social behavior. Journal of Health and Social Behavior, 40(2), 130–140.

    Article  Google Scholar 

  • Bussey, K., & Bandura, A. (1999). Social Cognitive Theory of gender development and differentiation. Psychological Review, 106(4), 676–713.

    Article  Google Scholar 

  • Canning, E. A., Muenks, K., Green, D. J., & Murphy, M. C. (2019). STEM faculty who believe ability is fixed have larger racial achievement gaps and inspire less student motivation in their classes. Science Advances, 5(2).

  • Caprile, M., Palmén, R., Sans, P., & Dente, G. (2015). Encouraging STEM studies for the labour market. Brussels: European Union.

    Google Scholar 

  • Caspi, A., Chajut, E., & Saporta, K. (2008). Participation in class and in online discussions: Gender differences. Computers & Education, 50(3), 718–724

    Article  Google Scholar 

  • Cassady, J. C., & Johnson, R. E. (2002). Cognitive test anxiety and academic performance. Contemporary Educational Psychology

  • Ceci, S. J., Williams, W. M., & Barnett, S. M. (2009). Women's underrepresentation in science: sociocultural and biological considerations. Psychological Bulletin, 135(2), 218.

    Article  Google Scholar 

  • Chang, E. H., Milkman, K. L., Gromet, D. M., Rebele, R. W., Massey, C., Duckworth, A. L., & Grant, A. M. (2019). The mixed effects of online diversity training. Proceedings of the National Academy of Sciences, 116(16), 7778–7783.

    Article  Google Scholar 

  • Chapell, M. S., Benjamin Blanding, Z., Takahashi, M., Silverstein, M. E., Newman, B., Gubi, A., & McCann, N. (2005). Test anxiety and academic performance in undergraduate and graduate students. Journal of Educational Psychology, 97(2), 268–274

    Article  Google Scholar 

  • Coates, J. M., & Herbert, J. (2008). Endogenous steroids and financial risk taking on a London trading floor. Proceedings of the National Academy of Sciences, 105(16), 6167–6172.

    Article  Google Scholar 

  • Cohen, G. L., & Garcia, J. (2008). Identity, belonging, and achievement: A model, interventions, implications. Current Directions in Psychological Science, 17(6), 365–369

    Article  Google Scholar 

  • Cotner, S., Ballen, C., Brooks, D. C., & Moore, R. (2011). Instructor gender and student confidence in the sciences: A need for more role models? Journal of College Science Teaching, 40(5), 96–101.

    Google Scholar 

  • Cotner, S., Thompson, S., & Wright, R. (2017). Do biology majors really differ from non–STEM majors? Cell Biology Education, 16(3), ar48

    Google Scholar 

  • Cotner, S., & Ballen, C. J. (2017). Can mixed assessment methods make biology classes more equitable?. PLoS One, 12(12), e0189610.

    Article  Google Scholar 

  • Cundiff, J. L., Vescio, T. K., Loken, E., & Lo, L. (2013). Do gender–science stereotypes predict science identification and science career aspirations among undergraduate science majors? Social Psychology of Education, 16(4), 541–554.

    Article  Google Scholar 

  • Deci, E. L., & Ryan, R. M. (1985). Intrinsic motivation and self-determination in human behavior. New York: Plenum Press.

    Book  Google Scholar 

  • Diekman, A. B., Clark, E. K., Johnston, A. M., Brown, E. R., & Steinberg, M. (2011). Malleability in communal goals and beliefs influences attraction to stem careers: Evidence for a goal congruity perspective. Journal of Personality and Social Psychology, 101(5), 902.

    Article  Google Scholar 

  • Dix, L. S. (Ed.) (1987). Women: Their underrepresentation and career differentials in science and engineering. Washington, D.C.: National Academy Press.

    Google Scholar 

  • Duncan, T. G., & McKeachie, W. J. (2005). The making of the motivated strategies for learning questionnaire. Educational Psychologist, 40(2), 117–128

    Article  Google Scholar 

  • Eddy, S. L., Brownell, S. E., & Wenderoth, M. P. (2014). Gender gaps in achievement and participation in multiple introductory biology classrooms. CBE Life Sciences Education, 13(3), 478–492

    Article  Google Scholar 

  • Eddy, S. L., & Hogan, K. A. (2014). Getting under the hood: How and for whom does increasing course structure work? CBE Life Sciences Education

  • Eysenck, M. W., Derakshan, N., Santos, R., & Calvo, M. G. (2007). Anxiety and cognitive performance: Attentional control theory. Emotion, 7(2), 336–353

    Google Scholar 

  • Fenollar, P., Román, S., & Cuestas, P. J. (2007). University students’ academic performance: An integrative conceptual framework and empirical analysis. The British Psychological Society, 77, 873–891

    Google Scholar 

  • Gibney, E. (2016). Women under-represented in world’s science academies. Nature Retrieved from website:

  • Glynn, S. M., Brickman, P., Armstrong, N., & Taasoobshirazi, G. (2011). Science motivation questionnaire II: Validation with science majors and nonscience majors. Journal of Research in Science Teaching, 48(10), 1–18

    Article  Google Scholar 

  • Grunspan, D. Z., Eddy, S. L., Brownell, S. E., Wiggins, B. L., Crowe, A. J., & Goodreau, S. M. (2016). Males under-estimate academic performance of their female peers in undergraduate biology classrooms. PLoS One, 11(2), 1–16

    Article  Google Scholar 

  • Gulbrandsen, T. (2007). Elite integration and institutional trust in Norway. Comparative Sociology, 6, 190.214

    Article  Google Scholar 

  • Haak, D. C., HilleRisLambers, J., Pitre, E., & Freeman, S. (2011). Increased structure and active learning reduce the achievement gap in introductory biology. Science.

  • Harris, R. B., Grunspan, D. Z., Pelch, M. A., Fernandes, G., Ramirez, G., & Freeman, S. (2019). Can test anxiety interventions alleviate a gender gap in an undergraduate STEM course? CBE Life Sciences Education, 18(35), 1–9.

    Google Scholar 

  • Hausmann, L. R., Schofield, J. W., & Woods, R. L. (2007). Sense of belonging as a predictor of intentions to persist among African American and White first-year college students. Research in Higher Education, 48(7), 803–839.

    Article  Google Scholar 

  • Hazari, Z., Sadler, P. M., & Sonnert, G. (2013). The science identity of college students: Exploring the intersection of gender, race, and ethnicity. Journal of College Science Teaching, 42(5), 82–91.

    Google Scholar 

  • Hembree, R. (1988). Correlates, Causes, Effects, and Treatment of Test Anxiety (Vol. 58). Retrieved from

    Google Scholar 

  • Hidi, S., & Harackiewicz, J. M. (2000). Motivating the academically unmotivated: A critical issue for the 21st century. Review of Educational Research, 70(2), 151–179

    Article  Google Scholar 

  • Hill, K. T., & Wigfield, A. (1984). Test anxiety: A major educational problem and what can be done about it. The Elementary School Journal, 85(1), 105–126.

    Article  Google Scholar 

  • Hoffmann, F., & Oreopoulos, P. (2009). A professor like me. The influence of instructor gender on college achievement. The Journal of Human Resources, 44(2), 479–494.

    Article  Google Scholar 

  • Hole, T. N., Jeno, L. M., Holtermann, K., Raaheim, A., Velle, G., Simonelli, A. L., & Vandvik, V. (2016). bioCEED Survey 2015. Retrieved from University of Bergen, Bora - Bergen Open Research Archive:

    Google Scholar 

  • Hulleman, C. S., & Harackiewicz, J. M. (2009). Promoting interest and performance in high school science classes. Science, 326(5958), 1410–1412.

    Article  Google Scholar 

  • Hyde, J. S., & Durik, A. M. (2005). Gender, competence, and motivation. In A. J. Elliot, & C. S. Dweck (Eds.), Handbook of Competence and Motivation, (pp. 375–391). New York: The Guilford Press.

    Google Scholar 

  • Isbell, L. A., Young, T. P., & Harcourt, A. H. (2012). Stag parties linger: Continued gender bias in a female-rich scientific discipline. PLoS One, 7(11), e49682.

    Article  Google Scholar 

  • Jones, M. G., Howe, A., & Rua, M. J. (2000). Gender differences in students’ experiences, interests, and attitudes toward science and scientists. Science Education, 84(2), 180–192.

    Article  Google Scholar 

  • Kalev, A., Dobbin, F., & Kelly, E. (2006). Best practices or best guesses? Assessing the efficacy of corporate affirmative action and diversity policies. American Sociological Review, 71(4), 589–617

    Article  Google Scholar 

  • Kitchen, E., Reeve, S., Bell, J. D., Sudweeks, R. R., & Bradshaw, W. S. (2007). The development and application of affective assessment in an upper-level cell biology course. Journal of Research in Science Teaching: The Official Journal of the National Association for Research in Science Teaching, 44(8), 1057–1087.

    Article  Google Scholar 

  • Koester, B. P., Grom, G., & McKay, T. A. (2016). Patterns of gendered performance difference in introductory STEM courses, 1–9. Retrieved from

    Google Scholar 

  • Larivière, V., Chaoqun Ni, Y. G., Cronin, B., & Sugimoto, C. R. (2013). Global gender disparities in science. Nature, 504(7479), 211–213.

    Article  Google Scholar 

  • Lent, R. W., Brown, S. D., & Hackett, G. (1994). Toward a Unifying Social Cognitive Theory of Career and Academic Interest, Choice, and Performance. Journal of Vocational Behavior, 45(1), 79–122

    Article  Google Scholar 

  • Lopatto, D. (2004). Survey of Undergraduate Research Experiences (SURE): First Findings. Cell Biology Education, 3(4), 270–277

    Article  Google Scholar 

  • Lorenzo, M., Crouch, C. H., & Mazur, E. (2006). Reducing the gender gap in the physics classroom. American Journal of Physics, 74(2), 118–122

    Article  Google Scholar 

  • Lundeberg, M. A., Fox, P. W., & LeCount, J. (1992). Highly confident, but wrong: Gender differences and similarities in confidence judgments. Paper presented at the AERA. San Francisco: CA.

    Google Scholar 

  • Macphee, D., Farro, S., & Canetto, S. S. (2013). Academic self-efficacy and performance of underrepresented STEM majors: Gender, Ethnic, and Social Class Patterns 2013 The Society for the Psychological Study of Social Issues. Analyses of Social Issues and Public Policy, 13(1), 347–369

    Article  Google Scholar 

  • Marshman, E. M., Kalender, Z. Y., Nokes-Malach, T., Schunn, C., & Singh, C. (2018). Female students with A’s have similar physics self-efficacy as male students with C’s in introductory courses: A cause for alarm? Physical Review Physics Education Research, 14(2), 020123

    Article  Google Scholar 

  • Matz, R. L., Koester, B. P., Fiorini, S., Grom, G., Shepard, L., Stangor, C. G., … McKay, T. A. (2017). Patterns of Gendered Performance Differences in Large Introductory Courses at Five Research Universities. AERA Open, 3(4), 233285841774375

    Article  Google Scholar 

  • Meece, J. L., Glienke, B. B., & Askew, K. (2009). Gender and motivation. In K. R. Wentzel, & A. Wigfield (Eds.), Handbook of Motivation at School, (pp. 411–431). New York: Routledge.

    Google Scholar 

  • Ministry of Education and Research (2015). Tilstandsrapport. Høyere utdanning 2015. Oslo: Ministry of Education and Research.

    Google Scholar 

  • Moss-Racusin, C. A., Dovidio, J. F., Brescoll, V. L., Graham, M. J., & Handelsman, J. (2012). Science faculty’s subtle gender biases favor male students. Proceedings of the National Academy of Sciences, 109(41), 16474–16479

    Article  Google Scholar 

  • Moss-Racusin, C. A., van der Toorn, J., Dovidio, J. F., Brescoll, V. L., Graham, M. J., & Handelsman, J. (2016). A “scientific diversity” intervention to reduce gender bias in a sample of life scientists. CBE Life Sciences Education, 15(3), ar29.

    Article  Google Scholar 

  • Murray, J. (2013). Likert data: What to use, parametric or non-parametric? International Journal of Business and Social Science, 4(11).

  • National Science Board (2015). Revisiting the STEM workforce: A companion to science and engineering indicators 2014. Virginia, US: National Science Board.

    Google Scholar 

  • Neill, C., Cotner, S., Driessen, M., & Ballen, C. J. (2018). Structured learning environments are required to promote equitable participation. Chemistry Education Research and Practice

  • Nissen, J. M., & Shemwell, J. T. (2016). Gender, experience, and self-efficacy in introductory physics. Physical Review Physics Education Research, 12, 020101-020101-020101-020116

    Article  Google Scholar 

  • Norman, G. (2010). Likert scales, levels of measurement and the “laws” of statistics. Advances in Health Sciences Education, 15(5), 625–632.

    Article  Google Scholar 

  • OECD (2014). OECD skills strategy diagnostic report: Norway. Paris: OECD Publications.

    Google Scholar 

  • Owens, M., Stevenson, J., Hadwin, J. A., & Norgate, R. (2014). When does anxiety help or hinder cognitive test performance? The role of working memory capacity. British Journal of Psychology, 105(1), 92–101.

    Article  Google Scholar 

  • Payne, B. D., Smith, J. E., & Payne, D. A. (1983). Grade, sex, and race differences in test anxiety. Psychological Reports

  • Pintrich, P. R., Smith, D. A. F., Garcia, T., & McKeachie, W. J. (1991). A manual for the use of the Motivated Strategies for Learning Questionnaire (MSLQ) Ann Arbor: University of Michigan, National Center for Research to Improve Postsecondary Teaching and Learning. MI: Ann Arbor: University of Michigan, National Center for Research to Improve Postsecondary Teaching and Learning.

    Google Scholar 

  • Ramirez, G., & Beilock, S. L. (2011). Writing about testing worries boosts exam performance in the classroom. Science, 331(6014), 211–213.

    Article  Google Scholar 

  • Robnett, R. D., Chemers, M. M., & Zurbriggen, E. L. (2015). Longitudinal associations among undergraduates’ research experience, self-efficacy, and identity. Journal of Research in Science Teaching, 52(6), 847–867.

    Article  Google Scholar 

  • Rogoff, B. (2003). The cultural nature of human development. New York: Oxford University Press, Inc.

    Google Scholar 

  • Salehi, S., Cotner, S., Azarin, S. M., Carlson, E. E., Driessen, M., Ferry, V. E., … Ballen, C. J. (2019). Gender performance gaps across different assessment methods and the underlying mechanisms: The case of incoming preparation and test anxiety. Frontiers in Education, 4(107), 1–14

    Google Scholar 

  • Sapienza, P., Zingales, L., & Maestripier, D. (2009). Gender differences in financial risk aversion and career choices are affected by testosterone. Proceedings of the National Academy of Sciences, 106(36), 15268–15273.

    Article  Google Scholar 

  • Sarason, I. G. (1984). Stress, anxiety, and cognitive interference: Reactions to tests. Journal of Personality and Social Psychology, 46(4), 929–938

    Article  Google Scholar 

  • Sawtelle, V., Brewe, E., & Kramer, L. H. (2012). Exploring the relationship between self-efficacy and retention in introductory physics. Journal of Research in Science Teaching, 49(9), 1096–1121

    Article  Google Scholar 

  • Seymour, E., Hunter, A.-B., Laursen, S. L., & Deantoni, T. (2004). Establishing the benefits of research experiences for undergraduates in the sciences: First findings from a three-year study. Science Education, 88, 493–534

    Article  Google Scholar 

  • Sommer, M., & Arendasy, M. E. (2014). Comparing different explanations of the effect of test anxiety on respondents’ test scores. Intelligence, 42, 115–127

    Article  Google Scholar 

  • Staats, C. (2015). Understanding implicit bias what educators should know. American Educator, 29–43 Retrieved from

  • Stankov, L., Lee, J., Luo, W., & Hogan, D. J. (2012). Confidence: A better predictor of academic achievement than self-efficacy, self-concept and anxiety? Learning and Individual Differences

  • Steele, C. M. (1997). A Threat in the Air: How Stereotypes Shape Intellectual Identity and Performance. American Psychologist, 52(6), 613–629.

    Article  Google Scholar 

  • Stout, J. G., Dasgupta, N., Hunsinger, M., & McManus, M. A. (2011). STEMing the tide: Using ingroup experts to inoculate women’s self-concept in science, technology, engineering, and mathematics (STEM). Journal of Personality and Social Psychology, 100(2), 255–270

    Article  Google Scholar 

  • Teigen, M., & Wängnerud, L. (2009). Tracing gender equality cultures: Elite perceptions of gender equality in Norway and Sweden. Politics & Gender, 5, 21–44

    Article  Google Scholar 

  • Trujillo, G., & Tanner, K. D. (2014). Considering the role of affect in learning: Monitoring students’ self-efficacy, sense of belonging, and science identity. CBE Life Sciences Education, 13(1), 6–15.

    Article  Google Scholar 

  • UNESCO (2015). Gender and EFA 2000-2015: achievements and challenges. Paris, France: United Nations Educational, Scientific and Cultural Organization.

    Google Scholar 

  • van Honk, J., Schutter, D. J. L. G., Hermans, E. J., Putma, P., Tuiten, A., & Koppeschaar, H. (2004). Testosterone shifts the balance between sensitivity for punishment and reward in healthy young women. Psychoneuroendocrinology, 29, 937–943

    Article  Google Scholar 

  • von der Embse, N., Jester, D., Roy, D., & Post, J. (2018). Test anxiety effects, predictors, and correlates: A 30-year meta-analytic review. Journal of Affective Disorders, 227, 483–493.

    Article  Google Scholar 

  • Walton, G. M., & Cohen, G. L. (2007). A question of belonging: Race, social fit, and achievement. Journal of Personality and Social Psychology, 92(1), 82–96

    Article  Google Scholar 

  • West, C., & Zimmerman, D. H. (1987). Doing gender. Gender and Society, 1(2), 125–151.

    Article  Google Scholar 

  • Wheeler, S. C., & Petty, R. E. (2001). The Effects of Stereotype Activation on Behavior: A Review of Possible Mechanisms. Psychological Bulletin, 127(6), 797–826

  • Wren, C., & Wren, T. (2003). The capacity to learn. In R. Curren (Ed.), A Companion to the Philosophy of Education, (pp. 246–259). Oxford: Blackwell Publishing Ltd..

    Chapter  Google Scholar 

  • Zeidner, M. (1998). Test anxiety: The state of the art (Perspectives on individual differences). Plenum Press

  • Zeidner, M. (2010). Test anxiety. In The Corsini encyclopedia of psychology, (pp. 1–3)

    Google Scholar 

Download references


We thank Jonathan Soulé and Oddfrid T. Kårstad Førland at bioCEED – Centre of Excellence in Biology Education, University of Bergen, for their help in collecting the data for this study.


The study was funded by a grant by NOKUT/DIKU under the Centres for Excellence in Higher Education Initiative to bioCEED – Centre of Excellence in Biology Education [2014–2024].

Author information

Authors and Affiliations



All authors were involved in project conception, data interpretation, and writing the manuscript. The authors read and approved the final manuscript.

Corresponding author

Correspondence to Sehoya Cotner.

Ethics declarations

Competing interests

The authors are aware of no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Cotner, S., Jeno, L.M., Walker, J.D. et al. Gender gaps in the performance of Norwegian biology students: the roles of test anxiety and science confidence. IJ STEM Ed 7, 55 (2020).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: