Foundations and Trends® in Information Retrieval > Vol 19 > Issue 3

Understanding and Mitigating Gender Bias in Information Retrieval Systems

By Shirin Seyedsalehi, Toronto Metropolitan University, Canada | Amin Bigdeli, University of Waterloo, Canada | Negar Arabzadeh, University of Waterloo, Canada | Batool AlMousawi, University of Calgary, Canada | Zack Marshall, University of Calgary, Canada | Morteza Zihayat, Toronto Metropolitan University, Canada | Ebrahim Bagheri, University of Toronto, Canada, ebrahim.bagheri@utoronto.ca

 
Suggested Citation
Shirin Seyedsalehi, Amin Bigdeli, Negar Arabzadeh, Batool AlMousawi, Zack Marshall, Morteza Zihayat and Ebrahim Bagheri (2025), "Understanding and Mitigating Gender Bias in Information Retrieval Systems", Foundations and TrendsĀ® in Information Retrieval: Vol. 19: No. 3, pp 191-364. http://dx.doi.org/10.1561/1500000103

Publication Date: 10 Feb 2025
© 2025 S. Seyedsalehi et al.
 
Subjects
Web search,  Performance issues for IR systems,  Applications of IR,  Ethics,  Interdisciplinary influence,  Privacy and social implications
 

Free Preview:

Download extract

Share

Download article
In this article:
1. Introduction
2. Framing Sex, Gender, and Gender Diversity
3. Gendered Information Retrieval Systems: Metrics and Measurements
4. Understanding the Sources of Gender Bias in IR Systems
5. Data-driven Debiasing Methods
6. Debiasing of Neural Embeddings
7. Method-level Debiasing
8. Challenges, Limitations, and Future Directions
References

Abstract

Gender bias is a pervasive issue that continues to influence various aspects of society, including the outcomes of information retrieval (IR) systems. As these systems become increasingly integral to accessing and navigating the vast amounts of information available today, the need to understand and mitigate gender bias within them is paramount. This monograph provides a comprehensive examination of the origins, manifestations, and consequences of gender bias in IR systems, as well as the current methodologies employed to address these biases.

Theoretical frameworks surrounding gender and its representation in artificial intelligence (AI) systems are explored, particularly focusing on how traditional gender binaries are perpetuated and reinforced through data and algorithmic processes. Metrics and methodologies used to identify and measure gender bias within IR systems are then analyzed, offering a detailed evaluation of existing approaches and their limitations.

Subsequent sections address the sources of gender bias, including biased input queries, retrieval methods, and gold standard datasets. Various data-driven and method-level debiasing strategies are presented, including techniques for debiasing neural embeddings and algorithmic approaches aimed at reducing bias in IR system outputs. The monograph concludes with a discussion of the challenges and limitations faced by current debiasing efforts and provides insights into future research directions that could lead to more equitable and inclusive IR systems.

This monograph serves as a valuable resource for researchers, practitioners, and students in the fields of information retrieval, artificial intelligence, and data science, providing the knowledge and tools needed to address gender bias and contribute to the development of fair and unbiased information systems.

DOI:10.1561/1500000103
ISBN: 978-1-63828-518-2
186 pp. $99.00
Buy book (pb)
 
ISBN: 978-1-63828-519-9
186 pp. $160.00
Buy E-book (.pdf)
Table of contents:
1. Introduction
2. Framing Sex, Gender, and Gender Diversity
3. Gendered Information Retrieval Systems: Metrics and Measurements
4. Understanding the Sources of Gender Bias in IR Systems
5. Data-driven Debiasing Methods
6. Debiasing of Neural Embeddings
7. Method-level Debiasing
8. Challenges, Limitations, and Future Directions
References

Understanding and Mitigating Gender Bias in Information Retrieval Systems

Gender bias is a pervasive issue that continues to influence various aspects of society, including the outcomes of information retrieval (IR) systems. As these systems become increasingly integral to accessing and navigating the vast amounts of information available today, the need to understand and mitigate gender bias within them is paramount. This monograph provides a comprehensive examination of the origins, manifestations, and consequences of gender bias in IR systems, as well as the current methodologies employed to address these biases.

Theoretical frameworks surrounding gender and its representation in artificial intelligence (AI) systems are explored, particularly focusing on how traditional gender binaries are perpetuated and reinforced through data and algorithmic processes. Metrics and methodologies used to identify and measure gender bias within IR systems are then analyzed, offering a detailed evaluation of existing approaches and their limitations. Subsequent sections address the sources of gender bias, including biased input queries, retrieval methods, and gold standard datasets. Various data-driven and method-level debiasing strategies are presented, including techniques for debiasing neural embeddings and algorithmic approaches aimed at reducing bias in IR system outputs.

The monograph concludes with a discussion of the challenges and limitations faced by current debiasing efforts and provides insights into future research directions that could lead to more equitable and inclusive IR systems. This monograph serves as a valuable resource for researchers, practitioners, and students in the fields of information retrieval, artificial intelligence, and data science, providing the knowledge and tools needed to address gender bias and contribute to the development of fair and unbiased information systems.

 
INR-103