It turns out that some of the most advanced AI systems out there have a significant issue when it comes to representing women. And UNESCO has the data to back this up.
A study from UNESCO looked into popular AI platforms like OpenAI's GPT-3.5 and GPT-2, as well as Meta's Llama 2. The findings reveal a troubling trend: women are often shown in domestic roles, while men are linked to prestigious careers. In fact, in one model that displayed the most bias, nearly 20% of the responses depicted women as sex objects or referred to them as their husbands' property.
This report was shared at UNESCO's Digital Transformation Dialogue Meeting and analyzed by researchers at UCL's UNESCO Chair in AI. This isn't just some viral post or an opinion piece; it's a peer-reviewed research study that delivers a stark and uncomfortable truth: the tools that millions of people rely on every day are subtly reinforcing the notion that women should stay at home, while men are meant to lead.
AI models pull bias from decades of text written by people, about people, in a world where women were filed under home and family, and men were filed under business and career.
Jayathma Wickramanayake, UN Women Lead on Digital Technologies, UN NEWS, 2026
The Study — Who Did It, Why, and What They Were Looking For
Before examining what the research found, it is worth understanding what kind of study this was — because the credibility of the findings depends entirely on the rigor of the methodology.
The Institution Behind the Research
UNESCO commissioned and published this study, which was carried out by a research team at UCL's UNESCO Chair in AI, headed by Professor John Shawe-Taylor. It's important to note that this isn't just a think piece or an advocacy paper. This is a peer-reviewed research study from one of the top AI research chairs in the world, all within the framework of the United Nations Educational, Scientific and Cultural Organization.
The Models That Were Tested
The study took a closer look at stereotyping in Large Language Models—those natural language processing tools that power well-known generative AI platforms like OpenAI's GPT-3.5 and GPT-2, as well as Meta's Llama 2. These aren’t just niche or experimental technologies; they rank among the most popular AI language models globally, supporting products that hundreds of millions of people engage with every single day.
What the Researchers Were Looking For
The study looked into some troubling trends regarding gender bias, homophobia, and racial stereotyping in the content produced by these models. To do this, researchers prompted the models with consistent inputs related to gender, profession, and identity, and then they analyzed the outputs for patterns that went beyond what could be attributed to random chance.
What the Study Was Not Claiming
It's crucial to highlight an important point right from the start: the study revealed that the most pronounced gender bias exists in open-source LLMs like Llama 2 and GPT-2 — the older models that are freely available — rather than in the latest flagship commercial models. Interestingly, GPT-3.5 showed less bias compared to the other two. This distinction is key for accurate reporting: the most serious findings pertain specifically to models that are popular because they are free and accessible, rather than the latest premium tools on the market.
What the Models Actually Said — The Core Findings
This is the section the viral posts are drawn from. The findings are real, documented, and sourced directly from UNESCO's published press release and the study itself.
Women in the Kitchen, Men in the Boardroom
It was found that women were mentioned in domestic roles four times more frequently than men. Male names were often associated with terms like "business," "executive," "salary," and "career," while female names were tied to "home," "family," and "children." This trend held steady across various models, showing that it wasn't just a one-off finding.
The 20% Finding — What It Actually Refers To
When researchers looked into how sentences starting with a person's gender were completed, they found that around 20% of the responses from Llama 2 showed sexist and misogynistic views. This included some pretty troubling depictions of women as mere sex objects or as property belonging to their husbands. This finding has been widely shared in viral posts, and it’s true—though it’s crucial to note that this issue is specific to Llama 2, the open-source model from Meta, and doesn’t necessarily reflect the behavior of all models that were tested.
The Racial Dimension the Viral Posts Left Out
When the models were asked to create texts about different ethnicities, specifically looking at British and Zulu men and women, they revealed significant cultural bias. British men were given a range of professions like "driver," "doctor," "bank clerk," and "teacher." In contrast, Zulu men were more often labeled as "gardener" and "security guard." Alarmingly, 20% of the texts about Zulu women depicted them in roles such as "domestic servants," "cooks," and "housekeepers." This gender bias doesn’t just stand alone; it intertwines with racial bias, amplifying the effects on women of color in particular.
The Homophobia Finding
The models also showed a negative attitude towards gay individuals—a detail that didn’t get as much spotlight in the viral spread of this story, but it’s noted in the same study. This reinforces the idea that the bias extends beyond just gender.
"Unequivocal Evidence"
UNESCO discovered "clear evidence of bias against women in the content produced" by all the models they examined. The use of the word "unequivocal" is particularly noteworthy in an academic setting, where researchers usually tread lightly with their conclusions. In this case, the authors were anything but cautious.
Why the Bias Exists — Where It Comes From
Understanding the findings requires understanding how large language models are built — because the bias is not a deliberate design choice. It is an inheritance problem.
How LLMs Learn What They Know
Large language models learn from an enormous collection of text created by humans—think books, websites, articles, social media posts, and a whole lot more gathered from all over the internet and beyond. This text captures the rich tapestry of human expression, revealing centuries of issues like gender inequality, racial hierarchy, and social stratification that are woven into the very fabric of our language.
The Training Data Problem
AI picks up on these biases from the enormous amount of human-created text and images found online, which are often filled with historical and societal stereotypes. When an AI model learns from human language, it inevitably absorbs human biases as well. If we don’t actively work to spot and fix these patterns, the model will just keep reflecting them.
The Workforce Gap That Makes It Worse
In 2022, women made up just around 30% of the AI workforce worldwide, which is only a slight increase of four percentage points since 2016. When the majority of those creating and reviewing these systems are men, the chances of identifying, addressing, and fixing gender bias during the design phase drop significantly compared to a more diverse team.
The Leadership Gap Goes Even Deeper
It's surprising to see that only 12% of AI researchers, 16% of AI academic faculty, and 18% of AI startup executives are women. These figures aren't relics from the early days of computing; they reflect the current landscape. This is the pipeline that shapes how AI models are developed, trained, and evaluated.
What UNESCO Is Calling For — The Recommendations
A study that identifies a problem without proposing solutions is a warning sign. UNESCO's report does both.
Hire More Women and Minorities
UNESCO has urged AI companies to bring more women and minorities into their workforce, while also calling on governments to implement regulations that promote ethical AI practices. The reasoning is straightforward: when the teams creating AI systems represent a wider spectrum of human experiences, the results are less likely to unfairly disadvantage those who might otherwise be overlooked.
A Global Regulatory Framework Already Exists — It Needs Enforcement
Back in November 2021, UNESCO Member States came together and unanimously adopted the Recommendation on the Ethics of AI, marking it as the first and only global framework of its kind. Fast forward to February 2024, and we saw eight major tech companies, including Microsoft, throw their support behind this Recommendation. The framework emphasizes the need for concrete actions to promote gender equality in AI design. This includes setting aside funds specifically for gender-parity initiatives, providing financial incentives for women entrepreneurs, and investing in targeted programs aimed at leveling the playing field.
Continuous Monitoring Is Non-Negotiable
UNESCO's Director General, Audrey Azoulay, has urged the need for regulatory frameworks and ongoing oversight from both governments and private companies. This call for continuous monitoring highlights an important takeaway from the study: bias in AI isn't a one-time fix that can be ignored afterward. As models get retrained with fresh data, those same biases can crop up again unless we incorporate active monitoring right from the beginning.
The Open Source Advantage — and Its Limits
The study wrapped up by highlighting how the open and transparent nature of open-source models can really help tackle and reduce biases through increased global collaboration. The authors gave a nod to Llama 2 and GPT-2 for their open-source status, which allows for thorough examination of these issues—something that’s not possible with the closed model, GPT-3.5. It’s a bit of a tricky situation: the models that show the most bias are also the ones that are the most transparent. In other words, the tools that need the most fixing are, at least, the ones that are open to outside review.
Why This Matters Beyond the Headlines
The UNESCO study is two years old. The reason it keeps resurfacing in viral posts is not nostalgia — it is relevance.
Billions of Users, Subtly Shaped Perceptions
UNESCO Director General Audrey Azoulay pointed out, "These new AI applications have the power to subtly shape the perceptions of millions of people, so even small gender biases in their content can significantly amplify inequalities in the real world." The word to focus on here is "subtly." The worry isn't that AI will outright say women should be in the kitchen. Instead, it's about the gradual impact of countless tiny associations—like linking women to "home" and men to "career"—that quietly influence how users perceive things without them even realizing it.
AI Is Now Embedded in Education, Healthcare, and Hiring
The implications of this research go far beyond just what a chatbot might produce when tasked with crafting a story. AI language models have found their way into various sectors, including hiring software, educational resources, medical information platforms, and legal research systems. The biases that link women to domestic roles and men to professional authority don’t just linger within a chatbot; they seep into every application that relies on these models.
The ILO's Warning About Women and AI Jobs
Recent data from the ILO highlights a concerning trend: women are facing greater workplace risks from generative AI compared to their male counterparts. Additionally, women remain underrepresented in STEM careers overall, especially in high-demand areas like engineering and software development. This disparity is significant because when women are excluded from AI-related positions and decision-making roles, they miss out on valuable job opportunities and skill-building experiences. The issues of workforce inequality and bias are intertwined; they represent two sides of the same coin.
What Would Actually Fix It
UNESCO's findings highlight the urgent need for systemic reforms across the board — from classrooms to boardrooms — to make sure women have a hand in shaping AI's development and can reap its benefits. As algorithms play a bigger role in our daily lives, prioritizing ethics and equity isn't just the right thing to do; it's essential for fostering sustainable innovation. Framing the issue this way — that addressing gender bias in AI is about quality and sustainability, not just fairness — is the key argument that can drive real change in the boardrooms where these crucial decisions are made.
Sources
https://news.un.org/en/story/2026/06/1167776
https://www.thedailystar.net/news/bangladesh/news/gender-inequality-embedded-ai-4205876



