Bias in AI

Simple Definition

AI bias refers to systematic errors or unfairness in AI outputs — where the model consistently produces results that favor or disadvantage certain groups of people, or that reflect inaccurate stereotypes.

Bias doesn’t come from malicious intent. It typically enters through training data that reflects existing human biases, societal inequalities, or underrepresentation of certain groups.

How Bias Enters AI Systems

1. Biased training data If training data over-represents certain demographics, industries, or viewpoints, the model learns those patterns. Historical data about who got loans or jobs encodes past discrimination.

2. Labeling bias Humans label training data with their own unconscious biases. What one person considers “professional” writing may reflect cultural assumptions.

3. Sampling bias If the training dataset doesn’t represent the full diversity of users and use cases, the model performs worse for underrepresented groups.

4. Feedback loops If biased outputs are used as training data for the next model version, bias compounds over time.

Real-World Examples

  • Facial recognition systems performing significantly worse on darker skin tones
  • Resume screening tools downranking women’s applications
  • Language models generating more negative associations with certain ethnic names
  • Medical AI trained on data that underrepresents women or non-white patients

Why It Matters

AI is making real decisions about credit, hiring, medical care, and content moderation. Biased AI can scale discrimination far faster than individual human bias.

Reducing Bias

  • Diverse, representative training datasets
  • Bias audits and testing across demographic groups
  • Human review of AI decisions in high-stakes contexts
  • Ongoing monitoring after deployment
  • Training Data — the primary source of bias in AI systems
  • AI Safety — fairness and bias reduction are core AI safety concerns
  • AI Ethics — the broader field addressing fairness, accountability, and bias
  • Alignment — ensuring AI behavior matches intended values, including fairness

See AI terms in action

Browse practical AI workflows that use the concepts in this glossary.

Last updated: