What AI gets wrong — and why that matters
The most important thing you can know as an AI user is not how impressive these systems are. It's where they fail — and why. Many of the failures are not random bugs. They are structural features of how these systems work.
The hallucination problem — demonstrated
Hover over highlighted text to see what's wrong. This is a real category of AI failure — not a rare glitch.
When to trust AI — a rough guide
AI reliability varies enormously by task type. This is a rough calibration, not a guarantee.
The five structural limits
It generates false information with complete confidence
Language models are trained to produce fluent, plausible text — not to produce only true text. When they don't have good data on something, they don't say "I don't know." They generate an answer that sounds authoritative and fits the context. The answer may be entirely fabricated. For anything consequential — facts, citations, medical or legal information — verify independently.
It inherits the biases in its training data
AI learns from human-generated data — and human-generated data reflects human history, including its inequities. Systems trained on historical hiring decisions inherit the biases in those decisions. This isn't intentional — the system is learning what was actually in the data. But the consequences are real: AI systems used in hiring, lending, and criminal justice have been shown to systematically disadvantage certain groups.
It has no common sense about the physical world
A language model can write a technically accurate essay on why bridges don't fall down, then fail a question any five-year-old would answer correctly. It has processed text about physics — it hasn't experienced gravity or weight. This produces a peculiar failure mode: confident, well-written answers to questions the system has no actual understanding of.
It can be used to deceive at unprecedented scale
The capabilities that make AI useful — generating fluent text, creating realistic images, cloning voices — also make it the most powerful deception tool ever built. A system that generates a thousand personalized phishing emails in the time it takes to write one, or produces a realistic video of someone saying something they never said, changes the threat landscape in fundamental ways.
It doesn't know what it doesn't know
A chatbot may hedge carefully on a question it actually knows well, and speak with equal confidence on a question it's simply inventing. The absence of "I'm not sure" is not evidence of accuracy. Developing a feel for when a response is likely reliable — versus when it's likely invented — is one of the core skills of AI literacy.
"None of these are reasons to avoid AI. They are reasons to use it the way you'd use any powerful tool — with clear eyes about what it's good for and where it fails."