Most content doesn't fail AI training pipelines because it's wrong — it fails because it lacks the structural signals that pipeline filters are calibrated to detect. Here is the five-stage process that decides what AI systems know, and what they don't.