autoresearch-criteria-health-check-at-experiment-10
At experiment 10, check criteria health: if >95% of outputs pass, the criterion is too easy and scores are meaningless; if <10% pass, it is too hard and the agent will thrash. Flag and recalibrate criteria before continuing. Criteria rot silently invalidates entire experiment runs without this checkpoint.