Monitor retrieval accuracy, answer groundedness, failed queries, and knowledge base health across all collections.