Make some questions and copy pastes of questions/situations from Discord.
Evaluate how AI reads the document. Use cheap thinking models.
Evaluate with another prompt on if the answer or answer traces matches the expected answer.
Extremely experimental and curious.