PQR: A Framework to Generate Diverse and Realistic User Queries that Elicit QA Agent Failures
PQR generates realistic user queries to expose QA agent failures
PQR identifies QA agent failures from real user scenarios, not adversarial ones. It automates testing with realistic queries, reducing manual design efforts. Prior methods focused on adversarial cases, but PQR surfaces failures from genuine user intents.