Qintong Li
Qintong Li
Home
Publications
Projects
Recent News
Leyang Cui
Latest
Exploring the Reliability of Large Language Models as Customized Evaluators for Diverse NLP Tasks
GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers
MAGE: Machine-generated Text Detection in the Wild
Cite
×