A Comprehensive Guide to Explainable AI: From Classical Models to LLMs Paper • 2412.00800 • Published Dec 1, 2024 • 1
KokushiMD-10: Benchmark for Evaluating Large Language Models on Ten Japanese National Healthcare Licensing Examinations Paper • 2506.11114 • Published Jun 9, 2025
Is GPT-OSS Good? A Comprehensive Evaluation of OpenAI's Latest Open Source Models Paper • 2508.12461 • Published Aug 17, 2025 • 2
Towards Alignment-Centric Paradigm: A Survey of Instruction Tuning in Large Language Models Paper • 2508.17184 • Published Aug 24, 2025