CommonLID: Re-evaluating State-of-the-Art Language Identification Performance on Web Data Paper • 2601.18026 • Published Jan 25
UniSkill: A Dataset for Matching University Curricula to Professional Competencies Paper • 2603.03134 • Published Mar 3
WorkRB: A Community-Driven Evaluation Framework for AI in the Work Domain Paper • 2604.13055 • Published Mar 17
CroCo: Cross-Lingual Contrastive Preference Tuning on Self-Generations Paper • 2605.26293 • Published 5 days ago • 2
CroCo: Cross-Lingual Contrastive Preference Tuning on Self-Generations Paper • 2605.26293 • Published 5 days ago • 2
Scaling Reasoning can Improve Factuality in Large Language Models Paper • 2505.11140 • Published May 16, 2025 • 7
Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation Paper • 2504.07072 • Published Apr 9, 2025 • 9
How Do Hackathons Foster Creativity? Towards AI Collaborative Evaluation of Creativity at Scale Paper • 2503.04290 • Published Mar 6, 2025 • 1
HiFi-KPI: A Dataset for Hierarchical KPI Extraction from Earnings Filings Paper • 2502.15411 • Published Feb 21, 2025 • 3
Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs Paper • 2502.12982 • Published Feb 18, 2025 • 19
SEFL: Harnessing Large Language Model Agents to Improve Educational Feedback Systems Paper • 2502.12927 • Published Feb 18, 2025 • 1
On-Device LLMs for Home Assistant: Dual Role in Intent Detection and Response Generation Paper • 2502.12923 • Published Feb 18, 2025
SnakModel: Lessons Learned from Training an Open Danish Large Language Model Paper • 2412.12956 • Published Dec 17, 2024 • 2
Leveraging Large Language Models for Actionable Course Evaluation Student Feedback to Lecturers Paper • 2407.01274 • Published Jul 1, 2024 • 1
INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge Paper • 2411.19799 • Published Nov 29, 2024 • 17
All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages Paper • 2411.16508 • Published Nov 25, 2024 • 10
SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval 2020) Paper • 2006.07235 • Published Jun 12, 2020
Summon a Demon and Bind it: A Grounded Theory of LLM Red Teaming Paper • 2311.06237 • Published Nov 10, 2023 • 1
Surveying (Dis)Parities and Concerns of Compute Hungry NLP Research Paper • 2306.16900 • Published Jun 29, 2023