Activity Feed

AI & ML interests

Model evaluation, Benchmark analysis, Generative language models, Measurement theories

Recent Activity

Salomeee  updated a dataset about 1 month ago
human-centered-eval/OpenEval
Salomeee  updated a dataset about 1 month ago
human-centered-eval/OpenEval
Salomeee  published a dataset about 2 months ago
human-centered-eval/OpenEval
View all activity