Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Suchir Salhan
suchirsalhan
1
13
2
Follow
21world's profile picture
Gargaz's profile picture
dianags's profile picture
14 followers
·
35 following
https://www.suchirsalhan.com/
suchirsalhan
suchirsalhan
ssalhan
AI & ML interests
Multilinguality and Cognitively-Inspired AI. Tokenization, Pretraining, Interpretability & Alignment.
Recent Activity
updated
a model
2 days ago
Beetle-FineWeb-100M/beetle-bilingual-l2-50-sequential-33-67-b3-fineweb-100m-isl-eng-1xa100
published
a model
2 days ago
Beetle-FineWeb-100M/beetle-bilingual-l2-50-sequential-33-67-b3-fineweb-100m-isl-eng-1xa100
updated
a model
2 days ago
Beetle-FineWeb-100M/beetle-bilingual-l2-50-simultaneous-b2-fineweb-100m-isl-eng-1xa100
View all activity
Organizations
suchirsalhan
's datasets
9
Sort: Recently updated
suchirsalhan/kidalign-llama-filterable
Viewer
•
Updated
Apr 14
•
97.6k
•
15
suchirsalhan/kidalign-llama-3.1-8B-Instruct
Updated
Apr 14
•
136
suchirsalhan/babylm-detox
Viewer
•
Updated
Apr 8
•
11.6M
•
32
suchirsalhan/gptbert-tokenised
Updated
Jul 24, 2025
•
2
suchirsalhan/Phonemized-UD
Viewer
•
Updated
May 30, 2025
•
1.19M
•
342
suchirsalhan/BabyLM-Pretokenised
Viewer
•
Updated
Jan 31, 2025
•
1.64M
•
15
suchirsalhan/MAO-CHILDES
Viewer
•
Updated
Apr 11, 2024
•
3.81M
•
28
suchirsalhan/CLiMP
Preview
•
Updated
Apr 2, 2024
•
36
•
1
suchirsalhan/SLING
Viewer
•
Updated
Apr 2, 2024
•
40k
•
97