Aleksei Dorkin PRO
adorkin
AI & ML interests
Computational Linguistics
Recent Activity
liked a model about 16 hours ago
moonshotai/MoonViT-SO-400M liked a model about 16 hours ago
LCO-Embedding/LCO-Embedding-Omni-7B upvoted an article 3 days ago
Introducing BERTopic Integration with the Hugging Face HubOrganizations
Multilingual Text Embedding Models
-
tencent/KaLM-Embedding-Gemma3-12B-2511
Sentence Similarity • 12B • Updated • 85k • 96 -
nvidia/llama-embed-nemotron-8b
Feature Extraction • 8B • Updated • 44.7k • 165 -
Qwen/Qwen3-Embedding-8B
Feature Extraction • 8B • Updated • 1.98M • • 694 -
Qwen/Qwen3-Embedding-4B
Feature Extraction • 4B • Updated • 2.76M • 275
Multilingual Text Encoders
Multilingual Text Embedding Models
-
tencent/KaLM-Embedding-Gemma3-12B-2511
Sentence Similarity • 12B • Updated • 85k • 96 -
nvidia/llama-embed-nemotron-8b
Feature Extraction • 8B • Updated • 44.7k • 165 -
Qwen/Qwen3-Embedding-8B
Feature Extraction • 8B • Updated • 1.98M • • 694 -
Qwen/Qwen3-Embedding-4B
Feature Extraction • 4B • Updated • 2.76M • 275
spaces 6
Sleeping
Agents
1
NLI Zero Shot Classification
🔍
Zero-shot classification based on natural language inference
Sleeping
Agents
2
GliLem
🤓
Lemmatization disambiguation for Estonian with GliNER
Running
Agents
SigLIP2 + Clothes
🤔
Text-to-image clothing search using SigLIP2
Sleeping
Agents
1
M-CLIP + Clothes
🦀
Text-to-image clothing search using multilingual CLIP
Sleeping
Agents
1
Tweet Emoji Predictor
🧐
Predict an emoji for your tweet (...your X?)
Sleeping
Agents
Sõnajaht Demo
🐠
Keeltevaheline pöördsõnastik
datasets 30
adorkin/replay-mix-10B
Viewer • Updated • 5.99M • 136
adorkin/olmocr_science_pdfs-software
Viewer • Updated • 734k • 1.15k
adorkin/olmocr_science_pdfs-travel_and_tourism
Viewer • Updated • 350k • 1.11k
adorkin/olmocr_science_pdfs-art_and_design
Viewer • Updated • 873k • 878
adorkin/olmocr_science_pdfs-history_and_geography
Viewer • Updated • 1.67M • 2.06k
adorkin/olmocr_science_pdfs-literature
Viewer • Updated • 1.85M • 2.19k
adorkin/olmocr_science_pdfs-software_development
Viewer • Updated • 2.21M • 1.68k
adorkin/olmocr_science_pdfs-games
Viewer • Updated • 157k • 259
adorkin/scientific-summaries-pubmed-open-access
Viewer • Updated • 270k • 5.29k
adorkin/nemotron-code-student-teacher-10M
Viewer • Updated • 10M • 1.16k