Nemotron 4 340B Collection Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 9 days ago • 164
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model Paper • 2510.14528 • Published Oct 16, 2025 • 128
Sirreajohn/whisper-tiny-minds14-en Automatic Speech Recognition • 37.8M • Updated Sep 10, 2025 • 1
Sirreajohn/whisper-tiny-minds14-en Automatic Speech Recognition • 37.8M • Updated Sep 10, 2025 • 1
view article Article Making automatic speech recognition work on large files with Wav2Vec2 in 🤗 Transformers Narsil • Feb 1, 2022 • 16
Running on Zero Agents 67 OCR Time Machine 📚 67 Extract text from images and XML files using OCR models
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale Paper • 2010.11929 • Published Oct 22, 2020 • 20