Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
petermaAI 's Collections
sentiment_analysis
Text-to-SQL
LLM-Papers
LLM
Routing
tts
Embedding_Similarity_Rerank
Q&A
Vision
Job-CV-Match

Vision

updated May 1, 2025
Upvote
-

  • Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

    Paper • 2412.05271 • Published Dec 6, 2024 • 161

  • naver-clova-ix/cord-v2

    Viewer • Updated Jul 19, 2022 • 1k • 9.88k • 117

  • naver-clova-ix/synthdog-en

    Viewer • Updated Jan 31, 2024 • 66k • 1.32k • 27

  • impira/layoutlm-invoices

    Document Question Answering • 0.1B • Updated Mar 25, 2023 • 5.54k • 225

  • SWHL/RapidOCR

    Updated Aug 28, 2024 • 30

  • SWHL/ChineseOCRBench

    Viewer • Updated Apr 30, 2024 • 3.41k • 264 • 24
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs