ViT-Walkability

This repository contains a fine-tuned Vision Transformer model for walkability-related image classification tasks.

Checkpoint details

  • Source file: best.pt
  • Top-level type: dict
  • Base model: google/vit-base-patch16-224
  • Num labels: 5
  • Labels: ['LABEL_0', 'LABEL_1', 'LABEL_2', 'LABEL_3', 'LABEL_4']
  • Keys: ['model_state_dict', 'epoch']

Loading notes

The uploaded files are in standard Hugging Face Transformers format and can be loaded directly with from_pretrained.

Downloads last month
51
Safetensors
Model size
85.8M params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support