Instructions to use 4ntoine/Qwen2.5-Coder-3B-Instruct-LiteRTLM with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- LiteRT-LM
How to use 4ntoine/Qwen2.5-Coder-3B-Instruct-LiteRTLM with LiteRT-LM:
# No code snippets available yet for this library. # To use this model, check the repository files and the library's documentation. # Want to help? PRs adding snippets are welcome at: # https://github.com/huggingface/huggingface.js
- Notebooks
- Google Colab
- Kaggle
The model is converted from the original Qwen/Qwen2.5-Coder-3B-Instruct using:
litert-torch export_hf \
--model=Qwen/Qwen2.5-Coder-3B-Instruct \
--output_dir="./dynamic_wi8_afp32" \
--quantization_recipe="dynamic_wi8_afp32" \
--bundle_litert_lm=true
- Downloads last month
- 175
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support