Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
parrishcorcoran
/
MedusaBitNet-2B-4T
like
0
Text Generation
GGUF
English
bitnet
speculative-decoding
medusa
ternary-weights
efficient-inference
cpu-inference
arxiv:
2401.10774
License:
mit
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
MedusaBitNet-2B-4T
1.31 GB
Ctrl+K
Ctrl+K
2 contributors
History:
8 commits
Parrish Corcoran
Add trained Medusa heads and merged GGUF model
734ed40
about 2 months ago
figures
Upload folder using huggingface_hub
about 2 months ago
.gitattributes
Safe
2.29 kB
Add trained Medusa heads and merged GGUF model
about 2 months ago
README.md
Safe
5.25 kB
Upload README.md with huggingface_hub
about 2 months ago
benchmark_headtohead.json
Safe
6.64 kB
Upload benchmark_headtohead.json with huggingface_hub
about 2 months ago
benchmark_medusa_real.json
Safe
504 Bytes
Upload benchmark_medusa_real.json with huggingface_hub
about 2 months ago
benchmark_results.json
Safe
3.08 kB
Upload benchmark_results.json with huggingface_hub
about 2 months ago
ggml-model-i2_s-medusa.gguf
1.2 GB
xet
Add trained Medusa heads and merged GGUF model
about 2 months ago
medusa_heads_step2000.pt
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.BFloat16Storage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
105 MB
xet
Add trained Medusa heads and merged GGUF model
about 2 months ago