How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="afrideva/phi-2-uncensored-GGUF",
	filename="",
)
output = llm(
	"Once upon a time,",
	max_tokens=512,
	echo=True
)
print(output)

Walmart-the-bag/phi-2-uncensored-GGUF

Quantized GGUF model files for phi-2-uncensored from Walmart-the-bag

Name Quant method Size
phi-2-uncensored.fp16.gguf fp16 5.56 GB
phi-2-uncensored.q2_k.gguf q2_k 1.17 GB
phi-2-uncensored.q3_k_m.gguf q3_k_m 1.48 GB
phi-2-uncensored.q4_k_m.gguf q4_k_m 1.79 GB
phi-2-uncensored.q5_k_m.gguf q5_k_m 2.07 GB
phi-2-uncensored.q6_k.gguf q6_k 2.29 GB
phi-2-uncensored.q8_0.gguf q8_0 2.96 GB

Original Model Card:

Downloads last month
434
GGUF
Model size
3B params
Architecture
phi2
Hardware compatibility
Log In to add your hardware

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train afrideva/phi-2-uncensored-GGUF