Instructions to use Alpha-VLLM/Lumina-Next-SFT-diffusers with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use Alpha-VLLM/Lumina-Next-SFT-diffusers with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("Alpha-VLLM/Lumina-Next-SFT-diffusers", dtype=torch.bfloat16, device_map="cuda") prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" image = pipe(prompt).images[0] - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- Draw Things
- DiffusionBee
Min vram?
#3
by scraper01 - opened
Tried to load the model unto an 4060 mobile with 8gb VRAM.
Not up to it - inference time way over 25 min. Flash attention disabled because windows.
If i want this to run on windows, how much VRAM do i need to get reasonable inference times - circa 15-20s ?
Regards,
Andy.