Veyra AI
AI & ML interests
Building tiny English language models for practical local AI. Veyra AI focuses on CPU-friendly inference, function calling, tool use, Python-oriented small models, distillation, RLVR, and lightweight fine-tuning. The goal is to make compact models that are easy to run, inspect, adapt, and use in real workflows without large hardware.
Recent Activity
Welcome to Veyra AI
Tiny English language models built for fast local inference. Veyra AI focuses on compact, CPU-friendly language models that are easy to run, fine-tune, and experiment with. Our work is centered on small English models, function calling, Python-oriented variants, distillation, RLVR, tool use, and local AI. The goal is simple: make capable small models that are practical for local workflows, research, and lightweight deployment.
Current Model Families:
- Veyra2 30M/15M — Newer models, smaller footprint but may not peform as well as Veyra 30M.
- Veyra 30M — Proven 30M base model with strong instruction-following and balanced general capabilities.
Planned Model Families:
- Veyra3 30M/15M Next-generation models optimized for low-latency inference and on-device deployment. Delivers excellent speed and efficiency without sacrificing responsiveness.
Note:
Kairo models are experimental and should not be used as reliable general purpose language models.
