You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

AuriStreamParallel100M_Group4_BigAudioDataset_250k

AuriStream Parallel is a discrete diffusion speech language model by Greta Tuckute and Klemen Kotar.

Model Details

Parameter Value
Parameters ~0.12B
Layers 12
Hidden Size 768
Attention Heads 12
Vocab Size 8193
Group Size 4
Mask Schedule linear_text_prime

Architecture

  • Bidirectional transformer attention
  • Grouped token latent projection
  • Parallel token heads for group-wise prediction
  • Partial masking diffusion objective

Usage

from transformers import AutoModel

model = AutoModel.from_pretrained(
    "TuKoResearch/AuriStreamParallel100M_Group4_BigAudioDataset_250k",
    trust_remote_code=True,
)

Base Model Code

This checkpoint uses shared model code from TuKoResearch/AuriStreamParallel-base.

Tokenizer

This model is intended for cochlear tokens, e.g. from WavCochCausalV8192.

Downloads last month
7
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support