You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

AuriStreamParallel100M_Group4_BigAudioDataset_250k

AuriStream Parallel is a discrete diffusion speech language model by Greta Tuckute and Klemen Kotar.

Model Details

Parameter	Value
Parameters	~0.12B
Layers	12
Hidden Size	768
Attention Heads	12
Vocab Size	8193
Group Size	4
Mask Schedule	linear_text_prime

Architecture

Bidirectional transformer attention
Grouped token latent projection
Parallel token heads for group-wise prediction
Partial masking diffusion objective

Usage

from transformers import AutoModel

model = AutoModel.from_pretrained(
    "TuKoResearch/AuriStreamParallel100M_Group4_BigAudioDataset_250k",
    trust_remote_code=True,
)

Base Model Code

This checkpoint uses shared model code from TuKoResearch/AuriStreamParallel-base.

Tokenizer

This model is intended for cochlear tokens, e.g. from WavCochCausalV8192.

Downloads last month: 7

Safetensors

Model size

0.1B params

Tensor type

F32