Kristian Schwethelm
KristianS7
AI & ML interests
Large Language Models
Recent Activity
updated a model 2 days ago
KristianS7/Ouro-1.4B new activity 2 days ago
KristianS7/Ouro-1.4B:Update tied weight metadata for Transformers 5 liked a model 6 days ago
KristianS7/Ouro-1.4BOrganizations
Update tied weight metadata for Transformers 5
#3 opened 2 days ago
by
KristianS7
Update Ouro remote code for Transformers 5.9
1
#2 opened 6 days ago
by
KristianS7
Fix default RoPE initialization for Transformers 5.9
#1 opened 6 days ago
by
KristianS7
Lower evaluation results
1
#2 opened 6 months ago
by
MianchuWang
Differences in the results of the reproduction test on lm-evaluation-harness
3
#8 opened 4 months ago
by
ThreeGold116
Fix bos/eos token IDs (config.json + tokenizer_config.json)
#5 opened 3 months ago
by
KristianS7
Fix UniversalTransformerCache.get_mask_sizes for batched generation
#4 opened 3 months ago
by
KristianS7
Fix bos/eos token IDs (config.json + tokenizer_config.json)
#11 opened 3 months ago
by
KristianS7
Fix UniversalTransformerCache.get_mask_sizes for batched generation
#10 opened 3 months ago
by
KristianS7
Fix UniversalTransformerCache.get_mask_sizes for batched generation
1
#5 opened 3 months ago
by
KristianS7
Fix UniversalTransformerCache.get_mask_sizes for batched generation
1
#8 opened 3 months ago
by
KristianS7
Batched generation (batch_size > 1) produces incorrect outputs — possible causal mask issue?
➕ 1
1
#9 opened 4 months ago
by
vconchel
Fix bos/eos token IDs + add enable_thinking to chat template
2
#7 opened 3 months ago
by
KristianS7
Fix bos/eos token IDs + add enable_thinking to chat template
2
#4 opened 3 months ago
by
KristianS7
Fix bos/eos token IDs + add enable_thinking to chat template
2
#7 opened 3 months ago
by
KristianS7
Fix bos/eos token IDs + add enable_thinking to chat template
2
#4 opened 3 months ago
by
KristianS7