3 9 8

Albert Catalan-Tatjer

aldakata

https://aldakata.github.io/

aldakata

AI & ML interests

Efficiency

Recent Activity

new activity about 1 month ago

allenai/OLMo-2-0425-1B:Main revision

liked a dataset 3 months ago

ricdomolm/MATH-500

liked a dataset 3 months ago

christopher/rosetta-code

View all activity

Organizations

None yet

upvoted a paper 3 months ago

Olmo 3

Paper • 2512.13961 • Published Dec 15, 2025 • 32

upvoted a collection 3 months ago

Olmo 3

Collection

Artifacts for the Olmo 3 release. • 7 items • Updated Mar 2 • 168

upvoted an article 6 months ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30, 2025

•

302

upvoted a paper 6 months ago

Training Dynamics Impact Post-Training Quantization Robustness

Paper • 2510.06213 • Published Oct 7, 2025 • 3

upvoted a collection 8 months ago

open-sci-ref-0.01 nemotron-hq

Collection

10 items • Updated Aug 17, 2025 • 4

upvoted an article 9 months ago

Article

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

Apr 16, 2025

•

upvoted a collection 9 months ago

🧠 SmolLM3

Collection

Smol, multilingual, long-context reasoner • 14 items • Updated Oct 9, 2025 • 100

upvoted 2 articles over 1 year ago

Article

Unlocking Longer Generation with Key-Value Cache Quantization

May 16, 2024

•

Article

Let's talk about LLM evaluation

May 23, 2024

•

208

Albert Catalan-Tatjer

AI & ML interests

Recent Activity

Organizations

aldakata's activity

KV Caching Explained: Optimizing Transformer Inference Efficiency

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

Unlocking Longer Generation with Key-Value Cache Quantization

Let's talk about LLM evaluation