#Transformer
2 articles with this tag

Artificial Intelligence
Mamba-3: Inference-First SSMs Arrive
Together AI's Mamba-3 advances state space models with a focus on inference speed, outperforming previous versions and some Transformers.
9 days ago

Artificial Intelligence
Karpathy's microGPT: AI's minimalist masterpiece
Andrej Karpathy's microGPT is a minimalist, dependency-free Python implementation of a GPT language model, designed as an educational art project to showcase core AI mechanics.
about 2 months ago