#AI

11 total

2026-03-19
Attention Residuals: Making Residual Connections Attention-Like
A reading of Kimi Team's Attention Residuals technical report: why residual connections should become attention-like too, and how Full AttnRes / Block AttnRes turn that idea into a trainable, deployable system

#technical-report-reading #residual-connections #transformer #AI #LLM #python
2026-03-11
Training Compute-Optimal Large Language Models: What Chinchilla Changed
The Chinchilla paper — why most large models were undertrained, and how to spend your compute budget wisely, with real Python code examples

#paper-reading #chinchilla #scaling-laws #AI #LLM #python
2026-03-01
Scaling Laws for Neural Language Models: The Mathematics of Scale
The mathematics of scale — why bigger models are predictably better, with real Python code examples

#paper-reading #scaling-laws #AI #LLM #python
2026-02-24
OpenClaw Architecture: Inside a Self-Hosted AI Assistant 🦞
Dissecting the engineering skeleton of a self-hosted AI assistant, based on v2026.3.8 source code

#AI #open-source #openclaw
2026-02-11
Language Models are Few-Shot Learners: GPT-3 and In-Context Learning
Larger models, better at eliciting abilities from context, with real Python code examples

#paper-reading #gpt-3 #AI #LLM #python
2026-02-03
OpenClaw Ecosystem: From Open-Source Project to AI Assistant Platform 🦞
A sharper look at OpenClaw: stars are not an ecosystem; structures that reduce supply and usage friction are.

#AI #open-source #openclaw
2026-01-31
BERT: The Pre-Training Blueprint for Language Understanding
Establishing the pre-training paradigm, with real Python code examples

#paper-reading #bert #AI #LLM #python
2026-01-24
Sequence to Sequence Learning: The Encoder-Decoder Blueprint
Establishing the encoder-decoder paradigm, with real Python code examples

#paper-reading #seq2seq #AI #LLM #python
2026-01-16
Clawdbot: An Early Sample of Personal AI Sovereignty
Clawdbot shows that the hard question for self-hosted AI agents is not how smart the model is, but who owns control, context, and execution rights.

#AI #open-source
2026-01-11
Neural Machine Translation by Jointly Learning to Align and Translate: Attention Before Transformers
The origin of attention mechanism, with real Python code examples

#paper-reading #attention #AI #LLM #python
2026-01-06
Attention Is All You Need: The Transformer Blueprint
A study note on the Transformer paper, with real Python code examples

#paper-reading #transformer #AI #LLM #python