Blog

Research, engineering, and insights from Kroonen AI.

Featured Run Finished 8 min read

April 9, 2026

Genesis 1B: Run 2 Extended - 60,000 Steps

Run 2 extended to 60,000 steps (~31.5B tokens). Step 42,292/60,000 (70.5%), loss ~1.94 avg. ETA April 16, 2026.

Genesis Run 2 pretraining running

🧬 Genesis 1B

March 24, 2026 5 min read

Genesis 1B, Run 2: Deeper, Faster, Smarter

Same 1B parameters, 3x throughput. How torch.compile, real-valued RoPE, a deeper architecture (32 layers vs 20), and batch tuning tripled training speed on the same 2x RTX 4090 hardware.

Genesis Run 2 architecture torch.compile

March 21, 2026 10 min read

The Genesis Manifesto: Sovereign Intelligence for the Post-Generative Era

Data sovereignty, constitutional alignment, and the case for training language models on consumer hardware. Why the future of AI is local, private, and personality-first.

Genesis philosophy alignment

🔧 Postmortems

March 23, 2026 8 min read

The Optimizer State Bug: A Silent Failure in DCP Resume

A silent AdamW state bug during Run 1 that produced a false recovery on poisoned weights. The load path didn't crash or hang -- it just silently ruined the model.

Genesis Run 1 postmortem optimizer dcp

March 18, 2026 8 min read

Fixing FSDP Checkpoint Deadlocks on 2x RTX 4090

How we fixed FSDP checkpoint deadlocks during Run 1 on consumer GPUs without NVLink using DCP sharded checkpoints and decoupled evaluation.

Genesis Run 1 postmortem fsdp dcp

🔬 Research

March 15, 2026 5 min read

Mapping the Mind of Qwen 3.5 9B: A Sparse Autoencoder for Mechanistic Interpretability

A sparse autoencoder trained on the internal activations of Qwen 3.5 9B. Zero dead features, 16,384 interpretable dimensions, trained on a single RTX 4090.

interpretability sparse-autoencoder research