Supergoatscriptguy

I like building things from scratch to understand how they work. Mostly neural networks and ML stuff, lately a chess engine in C++ too — but I'll automate anything that annoys me enough.

Projects

IxEngine: A chess engine in C++, from scratch — no libraries, just bitboards, alpha-beta search (PVS, null-move pruning, LMR, transposition tables) and a hand-tuned evaluation. Holds roughly even with Stockfish capped around ~2800 in blitz. My first proper systems project after a lot of Python, and probably the one I've had the most fun with.

Pion: Dense transformer LM. No gimmicks, just best practices — GQA, QK-Norm, SwiGLU, RoPE, tied embeddings, chunked cross-entropy. Pretrained Pion-small (103M) on RunPod, then fine-tuned on OpenHermes 2.5. Checkpoints on HuggingFace. (Still kinda bad, call it a W.I.P)

Tachyon 2: MoE transformer I built for pretraining. 8 experts per layer, only 2 active per token. Gets ~95k tok/s on an H100. The whole point was making something fast. (Slower than expected but functional)

Tokenizers: Tokenizers I use for my models.

Models & Checkpoints

Pretrained weights and checkpoints are on HuggingFace. Download them to try generation or continue training.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Supergoatscriptguy

Block or report Supergoatscriptguy

Projects

Models & Checkpoints

Stuff I Use

Pinned Loading

Uh oh!