I BUILT MY FIRST MODEL FROM SCRATCH
Sup, I'm Crownelius, I made that popular opus distill dataset. TODAY YOU ARE INTRODUCED TO SHARD a 40m parameter mal-formed LLM. Right now I'm working on a series of tiny LLM's, with a goal to run a coherent model for IoT tasks. I've …