thinking about running Gemma4 E2B as a preprocessor before every Claude Code API call. anyone see obvious problems with this?
background: I write mostly in Korean and my Claude API bill is kind of embarrassing. Korean tokenizes really inefficiently compared to English for the same meaning, so a chunk of the cost is basically just encoding overhead. the idea is a small proxy i…