BS Software Engineering at Virtual University | The Strategic Choice
By: Sohaib Malik, Software Engineer & Certified AI EngineerContinue reading on Medium »
By: Sohaib Malik, Software Engineer & Certified AI EngineerContinue reading on Medium »
microsoft/VibeVoice
VibeVoice is Microsoft’s Whisper-style audio model for speech-to-text, MIT licensed and with speaker diarization built into the model.
Microsoft released it on January 21st, 2026 but I hadn’t tried it until today. Here’s a one…
If you write Python long enough, you eventually accept a weird ritual as normal.Continue reading on Towards AI »
Thanks to a tip from Rahim Nathwani, here’s a uv run recipe for transcribing an audio file on macOS using the 10.28 GB Gemma 4 E2B model with MLX and mlx-vlm:
uv run –python 3.13 –with mlx_vlm –with torchvision –with gradio \
mlx_vlm.generat…
Trip Venturella released Mr. Chatterbox, a language model trained entirely on out-of-copyright text from the British Library. Here’s how he describes it in the model card:
Mr. Chatterbox is a language model trained entirely from scratch on a corp…