EXAONE 4.5 released
https://huggingface.co/LGAI-EXAONE/EXAONE-4.5-33B https://huggingface.co/LGAI-EXAONE/EXAONE-4.5-33B-FP8 https://huggingface.co/LGAI-EXAONE/EXAONE-4.5-33B-GGUF submitted by /u/Secure_Smoke_4280 [link] [comments]
https://huggingface.co/LGAI-EXAONE/EXAONE-4.5-33B https://huggingface.co/LGAI-EXAONE/EXAONE-4.5-33B-FP8 https://huggingface.co/LGAI-EXAONE/EXAONE-4.5-33B-GGUF submitted by /u/Secure_Smoke_4280 [link] [comments]
I love reading benchmark / eval papers. It's one of the best way to stay up-to-date with progress in Vision Language Models, and understand where they fall short. Vision tasks vary quite a lot from one to another. For example: vision tasks t…
Hello, Why do companies create open source models? They must allocate lots of resources toward this, but for what profit? If anything, doesn't it just take users off of using their paid for/proprietary models? submitted by /u/Excelle…
Three years ago this sub was full of llama2 distillation discussions then llama3.2, phi3 What happened to them? Last thing I remember about llama was llama4 scout or something that didn't beat gemma, then I saw it no more 🙁 submitted by …
I've been playing with Gemma 4 31B for coding tasks since it came out and been genuinely impressed with how capable it is. With the benchmarks putting it a little behind Qwen3.5 I didn't have high expectations, but it's honestly been perfor…
I recently had to process ~940,000 PDFs. I started with the standard OCR tools, but the bottlenecking was frustrating. Even on an RTX 5090, I was seeing low speed. The Problem: PaddleOCR (the most popular open source OCR): Maxed out at ~15 img/s. GPU …
VoxCPM2 — Three Modes of Speech Generation: 🎨 Voice Design — Create a brand-new voice 🎛️ Controllable Cloning — Clone a voice with optional style guidance 🎙️ Ultimate Cloning — Reproduce every vocal nuance through audio continuation Demo https://hug…
I abliterated Sarvam-30B and 105B – India's first multilingual MoE reasoning models – and found something interesting along the way! Reasoning models have 2 refusal circuits, not one. The <think> block and the final answer can disagree: the m…
Guys, I'm a win user and have been for ages. On my rig I thought hell, I'll give linux a try and a few months back started the software side with win11 and wsl, since all recommendations were pointing towards linux. Fast forward 4 months of slu…
We recently purchased a DGX Spark with 128 GB RAM to run multimodal LLMs. I wanted to hear from people as to how they are getting the best of this kind of hardware. submitted by /u/gymho69 [link] [comments]