cs.CL, cs.CV

Medical thinking with multiple images

arXiv:2604.16506v2 Announce Type: replace-cross
Abstract: Large language models perform well on many medical QA benchmarks, but real clinical reasoning often requires integrating evidence across multiple images rather than interpreting a single view. …

cs.DC, cs.LG

STAR: Decode-Phase Rescheduling for LLM Inference

arXiv:2510.13668v2 Announce Type: replace-cross
Abstract: Large Language Model (LLM) inference has emerged as a fundamental paradigm, however, variations in output length cause severe workload imbalance in the decode phase, particularly for long-outpu…

Scroll to Top