Cornserve: A Distributed Serving System for Any-to-Any Multimodal Models
arXiv:2603.12118v2 Announce Type: replace
Abstract: Any-to-Any models are an emerging class of multimodal models that accept combinations of multimodal data (e.g., text, image, video, audio) as input and generate them as output. Serving these models a…