cs.AI, cs.CL, cs.LG

Bridging the Semantic Gap for Categorical Data Clustering via Large Language Models

arXiv:2601.01162v2 Announce Type: replace-cross
Abstract: Categorical data are prevalent in domains such as healthcare, marketing, and bioinformatics, where clustering serves as a fundamental tool for pattern discovery. A core challenge in categorical…