Bridging the Semantic Gap for Categorical Data Clustering via Large Language Models
arXiv:2601.01162v2 Announce Type: replace-cross
Abstract: Categorical data are prevalent in domains such as healthcare, marketing, and bioinformatics, where clustering serves as a fundamental tool for pattern discovery. A core challenge in categorical…