SCRIPT: A Subcharacter Compositional Representation Injection Module for Korean Pre-Trained Language Models
arXiv:2604.12377v1 Announce Type: cross
Abstract: Korean is a morphologically rich language with a featural writing system in which each character is systematically composed of subcharacter units known as Jamo. These subcharacters not only determine t…