cs.AI, cs.SE

Do not copy and paste! Rewriting strategies for code retrieval

arXiv:2605.08299v1 Announce Type: cross
Abstract: Embedding-based code retrieval often suffers when encoders overfit to surface syntax. Prior work mitigates this by using LLMs to rephrase queries and corpora into a normalized style, but leaves two que…