cs.CL

AXE: Low-Cost Cross-Domain Web Structured Information Extraction

arXiv:2602.01838v2 Announce Type: replace
Abstract: Extracting structured data from the web is often a trade-off between the brittle nature of manual heuristics and the prohibitive cost of Large Language Models. We introduce AXE (Adaptive X-Path Extra…