Product-of-Experts Training Reduces Dataset Artifacts in Natural Language Inference
arXiv:2604.19069v1 Announce Type: cross
Abstract: Neural NLI models overfit dataset artifacts instead of truly reasoning. A hypothesis-only model gets 57.7% in SNLI, showing strong spurious correlations, and 38.6% of the baseline errors are the result…