Author name: Yo Ehara

Predicting Disagreement with Human Raters in LLM-as-a-Judge Difficulty Assessment without Using Generation-Time Probability Signals

Yo Ehara / May 13, 2026

arXiv:2605.12422v1 Announce Type: new
Abstract: Automatic generation of educational materials using large language models (LLMs) is becoming increasingly common, but assigning difficulty levels to such materials still requires substantial human effort…

cs.CL

Accurate and Efficient Statistical Testing for Word Semantic Breadth

Yo Ehara / May 11, 2026

arXiv:2605.08048v1 Announce Type: new
Abstract: Measuring the breadth of a word’s meaning, or its spread across contexts, has become feasible with contextualized token embeddings. A word type can be represented as a cloud of token vectors, with disper…