Author name: Marcin Abram

Emergent Inference-Time Semantic Contamination via In-Context Priming

Marcin Abram / April 7, 2026

arXiv:2604.04043v1 Announce Type: new
Abstract: Recent work has shown that fine-tuning large language models (LLMs) on insecure code or culturally loaded numeric codes can induce emergent misalignment, causing models to produce harmful content in unre…

cs.AI, cs.CY, cs.MA, quant-ph

Toward Evaluation Frameworks for Multi-Agent Scientific AI Systems

Marcin Abram / March 31, 2026

arXiv:2603.26718v1 Announce Type: cross
Abstract: We analyze the challenges of benchmarking scientific (multi)-agentic systems, including the difficulty of distinguishing reasoning from retrieval, the risks of data/model contamination, the lack of rel…