prompts - Provide.ai

Arize Agent Skills, claude-skills, Evals, evaluation framework, Evaluations, llm-evaluation, LLMs, model-evaluation, prompts, system prompt assembly

Models got an order of magnitude better at following instructions in one year

Laurie Voss / May 12, 2026

A year ago, frontier models started losing track of instructions somewhere around 200–300 simultaneous constraints. With 2026 models, that ceiling is closer to 2,000 — an order-of-magnitude jump. We re-ran IFScale to see how, and how each model fails.

The post Models got an order of magnitude better at following instructions in one year appeared first on Arize AI.

Researchers Say ‘Natural Decision-Making’ Prompt Strategy Boosts AI Accuracy in Healthcare Advice

Insider Brief Researchers at Technische Universität Berlin report finding that prompting large language models to reason more like humans significantly improved their ability to provide medical care-seeking advice, according to a study published in JMIR Biomedical Engineering. The study focused on a growing problem surrounding AI health tools such as ChatGPT: the tendency to recommend […]