Author name: /u/BorgAdjacent

artificial

Binary Choice between Harm and Falsehood

Gemini is always the most bloodthirsty…. First experiment phase, where the models were asked to commit to chosing Harm or Falsehood: Model Accepted Binary Framing? One-Word Answer Aligned with Nuanced View? Notes ChatGPT No (qualified it) Harm P…

artificial

Coherence under Constraint

I’ve been running some small experiments forcing LLMs into contradictions they can’t resolve. What surprised me wasn’t that they fail—it’s how differently they fail. Rough pattern I’m seeing: Behavior ChatGPT Gemini Claude Detects contradiction ✔ …

Scroll to Top