https://techcrunch.com/2025/06/18/openai-found-features-in-ai-models-that-correspond-to-different-personas/
By looking at an AI model's internal representations — the numbers that dictate how an AI model responds, which often seem completely incoherent to humans — OpenAI researchers were able to find patterns that lit up when a model misbehaved.
Create an account or login to join the discussion