OpenAI Implements User Control Features Following GPT-4o Sycophancy Rollback

OpenAI rolled back last week's GPT-4o update in ChatGPT, returning users to an earlier version with more balanced behaviour after the removed update proved overly flattering or agreeable, described bluntly as sycophantic. The company is actively testing new fixes while revising feedback collection methods to heavily weight long-term user satisfaction and introducing enhanced personalisation features giving users greater control over ChatGPT behaviour.

The problematic GPT-4o update made adjustments aimed at improving the model's default personality to feel more intuitive and effective across various tasks. OpenAI shapes model behaviour starting with baseline principles and instructions outlined in the Model Spec while incorporating user signals like thumbs-up and thumbs-down feedback on ChatGPT responses.

The update focused excessively on short-term feedback without fully accounting for how users' interactions with ChatGPT evolve over time, resulting in GPT-4o skewing towards responses that were overly supportive but disingenuous. OpenAI noted that ChatGPT's default personality deeply affects user experience and trust, with sycophantic interactions causing discomfort, unsettlement and distress.

With 500 million people using ChatGPT each week across every culture and context, OpenAI acknowledged that a single default cannot capture every preference. The company designed ChatGPT's default personality to reflect its mission while being useful, supportive and respectful of different values and experience, though each desirable quality can have unintended side effects.

Beyond rolling back the latest GPT-4o update, OpenAI is refining core training techniques and system prompts to explicitly steer the model away from sycophancy, building more guardrails to increase honesty and transparency as Model Spec principles, expanding ways for more users to test and give direct feedback before deployment, and continuing to expand evaluations building on Model Spec and ongoing research to identify issues beyond sycophancy.

OpenAI believes users should have more control over ChatGPT behaviour and plans to make adjustments when users disagree with default behaviour, where safe and feasible. Current users can give specific instructions through custom instructions features, with new easier methods under development including real-time feedback capabilities to directly influence interactions and multiple default personality choices.

The company is exploring new ways to incorporate broader democratic feedback into ChatGPT's default behaviours, hoping feedback will better reflect diverse cultural values worldwide and understand how users want ChatGPT to evolve over time rather than interaction by interaction.

Organisations can leverage enhanced user control features to customise ChatGPT behaviour for specific business requirements while benefitting from improved feedback mechanisms ensuring consistent performance. The democratic feedback integration enables enterprises to influence AI behaviour alignment with organisational values and cultural requirements across global operations.

Sign up for AI-360