Anthropic created a democratic AI chatbot by allowing its principles to be chosen by users.
In a groundbreaking exploration of AI capabilities, Anthropic, an artificial intelligence (AI) company, has tailored a large language model (LLM) to reflect user-defined values. This unique study involved gathering input from 1,000 participants to fine-tune the LLM’s responses based on their collective judgments. Unlike conventional LLMs equipped with predefined guardrails to constrain certain outputs, Anthropic’s approach embraces user agency. Models like Claude from Anthropic and ChatGPT from OpenAI often adhere to preset safety responses, especially regarding sensitive topics. However, critics argue that such interventions might compromise user autonomy, as the definition of acceptability varies and is subjective across cultures and time periods. A potential solution to this complex challenge is empowering users to shape the value alignment of AI models. Anthropic embarked on the “Collective Constitutional AI” experiment in collaboration with Polis and the C...