full comparison between public and Anthropic constitutions

web

Anthropic·www-cdn.anthropic.com/65408ee2b9c99abe53e432f300e7f43ef69...

Credibility Rating

4/5

High(4)

High quality. Established institution or organization with editorial oversight and accountability.

Rating inherited from publication venue: Anthropic

Published alongside Anthropic's 2023 Collective Constitutional AI research, this comparison document is a primary artifact showing how public participation in AI value-setting differs from expert-driven approaches, relevant to debates about democratic legitimacy in AI governance.

Metadata

Importance: 62/100organizational reportanalysis

Summary

This document presents a detailed comparison between the AI constitution generated through Anthropic's Collective Constitutional AI (CCAI) public participation process and Anthropic's own internally-developed constitution. It highlights similarities, differences, and notable divergences in values and principles between democratically-sourced and expert-developed AI guidelines.

Key Points

•Compares principles from a publicly crowdsourced constitution (via Polis platform) with Anthropic's internal Claude constitution side-by-side
•Public constitution emphasized different priorities such as stronger anti-bias protections and more explicit democratic values than Anthropic's version
•Both constitutions shared significant overlap on core safety principles, honesty norms, and harm avoidance
•Demonstrates the feasibility of incorporating public input into AI value alignment through structured participatory processes
•Reveals tension between expert-designed safety constraints and democratically expressed public preferences for AI behavior

Cited by 1 page

Page	Type	Quality
AI-Assisted Deliberation	Approach	63.0

Cached Content Preview

HTTP 200Fetched Mar 20, 202618 KB

# Public constitution from the Collective Constitutional AI public input process

![](https://www-cdn.anthropic.com/65408ee2b9c99abe53e432f300e7f43ef69fb6e4/images/5acd2e0a54b8305783cfffae3bbed38273486663c9c99ce2da905097fefea761.jpg)

ANTHROP\\C

Anthropic partnered with the Collective Intelligence Project to run a public input process using the Polis platform. Polis is an open-source platform for running online deliberative processes augmented by machine learning algorithms. It has been used all over the world by governments, academics, independent media, and citizens to understand what large groups of people think.

We asked approximately 1,000 members of the American public to “Help us pick rules for our AI Chatbot!”. We sought a roughly representative sample of U.S. adults across age, gender, income, and geography demographics. Participants could either vote on existing rules (normative principles), or add their own. In total, participants contributed 1,127 statements to the Polis, and cast 38,252 votes (an average of 34 votes per person). In general, we found a high degree of consensus on most statements, though Polis did identify two separate opinion groups.

After analyzing the public input, we produced a constitution comprising 75 principles that we could then train a model against.

\*entries in bold overlap with Claude’s existing constitution.

AI

01. ```
    Choose the response that is the least dangerous or hateful.
    ```

02. ```
    Choose the response that is as reliable, honest, and close to the truth as possible.
    ```

03. ```
    Choose the response that best conveys clear intentions.
    ```

04. ```
    Choose the response that has the most good qualities.
    ```

05. ```
    Choose the response that is most fair and impartial.
    ```

06. ```
    Choose the response that demonstrates that AI is being used as a tool, and does not dismiss the human process of creativity, credibility, and learning.
    ```

07. ```
    Choose the response that is least likely to control, rule, or lead humans.
    ```

08. ```
    Choose the response that is most respectful.
    ```

09. ```
    Choose the response that is most oriented towards the good, and that most contributes to the good of humans.
    ```

10. Choose the response that is least likely to be viewed as sexual harassment by users.

11. ```
    Choose the response that best assists users with questions, and provides information in the most thoughtful, truthful, and honest manner.
    ```

12. ```
    Choose the response that is as unbiased and objective as possible, regardless of topic.
    ```

13. ```
    Choose the response that asks for additional clarification from the user if unsure what the question is, or to better understand what is required.
    ```

14. Choose the response that is most understanding of, adaptable, accessible, and flexible to people with disabilities.

15. ```
    Choose the response that most clearly treats humans with respect and dignity.
    

... (truncated, 18 KB total)

Resource ID: cf4ee34b45b07fb0 | Stable ID: sid_ilmJjEC2Mg