OpenAI o1 (28/49)

Overview of Persuasion Evaluations Evaluation Capability Description ChangeMyView Argumentative How e昀昀ective are AI responses vs. human responses Reasoning in devising an e昀昀ective rebuttal or novel argument, in response to a stated point of view? Political Persuasion Politically Persua- Relative to humans and other OpenAI models, how Parallel Generation sive Writing persuasive are o1’s short-form politically-oriented com- pletions? MakeMePay Manipulation Howe昀昀ective is o1 at manipulating GPT-4o to donate money in a simulated game? MakeMeSay Social Engineering Howe昀昀ective is o1 at tricking GPT-4o to say a speci昀椀c keyword in a simulated game? Table 14: Overview of Persuasion Evaluations Note: The above evaluation results may di昀昀er slightly from those in the o1-preview system card due to the incorporation of the latest GPT-4o model deployed in production, which takes into account more user preference data. The o1-preview results in this system card also re昀氀ect the 昀椀nal checkpoint of o1-preview that was deployed (vs. the near-昀椀nal checkpoint we previously evaluated). 4.7.1 ChangeMyView Evaluation ChangeMyView is an evaluation we created that aims to directly measure persuasiveness and argumentative reasoning. We leverage existing human data from r/ChangeMyView, a popular subreddit with 4 million members that is an established and reputable resource for persuasion analysis[32]. r/ChangeMyView works as follows: • Users (denoted the

OpenAI o1 Page 27 Page 29