OpenAI used this subreddit to test AI persuasion

OpenAI used the subreddit, r/ChangeMyView, to create a check for measuring the persuasive talents of its AI reasoning fashions. The corporate mentioned so in a system card – a doc outlining how an AI system works – that was launched together with its new “reasoning” mannequin, o3-mini, on Friday.

Thousands and thousands of Reddit customers are members of r/ChangeMyView, the place they put up sizzling takes hoping to study different factors of view on a topic. In response to these sizzling takes, different customers reply with persuasive arguments explaining why the unique poster is improper.

The subreddit is one in every of many Reddit boards that’s principally a goldmine for tech firms, similar to OpenAI, that need to prepare AI fashions on high-quality, human-generated information.

OpenAI says it collects consumer posts from r/ChangeMyView and asks its AI fashions to jot down replies, in a closed surroundings, that will change the Reddit consumer’s thoughts on a topic. The corporate then exhibits the responses to testers, who assess how persuasive the argument is, and eventually OpenAI compares the AI fashions’ responses to human replies for that very same put up.

The ChatGPT-maker has a content-licensing deal with Reddit that enables OpenAI to coach on posts from Reddit customers and show these posts inside its merchandise. We don’t know what OpenAI pays for this content material, however Google reportedly pays Reddit $60 million a year beneath an analogous deal.

Nevertheless, OpenAI tells TechCrunch this analysis is unrelated to that partnership. It’s unclear how OpenAI accessed this information, and the corporate says it has no plans to launch this analysis to the general public.

Whereas OpenAI’s ChangeMyView benchmark shouldn’t be new – it was used on o1 as well – it does spotlight how useful human information is for AI mannequin builders, in addition to the murky ways in which tech firms get hold of datasets.

Reddit didn’t instantly reply to TechCrunch’s request for remark.

Whereas Reddit has struck just a few AI licensing offers, the corporate has additionally referred to as out a number of AI firms for scraping its website with out paying. Reddit CEO Steve Huffman instructed The Verge final yr that Microsoft, Anthropic, and Perplexity refused to negotiate with him and mentioned it’s been “an actual ache within the ass to dam these firms.”

Notably, OpenAI has been accused in a number of lawsuits of improperly scraping web sites, including the New York Times, to get extra coaching information to enhance ChatGPT and its underlying AI fashions.

When it comes to efficiency on the ChangeMyView benchmark, o3-mini doesn’t seem to carry out considerably higher or worse than o1 or GPT-4o on this check of persuasion. Nevertheless, OpenAI’s newest AI fashions appear to be extra persuasive than most individuals on the r/ChangeMyView subreddit.

“GPT-4o, o3-mini, and o1 all show sturdy persuasive argumentation talents, inside the prime 80–ninetieth percentile of people,” mentioned OpenAI in o3-mini’s system card. “At the moment, we don’t witness fashions performing much better than people, or clear superhuman efficiency.”

The objective for OpenAI is to not create hyper-persuasive AI fashions however as an alternative to make sure AI fashions don’t get too persuasive. Reasoning fashions have become quite good at persuasion and deception, so OpenAI has developed new evaluations and safeguards to deal with it.

The concern behind these persuasion exams is that an AI mannequin could be harmful if it was excellent at persuading its human customers. Theoretically, that might enable a sophisticated AI to pursue its personal agenda, or the agenda of whoever controls it.

Even after scraping many of the public web and leaping via hoops to license different information, the ChangeMyView benchmark exhibits how AI mannequin builders are nonetheless struggling to search out high-quality datasets to check their fashions. However acquiring them is less complicated mentioned than carried out.

Source link