Everything We Know About the Mysterious New ‘Gpt2-chatbot’

3 weeks ago

May 1, 2024 at 10:30 am

Everything We Know About the Mysterious New ‘Gpt2-chatbot’

A mysterious new AI chatbot called “gpt2-chatbot” is turning heads this week after it became available on a major large language model benchmarking site, LMSYS Org. No one knows where it came from, but many consider it to have roughly the same capabilities as OpenAI’s GPT-4. This puts gpt2-chatbot in a rare class of AI models that only a handful of developers worldwide have been able to achieve.

“No one knows who made it or what it is, but I have been playing with it a little and it appears to be in the same rough ability level as GPT-4,” Ethan Mollick, a Professor researching artificial intelligence at the Wharton School of the University of Pennsylvania, said in a tweet on Monday.

Online AI communities have gone wild about the anonymous gpt2-chatbot. One X user claims that gpt2-chatbot nearly coded a perfect clone of the mobile game Flappy Bird. Another X user says it solved an International Math Olympiad problem in one shot. On long Reddit threads, users are speculating wildly about the origins of the gpt2-chatbot and arguing over whether it’s from OpenAI, Google, or Anthropic. There’s no evidence for these claims, but tweets from OpenAI CEO Sam Altman and other executives have just added fuel to the fire.

i do have a soft spot for gpt2
— Sam Altman (@sama) April 30, 2024

when gpt-2
— Steven Heidel (@stevenheidel) April 30, 2024

You can try out the gpt2-chatbot yourself at LMSYS Org’s website. Navigate to “Direct Chat” or “Arena (side-by-side)” and select it from the dropdown menu. LMSYS Org says in its policy blog that certain AI model developers can test anonymous unreleased models before a broader release. This has led many to believe that gpt2-chatbot is an anonymous model from a major AI developer.

“Just to clarify, following our policy, we’ve partnered with several model developers to bring their new models to our platform for community preview testing,” said LMSYS Org in a tweet on Monday, responding to a thread about gpt2-chatbot. “These models are strictly for testing and won’t be listed on the leaderboard until they go public.”

LMYSYS Org and OpenAI did not immediately respond to Gizmodo’s request for comment.

In Gizmodo’s limited testing, we found the gpt2-chatbot has capabilities that are similar to leading AI models from Anthropic and OpenAI. It exhibited behavior exclusive to advanced large language models, reasoning well and outlining detailed plans for complicated tasks. Here are some of our examples comparing gpt2-chatbot (left) and Anthropic’s Claude Opus model (right).

Instruction prompt: gpt2-chatbot (left) vs. Claude 3 Opus (right)

Reasoning prompt: gpt2-chatbot (left) vs. Claude 3 Opus (right)

A computer engineering professor at the University of Wisconsin found that gpt2-chatbot could perform a task that other leading AI models could not. Dimitris Papailiopoulos asked gpt2-chatbot to solve a math riddle that involves learning some inexplicable rules. AI largely struggles to answer questions like this.

I found one task that gpt2-chatbot is better than all other models, and it’s completely useless.
Early but rapid ascent on the A+B-1 question by @Kangwook_Lee pic.twitter.com/xwOfnB1r03
— Dimitris Papailiopoulos (@DimitrisPapail) April 29, 2024

Ultimately, there’s very little information available about the gpt2-chatbot just yet. However, it seems clear that a power player is behind this AI model. In the coming weeks, the creator and origins of the gpt2-chatbot will likely become public. This could mean a new AI model is on the horizon or maybe there’s a new AI developer on the scene.

Sonos Has Finally Released the Product We’ve Been Hoping for

Microsoft Is Offering Some Interesting Perks for Buying a PC With Copilot+

Neon Picks Up Osgood Perkins & James Wan’s The Monkey

I Tried the Dyson’s New Supersonic Nural and These Are My Three Fave Features

Ghost of Tsushima Gave Sucker Punch the Spotlight It Always Deserved

Superloop Is Offering One of the Fastest NBN 1000 Plans for Under $100

Today’s Best Australian Tech Deals

All of the Best Tech Sales Still Available for Click Frenzy Mayhem 2024

Get a Full Year of Discounts for Aussie Broadband’s Fastest NBN Plans

Clean Up Nicely With These Sales for Dyson Vacuum Cleaners and More

Everything We Know About the Mysterious New ‘Gpt2-chatbot’