【Peggy Markoff】
Last month,Peggy Markoff the $61.5 billion-valuated AI startup Anthropic set up a gaming livestream on Twitch. Gaming livestreams are nothing new on Twitch, but this one is a little different: Claude, Anthropic's AI model, is attempting to beat Pokémon Red.
We are now one month in,and the livestream is still going. However, Claude has not progressedall that much. And, at this rate, Anthropic's AI agent may possibly never be the very best, like no one ever was.
According to Anthropic, when it first launched the "Claude Plays Pokémon" project, previous versions of its AI agent Claude failed at some very basic tasks. For example, according to Anthropic, Claude 3.5 would try to run away from almost every battle in June 2024.
You May Also Like
SEE ALSO: In 2024, Pokémania is evolving
A few months and a few versions of Claude later, Anthropic said there was a stark change. In February 2025, Anthropic gave Claude 3.7 Sonnet a whirl at playing Pokémon.
"Within hours, Claude defeated Brock. Days later, it trounced Misty," Anthropic said. "Progress that older models had little hope of achieving."
Anthropic said that Claude 3.7 Sonnet could plan ahead, remember objectives, and learn from its mistakes, unlike previous versions of the AI agent. It also built a knowledge base, saw the screen, and simulated button presses.
However, the progress Claude 3.7 Sonnet originally made in the game seems to have stalled.
For example, livestream viewers watchedas Clause 3.7 took 78 hoursto get through Mt. Moon in the game. On Reddit, gamers estimatedthat it would typically take a child just a few hours to advance through the same stage.
SEE ALSO: Hands-on with the Claude AI app: It's pleasant to use, but jankyClaude can be seen going in circles, stumbling around the same paths, and often knocking into walls as it tries to get around the game.
The livestream is engaging, especially as a text box lays out Claude's "thinking" as the AI agent tries to figure out what moves to make next.
According to Anthropic engineers in an interview with Ars Technica, Claude has an easier time with aspects of the game which involve text, such as Pokémon battles. However, it struggles with the more visual aspects of the game, such as moving around from town to town on the map.
Claude 3.7 Sonnet has gone much further in the game than previous Claude models, so there's been progress. However, for those warning that AI will soon be able to take over the world, we're nowhere close to that being a reality yet. Claude still has 151 Pokémon to catch.
Topics Artificial Intelligence Gaming Pokemon Twitch Streaming
Search
Categories
Latest Posts
FreeSync 2 Explained
2025-06-26 14:59The 'Ant
2025-06-26 14:04This viral tweet proves that love conquers all, even bad jokes
2025-06-26 14:04Best iPad deal: Save $100 on 13
2025-06-26 13:07Popular Posts
Best Sony deal: Save $100 on WH
2025-06-26 15:1521 cats that are really, really thicc
2025-06-26 14:30Marvel viewing order: What to watch before 'Ant
2025-06-26 14:28This viral tweet proves that love conquers all, even bad jokes
2025-06-26 14:08Featured Posts
NYT Strands hints, answers for May 2
2025-06-26 12:57Popular Articles
Best tablet deal: Get the Google Pixel Tablet for $120 off at Amazon
2025-06-26 14:48Apple's iOS and macOS have a nasty vulnerability, so update now
2025-06-26 14:17The best tweets of 2019
2025-06-26 13:43Amazon Big Spring Sale 2025: Best deals under $50
2025-06-26 13:13Newsletter
Subscribe to our newsletter for the latest updates.
Comments (774)
Transmission Information Network
Things Intel Needs to Fix
2025-06-26 15:29Evergreen Information Network
The viral Mike Bloomberg dance is fake, but you can still love/hate it
2025-06-26 15:22Transmission Information Network
How 'Broker' and 'Return to Seoul' reveal hard truths about Korean adoption
2025-06-26 14:29Reality Information Network
Twitter is making even less from Twitter Blue than previously known
2025-06-26 13:27Impression Information Network
Amazon Big Spring Sale 2025: Best deals under $50
2025-06-26 13:25