2 Sources
[1]
GPT-5 just completed Pokémon Red in a new world-record time - Claude, Gemini, and ChatGPT o3 aren't even close
There's a new champion - for now - in the contest to be the fastest AI to complete the original Pokémon Red video game, with ChatGPT-5 completely eclipsing its rivals. GPT-5, the latest model from OpenAI, took just 6,470 steps to complete the 1998 Game Boy classic, eclipsing the previous record of 18,184 steps by ChatGPT-o3. If it's hard to comprehend how many steps the main character, Red, had to make to defeat the Elite Four - 6,470 equates to around seven days of gameplay, compared to over 15 days for o3, the next best Pokémon model. It's a stark contrast to earlier this year, when Gemini 2.5 and Claude 3.7 Sonnet were in a race to even get to the end of the game, let alone do so in a fast time. Now, just a few months later, AI models are able to complete these classic games faster and faster - and they're only going to improve. Earlier this year, Anthropic used Pokémon as a benchmark to showcase the prowess of its latest Claude model. The company combined it with a YouTube video in which developers discussed why GameFreak's iconic franchise was the perfect way to assess an AI model's problem-solving capabilities. All of AI's attempts to complete Pokémon have been livestreamed on platforms like Twitch, where channels like GPT_Plays_Pokemon have regular viewers and subscribers. Having destroyed the previous record time for completing Pokémon Red, GPT-5 is now going to take on the sequel, Pokémon Crystal. The game, which was released in 2000, has double the amount of content to conquer, as you can venture back to the world of Kanto following your adventure in the Johto region. GPT-5 Just Finished Pokemon Red! from r/singularity GPT-5's Pokémon Red journey highlights a tactic young kids have used in the game for years: leveling up one Pokémon and neglecting the other five creatures in your party. In the Reddit thread highlighting the AI model's accomplishment, the top comment from u/Ok_Business84 reads, "Learned that sticking to one Pokémon and hard tanking everybody is the easier way." After nearly 30 years, I finally feel like my younger self has been validated. Back in 1999, I completed Pokémon Yellow with an overpowered Pikachu, and nothing else to show for it. It would be cool to see GPT-5 play Pokémon less like 6-year-old me, and more like an accomplished player, building a varied team of creatures that can take on any battle in the game. This run feels like the AI brute-forced its way to victory, and while it achieved the goal it set out to achieve, it's not entirely viable in a regular playthrough.
[2]
ChatGPT-5 just beat Pokémon Red in record time for an AI - 3x faster than GPT-o3 managed
TL;DR: OpenAI's ChatGPT-5 set a new record for a Pokémon Red speed run by an AI, completing the game in 6,470 steps, which is three times faster than GPT-o3. That's a huge leap and it's apparently down to GPT5 having fewer hallucinations, improved spatial skills, and better overall planning. There's a new fastest AI when it comes to Pokémon Red speed runs and it's GPT-5, beating out the previous record by a long way. OpenAI's most recent model took just 6,470 steps to complete Pokémon Red, compared to ChatGPT-o3 which beat the game in 18,184 steps. As TechRadar, which spotted this feat, points out, 6,500 steps translates to about a full week of gaming (non-stop) - well, just under a week - so it's not exactly fast. Well, not compared to a human speed runner (apparently something like 2,000 steps is possible, and the record time is for completion is 1 hour and 44 minutes). Still, GPT-5's time is still three times as quick as GPT-o3, which is a remarkable leap - and an impressive feat overall. Next up, GPT-5 is being tested with Pokémon Crystal (the sequel to the original). Apparently, there are a few reasons why this new OpenAI model outperformed GPT-o3 convincingly, and the main one is simple: GPT-5 hallucinates far less. Seemingly GPT-5 also has improved spatial skills, so it's able to navigate the game a lot more competently. Finally, it's just better at planning objectives and pursuing them, so there are some major steps forward across multiple fronts here. It's well worth a quick glimpse at those Pokémon Red record runs on YouTube by the way, with the level of accuracy, and ultra-swift navigation of menus, across a period of the best part of two hours being truly mind-boggling, frankly. What also boggles the mind is growth ChatGPT has witnessed this year - it now fields some 2.5 billion queries on a daily basis, an increase of 2.5x since the end of last year.
Share
Copy Link
OpenAI's GPT-5 has set a new record for AI completion of Pokémon Red, finishing the game in just 6,470 steps. This achievement demonstrates significant improvements in AI capabilities, including reduced hallucinations and enhanced spatial skills.
OpenAI's latest language model, GPT-5, has achieved a remarkable feat by completing the classic Game Boy game Pokémon Red in a record-breaking 6,470 steps. This accomplishment significantly outpaces the previous record of 18,184 steps set by ChatGPT-o3, showcasing the rapid advancements in AI capabilities 1.
The achievement of GPT-5 is particularly noteworthy when compared to other AI models:
This dramatic improvement demonstrates the significant strides made in AI technology within a short timeframe. The speed at which these models are advancing suggests that we can expect even more impressive performances in the future.
Source: TechRadar
Several key improvements in GPT-5 have contributed to its superior performance:
Interestingly, GPT-5's approach to the game mirrors a strategy often employed by young players – focusing on leveling up a single Pokémon while neglecting the rest of the team. This tactic, while effective for speed runs, may not represent the most balanced or skilled approach to the game 1.
It's worth noting that while GPT-5's performance is impressive for an AI, it still falls short of human speed runners. The current human record for completing Pokémon Red stands at 1 hour and 44 minutes, with top players able to finish the game in around 2,000 steps 2.
Following its success with Pokémon Red, GPT-5 is set to take on Pokémon Crystal, a more complex sequel with double the content. This progression highlights the ongoing use of video games as benchmarks for assessing AI problem-solving capabilities 1.
The achievements of AI in gaming have garnered significant public interest, with platforms like Twitch hosting channels dedicated to AI gameplay. These streams, such as GPT_Plays_Pokemon, have regular viewers and subscribers, indicating a growing fascination with AI's capabilities in familiar gaming environments 1.
The rapid progress demonstrated by GPT-5 in mastering Pokémon Red reflects the broader trend of accelerated AI development. This advancement is not limited to gaming; ChatGPT has seen a 2.5x increase in daily queries since the end of last year, now handling approximately 2.5 billion queries daily 2.
As AI continues to evolve at this pace, it raises questions about the potential applications and impacts of these technologies beyond gaming and into more complex real-world scenarios. The ability of AI to quickly learn, adapt, and optimize strategies in virtual environments could have far-reaching implications for problem-solving in various fields.
As nations compete for dominance in space, the risk of satellite hijacking and space-based weapons escalates, transforming outer space into a potential battlefield with far-reaching consequences for global security and economy.
7 Sources
Technology
11 hrs ago
7 Sources
Technology
11 hrs ago
Anthropic has updated its Claude Opus 4 and 4.1 AI models with the ability to terminate conversations in extreme cases of persistent harm or abuse, as part of its AI welfare research.
6 Sources
Technology
19 hrs ago
6 Sources
Technology
19 hrs ago
A pro-Russian propaganda group, Storm-1679, is using AI-generated content and impersonating legitimate news outlets to spread disinformation, raising concerns about the growing threat of AI-powered fake news.
2 Sources
Technology
11 hrs ago
2 Sources
Technology
11 hrs ago
OpenAI has made subtle changes to GPT-5's personality, aiming to make it more approachable after users complained about its formal tone. The company is also working on allowing greater customization of ChatGPT's style.
4 Sources
Technology
3 hrs ago
4 Sources
Technology
3 hrs ago
SoftBank has purchased Foxconn's Ohio plant for $375 million to produce AI servers for the Stargate project. Foxconn will continue to operate the facility, which will be retrofitted for AI server production.
5 Sources
Technology
2 hrs ago
5 Sources
Technology
2 hrs ago