GPT-5 Launch: Mixed Reviews on Coding and Creative Writing Capabilities

Reviewed byNidhi Govil

4 Sources

OpenAI's release of GPT-5 has generated mixed reactions, with impressive benchmark scores but disappointing performance in real-world coding and creative writing tasks. The AI community is divided on its effectiveness compared to previous models.

GPT-5 Launch and Initial Reception

OpenAI has officially released GPT-5, touting it as their "smartest, fastest, most useful model yet" 1. The company highlighted impressive benchmark scores, with GPT-5 achieving 94% on math tests and 74% on real-world coding tasks. OpenAI CEO Sam Altman compared the model to having a team of PhD-level experts on call 1.

However, the initial reception has been mixed, with the tech community split on GPT-5's performance. Within hours of launch, social media platforms were flooded with negative feedback, with users describing the model as "horrible," "awful," and "underwhelming" 1. The backlash was so significant that OpenAI had to promise to reinstate the older GPT-4o model after a petition garnered over 3,000 signatures 1.

Coding Capabilities: A Step Back?

Source: ZDNet

Source: ZDNet

Independent tests of GPT-5's coding abilities have yielded inconsistent results. In one test, GPT-5 initially failed to produce a working plugin for a simple randomization task, a problem that previous versions of ChatGPT had consistently solved 2. While the AI eventually corrected the issue after prompting, this regression in performance is noteworthy.

GPT-5 did pass some coding tests, such as rewriting a string function to handle dollars and cents and understanding a complex WordPress filter issue 3. However, it stumbled on a test involving Mac scripting tools and AppleScript, confidently presenting incorrect information 3.

Creative Writing: Lacking Soul and Depth

When tasked with creative writing, GPT-5's performance fell short of expectations. Outputs were described as technically correct but "devoid of soul," maintaining trademark AI writing patterns such as overuse of em dashes and formulaic paragraph structures 4. In a time-travel paradox story test, GPT-5's narrative lacked emotional depth and failed to fully address the prompt's core concept 4.

Comparatively, other AI models like Claude 4.0 Opus demonstrated superior creative writing abilities, providing richer descriptions, more coherent narratives, and better integration of cultural elements 4. GPT-5 struggled with dialogue, generating an entire story without a single line of character speech 4.

Benchmark Performance vs. Real-World Application

Source: Decrypt

Source: Decrypt

While GPT-5 boasts impressive benchmark scores, its performance in practical applications has been inconsistent. This discrepancy highlights the ongoing challenge in AI development: creating models that excel not only in controlled test environments but also in diverse, real-world scenarios 123.

Market Reaction and Future Outlook

The mixed reception of GPT-5 has had immediate market implications. On prediction markets, OpenAI's odds of having the best AI model by the end of August plummeted from 75% to 12% shortly after GPT-5's debut, with Google overtaking OpenAI at an 80% chance 1.

Despite the initial setbacks, it's important to note that GPT-5 is still a work in progress. OpenAI is likely to iterate and improve the model through updates, addressing the issues identified in these early tests and user feedback 4.

Conclusion

The launch of GPT-5 serves as a reminder of the complex nature of AI development and the challenges in meeting diverse user expectations. While the model shows promise in certain areas, its inconsistent performance across various tasks suggests that there is still significant room for improvement in large language models.

Explore today's top stories

AI-Designed Antibiotics Show Promise in Fighting Drug-Resistant Superbugs

MIT researchers use generative AI to create novel antibiotics effective against drug-resistant bacteria, including gonorrhea and MRSA, potentially ushering in a new era of antibiotic discovery.

IEEE Spectrum logoMassachusetts Institute of Technology logoBBC logo

8 Sources

Science and Research

19 hrs ago

AI-Designed Antibiotics Show Promise in Fighting

Cohere Raises $500 Million, Hires Meta's AI Research Head in Bid to Challenge AI Giants

Canadian AI startup Cohere secures $500 million in funding, reaching a $6.8 billion valuation, and appoints former Meta AI research head Joelle Pineau as Chief AI Officer, positioning itself as a secure enterprise AI solution provider.

TechCrunch logoFinancial Times News logoReuters logo

13 Sources

Business and Economy

19 hrs ago

Cohere Raises $500 Million, Hires Meta's AI Research Head

Brain Implant Decodes Inner Speech with Password Protection, Advancing AI-Assisted Communication

Scientists have developed a brain-computer interface that can decode inner speech with up to 74% accuracy, using a password system to protect user privacy. This breakthrough could revolutionize communication for people with severe speech impairments.

Nature logoNew Scientist logoNews-Medical logo

9 Sources

Science and Research

19 hrs ago

Brain Implant Decodes Inner Speech with Password

AI-Generated Errors in Australian Murder Case Highlight Legal Risks of Artificial Intelligence

A senior Australian lawyer apologizes for submitting AI-generated fake quotes and non-existent case judgments in a murder trial, causing a 24-hour delay and raising concerns about AI use in legal proceedings.

AP NEWS logoeuronews logoCBS News logo

9 Sources

Technology

3 hrs ago

AI-Generated Errors in Australian Murder Case Highlight

TeraWulf Secures $3.7B AI Hosting Deal Backed by Google, Pivoting from Bitcoin Mining

TeraWulf, a Bitcoin mining company, has signed a major AI infrastructure hosting deal with Fluidstack, backed by Google. This pivot could significantly boost the company's revenue and marks a shift in strategy for cryptocurrency miners facing challenges.

Cointelegraph logoEconomic Times logoBenzinga logo

7 Sources

Business and Economy

19 hrs ago

TeraWulf Secures $3.7B AI Hosting Deal Backed by Google,
TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo