AI Struggles with Sarcasm and Sentiment in Non-American English Varieties

2 Sources

New research reveals that large language models have difficulty detecting sarcasm and sentiment in Australian, Indian, and British English, highlighting the need for more diverse language training in AI.

New Benchmark Reveals AI's Struggle with Non-American English

Researchers have developed a new tool called BESSTIE (Benchmark for Sentiment and Sarcasm in Three International English varieties) to evaluate the performance of large language models (LLMs) in detecting sentiment and sarcasm across different English varieties. The study, published in the Findings of the Association for Computational Linguistics 2025, highlights significant challenges faced by AI in understanding non-American English 12.

The Challenge of Language Varieties

Dr. Siddharth Srivastava, the lead researcher, shares a personal anecdote that illustrates the complexity of language varieties. Despite studying English for over two decades, he found himself confused by Australian English upon moving to Australia. This experience mirrors the challenges faced by AI models, which are predominantly trained and tested on Standard American English 1.

Source: The Conversation

Source: The Conversation

BESSTIE: A Novel Benchmark Tool

BESSTIE is the first benchmark of its kind, focusing on three English varieties: Australian, Indian, and British. The researchers collected data from Google Maps reviews and Reddit posts, using language variety predictors to ensure a high probability of specific language varieties. The benchmark evaluates nine powerful, freely usable large language models, including RoBERTa, mBERT, Mistral, Gemma, and Qwen 12.

Key Findings

The study revealed several important insights:

  1. Performance disparity: LLMs performed better on Australian and British English (native varieties) compared to Indian English (non-native variety) 12.

  2. Sentiment vs. Sarcasm: AI models were more adept at detecting sentiment than sarcasm across all varieties 12.

  3. Sarcasm detection challenges: The models struggled significantly with sarcasm, achieving only 62% accuracy for Australian English and about 57% for Indian and British English 12.

Source: Tech Xplore

Source: Tech Xplore

  1. Inflated performance claims: The study's findings contradict the high performance metrics often reported by tech companies. For instance, while the GLUE leaderboard shows 97.5% accuracy for sentiment classification in American English, the actual performance on other English varieties was notably lower 12.

Implications and Future Directions

The research underscores the importance of evaluating AI models in specific national contexts. As LLMs become increasingly prevalent worldwide, there's a growing recognition of the need to adapt these tools for diverse language varieties 12.

Dr. Srivastava and his team are currently working on a project to implement LLMs in hospital emergency departments to assist patients with varying English proficiencies. Additionally, initiatives like the University of Western Australia and Google's project to improve LLM efficacy for Aboriginal English demonstrate the increasing focus on language diversity in AI development 12.

Conclusion

The BESSTIE benchmark represents a significant step towards more inclusive and accurate AI language models. By highlighting the current limitations in processing non-American English varieties, this research paves the way for future improvements in AI's ability to understand and interpret diverse language patterns, ultimately leading to more effective and equitable AI applications across different cultures and regions.

Explore today's top stories

Google Unveils AI-Powered Pixel 10 Smartphones with Advanced Gemini Features

Google launches its new Pixel 10 smartphone series, showcasing advanced AI capabilities powered by Gemini, aiming to challenge competitors in the premium handset market.

Bloomberg Business logoThe Register logoReuters logo

20 Sources

Technology

2 hrs ago

Google Unveils AI-Powered Pixel 10 Smartphones with

Google Unveils AI-Powered Pixel 10 Series: A New Era of Smartphone Intelligence

Google's Pixel 10 series introduces groundbreaking AI features, including Magic Cue, Camera Coach, and Voice Translate, powered by the new Tensor G5 chip and Gemini Nano model.

TechCrunch logoZDNet logoengadget logo

12 Sources

Technology

2 hrs ago

Google Unveils AI-Powered Pixel 10 Series: A New Era of

NASA and IBM Unveil Surya: An AI Model to Predict Solar Flares and Space Weather

NASA and IBM have developed Surya, an open-source AI model that can predict solar flares and space weather with improved accuracy, potentially helping to protect Earth's infrastructure from solar storm damage.

New Scientist logoengadget logoGizmodo logo

6 Sources

Technology

10 hrs ago

NASA and IBM Unveil Surya: An AI Model to Predict Solar

Google Unveils Pixel Watch 4: A Leap Forward in AI-Powered Wearables

Google's latest smartwatch, the Pixel Watch 4, introduces significant upgrades including a curved display, enhanced AI features, and improved health tracking capabilities.

TechCrunch logoCNET logoZDNet logo

17 Sources

Technology

2 hrs ago

Google Unveils Pixel Watch 4: A Leap Forward in AI-Powered

FieldAI Secures $405M Funding to Revolutionize Robot Intelligence with Physics-Based AI Models

FieldAI, a robotics startup, has raised $405 million to develop "foundational embodied AI models" for various robot types. The company's innovative approach integrates physics principles into AI, enabling safer and more adaptable robot operations across diverse environments.

TechCrunch logoReuters logoGeekWire logo

7 Sources

Technology

2 hrs ago

FieldAI Secures $405M Funding to Revolutionize Robot
TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo