Chile launches Latam-GPT, an open-source AI model trained on Latin American data and culture

3 Sources

Share

Chile unveiled Latam-GPT, the first open-source artificial intelligence language model built specifically for Latin America. Developed with just $550,000 and trained on over eight terabytes of regional data, the model aims to combat US-centric bias in AI and better reflect the continent's cultural diversity, slang, and regional realities.

Chile Introduces Latam-GPT to Address Regional Underrepresentation

Chile on Tuesday launched Latam-GPT, marking a significant step for Latin America in the global artificial intelligence race

1

. The open-source AI model represents the first artificial intelligence language model specifically trained on the diverse cultures of Latin America, addressing a critical gap in how AI systems understand and represent the region

2

. Led by the Chilean National Center for Artificial Intelligence, known as CENIA, the project emerged from a two-year regional collaboration involving over 30 institutions across eight Latin American countries including Argentina, Brazil, Chile, Colombia, Ecuador, Mexico, Peru, and Uruguay

1

.

Source: France 24

Source: France 24

Combating Linguistic Biases and US-Centric Bias in AI

The initiative directly addresses the problem of linguistic biases inherent in AI models trained primarily on English data from US-centric sources

1

. "Latam-GPT is trained with a proportion of Latin American data that previously did not exist online and was not included in existing models," explained Rodrigo Durán, executive director at CENIA

1

. This approach enables more accurate performance when addressing Latin America and the Caribbean's specific needs. Chilean President Gabriel Boric emphasized the strategic importance, stating that "we're at the table -- we're not on the menu," highlighting the region's determination to be an active participant rather than a passive consumer in the AI economy

2

.

Training Data and Cultural Diversity Captured

Developing Latam-GPT required collecting more than eight terabytes of data, equivalent to millions of books

1

. The data training process incorporated information from private sources obtained through strategic partnerships across the region, supplemented by synthetic data to address underrepresentation in specific areas

1

. Gabriela Arriagada, a researcher at CENIA and head of the project's ethics team, explained that incorporating Latin American culture means "a training approach designed to address data that reflects cultural realities, identifying where gaps exist in other models"

1

. The model's ability to recognize regional slang, idioms, and speech patterns represents a major advancement in cultural relevance

2

.

Foundational AI Infrastructure for Regional AI Applications

Unlike consumer-facing tools such as ChatGPT or Google's Gemini, Latam-GPT functions as foundational AI infrastructure for future regional AI applications

1

. The open-source nature allows programmers to customize the software for specific needs

2

. CENIA director Alvaro Soto noted potential applications for hospitals "with logistical problems or issues with the use of medical resources"

2

. Chilean entrepreneur Roberto Musso's company Digevo plans to use Latam-GPT to develop customer service programs for airlines and retailers, with clients expressing strong interest in having users communicate in local language with proper cultural context

2

.

Technical Capacity and AI Regulation Implications

The project demonstrates that Latin America now possesses the technical capacity to build AI models independently. Durán emphasized that "Latin America has come together to form a collaborative group" showing the region can develop and understand this technology, which carries "important implications for AI regulation, because you cannot regulate something you do not understand"

1

. Luis Chiruzzo, an engineering professor at the University of the Republic in Uruguay, called it a "very important milestone for Latin America" that ensures everyone is included in the data training process

1

.

Budget Constraints and Future Development Plans

Latam-GPT was developed with just $550,000 in funding from CENIA's budget and the Development Bank of Latin America

1

. The team used Amazon Web Services' cloud to develop its first version, launching at the end of February. Subsequent versions will be trained on a supercomputer at the University of Tarapacá in northern Chile, costing approximately $4.5 million, starting in the first semester of 2026

3

. While experts acknowledge the model has "no chance" of competing against major AI corporations with vastly greater resources, it represents a critical step forward in positioning Latin America in the world of language models with its own voice

2

.

Language Support and Indigenous Languages Integration

For now, the project operates primarily in Spanish and Portuguese, with plans to incorporate Indigenous languages in later stages

1

. This phased approach mirrors similar efforts in other regions, such as Singapore's SEA-LION model for Southeast Asian languages and Kenya's UlizaLLama for Swahili-speaking populations

2

. The expansion to Indigenous languages will further enhance the model's ability to preserve and represent the full spectrum of Latin American cultural diversity, addressing concerns that the region could lose significant parts of its traditions if it remains merely a passive recipient of AI systems developed elsewhere

2

.

Today's Top Stories

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2026 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo