2 Sources
[1]
Why Do Chatbots Keep Telling Stories About Someone Named 'Elias Thorne'?
Who in the world is Elias Thorne? He's a regular fixture in stories told by chatbots, as first spotted by software engineer Daniel May, but no one knows why... until now. According to a new preprint research paper first reported by 404 Media, the proliferation of the legend of Elias might be related to guardrails put in place for AI models during safety and alignment training. If you need to catch up on the Elias Thorne of it all, the paper published by researchers at Cornell University is a good place to start. They gave several AI models, including OpenAI's GPT-5.4 Mini, Anthropic's Claude Haiku 4.5, and Google's Gemini 3.1 Flash-Lite, five different prompts to generate stories. They looked at about 20,000 stories generated by the models and found a shocking amount of repetition: 11 words -- Lighthouse, Keeper, Baker, Mayor, Clockmaker, Fisherman, Librarian, Conductor, and the names Mara, Elias, and Elara -- appeared in a whopping 88% of all stories. No combination of that incredibly narrow pool of nouns for storytelling purposes appears more often than Elias the lighthouse keeper, which showed up in two-thirds of all stories generated. That's pretty much in line with the anecdotal examples provided by May, who also prompted multiple different models to write stories and found the same Elias the lighthouse keeper pop up over and over again. So what exactly is the deal? The researchers posited that it might have something to do with the pre-training data fed into these models, but quickly ruled that out when they couldn't find anything to suggest "Elias the lighthouse keeper" appears with excess frequency in pre-training data or literature used in training. Instead, they attribute the issue to the use of specific datasets that have become commonly used by AI labs. They cited WildChat, an open-source dataset of millions of conversations between people and a GPT-3.5-powered chatbot, as a possible example. The dataset was created to help researchers understand how people communicate with bots, but has since been used to train many different models. They theorize that alignment training meant to steer models away from copyrighted characters and adult content may have inadvertently given "safe" alternatives, such as "Elias the lighthouse keeper," unusual prominence, causing them to appear repeatedly when users ask the model to generate a story. Elias Thorne, the lighthouse keeper, might be fine for a children's bedtime story, but 404 Media reported that it seems the character name is spreading. The publication found examples of the name as the protagonist in fantasy books, as well as the "artist" listed on ambient music tracks available on Amazon. May also discovered examples of Elias Thorne as the author of books, including a handbook that claims to provide information on alternative cancer treatments. So, that's not great. If nothing else, the strange quirk of LLM storytelling is a good reminder that AI is not creative. A study published last year found that image generation models repeatedly produce images that fall into one of just 12 specific motifs, no matter how out-there the given prompts. Basically, give AI a creative task, and it'll give you the equivalent of elevator music.
[2]
Chatbots Keep Telling Stories About Lighthouse Keeper 'Elias Thorne'. We Might Know Why
LLMs including ChatGPT, Gemini and Claude are obsessed with telling stories about lighthouse keepers and clockmakers, and one character named 'Elias Thorne' has made his way from chatbots to Amazon books. Researchers are trying to discover why. Depending on which chatbot you ask, Elias Thorne might be a clockmaker, a lighthouse keeper, or a librarian. But if you ask ChatGPT or any of the other popular large language models to tell you a story, there's a good chance he'll appear, unbidden. And Elias's stories are flooding the self-published AI generated book market, Youtube, and fake news sites. Software engineer Daniel May first noticed the Elias takeover earlier this year; he found that on Google Trends, people weren't searching for "Elias Thorne" until late 2025. Searches for the name really spiked in early 2026, while the related query "lighthouse keeper" also started trending upward in the last few years. He tested a few chatbots, including Grok, Deepseek, and Gemini, with the prompt "tell me a story," and the chatbots frequently started with similar stories about lighthouses, clockmakers, or explorers. In late May, researchers Sil Hamilton and David Mimno at Cornell University's Department of Information Science published their paper, "Elias in the Lighthouse, Again?" on the preprint repository arXiv. They sampled 20,000 total stories from OpenAI's ChatGPT, Anthropic's Claude, and Google's Gemini, and the Allen Institute for AI's chatbot using five prompts, and found that the same 11 words -- names like Elias, Mara, and Elara, and occupations like lighthouse keeper, clockmaker, and librarian -- appear in more than 88% of generated stories, with little difference between models. Unite.ai covered the study shortly after it was published. The researchers posit in their paper that these themes show up so often in part because of the models' safety and alignment tuning. "Model development today is like a big family tree. Most models are related to each other because developers synthesize a lot of training data with models even from different companies," Hamilton told me in an email. He, Mimno, and their colleague Rebecca M. M. Hicke found this in a 2025 paper where they looked at specific words used across models. OpenAI's first ChatGPT model, GPT-3.5, is the root of the family tree because it was used to make WildChat, a training set that's since been used to make other training sets. "WildChat contains 1 million real conversations with ChatGPT, and 166 of these contain the name 'Elias' like here and here," Hamilton added. "These are written in that familiar 'lighthouse' style. Models trained on WildChat copied this style, and developers unwittingly replicated it when using those models to generate newer datasets. It's like a virus." Elias has since escaped chatbot containment. May noticed Elias Thorne popping up on Amazon as an author of alt-medicine cancer handbooks, a 2026 YouTube-algorithm guide, a book on Greek mythology, and a psychological thriller novella. "No human writes all of those," May wrote in his blog post. "The first one sits in territory where bad advice causes real harm. The mode-collapsed name from the chat window is now a byline appearing across genres." When I searched Elias Thorne on Amazon, I found Elias as the protagonist in fantasy books and producing music, too: he's "a brilliant but cynical archaeologist with a knack for unearthing what powerful institutions want to keep hidden" in one fantasy series, or a musical artist making ambient listening albums of birds and nature sounds. Fittingly, one Elias Thorne with an AI-generated author photo is also churning out AI grift books. In the last few years, AI-generated books have flooded Amazon's self-publishing offerings, especially, with books containing dangerous misinformation and messy errors taking over the platform. AI-generated books are also making librarians' jobs hell. Elias has also escaped to the Youtube slop world: in one video from the channel Moments That Moved the World, a slop-illustrated story features the plight of "83-year-old Sergeant Major Elias Thorne." On the AI slop site Wonderful Museums, "Snake Museum Owner Shot By Wife: Unpacking the Tragic Incident at Thorne's Reptile Sanctuary" spins Elias Thorne's story as a man shot by his wife. On another slop site called Tatticle, the "wealthiest man in Ohio," Elias Thorne, died "with exactly twelve dollars in his pocket." In these stories, Elias is usually a tragic figure, an aggrieved and unfairly-treated old man. He's a similar character in a short story published by the BBC as a finalist in its 2024/2025 children's writing competition -- but Elias is a real name, and could feasibly still be the subject of a human-written story (and there have been no accusations of the BBC's children's writing competition being infiltrated by AI slop). But with all the world's literature as its training data, why do LLMs seem to default so often to the lighthouse? It comes down to how model makers try to safety-align and sanitize their outputs. "We found many stories in WildChat are not safe for work. This led us to hypothesize that models going through alignment are preferring a small slice of WildChat stories, like a bottleneck," Hamilton said. "It isn't that Elias stories are frequent, but that they're just so safe." He said the researchers plan to explore this theory further in future research. As for Elias, there is one example I've found of him existing pre-generative AI, as a time traveling mad scientist in the 1980's trading card series Dinosaurs Attack!. And a real-life Elias that comes close to the stories told by LLMs did actually exist, Hamilton found -- Elias Allen was a 16th century clockmaker in London.
Share
Copy Link
AI chatbots from OpenAI, Anthropic, and Google are obsessed with one character: Elias Thorne, a lighthouse keeper. Cornell University researchers found that 11 specific words appear in 88% of AI-generated stories, with Elias the lighthouse keeper showing up in two-thirds of them. The culprit? Safety alignment training that inadvertently amplified 'safe' character names, spreading them across models and now flooding Amazon with AI-generated books.
Chatbots have developed an unusual obsession with a character named Elias Thorne. Software engineer Daniel May first spotted this peculiar pattern when he noticed that large language models from OpenAI, Anthropic, and Google Gemini consistently generated stories featuring this lighthouse keeper character
2
. Google Trends data revealed that searches for "Elias Thorne" didn't exist until late 2025, with a significant spike in early 20262
. When May tested chatbots including Grok, Deepseek, and Gemini with the simple prompt "tell me a story," the same lighthouse keeper narratives appeared repeatedly2
.
Source: Gizmodo
Researchers Sil Hamilton and David Mimno at Cornell University's Department of Information Science published a preprint paper titled "Elias in the Lighthouse, Again?" that examined this phenomenon systematically
1
. They analyzed approximately 20,000 AI-generated stories from OpenAI's GPT-5.4 Mini, Anthropic's Claude Haiku 4.5, Google's Gemini 3.1 Flash-Lite, and the Allen Institute for AI's chatbot using five different prompts1
2
. The findings were striking: 11 words—Lighthouse, Keeper, Baker, Mayor, Clockmaker, Fisherman, Librarian, Conductor, and the names Mara, Elias, and Elara—appeared in a staggering 88% of all stories, with little variation between models1
. The combination of Elias as a lighthouse keeper showed up in two-thirds of all generated stories1
.The researchers initially considered whether pre-training data might explain the pattern but found no evidence that "Elias the lighthouse keeper" appears with unusual frequency in literature or training data
1
. Instead, they attribute the issue to alignment training meant to steer AI models away from copyrighted content and adult material. Hamilton explained that model alignment today resembles "a big family tree" where developers synthesize training data across companies, with OpenAI's GPT-3.5 serving as the root2
.The WildChat dataset, an open-source collection of millions of conversations between people and a GPT-3.5-powered chatbot, appears to be a key culprit
1
. Created to help researchers understand human-bot communication, WildChat contains 166 conversations featuring the name "Elias" written in the familiar lighthouse style2
. This dataset has since been used to train numerous other models, with Hamilton describing it as "like a virus" spreading across the AI ecosystem2
. Safety guardrails inadvertently gave "safe" alternatives unusual prominence, causing them to appear repeatedly when users request stories1
.Related Stories
Elias Thorne has escaped chatbot containment and now appears across digital platforms. May discovered the character listed as the author of Amazon books spanning wildly different genres: an alternative cancer treatment handbook, a 2026 YouTube-algorithm guide, Greek mythology books, and psychological thriller novellas
2
. "No human writes all of those," May observed, noting that "the mode-collapsed name from the chat window is now a byline appearing across genres"2
.On Amazon, Elias appears as a protagonist in fantasy series—described as "a brilliant but cynical archaeologist"—and even as a musical artist producing ambient nature sounds
2
. The character has infiltrated YouTube slop content, appearing in videos from channels like "Moments That Moved the World" featuring "83-year-old Sergeant Major Elias Thorne"2
. AI slop sites have also adopted the name, with fabricated stories about snake museum owners and Ohio's wealthiest man2
.The proliferation of Elias Thorne highlights serious concerns about AI-generated content flooding digital platforms. Books containing dangerous misinformation and messy errors are taking over Amazon's self-publishing platform, while AI-generated books are making librarians' jobs increasingly difficult
2
. The cancer treatment handbook attributed to Elias Thorne represents a particularly troubling example where repetitive AI-generated content enters territory where bad advice causes real harm2
.This phenomenon serves as a stark reminder that AI lacks genuine creativity. A previous study found that image generation models repeatedly produce images falling into just 12 specific motifs, regardless of prompt variety
1
. As Hamilton and Mimno's research demonstrates, when given creative tasks, AI delivers the equivalent of elevator music—safe, repetitive, and devoid of originality1
. For anyone relying on these tools for content creation, the Elias Thorne case signals the need to scrutinize outputs more carefully and understand the limitations built into these systems through their training processes.Summarized by
Navi
03 Dec 2024•Technology

11 Dec 2025•Entertainment and Society

29 Apr 2026•Technology

1
Technology

2
Business and Economy

3
Health
