Curated by THEOUTPOST
On Sat, 9 Nov, 12:02 AM UTC
5 Sources
[1]
Google drops new Gemini model and it goes straight to the top of the LLM leaderboard
Google is constantly updating Gemini, releasing new versions of its AI model family every few weeks. The latest is so good it went straight to the top of the Imarena Chatbot Arena leaderboard -- toppling the latest version of OpenAI's GPT-4o. Previously known as the LMSys arena, it is a platform that lets AI labs pit their best models against one another in a blind head-to-head. The users vote but don't know which model is which until after they've voted. The new model from Google DeepMind has the catchy name Gemini-Exp-1114 and has matched the latest version of GPT-4o and exceeded the capabilities of the o1-preview reasoning model from OpenAI. The top 5 models in the arena are all versions of OpenAI or Google models. The first model on the leaderboard not made by either of those companies is xAI's Grok 2. The success of this new model comes as Google finally releases a Gemini app for iPhone, which beat the ChatGPT app in our Gemini vs. ChatGPT 7-round face-off. The latest Gemini model seems to perform particularly well at math and vision tasks, which makes sense as they are areas in which all Gemini models excel. Gemini-Exp-1114 isn't currently available in the Gemini app or website. You can only access it by signing up for a free Google AI Studio account (the platform aimed at developers wanting to try new ideas). I'm also not sure whether this is a version of Gemini 1.5 or whether its an early insight into Gemini 2, expected next month. If it is the latter then the improvement over the previous generation might not be as extreme as some expected. However, it is doing well in technical and creative areas according to benchmarks. This would tie in to the idea its going to be useful for reasoning and managing agents. It first in math, solving hard problems, creative writing and vision. Unlike other benchmarks the Chatbot Arena is based on human perceptions of performance and output quality, rather than rigid testing against data. Whether this is just a new version of Gemini 1.5 Pro or an early insight into the capabilities of Gemini 2, its going to be an interesting few months in AI land.
[2]
Google Gemini 2 Likely to Dethrone OpenAI o1
Will Google wait for the full release of o1 to steal the show from OpenAI? The buzz around the imminent launch of Google Gemini 2 is heating up. According to a recent leak on X, Google is preparing to launch a new model: Gemini-2.0-Pro-Exp-0111. On X, Google's senior product manager Logan Kilpatrick posted, "AI is cool I guess," in what seems like a subtle nod to OpenAI chief Sam Altman. The new model is expected to appear under the Advanced section, though it's unclear if it's intended for an internal testing group or a public launch. The user tried prompting it and received responses. According to them, it seems to work quite fast, but they're still unsure if the responses are truly from version 2.0. AIM had previously explored 'why Google will make a better model than OpenAI's o1', and it looks like that prediction is coming true. "An unknown Gemini model is available in the LMSYS Arena (battle). While it's unclear if this is Gemini 2.0, the "Gemini-test" outperformed one of my tests with OpenAI o1-mini," posted a user on X. Meanwhile, AI insider Jimmy Apples shared a scoop on Gemini 2, posting, "Someone may have gotten too drunk and said Gemini 2.0 has already been deployed to select B2B customers..." Similar to Gemini 1.5, Gemini 2 will continue to generate images and perform web searches -- features likely included to help Google compete with OpenAI's Search GPT and Perplexity AI. Meta is also expected to join the search race. Interestingly, Google AI Studio and the Gemini API recently introduced 'Grounding with Google Search', allowing developers to improve response accuracy by incorporating real-time data from Google Search. With this update, Gemini 1.5 models can pull live information from Google Search, increasing accuracy and transparency. Developers can access grounding features directly through the "Tools" section in Google AI Studio or by enabling the 'google_search_retrieval' tool in the Gemini API. Gemini 2 and its APIs will likely have this feature. A user on X who attended Kilpatrick's session in San Francisco revealed that Gemini 2 will be a larger model with multi-turn capabilities, vision, audio, embeddings, and more. Google plans to release a new feature that can take control of a user's web browser to perform tasks like gathering research, purchasing products, or booking flights. This feature will also be integrated into Gemini 2. Code-named 'Jarvis', the product was recently leaked, according to a report that said it was briefly available for download through Google's Chrome web browser extension store and described itself as 'a helpful companion that surfs the web with you'. This is quite similar to Anthropic's 'Computer Use' feature, which can take control of a user's screen to perform actions such as viewing the screen, moving the cursor, clicking buttons, and typing text. Similarly, Microsoft is testing Copilot Vision, a feature that enables its AI to understand and interact with content on web pages. With Copilot Vision, the AI can interpret what users are viewing in Microsoft Edge, answer questions about the content, and suggest next steps based on what's displayed. Google has recently seen success with its latest products, with NotebookLM as a prime example. It has been widely praised and even called Google's "ChatGPT Moment." Furthermore, during the company's recent earnings call, Google chief Sundar Pichai revealed that Google Gemini API calls have increased 14x times in the past six months. GitHub recently partnered with Google to bring Gemini 1.5 Pro to GitHub Copilot. Gemini 1.5 is known for its two-million-token context window and ability to process code, images, video, and text simultaneously. Gemini's reasoning capabilities are expected to be better than OpenAI's o1. A recent report reveals that Google is working on AI with reasoning capabilities similar to those of humans, most likely for its Gemini platform. Kilpatrick said in an exclusive interview with AIM that Google plans to release Gemini 2, which will feature better reasoning quality and a longer context window -- potentially up to billions or trillions of tokens. According to Kilpatrick, the model will be fully multimodal, with the capability to understand large videos as well. Recently, Apples shared a document on X, dated last year, revealing that Google is planning to integrate the 'PLANNING' piece in the LLM. Moreover, in an old Wired article, Google's Demis Hassabis also said that his team will combine the technologies used in AlphaGo to give the system new capabilities such as planning and solving new problems. Notably, Google recently published a paper titled 'Training Language Models to Self-Correct via Reinforcement Learning'. Google DeepMind has developed a multi-turn online reinforcement learning approach to improve an LLM's capabilities to self-correct. With further improvements to Google DeepMind's RL techniques and their integration with Chain of Thought in Gemini, Google could easily create a model that outperforms OpenAI's o1. Kilpatrick told AIM that Google Gemini and Google DeepMind collaborate closely, with Google DeepMind focused on making AI accessible to developers and the public. Google DeepMind's recent models AlphaProof and AlphaGeometry 2 won a silver medal at the International Mathematical Olympiad (IMO), while OpenAI's o1-preview achieved 83% in a similar test. Meanwhile, OpenAI is preparing to launch o1. According to a recent Reddit thread, Altman appears more confident about the imminence of AGI, likely due to their latest model, o1. He even said that they have achieved human-level reasoning and will now move on to level 3 in their AGI roadmap. Many are now suggesting that OpenAI's o1 may be regarded as the first successful commercial launch of a System 2 LLM. The battle is on, and it seems like Google is finally ready to steal the spotlight from OpenAI. As one X post put it, "We will finally see Gemini 2.0 Pro soon, long overdue. But they will probably wait for the release of full o1 to steal the show from OpenAI, so to speak - just as OpenAI has done to Google every time."
[3]
Gemini 2.0 is leaked, now we wait for the launch
Gemini 2.0 leaked this week, sparking anticipation for Google's latest AI model release. TestingCatalog identified a model titled Gemini-2.0-Pro-Exp-0111 on the Gemini web app, available only to select users under the Gemini Advanced section. This discovery has heightened speculation about Gemini 2.0's potential capabilities and suggests Google may be gearing up for a public launch soon. The Gemini-2.0-Pro-Exp-0111 model reportedly appears as an option for paid Google One AI Premium subscribers, who already enjoy exclusive access to advanced tools such as Gemini Advanced. Free users still have access to Gemini 1.5 Flash, but it's rumored that Google may introduce a Gemini 2.0 Flash for them as well. TestingCatalog noted that the experimental model responds quickly to prompts and includes capabilities such as image generation and web search. However, the model is not yet available for general use, indicating it might still be undergoing internal testing. The current model is labeled as "our experimental model," according to Google. However, it remains uncertain whether this experimental tag hints at its readiness for full public release or if it's merely part of Google's preliminary tests. Adding to the intrigue, Google seems to be working on a standalone Gemini app specifically for iPhone users. Though iPhone users have been able to access Gemini AI through the Google app, TestingCatalog reported a sighting of a dedicated Gemini app briefly available on the App Store. This app, which includes voice search, text generation, and image creation, promises a more direct experience with Google's AI features for iOS users. Voice search, showcased at Google I/O 2024, was one of the standout features, offering a conversational and advanced AI interaction. If launched, this standalone app would be a significant addition for iPhone users who may not have access to Apple's own advanced AI features. This move aligns with Google's strategy of enhancing cross-platform accessibility, providing a viable alternative to Apple's native AI tools. The timing of the Gemini 2.0 leak aligns with expectations of an official release by the end of 2024, as Google aims to keep pace with OpenAI's anticipated launch of its Orion model. Both companies have been preparing to unveil their next-generation AI models, setting the stage for intense competition in the AI space. With both Google and OpenAI in a race to dominate the next wave of AI technology, the imminent arrival of Gemini 2.0 could significantly bolster Google's presence. The added advantage of a dedicated iOS app positions Google to reach users across multiple platforms, a strategic advantage in this competitive landscape. Gemini Live can now speak French, German, Portuguese, Hindi, and Spanish The leak of Gemini 2.0 and the possible release of a Gemini app for iPhone underscore Google's commitment to delivering cutting-edge AI to a broader audience. As these developments unfold, they could redefine how users engage with AI across devices, setting a new standard for accessible, high-performance AI tools. Google's careful rollouts, combined with strategic platform inclusivity, signal a promising future for Gemini 2.0. With potential enhancements like faster response times, image generation, and voice-activated capabilities, the model stands to offer significant advancements over its predecessors.
[4]
Google could release its Gemini 2 AI model sooner than we thought -- here's why
Google is expected to release the next version of its Gemini family of AI models early next month, but according to Testing Catalog, it could be coming sooner than expected. Gemini was first announced in December 2023, replacing the Bard chatbot and becoming the default model across all Google products. Since then, version 1.5 has been released, and multiple smaller upgrades have been made, but it looks like Gemini 2 is just around the corner. Alexey Shabanov of Testing Catalog reports finding a new "experimental model" listed under the model menu in the Gemini web app. It was under the Gemini Advanced options and labeled as "With 2.0-Pro-Exp-0111." It isn't clear if this is actually Gemini 2 or a mistake in labeling. In his testing, Shabanov reports that it is "lightning fast" but may not be "fully baked" as it failed some basic tests including identifying the number of times "r" appears in the word strawberry. We know very little about Gemini 2 as AI labs, including those funded by large tech companies, tend to hold those details close to their chests until all training and testing is complete. However, we're getting a brief insight thanks to leaks and some early external testing gone awry. Gemini 2 is likely going to have some reasoning capabilities, similar to OpenAI's o1 family of models. It will also power "Project Jarvis", a new agent system set to be built into Chrome that will allow you to perform common web tasks by using natural language prompts. Previous reports suggested it would be released in the first week of December, a year after the release of Gemini 1. This makes sense and would be around the time we're expecting a major ChatGPT upgrade (to coincide with its second anniversary). If Gemini 2 follows recent trends from Google it will come in two versions initially, Gemini 2 Flash for very fast, lower-level responses and then Gemini 2 Pro for more advanced output. If the leak is anything to go by, it looks like even Pro will get a speed upgrade.
[5]
Leaked Screenshots Suggest That Google Might Launch Gemini 2.0 Closer To OpenAI's Upcoming o1 Model Release Window
AI seems to be all over the news, and the tech community is aggressively focusing on initiatives on the artificial intelligence front and working on ways to keep evolving the technology. OpenAI has recently shared its plans for an o1 model that is centered on bringing logical reasoning and problem-solving capabilities. Google would not be left behind, as we have now leaked screenshots hinting at the tech giant gearing up to launch its Gemini 2.0. The tech community is hooked on OpenAI's next big AI model, o1, as it is said to be quite promising and far more advanced than any of the existing tools, especially with its ability to reason and offer human-like intelligence. Google is not slowing down and is working vigorously on its next big AI innovation. A new report has surfaced that hints at the imminent release of Google's Gemini 2.0, and it might come out sooner than anticipated. The rumor mill seems to be churning in full swing, and recently leaked screenshots via Android Headlines have started doing the rounds for hinting about the upcoming model. One of the screenshots shared on X is of the Gemini website that expands on the model selector option, and we see the Gemini Advanced section "with 2.0-Pro-Exp-0111" listed as one of the options and is titled as being in the experimental phase. Another screenshot shared leaked reports regarding an unnamed Gemini model that has allegedly outed OpenAI's o1 mini model in performance in the early online testing phase. While details on this are scarce, the experimental model hints at the next Gemini model being in the works. With reports and speculations emerging every now and then, we are given the likelihood of Google rolling out its Gemini 2.0 model soon. While there is no confirmation on the company's end, such information should be taken with a grain of salt till more clarity is gained. Given the increasing competition in scaling AI capabilities, OpenAI and Google are working aggressively towards their next big model. If things are to be seen from a strategic point of view, Google might plan its release closer to OpenAI's launch of the o1 model, given how this pattern was previously observed with OpenAI launching GPT-4o sometime after Google brought forth its Gemini 1.5 Pro. Whether Google opts for a similar launch window or waits for OpenAI to make its launch move is still ambiguous, but one thing is certain both companies are refining their AI models and trying to keep up with each other's advances and working on leaving their mark on the AI front.
Share
Share
Copy Link
Recent leaks suggest Google is preparing to launch Gemini 2.0, a powerful AI model that could rival OpenAI's upcoming o1. The new model promises enhanced capabilities in reasoning, multimodal processing, and faster performance.
Recent leaks and reports suggest that Google is on the verge of releasing Gemini 2.0, its next-generation AI model. This development comes as the tech giant aims to maintain its competitive edge against OpenAI's upcoming o1 model 12.
A model titled "Gemini-2.0-Pro-Exp-0111" was spotted in the Gemini web app, available to select users under the Gemini Advanced section 3. This experimental model reportedly responds quickly to prompts and includes capabilities such as image generation and web search 3. While not yet available for general use, its presence indicates that Google may be in the final stages of testing before a public launch 4.
Gemini 2.0 is expected to bring significant improvements over its predecessors:
The timing of Gemini 2.0's development aligns closely with OpenAI's anticipated launch of its o1 model. Both companies are in a race to dominate the next wave of AI technology 5. Google's senior product manager, Logan Kilpatrick, hinted at the model's capabilities, suggesting it might outperform OpenAI's offerings in areas such as math, vision tasks, and creative writing 12.
Gemini 2.0 is expected to have wide-ranging applications:
The imminent release of Gemini 2.0 could significantly bolster Google's presence in the AI space. With both Google and OpenAI pushing the boundaries of AI technology, the competition is likely to drive rapid advancements in the field 5. This development also highlights the increasing focus on AI that can perform complex reasoning tasks and solve problems in a manner similar to human intelligence 2.
As the AI landscape continues to evolve, the release of Gemini 2.0 could mark a significant milestone in the development of more advanced, capable, and accessible AI systems. The tech community eagerly awaits official announcements from Google to confirm these leaked details and showcase the full potential of Gemini 2.0 5.
Reference
[2]
[3]
Google's experimental AI model Gemini-Exp-1121 has tied with OpenAI's GPT-4o for the top spot in AI chatbot rankings, showcasing rapid advancements in AI capabilities. However, this development also raises questions about the effectiveness of current AI evaluation methods.
5 Sources
5 Sources
Google is expected to release Gemini 2.0, the next generation of its AI model, in December 2024. This launch comes amid intense competition in the AI industry and may bring new capabilities and advancements to the field.
8 Sources
8 Sources
Google has announced the release of new Gemini models, showcasing advancements in AI technology. These models promise improved performance and capabilities across various applications.
2 Sources
2 Sources
Google hints at upcoming features for Gemini Advanced, including video generation tools, AI agents, and improved language models, signaling a significant leap in AI capabilities and user experience.
13 Sources
13 Sources
Google has released an experimental version of Gemini 2.0 Advanced, offering improved performance in math, coding, and reasoning. The new model is available to Gemini Advanced subscribers and represents a significant step in AI development.
11 Sources
11 Sources
The Outpost is a comprehensive collection of curated artificial intelligence software tools that cater to the needs of small business owners, bloggers, artists, musicians, entrepreneurs, marketers, writers, and researchers.
© 2025 TheOutpost.AI All rights reserved