Curated by THEOUTPOST
On Wed, 16 Apr, 4:01 PM UTC
3 Sources
[1]
Cohere launches Embed 4: New multimodal search model processes 200-page documents
Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Enterprise retrieval augmented generation (RAG) remains integral to the current agentic AI craze. Taking advantage of the continued interest in agents, Cohere released the latest version of its embeddings model with longer context windows and more multimodality. Cohere's Embed 4 builds on the multimodal updates of Embed 3 and adds more capabilities around unstructured data. Thanks to a 128,000 token context window, organizations can generate embeddings for documents with around 200 pages. "Existing embedding models fail to natively understand complex multimodal business materials,‬†leading companies to develop cumbersome data pre-processing pipelines that only slightly‬†improve accuracy," Cohere said in a blog post. "Embed 4 solves this problem, allowing enterprises and their employees to‬†efficiently surface insights that are hidden within mountains of unsearchable information.‬" Enterprises can deploy Embed 4 on virtual private clouds or on-premise technology stacks for added data security. Companies can generate embeddings to transform their documents or other data into numerical representations for RAG use cases. Agents can then reference these embeddings to answer prompts. Domain-specific knowledge Embed 4 "excels in regulated industries" like finance, healthcare and manufacturing, the company said. Cohere, which mainly focuses on enterprise AI use cases, said its models consider the security needs of regulated sectors and have a strong understanding of businesses. The company trained Embed 4 "to be robust against noisy real-world data" in that it remains accurate despite the "imperfections" of enterprise data, such as spelling mistakes and formatting issues. "It is also performant at searching over scanned documents and‬ handwriting. These formats are common in legal paperwork, insurance invoices, and expense‬ receipts. This capability eliminates the need for complex data preparations or pre-processing†pipelines, saving businesses time and operational costs," Cohere said. Organizations can use Embed 4 for investor presentations, due diligence files, clinical trial reports, repair guides and product documents. †The model supports more than 100 languages, just like the previous version of the model. Agora, a customer of Cohere, used Embed 4 for its AI search engine and found that the model could surface relevant products. "E-commerce data is complex, containing images and multifaceted text descriptions. Being able to‬ represent our products in a unified embedding makes our search faster and our internal tooling more‬ efficient," said Param Jaggi, Founder of Agora‬, in the blog post. Agent use cases Cohere argues that models like Embed 4 would improve agentic use cases and claims it can be "the optimal search engine" for agents and AI assistants across an enterprise. "In addition to‬†strong accuracy across data types, the model delivers enterprise-grade efficiency," Cohere said. "This allows it‬ to scale to meet the demands of large organizations." Cohere added that Embed 4 creates compressed data embeddings to cut high storage costs. Embeddings and RAG-based searches let the agent reference specific documents to fulfill request-related tasks. Many believe these provide more accurate results, ensuring the agents do not respond with incorrect or hallucinated answers.
[2]
Cohere releases Embed 4: a multimodal AI model designed for agentic search - SiliconANGLE
Cohere releases Embed 4: a multimodal AI model designed for agentic search Artificial intelligence startup Cohere Inc. today launched Embed 4, its latest AI model designed to provide embeddings for search and retrieval for AI applications such as assistants and agents. Enterprise businesses rely heavily on retrieval augmented generation, a technique that allows generative AI models to retrieve and incorporate fresh, accurate information in real-time so that large language models respond to user queries with the best possible data. Embedding models, such as Embed take data from documents and turn it into mathematical representations called vectors that can be used to represent the dynamic, multifaceted context of the information. In the case of Embed 4, this can include anything from text-based documents or images to tables, graphs, code and diagrams. Cohere said the new model includes an extremely large context length - up to 128,000 tokens, or around a 200-page document - allowing it to ingest a lengthy annual financial report, product manual or detailed legal contract. It is also deeply multilingual with over 100 languages, featuring key business languages including Arabic, Japanese, Korean and French proficiencies in addition to English. "Embed 4 enables organizations to search their unstructured documents, where a large majority of their important data resides," Cohere said about Embed 4 in the announcement. "It is uniquely capable of generating high-quality representations of complex mixed-modality documents - all within a unified vector." The AI startup said it designed the new model to excel in regulated industries such as finance, healthcare and manufacturing with domain-specific understanding of these industries. These include capabilities for searching investor presentations, annual financial reports, medical records, procedural charts, product specification documents, repair guides and supply chain documents. Cohere also noted that Embed 4 can deal with noisy real-world data by handling fuzzy images and poorly oriented documents. The model was trained against numerous documents of scanned documents, handwriting and other distressed documents, the company said. These are the types of complex data that many businesses will encounter in day-to-day multimodal data pre-processing which are part of the human manual pipeline. Agora, an AI-driven search engine for 35,000 online stores and a customer of Cohere used the model to assist with its business, said it was able to build a better search using its advanced multimodal embedding features. "E-commerce data is complex, containing images and multifaceted text descriptions," said Param Jaggi, founder of Agora. "Being able to represent our products in a unified embedding makes our search faster and our internal tooling more efficient," Embed's capabilities are essential for accurate search and retrieval, which power generative AI models such as Cohere's Command A, a low-cost model the company released last month. Models such as Command A power conversational assistants and AI agents, but rely heavily on search engines, which are connected to secure, proprietary company information to source relevant information to user questions. This is necessary to speed up responses, increase accuracy and reduce hallucinations. Cohere said the new Embed 4 model is integrated with North, the company's secure AI agent productivity platform where it powers its semantic search capability in its Compass product. The Embed 4 model is also available starting today from Microsoft Azure AI Foundry, on Amazon SageMaker and for private deployments.
[3]
Cohere Launches Embed 4 for Advanced Multimodal Enterprise Search | AIM Media House
Security and deployment flexibility are central to Embed 4's design. Enterprise AI startup Cohere has released Embed 4, a powerful new embedding model designed to help organisations search and make sense of large volumes of unstructured data, including documents combining text with visuals. Tailored for enterprise use, Embed 4 supports applications like AI assistants and agents that need a deeper understanding of a company's internal knowledge. It can process documents up to 1,28,000 tokens in length, roughly 200 pages, making it ideal for handling long-form content such as annual reports, legal agreements or technical documentation. The model supports more than 100 languages, including widely used business languages like Arabic, French, Japanese and Korean. This makes it easier for global teams to find and access the information they need, regardless of language. Security and deployment flexibility are central to Embed 4's design. It can be run within virtual private clouds or on premises, helping organisations regulate sectors such as finance, healthcare, or manufacturing meet strict data privacy and compliance requirements. One of Embed 4's standout strengths is its ability to deal with real-world document imperfections. Whether formatting issues, typos or scanned images are involved, the model is built to handle messy data often found in business documents such as invoices, charts or handwritten notes. It also excels at representing multimodal content -- documents that mix text with images, tables, graphs and even code -- in a single embedding. This improves the accuracy of enterprise search tools and cuts down on the need for heavy pre-processing when analysing complex files. Embed 4 is a key component in retrieval-augmented generation (RAG) pipelines, where generative models like Cohere's Command R rely on high-quality data retrieval before producing responses. As the engine behind this retrieval, Embed 4 helps ensure that generated content is more reliable, grounded in the right context and less prone to hallucination. Embed 4, along with Cohere's Command A language model, is now available via the Azure AI Foundry model catalogue.
Share
Share
Copy Link
Cohere releases Embed 4, a powerful new embedding model designed for enterprise-grade multimodal search and retrieval, supporting AI assistants and agents with improved accuracy and efficiency.
Cohere, an enterprise AI startup, has launched Embed 4, its latest embedding model designed to revolutionize search and retrieval capabilities for AI applications such as assistants and agents. This release marks a significant advancement in multimodal AI technology, particularly for enterprise use cases 123.
Embed 4 boasts an impressive context length of up to 128,000 tokens, allowing it to process documents of around 200 pages. This extensive capacity enables the model to handle lengthy annual financial reports, product manuals, and detailed legal contracts with ease 12.
The model excels in processing unstructured and multimodal data, including:
This versatility allows Embed 4 to generate high-quality representations of complex mixed-modality documents within a unified vector 2.
Embed 4 supports over 100 languages, with proficiency in key business languages such as Arabic, Japanese, Korean, and French, in addition to English. This multilingual capability facilitates efficient information access for global teams 23.
The model is tailored for regulated industries like finance, healthcare, and manufacturing, demonstrating domain-specific understanding. It can effectively handle:
Cohere has trained Embed 4 to be resilient against noisy real-world data, maintaining accuracy despite common imperfections in enterprise data such as:
This capability significantly reduces the need for complex data preparation and pre-processing pipelines, potentially saving businesses time and operational costs 1.
Embed 4 plays a crucial role in retrieval augmented generation (RAG) pipelines, where it serves as the engine behind high-quality data retrieval. This integration helps ensure that generated content is more reliable, contextually grounded, and less prone to hallucination 3.
Recognizing the importance of data security, especially in regulated sectors, Cohere offers flexible deployment options for Embed 4:
Agora, an AI-driven search engine for 35,000 online stores and a Cohere customer, has successfully implemented Embed 4 to improve its search capabilities. The model's ability to represent complex e-commerce data, including images and multifaceted text descriptions, in a unified embedding has resulted in faster search and more efficient internal tooling 12.
As enterprises continue to grapple with vast amounts of unstructured data, Cohere's Embed 4 emerges as a powerful solution, promising to unlock hidden insights and streamline information retrieval across various industries.
Reference
[2]
[3]
Analytics India Magazine
|Cohere Launches Embed 4 for Advanced Multimodal Enterprise Search | AIM Media HouseCohere launches Embed 3, an advanced multimodal AI model that integrates text and image embeddings, setting new standards for enterprise search and multilingual retrieval tasks.
2 Sources
2 Sources
Cohere releases Command A, a new large language model designed for enterprise use, offering high performance with minimal hardware requirements and expanded multilingual capabilities.
5 Sources
5 Sources
Cohere introduces Command R7B, the smallest model in its R series, designed for enterprise use with a focus on efficiency, performance, and versatility across multiple languages and tasks.
2 Sources
2 Sources
Cohere's research arm releases Aya Expanse, a family of multilingual AI models that outperform leading open-source alternatives, aiming to bridge the global language divide in AI technology.
3 Sources
3 Sources
Cohere, a Canada-based AI startup, introduces North, a secure AI platform designed to enhance enterprise workflows with LLM-powered tools, advanced search capabilities, and customizable AI agents, prioritizing data security and privacy for regulated industries.
4 Sources
4 Sources
The Outpost is a comprehensive collection of curated artificial intelligence software tools that cater to the needs of small business owners, bloggers, artists, musicians, entrepreneurs, marketers, writers, and researchers.
© 2025 TheOutpost.AI All rights reserved