Deep Cogito Emerges from Stealth with Innovative Hybrid AI 'Reasoning' Models

2 Sources

Deep Cogito, a new AI research startup, has unveiled a series of open-source large language models with hybrid reasoning capabilities, aiming to push the boundaries of AI development towards superintelligence.

News article

Deep Cogito Unveils Groundbreaking Hybrid AI Models

Deep Cogito, a San Francisco-based AI research startup, has emerged from stealth mode with the release of Cogito v1, a new family of open-source large language models (LLMs) that feature innovative hybrid reasoning capabilities 1. These models, which can switch between "reasoning" and non-reasoning modes, represent a significant advancement in AI technology and have already shown impressive performance on various benchmarks.

Hybrid Reasoning: A New Approach to AI

The Cogito v1 models are built on a hybrid architecture that combines reasoning components with standard, non-reasoning elements. This approach allows the models to quickly answer simple questions while dedicating additional time to more complex queries that require deeper consideration 1. The ability to toggle between these modes offers a unique flexibility in AI applications, potentially improving efficiency and accuracy across various tasks.

Model Specifications and Availability

Deep Cogito has released five base sizes of the Cogito v1 models, ranging from 3 billion to 70 billion parameters 2. These models are available for download via Hugging Face and Ollama, and can also be accessed through APIs provided by Fireworks AI and Together AI. The company plans to release even larger models, up to 671 billion parameters, in the coming months 2.

Performance and Benchmarks

According to Deep Cogito's internal benchmarking, the largest model, Cogito 70B, outperforms other open models of similar size, including those from Meta and DeepSeek, on several mathematics and language evaluations 1. The company claims that their models are "the strongest open models at their scale," surpassing offerings from LLaMA, DeepSeek, and Qwen 2.

Novel Training Approach

Deep Cogito employs a unique training methodology called iterated distillation and amplification (IDA). This approach, described as an alternative to traditional reinforcement learning from human feedback (RLHF), aims to create a feedback loop for capability growth by allowing the model to generate improved solutions and then distill the enhanced reasoning process into its own parameters 2.

Company Background and Vision

Founded in June 2024, Deep Cogito is led by co-founders Drishan Arora and Dhruv Malhotra, both of whom have backgrounds in prominent AI research institutions 1. The company's ambitious goal is to build "general superintelligence," which they define as AI capable of performing tasks better than most humans and uncovering entirely new capabilities 1.

Open-Source Commitment and Future Plans

Deep Cogito has committed to open-sourcing all of its models, making them available under the Llama licensing terms for commercial usage 2. This approach aligns with the company's vision of pushing AI development forward collaboratively. As they continue to refine and expand their model lineup, Deep Cogito aims to remove dependence on human or static teacher models, paving the way for scalable self-improvement in AI systems 2.

Explore today's top stories

OpenAI Challenges Court Order to Preserve Deleted ChatGPT Conversations Amid NYT Lawsuit

OpenAI appeals a court order requiring it to indefinitely store deleted ChatGPT conversations as part of The New York Times' copyright lawsuit, citing user privacy concerns and setting a precedent for AI data retention.

The Verge logoengadget logoGizmodo logo

9 Sources

Technology

17 hrs ago

OpenAI Challenges Court Order to Preserve Deleted ChatGPT

Anysphere's Cursor AI Coding Assistant Secures $900M Funding, Reaches $9.9B Valuation

Anysphere, the company behind the AI coding assistant Cursor, has raised $900 million in funding, reaching a $9.9 billion valuation. The startup has surpassed $500 million in annual recurring revenue, making it potentially the fastest-growing software startup ever.

TechCrunch logoBloomberg Business logoSiliconANGLE logo

4 Sources

Technology

17 hrs ago

Anysphere's Cursor AI Coding Assistant Secures $900M

US-UAE AI Data Campus Deal Faces Security Hurdles Despite High-Profile Announcement

A multi-billion dollar deal to build one of the world's largest AI data center hubs in the UAE, involving major US tech companies, is far from finalized due to persistent security concerns and geopolitical complexities.

Reuters logoEconomic Times logoInvesting.com logo

4 Sources

Technology

9 hrs ago

US-UAE AI Data Campus Deal Faces Security Hurdles Despite

PwC Report Reveals AI's Positive Impact on Job Market: Workers Become 'More Valuable'

A new PwC study challenges common fears about AI's impact on jobs, showing that AI is actually creating jobs, boosting wages, and increasing worker value across industries.

CNBC logoEconomic Times logo

2 Sources

Business and Economy

9 hrs ago

PwC Report Reveals AI's Positive Impact on Job Market:

AI Film Festival Showcases the Future of Movie-Making Technology

Runway's AI Film Festival in New York highlights the growing role of artificial intelligence in filmmaking, showcasing innovative short films and sparking discussions about AI's impact on the entertainment industry.

AP NEWS logoABC News logoThe Seattle Times logo

5 Sources

Technology

9 hrs ago

AI Film Festival Showcases the Future of Movie-Making
TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

Β© 2025 Triveous Technologies Private Limited
Twitter logo
Instagram logo
LinkedIn logo