Deep Cogito Emerges from Stealth with Innovative Hybrid AI 'Reasoning' Models

Deep Cogito Unveils Groundbreaking Hybrid AI Models

Deep Cogito, a San Francisco-based AI research startup, has emerged from stealth mode with the release of Cogito v1, a new family of open-source large language models (LLMs) that feature innovative hybrid reasoning capabilities 1

. These models, which can switch between "reasoning" and non-reasoning modes, represent a significant advancement in AI technology and have already shown impressive performance on various benchmarks.

Hybrid Reasoning: A New Approach to AI

The Cogito v1 models are built on a hybrid architecture that combines reasoning components with standard, non-reasoning elements. This approach allows the models to quickly answer simple questions while dedicating additional time to more complex queries that require deeper consideration 1

. The ability to toggle between these modes offers a unique flexibility in AI applications, potentially improving efficiency and accuracy across various tasks.

Model Specifications and Availability

Deep Cogito has released five base sizes of the Cogito v1 models, ranging from 3 billion to 70 billion parameters 2

. These models are available for download via Hugging Face and Ollama, and can also be accessed through APIs provided by Fireworks AI and Together AI. The company plans to release even larger models, up to 671 billion parameters, in the coming months 2

Performance and Benchmarks

According to Deep Cogito's internal benchmarking, the largest model, Cogito 70B, outperforms other open models of similar size, including those from Meta and DeepSeek, on several mathematics and language evaluations 1

. The company claims that their models are "the strongest open models at their scale," surpassing offerings from LLaMA, DeepSeek, and Qwen 2

Novel Training Approach

Deep Cogito employs a unique training methodology called iterated distillation and amplification (IDA). This approach, described as an alternative to traditional reinforcement learning from human feedback (RLHF), aims to create a feedback loop for capability growth by allowing the model to generate improved solutions and then distill the enhanced reasoning process into its own parameters 2

Company Background and Vision

Founded in June 2024, Deep Cogito is led by co-founders Drishan Arora and Dhruv Malhotra, both of whom have backgrounds in prominent AI research institutions 1

. The company's ambitious goal is to build "general superintelligence," which they define as AI capable of performing tasks better than most humans and uncovering entirely new capabilities 1

Open-Source Commitment and Future Plans

Deep Cogito has committed to open-sourcing all of its models, making them available under the Llama licensing terms for commercial usage 2

. This approach aligns with the company's vision of pushing AI development forward collaboratively. As they continue to refine and expand their model lineup, Deep Cogito aims to remove dependence on human or static teacher models, paving the way for scalable self-improvement in AI systems 2