Curated by THEOUTPOST
On Wed, 9 Apr, 12:03 AM UTC
2 Sources
[1]
Meta's Llama 4 Models Now Available on Krutrim Cloud
Llama 4 models, including Scout and Maverick, are now live on its platform, allowing developers to build and deploy AI applications at competitive pricing. Ola chief Bhavish Aggarwal on Tuesday announced that Krutrim Cloud will be able to run Meta's Llama 4 models entirely on an India-hosted cloud infrastructure. The move will allow developers across the country to access advanced AI capabilities while maintaining full data sovereignty. "Excited to share that Krutrim is among the world's first to host Meta's Llama 4 models running entirely on its India-hosted cloud. Powering our developers with world-class AI, at industry-disrupting prices with complete data sovereignty," he said in a post on X. In a separate LinkedIn post, he said that the company is deploying both Llama 4 Scout and Llama 4 Maverick models at even more disruptive prices - just ₹7 to ₹17 per million tokens."This isn't just about cost savings - it's about democratising access to cutting-edge AI for every Indian developer and startup," he said Llama 4 models, including Scout and Maverick, are now live on its platform, allowing developers to build and deploy AI applications at competitive pricing. The models are hosted within India's borders, aligning with growing demands for localised data control and privacy. Krutrim Cloud, launched last year, provides a comprehensive suite of AI services, including Model-as-a-Service (MaaS) and GPU-as-a-Service. It recently added support for DeepSeek models as well. Meta recently launched two multimodal open-weight models -- Llama 4 Scout and Llama 4 Maverick. Both models are built on a mixture-of-experts (MoE) setup. Llama 4 Scout features 17 billion active parameters and 16 experts, designed to fit within a single H100 GPU. Meta claims it supports an industry-leading 10 million token context window, enabling complex tasks such as multi-document summarisation and reasoning over large codebases. Llama 4 Maverick is a 17 billion active parameter model with 128 experts. It includes 400 billion total parameters and performs competitively with larger models like DeepSeek V3 on reasoning and coding tasks. Meta said that Maverick exceeds GPT-4o and Gemini 2.0 Flash on several benchmarks. It scored an ELO of 1417 on LMArena in experimental chat settings. The models were distilled from Llama 4 Behemoth. This unreleased teacher model is also a multimodal mixture-of-experts model, with 288B active parameters, 16 experts, and nearly two trillion total parameters. There were also some questions around the training and testing data of the model, which were later clarified by Ahmad Al-Dahle, the lead of GenAI at Meta. "That's simply not true, and we would never do that. Our best understanding is that the variable quality people are seeing is due to needing to stabilise implementations."
[2]
Krutrim Starts Hosting Meta's Llama 4 Models On Its Cloud In India
The startup also reiterated that the second version of Krutrim Assistant is well on track to be rolled out later this month and it will feature the "DeepSearch" tool Bhavish Aggarwal-led artificial intelligence (AI) unicorn Krutrim has said that it has started hosting Meta's Llama 4 models on its cloud platform. In a statement, Krutrim said that Llama 4 Scout and Llama 4 Maverick will be available for developers to test, build, and deploy applications in the price range of INR 7 to INR 17 per Mn tokens. While Llama 4 Scout Model is a 17 Bn active parameter model with 16 experts and 10 Mn token context, Llama 4 Maverick boasts 17 Bn active parameters with 128 experts and 1 Mn token context window. For context, tokens are a fundamental unit of text that LLMs generate. With this, the unicorn claims to have become the first Indian AI company to deploy Meta's Llama 4 models on Indian servers. "Excited to share that @Krutrim is among the world's first to host Meta's Llama 4 models running entirely on it's India-hosted cloud. Powering our developers with world-class AI, at industry-disrupting prices, with complete data sovereignty," said Aggarwal in a post on X. In a statement, the startup reiterated that the second version of Krutrim Assistant is well on track to be rolled out later this month. It also said that the Assistant, V2, will feature "DeepSearch", a tool designed to make data searches more precise and efficient. The AI startup also reiterated plans to scale up its data centre capacity to 1 GW by 2028. The development comes a few months after the Bhavish Aggarwal-led AI unicorn, in January 2025, began hosting open source AI models of Chinese GenAI company DeepSeek on its cloud platform. A month later in February, it also deployed DeepSeek's new R1 671B model on Nvidia's H100 graphics processing units in India. Founded in 2023, Krutrim offers GPU-as-a-service, model-as-a-service, along with other multiple no-code platforms. It became a unicorn in January 2024 after raising $50 Mn in a round led by Z47 (erstwhile Matrix Partners India). The startup has also submitted a proposal to the government to build indigenous AI foundational models under the INR 10,037 Cr IndiaAI Mission. Earlier this week, IT minister Ashwini Vaishnaw said that the evaluation process of the proposals is in the "final leg", adding that the first few of the selected startups will be offered funding by the Centre.
Share
Share
Copy Link
Krutrim Cloud, an Indian AI startup, now hosts Meta's advanced Llama 4 models, offering developers access to cutting-edge AI capabilities at competitive prices while ensuring data sovereignty.
In a significant move for India's AI ecosystem, Krutrim Cloud has announced the deployment of Meta's advanced Llama 4 models on its India-hosted cloud infrastructure. This development marks a major step in democratizing access to cutting-edge AI technologies for Indian developers and startups 1.
Ola chief and Krutrim founder Bhavish Aggarwal revealed that the company is offering Llama 4 models, including Scout and Maverick, at highly competitive prices ranging from ₹7 to ₹17 per million tokens. This pricing strategy aims to make advanced AI capabilities more accessible to a wider range of developers and businesses 1.
Importantly, Krutrim Cloud's hosting of these models within India's borders addresses growing concerns about data sovereignty and privacy. This move allows developers to leverage powerful AI tools while maintaining complete control over their data 2.
The Llama 4 Scout model boasts 17 billion active parameters with 16 experts and supports an impressive 10 million token context window. This capability enables complex tasks such as multi-document summarization and reasoning over large codebases 1.
Llama 4 Maverick, also featuring 17 billion active parameters but with 128 experts, includes a total of 400 billion parameters. Meta claims that Maverick outperforms larger models like GPT-4 and Gemini 2.0 Flash on several benchmarks 1.
Krutrim Cloud, launched last year, provides a comprehensive suite of AI services, including Model-as-a-Service (MaaS) and GPU-as-a-Service. The company has been actively expanding its portfolio, having recently added support for DeepSeek models as well 1.
The startup has ambitious plans to scale up its data center capacity to 1 GW by 2028, indicating its commitment to building robust AI infrastructure in India 2.
Krutrim is set to release the second version of its Krutrim Assistant later this month, featuring a new "DeepSearch" tool designed to enhance data search precision and efficiency 2.
The company has also submitted a proposal to the Indian government under the INR 10,037 Cr IndiaAI Mission, aiming to build indigenous AI foundational models. This aligns with the government's efforts to boost India's AI capabilities, with IT minister Ashwini Vaishnaw indicating that the evaluation process for such proposals is in its final stages 2.
Reference
[1]
Ola's AI platform Krutrim now hosts DeepSeek models on its cloud infrastructure, offering low-cost access to powerful AI tools and advancing India's position in the global AI race.
3 Sources
3 Sources
Meta has released Llama 3, its latest and most advanced AI language model, boasting significant improvements in language processing and mathematical capabilities. This update positions Meta as a strong contender in the AI race, with potential impacts on various industries and startups.
22 Sources
22 Sources
Meta's surprise release of Llama 4 AI models sparks debate over performance claims and practical limitations, highlighting the gap between AI marketing and real-world application.
48 Sources
48 Sources
Meta has released Llama 3.3, a 70 billion parameter AI model that offers performance comparable to larger models at a fraction of the cost, marking a significant advancement in open-source AI technology.
11 Sources
11 Sources
Meta's Llama AI models have achieved a staggering 350 million downloads, solidifying the company's position as a leader in open-source AI. This milestone represents a tenfold increase in downloads compared to the previous year, highlighting the growing interest in accessible AI technologies.
4 Sources
4 Sources
The Outpost is a comprehensive collection of curated artificial intelligence software tools that cater to the needs of small business owners, bloggers, artists, musicians, entrepreneurs, marketers, writers, and researchers.
© 2025 TheOutpost.AI All rights reserved