Floworks' ThorV2 Architecture: A Game-Changer for API Calling in LLMs

2 Sources

Floworks, in collaboration with IIT Bombay and IIT Kharagpur, introduces ThorV2, a novel architecture that enhances LLMs' API calling capabilities, offering improved accuracy, reliability, and cost-effectiveness compared to leading models.

News article

Floworks Unveils ThorV2: A Breakthrough in LLM Function Calling

Floworks, a YC-backed cloud-based enterprise automation startup, has introduced ThorV2, a novel architecture designed to revolutionize how Large Language Models (LLMs) handle API calls. Developed in collaboration with IIT Bombay and IIT Kharagpur, ThorV2 promises to address critical challenges in agentic workflows for market-leading LLMs 12.

Key Features and Innovations

ThorV2 incorporates several innovative features:

  1. Edge of Domain Modeling: This approach provides minimal upfront instructions, allowing the agent to begin tasks and receive additional information through error corrections post-task. This method reduces token usage in prompts, potentially leading to cost savings 12.

  2. Agent Validator Architecture: ThorV2 introduces a static agent, including a Domain Expert Validator (DEV), which inspects LLM outputs for errors. This approach overcomes limitations of traditional agentic workflows that rely on multiple LLMs for feedback 12.

  3. Multiple API Calls in a Single Step: ThorV2 can generate multiple API calls simultaneously, using placeholders for unknown values and injecting them once retrieved. This capability significantly improves upon the sequential API call handling in current LLMs 12.

Performance and Benchmarks

Floworks claims that ThorV2 outperforms leading models like OpenAI's GPT-4o, GPT-4 Turbo, and Claude 3 Opus in several key areas:

  • Accuracy: 36% more accurate than GPT-4o
  • Cost-effectiveness: 4x cheaper than competing models
  • Speed: 30% faster in terms of latency
  • Reliability: Achieved a 100% score in consistency tests 12

These claims were supported by benchmarks conducted on a dataset called HubBench, focusing on operations within HubSpot's CRM. ThorV2, connected to the Llama 3 70B model, demonstrated superior performance across accuracy, reliability, speed, and cost metrics 12.

Implications for the AI Industry

The introduction of ThorV2 could have significant implications for the AI industry:

  1. Cost Reduction: At $1 per thousand queries, ThorV2 is reportedly three times cheaper than OpenAI's models, potentially disrupting the pricing structure of AI services 12.

  2. Improved Efficiency: The ability to handle multiple API calls in a single step could streamline complex AI-driven processes in various industries.

  3. Enhanced Reliability: With claims of 100% reliability for API call tasks, ThorV2 could set a new standard for dependability in AI applications 12.

Future Developments and Limitations

While ThorV2 shows promise, there are considerations for its future:

  1. Adaptability: As an architecture rather than a standalone model, ThorV2 is designed to enhance existing LLMs, potentially improving as underlying models advance 12.

  2. Ongoing Development: Floworks has announced plans for Thor v3, indicating continued innovation in this space 12.

  3. Current Limitations: ThorV2 relies on established error patterns and has been tested primarily on single and two API call functions, leaving room for expansion in handling more complex scenarios 12.

As the AI landscape continues to evolve rapidly, innovations like ThorV2 underscore the importance of architectural improvements alongside model advancements in pushing the boundaries of AI capabilities.

Explore today's top stories

Baidu's Open-Source Ernie AI: A Game-Changer in the Global AI Race

Baidu, China's tech giant, is set to open-source its Ernie AI model, potentially disrupting the global AI market and intensifying competition with Western rivals like OpenAI and Anthropic.

CNBC logoSiliconANGLE logoDataconomy logo

4 Sources

Technology

13 hrs ago

Baidu's Open-Source Ernie AI: A Game-Changer in the Global

Microsoft's AI Diagnostic Tool Outperforms Human Doctors in Accuracy and Cost-Efficiency

Microsoft unveils a powerful AI-powered medical diagnostic tool that claims to be four times more accurate than human doctors, potentially transforming healthcare with improved diagnosis and reduced costs.

Wired logoFinancial Times News logo

2 Sources

Technology

5 hrs ago

Microsoft's AI Diagnostic Tool Outperforms Human Doctors in

Apple's Ambitious Roadmap: Seven Head-Mounted Devices in Development, Including Smart Glasses for 2027

Apple is reportedly developing seven different head-mounted devices, including smart glasses and VR headsets, with the first smart glasses expected to launch in 2027. This move signals Apple's view of head-mounted devices as the next major trend in consumer electronics.

Tom's Guide logoMashable logoLaptopMag logo

6 Sources

Technology

13 hrs ago

Apple's Ambitious Roadmap: Seven Head-Mounted Devices in

AI Recruiters: The New Gatekeepers of Job Applications

AI-powered virtual recruiters are increasingly conducting initial job interviews, transforming the hiring process and raising questions about the future of recruitment.

Washington Post logoEconomic Times logo

2 Sources

Technology

5 hrs ago

AI Recruiters: The New Gatekeepers of Job Applications

Microsoft Ties Employee Performance Reviews to AI Tool Usage, Sparking Debate

Microsoft is reportedly pressuring employees to use AI tools by incorporating their usage into performance evaluations, signaling a shift from optional to mandatory AI adoption in the workplace.

pcgamer logoEconomic Times logoBenzinga logo

3 Sources

Business and Economy

21 hrs ago

Microsoft Ties Employee Performance Reviews to AI Tool
TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

Β© 2025 Triveous Technologies Private Limited
Twitter logo
Instagram logo
LinkedIn logo