Floworks' ThorV2 Architecture: A Game-Changer for API Calling in LLMs

2 Sources

Share

Floworks, in collaboration with IIT Bombay and IIT Kharagpur, introduces ThorV2, a novel architecture that enhances LLMs' API calling capabilities, offering improved accuracy, reliability, and cost-effectiveness compared to leading models.

News article

Floworks Unveils ThorV2: A Breakthrough in LLM Function Calling

Floworks, a YC-backed cloud-based enterprise automation startup, has introduced ThorV2, a novel architecture designed to revolutionize how Large Language Models (LLMs) handle API calls. Developed in collaboration with IIT Bombay and IIT Kharagpur, ThorV2 promises to address critical challenges in agentic workflows for market-leading LLMs

1

2

.

Key Features and Innovations

ThorV2 incorporates several innovative features:

  1. Edge of Domain Modeling: This approach provides minimal upfront instructions, allowing the agent to begin tasks and receive additional information through error corrections post-task. This method reduces token usage in prompts, potentially leading to cost savings

    1

    2

    .

  2. Agent Validator Architecture: ThorV2 introduces a static agent, including a Domain Expert Validator (DEV), which inspects LLM outputs for errors. This approach overcomes limitations of traditional agentic workflows that rely on multiple LLMs for feedback

    1

    2

    .

  3. Multiple API Calls in a Single Step: ThorV2 can generate multiple API calls simultaneously, using placeholders for unknown values and injecting them once retrieved. This capability significantly improves upon the sequential API call handling in current LLMs

    1

    2

    .

Performance and Benchmarks

Floworks claims that ThorV2 outperforms leading models like OpenAI's GPT-4o, GPT-4 Turbo, and Claude 3 Opus in several key areas:

  • Accuracy: 36% more accurate than GPT-4o
  • Cost-effectiveness: 4x cheaper than competing models
  • Speed: 30% faster in terms of latency
  • Reliability: Achieved a 100% score in consistency tests

    1

    2

These claims were supported by benchmarks conducted on a dataset called HubBench, focusing on operations within HubSpot's CRM. ThorV2, connected to the Llama 3 70B model, demonstrated superior performance across accuracy, reliability, speed, and cost metrics

1

2

.

Implications for the AI Industry

The introduction of ThorV2 could have significant implications for the AI industry:

  1. Cost Reduction: At $1 per thousand queries, ThorV2 is reportedly three times cheaper than OpenAI's models, potentially disrupting the pricing structure of AI services

    1

    2

    .

  2. Improved Efficiency: The ability to handle multiple API calls in a single step could streamline complex AI-driven processes in various industries.

  3. Enhanced Reliability: With claims of 100% reliability for API call tasks, ThorV2 could set a new standard for dependability in AI applications

    1

    2

    .

Future Developments and Limitations

While ThorV2 shows promise, there are considerations for its future:

  1. Adaptability: As an architecture rather than a standalone model, ThorV2 is designed to enhance existing LLMs, potentially improving as underlying models advance

    1

    2

    .

  2. Ongoing Development: Floworks has announced plans for Thor v3, indicating continued innovation in this space

    1

    2

    .

  3. Current Limitations: ThorV2 relies on established error patterns and has been tested primarily on single and two API call functions, leaving room for expansion in handling more complex scenarios

    1

    2

    .

As the AI landscape continues to evolve rapidly, innovations like ThorV2 underscore the importance of architectural improvements alongside model advancements in pushing the boundaries of AI capabilities.

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo