Telnyx - Global Communications Platform ProviderHome
Voice AIVoice APIInferenceMobile VoiceSpeech-to-TextText-to-speechSIP TrunkingSMS APIWhatsApp Business APIView all productsHealthcareFinanceTravel and HospitalityLogistics and TransportationContact CenterInsuranceRetail and E-CommerceSales and MarketingServices and DiningView all solutionsVoice AIVoice APIInferenceMobile VoiceSpeech-to-TextText-to-SpeechSIP TrunkingSMS APIWhatsApp Business APIGlobal NumbersIoT SIM CardView all pricingOur NetworkMission Control PortalCustomer storiesGlobal coveragePartnersCareersEventsResource centerSupport centerAI TemplatesSETIDev DocsIntegrations
Contact usLog in
Contact usLog inSign up

Social

Company

  • Our Network
  • Global Coverage
  • Release Notes
  • Careers
  • Voice AI
  • AI Glossary
  • Shop

Legal

  • Data and Privacy
  • Report Abuse
  • Privacy Policy
  • Cookie Policy
  • Law Enforcement
  • Acceptable Use
  • Trust Center
  • Country Specific Requirements
  • Website Terms and Conditions
  • Terms and Conditions of Service

Compare

  • ElevenLabs
  • Vapi
  • Baseten
  • Together.ai
  • Twilio
  • Bandwidth
  • Vonage
  • Amazon Connect
© Telnyx LLC 2026
ISO • PCI • HIPAA • GDPR • SOC2 Type II

Ask AI

  • GPT
  • Claude
  • Perplexity
  • Gemini
  • Grok

Hermes 2 Pro Mistral 7B

A 7B model from Nous Research built on Mistral, optimized for function calling, JSON structured output, and general conversational tasks.

Start buildingGET Available Models

about

Nous Research fine-tuned Mistral 7B with a custom dataset built specifically for structured tool use and function calling at small scale. Using the ChatML prompt format and a dedicated <tool_call> token, it handles nested function schemas and complex JSON output at 7B parameters, scoring competitively with models ten times its size on agentic benchmarks.

Licenseapache-2.0

Explore Our LLM Library

Discover the power and diversity of large language models available with Telnyx. Explore the options below to find the perfect model for your project.

No data available at this time, please try again later.
OrganizationModel NameTasksLanguages SupportedContext LengthParametersModel TierLicense
No data available at this time, please try again later.
TRY IT OUT

Chat with an LLM

Powered by our own GPU infrastructure, select a large language model, add a prompt, and chat away. For unlimited chats, sign up for a free account on our Mission Control Portal here.

Loading...
HOW IT WORKS

Selecting LLMs for Voice AI

GET Available Models
RESOURCES

Get started

Check out our helpful tools to help get you started.

  • Icon Resources ebook

    Test in the portal

    Easily browse and select your preferred model in the AI Playground.

Sign up and start building

Sign upContact sales

faqs

Is Mistral 7B a good model?

Hermes 2 Pro Mistral 7B builds on the strong Mistral 7B base with enhanced function calling and structured output capabilities. It is a capable model for tool-using applications at the 7B scale.

What format is Hermes 2 Pro?

Hermes 2 Pro uses ChatML format for conversation templating and supports structured JSON outputs for function calling. It is available in standard and GGUF quantized formats on Hugging Face.

Context window(in thousands)32768

Use cases for Hermes 2 Pro Mistral 7B

  1. Structured function calling at 7B scale: Scoring competitively with models 10x its size on tool-use benchmarks, it handles nested and complex function schemas via a dedicated tool_call token format.
  2. Guaranteed JSON output: Built-in JSON mode support produces valid structured data for API integrations, database inserts, and data pipelines without parsing failures.
  3. Lightweight agentic deployment: At 7B parameters with 32K context, it runs tool-augmented agent loops on single-GPU infrastructure where larger function-calling models would be too slow or expensive.

Quality

Arena Elo1074
MMLUN/A
MT BenchN/A

Hermes 2 Pro Mistral 7B scores 62.2% on MMLU, comparable to Nous Hermes 2 Mistral 7B DPO (63.4%) on the same sheet. Its differentiation is in function calling, where it scores competitively with models 10x its size through a dedicated tool_call token format and built-in JSON mode. For general knowledge it trails Gemma 7B IT (64.3%) by about 2 points.

Llama 2 Chat 70B

1093

Nous Hermes 2 Mixtral 8x7B

1084

Hermes 2 Pro Mistral 7B

1074

Mistral 7B Instruct v0.2

1072

GPT-3.5 Turbo-1106

1068

pricing

The cost of running Hermes 2 Pro Mistral 7B with Telnyx Inference is $0.0002 per 1,000 tokens. Processing 1,000,000 function-calling tasks at 1,000 tokens each would cost $200, the same as other 7B-class models but with structured JSON output and tool-use capabilities that typically require larger models.

What's Twitter saying?

  • Developers praise Hermes 2 Pro Mistral 7B for excelling in function calling and JSON structured outputs, scoring competitively with much larger models on agentic benchmarks.
  • It ranks highly on the Chatbot Arena Leaderboard with an Elo of 1074, outperforming other 7B models like Gemma 2B in conversational tasks.
  • Tech enthusiasts on Hacker News express surprise and enthusiasm, recommending it alongside models like Starling for impressive chat capabilities under RAM constraints.
Test today
  • Icon Resources Docs

    Explore the docs

    Don’t wait to scale, start today with our public API endpoints.

    Get started
  • Icon Resources Article

    Stay up to date

    Keep an eye on our AI changelog so you don't miss a beat.

    See updates
  • What are the limitations of Mistral 7B?

    The main limitations are the 7B parameter count constraining complex reasoning, and a 32K context window. For tasks requiring deeper analysis, larger models in the Mixtral or Llama families are better suited.

    What makes Hermes 2 Pro special?

    Hermes 2 Pro adds structured function calling and JSON output mode to the base Mistral 7B, making it one of the first small models with reliable tool-use capability. It is available through Telnyx and other providers.

    Is Hermes 2 Pro free?

    Yes, Hermes 2 Pro Mistral 7B is released under the Apache 2.0 license for free commercial use. Weights are on Hugging Face.

    Loading...