Telnyx - Global Communications Platform ProviderHome
Voice AIVoice APIInferenceMobile VoiceSpeech-to-TextText-to-speechSIP TrunkingSMS APIWhatsApp Business APIView all productsHealthcareFinanceTravel and HospitalityLogistics and TransportationContact CenterInsuranceRetail and E-CommerceSales and MarketingServices and DiningView all solutionsVoice AIVoice APIInferenceMobile VoiceSpeech-to-TextText-to-SpeechSIP TrunkingSMS APIWhatsApp Business APIGlobal NumbersIoT SIM CardView all pricingOur NetworkMission Control PortalCustomer storiesGlobal coveragePartnersCareersEventsResource centerSupport centerAI TemplatesSETIDev DocsIntegrations
Contact usLog in
Contact usLog inSign up

Social

Company

  • Our Network
  • Global Coverage
  • Release Notes
  • Careers
  • Voice AI
  • AI Glossary
  • Shop

Legal

  • Data and Privacy
  • Report Abuse
  • Privacy Policy
  • Cookie Policy
  • Law Enforcement
  • Acceptable Use
  • Trust Center
  • Country Specific Requirements
  • Website Terms and Conditions
  • Terms and Conditions of Service

Compare

  • ElevenLabs
  • Vapi
  • Baseten
  • Together.ai
  • Twilio
  • Bandwidth
  • Vonage
  • Amazon Connect
© Telnyx LLC 2026
ISO • PCI • HIPAA • GDPR • SOC2 Type II

Ask AI

  • GPT
  • Claude
  • Perplexity
  • Gemini
  • Grok

GPT-3.5 Turbo-0613

The June 2023 snapshot of GPT-3.5 Turbo, recognized for strong system prompt adherence and reliable instruction-following in chat and completion tasks.

Start buildingGET Available Models

about

The June 2023 snapshot introduced function calling to GPT-3.5 Turbo, enabling structured JSON output for tool use through a new functions API parameter. It also improved system message steerability over the original 0301 snapshot, making it the first 3.5 variant widely adopted for production API workflows requiring consistent instruction-following.

Licenseopenai
Context window(in thousands)4096

Use cases for GPT-3.5 Turbo-0613

  1. Structured API tool use: As the first GPT-3.5 model with native function calling, the 0613 snapshot converts natural language into structured JSON function calls for reliable API orchestration.
  2. System prompt-driven agents: Improved system message adherence makes it effective for applications where consistent persona, tone, and behavioral constraints must be maintained across conversations.
  3. Legacy workflow compatibility: As a pinned snapshot with deterministic behavior, it serves as a stable inference target for production systems that depend on consistent output formatting.

Quality

Arena Elo1117
MMLUN/A
MT Bench8.39

GPT-3.5 Turbo 0613 shares the 70.0% MMLU (5-shot) and 7.94 MT-Bench baseline of the GPT-3.5 Turbo family. As the first snapshot to introduce function calling, its quality differentiation from later snapshots (1106, 0125) is in structured output reliability rather than raw benchmark performance. Compared to Mixtral 8x7B (70.6% MMLU, 8.30 MT-Bench) on the sheet, it trails by a narrow margin on both measures.

Llama 3 Instruct 8B

1152

Claude-Sonnet-4-20250514

1138

GPT-3.5 Turbo-0613

1117

Mixtral 8x7B Instruct v0.1

1114

GPT-3.5 Turbo-0125

1106

pricing

The cost per 1,000 tokens for running the model with Telnyx Inference is $0.0010. To illustrate, if a marketing ops team were to analyze 1,000,000 customer chats, assuming each chat is 1,000 tokens long, the total cost would be $1,000.

What's Twitter saying?

  • Developers report worse output quality with GPT-3.5-turbo-0613 compared to 0301, as it ignores system prompts and produces unusable results across inputs.
  • Users note poorer performance in non-English languages and overall "more stupid" behavior, leading to calls not to deprecate older versions.
  • Benchmarks show lower labeling accuracy on 6/8 datasets versus prior models, though it's ~40% faster.

Explore Our LLM Library

Discover the power and diversity of large language models available with Telnyx. Explore the options below to find the perfect model for your project.

No data available at this time, please try again later.
OrganizationModel NameTasksLanguages SupportedContext LengthParametersModel TierLicense
No data available at this time, please try again later.
TRY IT OUT

Chat with an LLM

Powered by our own GPU infrastructure, select a large language model, add a prompt, and chat away. For unlimited chats, sign up for a free account on our Mission Control Portal here.

Loading...
HOW IT WORKS

Selecting LLMs for Voice AI

GET Available Models
RESOURCES

Get started

Check out our helpful tools to help get you started.

  • Icon Resources ebook

    Test in the portal

    Easily browse and select your preferred model in the AI Playground.

    Test today
  • Icon Resources Docs

    Explore the docs

    Don’t wait to scale, start today with our public API endpoints.

    Get started
  • Icon Resources Article

    Stay up to date

    Keep an eye on our AI changelog so you don't miss a beat.

    See updates

Sign up and start building

Sign upContact sales

faqs

Does GPT-3.5 Turbo still exist?

GPT-3.5 Turbo 0613 was deprecated by OpenAI in favor of newer snapshots. The 0125 variant is the recommended replacement, and OpenAI suggests migrating to GPT-4o mini for new projects.

Is GPT-3.5 Turbo a good model?

GPT-3.5 Turbo 0613 was the first snapshot to support function calling, making it a foundational model for tool-using applications. While surpassed by newer models, it remains functional for basic chat and classification tasks at low cost.

What is the difference between GPT-3 and GPT-3.5 Turbo?

GPT-3.5 Turbo added chat optimization, function calling, and significantly lower pricing compared to GPT-3's completion-based API. The 0613 snapshot was the first to introduce function calling, enabling structured tool integration.

What is GPT-4 0613?

GPT-4 0613 is a separate model, the June 2023 snapshot of GPT-4 with function calling support. It is not related to GPT-3.5 Turbo 0613 beyond sharing the same release date convention. Both are available through OpenAI's API.

How much does GPT-3.5 Turbo cost?

GPT-3.5 Turbo 0613 is priced at $1.50 per million input tokens and $2.00 per million output tokens. The newer 0125 snapshot offers the same capability at lower pricing.

Loading...