GPT-3.5 Turbo-1106

The November 2023 GPT-3.5 Turbo snapshot introducing JSON mode, parallel function calling, and a 16k context window for structured output tasks.

about

Released at OpenAI DevDay in November 2023, this snapshot merged the standard and 16K context variants into a single model defaulting to 16,384 tokens and introduced JSON mode, parallel function calling, and reproducible outputs via a seed parameter. Input pricing dropped 50% compared to the 0613 snapshot, and the training data cutoff moved forward to April 2023.

Licenseopenai
Context window(in thousands)4096

Use cases for GPT-3.5 Turbo-1106

  1. Content generation: GPT-3.5 Turbo-1106 efficiently produces diverse content types such as articles, blog posts, and creative writing.
  2. Chatbots: Leveraging its language comprehension, GPT-3.5 Turbo-1106 builds sophisticated chatbots capable of handling complex user queries.
  3. Translation: With its impressive MT Bench score, the model performs well in language translation across different languages.

Quality

Arena Elo1068
MMLUN/A
MT Bench8.32

GPT-3.5 Turbo-1106 demonstrates high-caliber performance across key metrics, providing effective language responses, solid translation benchmarks, and strong knowledge-based task understanding.

Hermes 2 Pro Mistral 7B

1074

Mistral 7B Instruct v0.2

1072

GPT-3.5 Turbo-1106

1068

Llama 2 Chat 13B

1063

Dolphin 2.5 Mixtral 8X7B

1063

pricing

The cost per 1,000 tokens for running the model with Telnyx Inference is $0.0010. For instance, analyzing 1,000,000 customer chats, assuming each chat is 1,000 tokens long, would cost $1,000.

What's Twitter saying?

  • Performance:William Tweet highlights discussions on GPT-3.5-turbo-0613 model performance compared to its predecessors, sparking interest in the model's metrics. (Source: @wgussml)
  • Steerability improvements: Simon Willison questions if steerability improvements in GPT-3.5 Turbo-0613 also apply to GPT-3.5 Turbo-16k, prompting a debate on update reliability. (Source: @simonw)
  • Function calling in GPT models: Jayjen highlights function calling capabilities in OpenAI's GPT models, enhancing versatility in GPT-3.5 Turbo-0613 and GPT-4-0613. (Source: @jayjen_x)

Explore Our LLM Library

Discover the power and diversity of large language models available with Telnyx. Explore the options below to find the perfect model for your project.

Organizationdeepseek-ai
Model NameDeepSeek-R1-Distill-Qwen-14B
Taskstext generation
Languages SupportedEnglish
Context Length43,000
Parameters14.8B
Model Tiermedium
Licensedeepseek

TRY IT OUT

Chat with an LLM

Powered by our own GPU infrastructure, select a large language model, add a prompt, and chat away. For unlimited chats, sign up for a free account on our Mission Control Portal here.

HOW IT WORKS

Selecting LLMs for Voice AI

RESOURCES

Get started

Check out our helpful tools to help get you started.

  • Icon Resources ebook

    Test in the portal

    Easily browse and select your preferred model in the AI Playground.

  • Icon Resources Docs

    Explore the docs

    Don’t wait to scale, start today with our public API endpoints.

  • Icon Resources Article

    Stay up to date

    Keep an eye on our AI changelog so you don't miss a beat.

Sign up and start building

faqs

Is GPT-3.5 Turbo a good model?

GPT-3.5 Turbo 1106 introduced JSON mode and parallel function calling, making it a notable upgrade for developers building structured applications. It offered solid performance for its time on chat, summarization, and code tasks at a low price point.

Does GPT-3.5 Turbo 1106 still exist?

GPT-3.5 Turbo 1106 remains accessible through the API but has been superseded by the 0125 snapshot and newer models. OpenAI recommends GPT-4o mini for new projects requiring similar capabilities at better performance.

What is the difference between GPT-3 and GPT-3.5 Turbo?

The 1106 variant of GPT-3.5 Turbo was a major step forward from GPT-3, adding chat optimization, a 16K context window, JSON mode, and parallel function calling. It was designed for the Chat Completions API rather than the older completions format.

What is GPT-3.5 Turbo 1106?

GPT-3.5 Turbo 1106 is the November 2023 snapshot of OpenAI's GPT-3.5 Turbo model. It was the first GPT-3.5 variant to support JSON mode and parallel function calling, with improved instruction-following for structured output tasks.