Telnyx - Global Communications Platform ProviderHome
Voice AIVoice APIInferenceMobile VoiceSpeech-to-TextText-to-speechSIP TrunkingSMS APIWhatsApp Business APIView all productsHealthcareFinanceTravel and HospitalityLogistics and TransportationContact CenterInsuranceRetail and E-CommerceSales and MarketingServices and DiningView all solutionsVoice AIVoice APIInferenceMobile VoiceSpeech-to-TextText-to-SpeechSIP TrunkingSMS APIWhatsApp CallingGlobal NumbersIoT SIM CardView all pricingOur NetworkMission Control PortalCustomer storiesGlobal coveragePartnersCareersEventsResource centerSupport centerAI TemplatesSETIDev DocsIntegrations
Contact usLog in
Contact usLog inSign up

Social

Company

  • Our Network
  • Global Coverage
  • Release Notes
  • Careers
  • Voice AI
  • AI Glossary
  • Shop

Legal

  • Data and Privacy
  • Report Abuse
  • Privacy Policy
  • Cookie Policy
  • Law Enforcement
  • Acceptable Use
  • Trust Center
  • Country Specific Requirements
  • Website Terms and Conditions
  • Terms and Conditions of Service

Compare

  • ElevenLabs
  • Vapi
  • Baseten
  • Together.ai
  • Twilio
  • Bandwidth
  • Vonage
  • Amazon Connect
© Telnyx LLC 2026
ISO • PCI • HIPAA • GDPR • SOC2 Type II

Ask AI

  • GPT
  • Claude
  • Perplexity
  • Gemini
  • Grok

GPT-4 0125 Preview

The January 2024 GPT-4 Turbo preview with a 128k context window, improved instruction-following, and reduced laziness in code generation tasks.

Start buildingGET Available Models

about

OpenAI released this January 2024 GPT-4 Turbo snapshot specifically to address the "laziness" problem where the prior 1106-preview would truncate code outputs or respond with "rest remains the same." It also fixed a UTF-8 encoding bug in non-English function calls and improved format-following compliance, serving as the last preview before the April 2024 general availability release.

Licenseopenai
Context window(in thousands)128000

Use cases for GPT-4 0125 Preview

  1. Reliable code generation: OpenAI specifically addressed the "laziness" problem from the 1106 preview, making this snapshot produce complete implementations instead of truncating with "rest remains the same."
  2. 128K-context codebase analysis: The full 128K window processes large repositories, documentation sets, or log files for cross-file analysis, dependency mapping, and root-cause investigation.
  3. Format-precise structured output: Improved accuracy on requested output formats (JSON, YAML, XML) makes it suited for data extraction pipelines that require valid structured output without retry loops.

Quality

Arena Elo1245
MMLUN/A
MT Bench9.15

GPT-4 0125 preview shares the ~86.5% MMLU baseline of the GPT-4 Turbo family on the sheet, with its primary improvement being reduced code generation laziness compared to the 1106 preview. Its Arena ELO of 1,245 is close to the 1106 preview (1,251), confirming that the update was a reliability fix rather than a capability upgrade. It was the final preview before the GA GPT-4 Turbo release in April 2024.

Llama-4-Scout-Instruct

1250

Llama 3.1 70B Instruct

1248

GPT-4 0125 Preview

1245

Llama 3 Instruct 70B

1206

GPT-4 0314

1186

pricing

The cost per 1,000 tokens for running the model with Telnyx Inference is $0.0010. To put this into perspective, if an organization were to analyze 1,000,000 customer chats, assuming each chat contains 100 tokens, the total cost would be $100.

What's Twitter saying?

  • Developers report GPT-4 Turbo 0125-preview is lazier at coding than prior GPT-4 versions, underperforming on benchmarks like Aider's lazy coding suite despite OpenAI's intent to fix laziness.
  • Users note slower response speeds compared to GPT-4-0613 or GPT-3.5 Turbo, with token rates dropping to ~9 tps in tests versus higher on older models.
  • Community observes weaker logic reasoning than GPT-4-0314, requiring prompt rewrites for consistent results, though it's more stable and cheaper.

Explore Our LLM Library

Discover the power and diversity of large language models available with Telnyx. Explore the options below to find the perfect model for your project.

No data available at this time, please try again later.
OrganizationModel NameTasksLanguages SupportedContext LengthParametersModel TierLicense
No data available at this time, please try again later.
TRY IT OUT

Chat with an LLM

Powered by our own GPU infrastructure, select a large language model, add a prompt, and chat away. For unlimited chats, sign up for a free account on our Mission Control Portal here.

Loading...
HOW IT WORKS

Selecting LLMs for Voice AI

GET Available Models
RESOURCES

Get started

Check out our helpful tools to help get you started.

  • Icon Resources ebook

    Test in the portal

    Easily browse and select your preferred model in the AI Playground.

    Test today
  • Icon Resources Docs

    Explore the docs

    Don’t wait to scale, start today with our public API endpoints.

    Get started
  • Icon Resources Article

    Stay up to date

    Keep an eye on our AI changelog so you don't miss a beat.

    See updates

Sign up and start building

Sign upContact sales

faqs

What is GPT-4 0125 Preview?

GPT-4 0125 Preview is the January 2024 snapshot of GPT-4 Turbo, fixing format-following issues from the 1106 release and improving code generation. It is available through Telnyx and OpenAI's API.

Is GPT-4 Vision Preview deprecated?

GPT-4 Vision has been folded into GPT-4o and GPT-4.1, which support vision natively. The standalone vision preview is being deprecated in favor of multimodal successors.

Why is GPT-4 going away?

OpenAI is consolidating older snapshots as GPT-4o and GPT-4.1 supersede them. The 0125 preview remains available but migration to newer models is recommended.

How does 0125 compare to 1106 Preview?

The 0125 snapshot fixed lazy formatting issues in the 1106 release and improved code generation accuracy. Both share the 128K context window and JSON mode, but 0125 is more reliable for structured output.

How much does GPT-4 0125 cost?

GPT-4 0125 Preview is priced at $10 per million input tokens and $30 per million output tokens. Newer models like GPT-4o offer better performance at lower pricing.

Is GPT-4 0125 Preview still worth using?

GPT-4 0125 Preview has been superseded by GPT-4o, GPT-4.1, and GPT-5. Unless you need to pin to this specific version, upgrading will give better results at lower cost.

CHOOSE MODEL
CHAT TO AN AGENT