Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Search overlay panel for performing site-wide searches

Boost Performance & Scale with Postgres Advanced. Join Pilot Now!

  • Dashboard icon Dashboard
  • Databases icon Databases
  • Dataclips icon Dataclips
  • Elements icon Elements
  • Documentation icon Documentation
  • Support icon Support
Dashboard
Login Sign Up
Heroku - from Salesforce
  • Products
        • Products

          Discover Heroku’s AI PaaS (Platform as a Service), designed for effortless app deployment and scaling. Explore our cloud application platform features, reliable managed data services, and a robust ecosystem to power your modern applications.

        • Heroku PlatformDeploy, manage, and scale apps on Heroku, an AI PaaS based on a managed container system.
        • Heroku AIBuild intelligent applications with managed inference and agents, MCP, and more.
        • Heroku Data ServicesSimplify your data management with Heroku’s fully managed cloud databases and data services.
        • Heroku EnterpriseThe Heroku experience developers love, with the enterprise features large companies need.
        • Heroku SuccessExplore Heroku’s support options from Standard to Signature, with resources for developers and partners.
        • Heroku Elements MarketplaceAccelerate your app development with our ecosystem of add-ons, integrations, and buildpacks.
        • Illustration of a laptop with a webcam on top, displaying a virtual meeting interface with user icons and stylized cloud elements around the screen.
        • Salesforce OrgFarm: Boosting Developer Productivity with Heroku and AI

          Explore how Salesforce scaled developer productivity for 15,000 engineers using Heroku and advanced AI solutions.

  • Developers
        • Developers

          With Heroku's flexible app platform, you can focus on building apps, not infrastructure. Benefit from a meticulously designed developer experience, a polyglot platform supporting your favorite languages, and innovative AI PaaS features to enhance your development workflow.

        • Dev CenterDive into our comprehensive documentation and guides. Learn about building, deploying, managing, and scaling your apps.
        • Languages
          • Python
          • .NET
          • Java
          • Node.js
          • Go
          • PHP
          • Ruby
          • Scala
          • Clojure
        • Illustration of a laptop with a webcam on top, displaying a virtual meeting interface with user icons and stylized cloud elements around the screen.
        • Salesforce OrgFarm: Boosting Developer Productivity with Heroku and AI

          Explore how Salesforce scaled developer productivity for 15,000 engineers using Heroku and advanced AI solutions.

  • Customers
        • Customers

          Discover how companies in diverse industries like Healthcare, Entertainment, Automotive, Retail, FinTech, and more achieve growth and foster innovation with our powerful cloud application platform. Learn how they leverage Heroku’s AI PaaS for cutting-edge solutions.

        • Customer StoriesHeroku in action: See how organizations of all sizes and industries are succeeding with Heroku.
        • Community StoriesExplore real-world experiences and perspectives on Heroku from developers and communities online.
        • Fastcall

          How Fastcall Delivers Enterprise-Scale Voice and Messaging with Heroku

          • Learn how Fastcall achieved 5-10X performance improvements with Heroku, enabling their Salesforce-native phone & SMS app to scale for enterprise accounts.
  • Pricing
  • Resources
        • Resources

          Learn more about Heroku's AI PaaS and stay up to date on all Heroku announcements. Gain insights from our team through insightful content and discover how to leverage our platform effectively.

        • BlogStay up-to-date on Heroku news, important product releases, and valuable insights from the Heroku team.
        • What is Heroku?Learn how Heroku’s AI PaaS simplifies app development, deployment, and scaling.
        • EventsFind Heroku at an event near you! Explore our calendar of tech conferences and meetups.
        • PartnersFor Consultants, ISVs, and technical solution providers looking to grow their business with pro-code solutions.
        • Compliance CenterHeroku is a platform you can trust. Explore our compliance certifications and security measures.
        • Help CenterFind answers to your questions in the Help Center. Browse FAQs, articles, and get support.
        • Latest News from the Heroku Blog

          Whats New in Heroku AI: New Models and a Flexible Standard Plan

          • Last Updated: February 19, 2026
          • Anush DSouza, Josh Lewis

          Heroku is introducing significant updates to Managed Inference and Agents. These changes focus on reducing developer friction, expanding model catalogue, and streamlining deployment workflows.

Blog / News / Heroku AI: Accelerating AI Development With New Models, Performance Improvements, and Messages API

Heroku AI: Accelerating AI Development With New Models, Performance Improvements, and Messages API

Anush DSouza
Posted By Anush DSouza
  • Last Updated: December 18, 2025

This month marks significant expansion for Heroku Managed Inference and Agents, directly accelerating our AI PaaS framework. We’re announcing a substantial addition to our model catalog, providing access to leading proprietary AI models such as Claude Opus 4.5, Nova 2, and open-weight models such as Kimi K2 thinking, MiniMax M2, and Qwen3. These resources are fully managed, secure, and accessible via a single CLI command. We have also refreshed aistudio.heroku.com, please navigate to aistudio.heroku.com from your Managed Inference and Agents add-on to access the models you have provisioned.

Whether you are building complex reasoning agents or high-performance consumer applications, here’s what’s new in our platform. All of the open-weight models you access on Heroku are running on secure compute on AWS servers. Neither Heroku nor the model provider has access to your data and it is not used in training.

Expanding Heroku’s AI catalog with new state of the art models

Claude 4.5 models

We now support the full Claude 4.5 family in both US and EU regions, replacing the prior Claude 3 models which are scheduled for depreciation in January of 2026.

  • Claude Opus 4.5: Designed for deep reasoning, complex task orchestration, and long-horizon planning. Recommended for demanding agentic workflows.
  • Claude Sonnet 4.5: Balanced model for enterprise workloads, coding, and analysis.
  • Claude Haiku 4.5: Low-latency model for high-volume tasks and classification.

Open-weight models

We have added several open-weights models to Heroku Managed Inference and Agents.

  • Kimi K2 Thinking: Specialized for chain-of-thought processing, writing, and reasoning tasks.
  • MiniMax M2: optimized for creative generation, roleplay, and coding agents.
  • Qwen3 (235B & Coder 480B): Large models delivering exceptional performance as coding agents.

Nova models

  • Amazon Nova 2 Lite: The Nova 2 family is now available, replacing the previous generation. These models provide updated multimodal capabilities and improved price-performance ratios.

Anthropic’s Messages API (Heroku preview)

Heroku now offers preview support for the Messages API format for all Anthropic models on Heroku. The API format is an alternative to the standard chatCompletions API and aligns with the Claude SDKs, enabling direct integration with Claude Code and the Claude Agents SDK.

Technical implementation and authentication

Authentication detail for the v1/messages endpoint, the authentication structure mirrors Anthropic’s standard practice. Set the value of your Heroku add-on’s INFERENCE_KEY as the value for the x-api-key HTTP header in your request.

Quickstart with Anthropic Python SDK

import os
from anthropic import Anthropic
 
inference_url = os.getenv("INFERENCE_URL")
inference_key = os.getenv("INFERENCE_KEY")
inference_model = os.getenv("INFERENCE_MODEL")


client = Anthropic(
    api_key=inference_key,
    base_url=inference_url
)


message = client.messages.create(
    model=inference_model,
    max_tokens=1024,
    messages=[
        {"role": "user", "content": "Hello, what should I build today?"}
    ]
)

Key Constraints for Developers

  • Beta Features: We do not currently support the anthropic-beta header.
  • Claude Code: To ensure compatibility, set CLAUDE_CODE_DISABLE_EXPERIMENTAL_BETAS=1.
  • Scope: The Messages API is exclusively available for Anthropic models.

Performance boost: automatic prompt caching

Heroku now caches system prompts and tool definitions to reduce latency on repeated requests. Prompt caching is enabled by default with no code changes required. Only system prompts and tool definitions are cached; user messages and conversation history are excluded and automatically expire to ensure privacy and security. You can disable caching for any request by adding a single HTTP header: X-Heroku-Prompt-Caching: false.

Lifecycle updates

Deprecations

  • Claude 3 Family: The Claude 3 models (Sonnet 3.5, Sonnet 3.7, Haiku 3, and Haiku 3.5) will be deprecated as of Jan 30th, 2026. Workloads should migrate to the Claude 4.5 family.
  • Nova 1st Gen: will be deprecated as of Feb 28th, 2026 in favor of Nova 2.
  • Model Fallback: We are working on a default model fallback mechanism where if your model is deprecated, you’ll automatically switch over to a similar more recent model in the same family of models.

Heroku AI PaaS: Accelerating AI Development

This release brings state-of-the-art reasoning and efficient open-weight models to the Heroku platform. With the addition of prompt caching you can now optimize latency with minimal configuration. We recommend validating your applications with the Claude 4.5 and Nova 2 families ahead of the upcoming deprecation cycle. We would love to hear your feedback and feature requests, please reach out to heroku-ai-feedback@salesforce.com.

  • Originally Published: December 18, 2025
  • AIHeroku AIManaged Inference and Agents
Post
Share
Share

Related Posts

Whats New in Heroku AI: New Models and a Flexible Standard Plan
Code Execution Sandbox for Agents on Heroku
Building AI-Powered Observability with Heroku Managed Inference and Agents
Heroku and GitHub Enterprise Server: Stronger Security, Seamless Delivery
An Update on Heroku

Ready to Get Started?

Stay focused on building great data-driven applications and let Heroku tackle the rest.

Talk to A Heroku Rep   Sign Up Now

More from the Author
Anush DSouza
Senior Product Manager at Heroku
Heroku Staff

  • Whats New in Heroku AI: New Models and a Flexible Standard Plan
  • Code Execution Sandbox for Agents on Heroku
  • Building AI Search on Heroku

Browse the archives for News or all blogs. Subscribe to the RSS feed for News or all blogs.

  • Products
    • Heroku Platform
    • Heroku AI
    • Heroku Managed Inference and Agents
    • pgvector for Heroku Postgres
    • MCP on Heroku
    • Heroku Data Services
    • Heroku Postgres
    • Heroku Key-Value Store
    • Apache Kafka on Heroku
    • Heroku Enterprise
    • Heroku Private Spaces
    • Heroku Connect
    • Heroku Shield
    • Heroku Success
    • Heroku Teams
    • Elements Marketplace
  • Languages
    • Python
    • .NET
    • Java
    • Node.js
    • Go
    • PHP
    • Ruby
    • Scala
    • Clojure
Resources
  • Dev Center
  • Training & Education
  • Get Started
  • Pricing
  • Blog
  • Customers
  • Partners
  • Compliance Center
  • Solutions
  • Podcasts
  • About Us
    • What is Heroku?
    • Heroku & Salesforce
    • Careers
  • Help Center
  • Status
  • Contact
Bluesky X-twitter-square Linkedin Github Rss
Heroku - from Salesforce
  • Legal
  • Terms of Service
  • Privacy Information
  • Responsible Disclosure
  • Trust
  • Contact
  • Cookie Preferences
  • Your Privacy Choices
  • Sitemap
  • Legal
  • Terms of Service
  • Privacy Information
  • Responsible Disclosure
  • Trust
  • Contact
  • Cookie Preferences
  • Your Privacy Choices
  • Sitemap

© Copyright 2026 Salesforce, Inc. All rights reserved. Various trademarks held by their respective owners. Salesforce Tower, 415 Mission Street, 3rd Floor, San Francisco, CA 94105, United States