API Versioning
Vellum uses date-based API versioning via request headers to give you control over when to enable new API features that may introduce breaking changes. This allows you to upgrade to new features on your own timeline while maintaining backward compatibility.
Why API Versioning Exists
API versioning in Vellum serves several important purposes:
- Backward Compatibility: Ensures your existing integrations continue to work when new features are released
- Controlled Upgrades: Allows you to test and adopt new features when you’re ready, rather than being forced to handle breaking changes immediately
- Feature Gating: Enables access to new capabilities like Reasoning Outputs and enhanced response formats
- Stable Production: Keeps your production systems running smoothly while new features are being developed
How to Use API Versioning
Setting the API Version
Specify which version of Vellum’s API you want to use by including the X-API-VERSION
header in your requests:
Using API Versioning in Sandboxes
API versioning is available in both Prompt Sandboxes and Workflows Sandboxes, allowing you to specify which version of the Vellum API you want to use when testing and developing your prompts and workflows. This gives you the same level of control over new features and breaking changes in your development environment.
To set the API version in a sandbox:
For Prompt Sandboxes:
- Open your prompt in the Prompt Sandbox
- Click on the Prompt Editor Settings/Prompt Comparison Settings
- Select your desired API version from the API Version Selector
For Workflows Sandboxes:
- Open your workflow in the Workflows Sandbox
- Click on the Workflow Builder Settings
- Select your desired API version from the API Version Selector
This enhancement ensures consistency between your development and production environments, allowing you to test new API features before deploying them to production. The API version selector works the same way across both sandbox environments - simply choose your desired API version to access the corresponding feature set.
This provides a unified experience whether you’re working with Prompt Deployments, Workflow Deployments, or developing in either sandbox environment.
Default Behavior
The default API version behavior differs depending on how you make requests:
- Raw API requests: If you don’t specify an
X-API-VERSION
header, Vellum will use the default version (2024-10-25
) to ensure backward compatibility with existing integrations. - SDK v1.0.0+: The latest API version (
2025-07-30
) is used by default, giving you access to the newest features automatically. You can override this to use a prior API version if you wish. - SDK versions prior to 1.0.0: Defaults to using API version
2024-10-25
, but can be overridden.
SDK Version Requirements
Different SDK versions support different API versions:
- SDK versions prior to 1.0.0: Only support API version
2024-10-25
- SDK versions 1.0.0 and later: Support both
2024-10-25
and2025-07-30
, with2025-07-30
as the default
Make sure to upgrade your SDK to version 1.0.0 or later to access the latest API features.
Available API Versions
2024-10-25
(Legacy Default)
This is the original API version that maintains full backward compatibility. It’s used by default for raw API requests when no X-API-VERSION
header is provided.
Features:
- Standard prompt execution responses
- Traditional output formats
- Full compatibility with all existing integrations
2025-07-30
(Latest, SDK Default)
The latest API version that includes new features and improvements. This is the default in our SDKs beginning v1.0.0.
New Features:
- Reasoning Outputs: Support for thinking/reasoning blocks from compatible models
- Enhanced Response Format: Differentiated blocks in API responses for reasoning-capable models
- Future Features: This version will continue to receive new capabilities as they’re developed
Breaking Changes Across Versions
Changes in 2025-07-30
Reasoning Outputs Support
The most significant change in 2025-07-30
is the introduction of Reasoning Outputs for models that support thinking/reasoning capabilities.
Non-Streaming Output Before (2024-10-25
):
Non-Streaming Output After (2025-07-30
):
Streaming outputs also support Reasoning Outputs with the API Version set to 2025-07-30
.
Streaming Output before (2024-10-25
):
Streaming Output After (2025-07-30
):
If your application parses API responses and expects a specific structure, you may need to update your code to handle the new THINKING
output type and introduction of additional outputs when upgrading to 2025-07-30
.
Thinking Output in Workflows
With API version 2025-07-30
, thinking output support has been extended to Prompt Nodes and Prompt Deployment Nodes within the Workflows Sandbox. This enhancement brings the same transparency and insight capabilities directly into your workflow development process.
When a Prompt Node or Prompt Deployment Node executes with a model that supports reasoning outputs, you can view the model’s thinking process directly in the node’s results view. This provides valuable debugging and development insights, allowing you to understand how the model arrives at its conclusions within your workflow context.
Important: While the thinking output is visible for your review and debugging purposes, downstream nodes in your workflow will only receive the final output - the thinking process remains isolated to the results view. This ensures your workflow logic remains unaffected while providing you with valuable insight into the model’s reasoning process.
Models Affected
Reasoning Outputs are currently supported by:
- All Anthropic Claude models with reasoning capabilities
- All OpenAI Models with reasoning capabilities invoked via Responses API
Only models that actually support reasoning will include the THINKING
output in responses. Standard models will continue to return responses in the same format as before.
Best Practices
Testing New Versions
- Test in Development: Always test new API versions in your development environment first
- Gradual Rollout: Consider rolling out API version changes gradually across your systems
- Monitor Responses: Watch for any changes in response structure that might affect your application
Version Management
- Pin Versions: Explicitly specify the API version in your requests rather than relying on defaults
- Document Usage: Keep track of which API versions you’re using across different parts of your application
- Plan Upgrades: Review changelog entries for new API versions to understand what changes to expect
Migration Guide
Upgrading to 2025-07-30
- Update SDK: Upgrade to SDK version 1.0.0 or later
- Test Reasoning Models: If you use reasoning-capable models, test that your application handles the new
thinking
field appropriately - Update Headers: Add
X-API-VERSION: 2025-07-30
to your API requests (or rely on SDK v1.0.0+ defaults) - Monitor: Watch for any unexpected behavior after the upgrade
Handling Reasoning Outputs
If you’re upgrading to 2025-07-30
and use reasoning-capable models, you may need to update your response handling:
Future Versions
Vellum will continue to release new API versions as new features are developed. Each new version will be documented with:
- A comprehensive list of new features
- Breaking changes and migration guidance
- SDK compatibility requirements
- Examples of new capabilities
Stay tuned to the changelog for announcements of new API versions and features.