Batching Executions | Vellum

When you need to process many Workflow executions at once, the async execution endpoint is ideal for batch processing. Async executions automatically queue when you exceed your concurrency limit, allowing you to initiate many executions quickly without waiting for each one to complete.

Why Use Async Execution for Batch Jobs

Async execution is perfect for batch processing scenarios because:

Automatic queuing: Executions automatically queue when you exceed your concurrency limit
Non-blocking: You can initiate many executions quickly without waiting for completion
Efficient resource usage: Executions process as capacity becomes available
Scalable: Handle large batches without overwhelming your system

Basic Batch Job Pattern

The simplest pattern is to initiate all executions at once. They’ll queue automatically if needed:

1 import vellum
2 from typing import List
3 
4 client = vellum.VellumClient(api_key="your-api-key")
5 
6 # Process a batch of items
7 items_to_process = [
8     {"user_query": "Process item 1"},
9     {"user_query": "Process item 2"},
10     {"user_query": "Process item 3"},
11     # ... many more items
12 ]
13 
14 # Initiate all executions - they'll queue automatically if needed
15 execution_ids = []
16 for i, item in enumerate(items_to_process):
17     response = client.execute_workflow_async(
18         workflow_deployment_name="your-workflow",
19         inputs=[
20             vellum.WorkflowRequestStringInput(
21                 name="user_query",
22                 value=item["user_query"]
23             )
24         ],
25         external_id=f"batch-item-{i}"  # Track each item
26     )
27     execution_ids.append(response.execution_id)
28     print(f"Initiated execution {i+1}/{len(items_to_process)}: {response.execution_id}")
29 
30 print(f"\nInitiated {len(execution_ids)} executions. They'll process as capacity becomes available.")

Tracking Batch Job Completion

After initiating your batch, you have several options for tracking completion:

Option 1: Webhooks (Recommended)

The most efficient approach is to use webhooks to receive completion notifications. See our Long Running Workflows guide for webhook setup details.

Python SDK - Batch Job with Webhooks

1 import vellum
2 
3 client = vellum.VellumClient(api_key="your-api-key")
4 
5 # Initiate batch executions
6 items_to_process = [/* your items */]
7 execution_ids = []
8 
9 for i, item in enumerate(items_to_process):
10     response = client.execute_workflow_async(
11         workflow_deployment_name="your-workflow",
12         inputs=[/* your inputs */],
13         external_id=f"batch-item-{i}"  # Use external_id for webhook correlation
14     )
15     execution_ids.append(response.execution_id)
16 
17 # Store execution_ids for tracking
18 # Webhooks will notify you when each execution completes
19 print(f"Initiated {len(execution_ids)} executions. Webhooks will notify on completion.")

Option 2: Status Polling

Poll the status endpoint to check completion. This is useful when you need to wait for results before proceeding:

Python SDK - Batch Job with Status Polling

1 import vellum
2 import time
3 from typing import Dict, List
4 
5 client = vellum.VellumClient(api_key="your-api-key")
6 
7 # Initiate batch executions
8 items_to_process = [/* your items */]
9 execution_ids = []
10 
11 for i, item in enumerate(items_to_process):
12     response = client.execute_workflow_async(
13         workflow_deployment_name="your-workflow",
14         inputs=[/* your inputs */],
15         external_id=f"batch-item-{i}"
16     )
17     execution_ids.append(response.execution_id)
18 
19 # Poll for completion
20 results: Dict[str, dict] = {}
21 pending = set(execution_ids)
22 
23 while pending:
24     for execution_id in list(pending):
25         try:
26             status_response = client.check_workflow_execution_status(
27                 execution_id=execution_id
28             )
29             
30             if status_response.status == "FULFILLED":
31                 results[execution_id] = {
32                     "status": "completed",
33                     "outputs": status_response.outputs
34                 }
35                 pending.remove(execution_id)
36                 print(f"Completed: {execution_id}")
37             elif status_response.status == "REJECTED":
38                 results[execution_id] = {
39                     "status": "failed"
40                 }
41                 pending.remove(execution_id)
42                 print(f"Failed: {execution_id}")
43         except Exception as e:
44             print(f"Error checking {execution_id}: {e}")
45     
46     if pending:
47         print(f"Still processing {len(pending)} executions...")
48         time.sleep(30)  # Poll every 30 seconds
49 
50 print(f"\nBatch complete! Processed {len(results)} executions.")

Option 3: Hybrid Approach

Initiate executions and periodically check status, but rely on webhooks for final notification:

Python SDK - Hybrid Approach

1 import vellum
2 import time
3 
4 client = vellum.VellumClient(api_key="your-api-key")
5 
6 # Initiate batch executions
7 items_to_process = [/* your items */]
8 execution_ids = []
9 
10 for i, item in enumerate(items_to_process):
11     response = client.execute_workflow_async(
12         workflow_deployment_name="your-workflow",
13         inputs=[/* your inputs */],
14         external_id=f"batch-item-{i}"
15     )
16     execution_ids.append(response.execution_id)
17 
18 # Optional: Quick status check after a delay
19 time.sleep(60)  # Wait 1 minute
20 
21 # Check how many have completed so far
22 completed = 0
23 for execution_id in execution_ids:
24     try:
25         status = client.check_workflow_execution_status(execution_id=execution_id)
26         if status.status in ["FULFILLED", "REJECTED"]:
27             completed += 1
28     except:
29         pass
30 
31 print(f"Progress: {completed}/{len(execution_ids)} completed")
32 print("Webhooks will notify when remaining executions complete.")

Best Practices

Use External IDs for Tracking

Always include an external_id when initiating batch executions. This allows you to correlate webhook events with your internal records, making it easy to track which item in your batch corresponds to each execution.

Monitor Concurrency Limits

Be aware of your organization’s concurrency limits. While async executions queue automatically, understanding your limits helps you plan batch sizes and processing times.

Handle Failures Gracefully

Some executions in a batch may fail. Use webhooks or status polling to identify failures and implement retry logic or error handling as needed.

Use Webhooks for Large Batches

For large batches (hundreds or thousands of executions), webhooks are more efficient than polling. They reduce API calls and provide real-time notifications.

Batch Size Considerations

There’s no hard limit on batch size, but consider:

Your organization’s concurrency limits
Processing time per execution
Webhook endpoint capacity
Error handling complexity

Long Running Workflows - Detailed guide on async execution patterns
Webhooks Configuration - Set up webhooks for completion notifications
API Reference - Execute Workflow Async
API Reference - Check Workflow Execution Status