Serverless DBU consumption by SKU

This article explains the SKUs and DBU multipliers used to bill for various Databricks serverless offerings.

For Azure Databricks pricing, see pricing details.

What is a DBU multiplier?

When using certain features, a multiplier is applied to the underlying DBUs consumed. For instance, Lakehouse Monitoring has a 2X multiplier. If the associated background job uses 5 DBUs, you are billed for 10 DBUs after applying the multiplier. The DBUs shown on your bill and in system tables reflect the final amount after this multiplier is applied. See What is a DBU for the definition of a DBU.

Automated Serverless SKU

The following capabilities are billed against the Automated Serverless SKU.

Feature DBU multiplier
Serverless Jobs 1X
Serverless DLT 1X
Serverless DLT with Advanced Pipeline Features 1.5X
Predictive Optimization 1X
Lakehouse Monitoring 2X
Fine-Grained Access Control (Preview) 1X
Online tables synchronization (Preview) 1X
Online tables synchronization with Advanced Pipeline Features (Preview) 1.5X
Online tables Capacity Unit (Preview) 2X
Materialized Views and Streaming Tables in Databricks SQL 1X
Materialized Views and Streaming Tables in Databricks SQL with Advanced Pipeline Features 1.5X

Interactive Serverless SKU

The following capabilities are billed against the Interactive Serverless SKU.

Product / Feature DBU Multiplier
Serverless Notebook 1X
Databricks App capacity hour 0.5X

SQL Serverless SKU

The following capabilities are billed against the SQL Serverless SKU.

Product / Feature DBU Multiplier
Warehouse Size DBU/hour
2X-Small 4
X-Small 6
Small 12
Medium 24
Large 40
X-Large 80
2X-Large 144
3X-Large 272
4X-Large 528

Model Serving SKU

The following capabilities are billed against the Serverless Real-Time Inference SKU.

AI Gateway (Preview)

Product / Feature DBU Multiplier
AI Guardrails 21.429 DBUs / M tokens
Inference Tables for FM API endpoints 2.857 DBUs / M tokens
Inference Tables for CPU, GPU endpoints 7.143 DBUs / 1 GB of payload
Usage Tracking for FM API endpoints 0.571 DBUs / M tokens
Usage Tracking for CPU, GPU endpoints 1.429 DBUs / 1 GB of payload

CPU Model Serving

1 concurrent request/hr = 1 DBU/hr

GPU Model Serving

Instance Size GPU configuration DBUs / hour
Small T4 or equivalent 10.48
Medium A10G x 1GPU or equivalent 20.00
Medium 4X A10G x 4GPU or equivalent 112.00
Medium 8x A10G x 8GPU or equivalent 290.80
XLarge A100 40GB x 8GPU or equivalent 538.40
XLarge A100 80GB x 8GPU or equivalent 628.00

Foundation Models Serving

Model Pay-Per-Token Provisioned Throughput
DBU / 1M INPUT tokens DBU / 1M OUTPUT tokens DBU per hour
Current Models
Llama 3.1 405B 71.429 214.286 700.000
Llama 3.1 70B 14.286 42.857 424.286
Llama 3.1 8B n/a n/a 106.000
Llama 3.2 3B n/a n/a 92.857
Llama 3.2 1B n/a n/a 85.714
DBRX 10.714 32.143 171.429
Mixtral 8x7B 7.143 14.286 157.143
Legacy Models
Llama 3 70B n/a n/a 212.143
Llama 3 8B n/a n/a 106.000
Llama 2 70B 7.143 21.429 157.143
Llama 2 13B n/a n/a 78.571
MPT 30B n/a n/a 112.000
MPT 7B n/a n/a 20.000
GTE 1.857 1.857 n/a
BGE Large 1.429 1.429 10.480

Shutterstock Image AI

1 image = 0.857 DBUs

1 vector search unit = 4 DBU/hr

Agent Evaluation

1 judge request = 1 DBU

Model Training

The following capabilities are billed against the Model Training SKU.

Model Training - Fine Tuning

Model Training word count Approximate DBUs
Current Models
Llama 3.1 405B 10,000,000 1,150
500,000,000 57,150
Llama 3.1 70B 10,000,000 375
500,000,000 17,600
Llama 3.1 8B 10,000,000 150
500,000,000 6,600
Llama 3.2 3B 10,000,000 75
500,000,000 3,300
Llama 3.2 1B 10,000,000 25
500,000,000 1,100
DBRX 10,000,000 300
500,000,000 14,300
Mixtral 8x7B 10,000,000 150
500,000,000 6,600
Mistral 7B 10,000,000 50
500,000,000 1,325
Legacy Models (to be deprecated on Dec 13, 2024)
Llama 3 70B 10,000,000 375
500,000,000 17,600
Llama 3 8B 10,000,000 150
500,000,000 6,600
Llama 2 70B 10,000,000 275
500,000,000 13,200
Llama 2 13B 10,000,000 50
500,000,000 2,475
Llama 2 7B 10,000,000 25
500,000,000 1,175
Codellama 34B 10,000,000 100
500,000,000 4,950
Codellama 13B 10,000,000 75
500,000,000 2,650
Codellama 7B 10,000,000 50
500,000,000 1,325

Databricks Storage

The following capabilities are billed against the Databricks Storage SKU

Product / Feature DSU Multiplier
Vector Search 10X
Online Tables Storage (preview) 15X