Dataform | Manage data pipelines in BigQuery

Transform your raw data into reliable, documented, up-to-date datasets.

Dataform is where your data team works together to build a single source of truth for your company’s data. Collaborate on SQL pipelines in BigQuery without writing code or managing infrastructure.

contacts
salesforce
updated
identifies
segment
source
tracks
segment
source
customers
dataform
success
customer_stats
dataform
success
user_stats
dataform
cached
users
segment
cached
domain_stats
dataform
error
daily_customer_stats
dataform
running

Dataform is where the whole data team works together

Analytics leads

See how Dataform enables your team to collaborate together and make sure data is discoverable across the organization.

Data engineers

See how Dataform's open source framework enables you to save costs and reduce latency across 1000's of datasets.

Data analysts

See how Dataform's cloud IDE makes writing and maintaining SQL pipelines a breeze.

Kaleva
Livup
Echo
Charlotte Tilbury
Acordo Certo
Big Tree
Teatime Games
Slite
Tasman Analytics
Butternut Box
Outshine
Dreamdata
Mittelbayerische
SME Capital
Mixcloud
Rapidoo
Prolific
Unbound Solar

Where Dataform fits in your stack

Dataform enables analysts to manage all data processes in BigQuery, turning raw data into the clean datasets you need for analytics.

How Dataform works

A software engineering inspired approach to data modeling, pipelines, and data quality management.

1

Write data transformations as code

Develop your data workflows in SQL with Dataform's built-in templating syntax and APIs.

2

Deploy to BigQuery with Dataform

Dataform deploys your SQL definitions to BigQuery, creates tables and views, and runs tests.

3

Create a single source of truth for your data

Finish with trusted, documented, and well-tested datasets ready to be consumed by other teams, or visualized in dashboards.

What our customers say

Mike Lee

Head of Data

speech marks

Being able to produce analytics tables that we are confident in the output of (because of assertions) and are as up to date as we need them to be (because of scheduling) makes our lives really easy. The UI is incredibly easy and intuitive to use, meaning we spend little of our time setting these things up, and most of our time writing SQL!

Echo Logo
Jim Lumsden

Growth Analyst

speech marks

I love the dependency tree in Dataform. For me this is a central place for sanity checking my data flows, understanding if I'm reimplementing a dataset which already exists, and verifying logic. Secondly, I love SQLX for generating SQL of a similar structure again and again, it really speeds up development and let's your abstract away logic.

Prolific Logo
Saadat Photo
Saadat Qadri

Analytics Practice Lead

speech marks

Having modeled data using other tools in the past, this is much simpler and an easier environment to code in. The code compiles in real time and lets you know if there are errors in the syntax. It also helps generate a dependency graph for the data pipeline which is insanely useful.

Outshine Logo