Workspace + Genie Code

You'll learn how to build a pipeline interactively using the Databricks Workspace and Genie Code in ~10 min.

Prereqs: Build the first pipeline

What you'll walk away with

A working Spark Declarative Pipeline you built without leaving the browser. You describe the transformation to Genie Code in plain language, it writes the pipeline code, and you validate and run it in the Workspace editor. This is where most people should start before going code-first with DABs.

How it works

The Workspace has a pipeline editor for building Spark Declarative Pipelines visually. Genie Code sits alongside it: tell it what you want, and it generates the transformation code, suggests data quality expectations, and auto-completes schema definitions from your actual data.

Getting started

This video builds a pipeline from scratch in the Workspace with Genie Code:

Take Genie Code further

Once the basics run, Genie Code handles the harder parts: multi-table joins, layered transformations, and data quality rules.

When to reach for this

Build here when you want fast feedback on your first pipeline, when you are prototyping a transformation before committing it to code, or when you just prefer working in a visual editor.

Switch to DABs once you need version-controlled, repeatable deployments, the pipeline has to run in staging and prod, or you want the schemas, volumes, and warehouses defined alongside the pipeline.

Do next: DABs pipeline
Learn why: Spark Declarative Pipelines overview
Reference: Create a pipeline (Databricks docs)

What you'll walk away with​

How it works​

Getting started​

Take Genie Code further​

When to reach for this​

Next​

What you'll walk away with

How it works

Getting started

Take Genie Code further

When to reach for this

Next