Workspace + Genie Code
You'll learn how to build a pipeline interactively using the Databricks Workspace and Genie Code in ~10 min.
Prereqs: Build the first pipeline
What you'll walk away with
A working Spark Declarative Pipeline you built without leaving the browser. You describe the transformation to Genie Code in plain language, it writes the pipeline code, and you validate and run it in the Workspace editor. This is where most people should start before going code-first with DABs.
How it works
The Workspace has a pipeline editor for building Spark Declarative Pipelines visually. Genie Code sits alongside it: tell it what you want, and it generates the transformation code, suggests data quality expectations, and auto-completes schema definitions from your actual data.
Getting started
This video builds a pipeline from scratch in the Workspace with Genie Code:
Take Genie Code further
Once the basics run, Genie Code handles the harder parts: multi-table joins, layered transformations, and data quality rules.
When to reach for this
Build here when you want fast feedback on your first pipeline, when you are prototyping a transformation before committing it to code, or when you just prefer working in a visual editor.
Switch to DABs once you need version-controlled, repeatable deployments, the pipeline has to run in staging and prod, or you want the schemas, volumes, and warehouses defined alongside the pipeline.
Next
- Do next: DABs pipeline
- Learn why: Spark Declarative Pipelines overview
- Reference: Create a pipeline (Databricks docs)