Infra Setup
You'll deploy the core Databricks infrastructure — workspaces, identity, and governance — in this section.
Prereqs: Before you Start
Why this matters
A Databricks deployment without proper workspace layout, centralized identity, and metastore ownership becomes hard to govern and harder to scale. This section walks through each piece in the order it should be done.
Journey checklist
-
Get started. -
Before you start. - Infra setup
- Create workspaces.
- Add users.
- Add groups.
- Change ownership to metastore admins.
- Activate SSO.
- Cost monitoring.
- Data Governance Strategy.
- Access your data.
- Build the first pipeline.
- Automation and orchestration.
- Query and explore.
- Databricks AI/BI.
- Business semantics.
What you'll set up
Work through these sub-sections in order. Each one depends on the previous.
- Create Workspaces — Deploy workspaces on AWS, Azure, or GCP (manual or Terraform).
- Add Users — Register users manually or via SCIM provisioning.
- Add Groups — Create groups that map to data personas and assign users.
- Metastore Admins — Set the admin group and transfer UC asset ownership.
- Activate SSO — Configure single sign-on with your identity provider.
Next
- Do next: Create Workspaces
- Learn why: Unity Catalog foundations
- Reference: Databricks administration overview