Skip to main content

Recap and learning

Workspace

What is a Workspace?

A workspace is the environment where you create and integrate Databricks products to build end-to-end Data + AI solutions.

How many workspaces do I need?
  • The best practice is 3 separated workspaces aligned with the Software Development Lifecycle (SDLC).
    • Development workspace.
    • Staging - UAT - Testing workspace.
    • Production workspace.
Do I need a new workspace for each team in my organization?
  • No, just add groups either as users or admins to a specific Workspace.
  • Discussed in more detail on section 6.
How can I isolate my organization data?
  • The physical and logical isolation is achieved through Unity Catalog (next section).

Unity Catalog

What is UC?

Unity Catalog is a centralized metadata layer designed to manage data access, security, lineage, and governance across Databricks workspaces, enabling unified data management and secure collaboration.

Where is the data stored?

The data, metadata and AI models are always stored in your cloud-object storage.

How should every data+AI interaction be done?

Through Unity Catalog.

What should be avoided?
  • Accessing data using hardcoded credentials.
  • Configure data access on the clusters using external libraries + environment variables.

Databricks Account Console

What is the purpose of the Databricks Account Console?

A portal similar to your cloud provider console / UI, here you can manage:

  • Workspaces.
  • UC Metastores.
  • Users, Groups and Service Principals.
  • Billing and Budgets.
  • SCIM and SSO.