Deploying a Pipeline Bundle
Deploying From Your Local Machine
Once you have created a data pipeline bundle and deployed the Lakeflow Framework, you can deploy it to your Databricks workspace.
Ensure you have the Databricks CLI installed and configured. If not, please refer to the Databricks CLI documentation.
Ensure the correct Databricks workspace is set as the workspace host field in the databricks.yml file (Databricks CLI should be configured with credentials to access this workspace).
Run the following command to validate the data pipeline bundle:
databricks bundle validateThis command will run a series of checks to ensure the bundle is correctly set up and ready for deployment.
Run the following command to deploy the data pipeline bundle to your Databricks workspace:
databricks bundle deploy --var="pipeline_framework_path=/Workspace/Users/<your_databricks_user_id>/.bundle/<framework_bundle_name>/<environment>/current/files/src"The owner is your databricks user id.
Once the deployment is successful, you should see the data pipeline bundle in your Databricks workspace.
To varify, you can go to your Databricks workspace and check if the bundle is present in the
.bundledirectory. Also verify that a Spark Declarative Pipeline has been created in the Databricks workspace with the name of the pipeline being the name provided in the resources yaml file for the Spark Declarative Pipeline.
Deploying via CI/CD
Please refer to the CI/CD documentation for more information on how to deploy the Lakeflow Framework samples using CI/CD.