Development Environment Setup
The sections below assumes the Lakeflow Framework repository has been cloned from git and you are in the root directory. If not please do so first.
Setting up for development as a contributor to the Lakeflow Framework
Once you have cloned the Lakeflow Framework repository, you’ll need to follow the steps below to set up the framework.
Install requirements
The dev dependencies are pinned and hash-verified in
requirements-dev.lock(generated fromrequirements-dev.txt). Installing from the lockfile guarantees a reproducible environment that matches what CI uses.Install them from the root directory by running the following command (you may also want to use a virtual environment for this by running
python -m venv .venv/first. See Python Virtual Environments for more details):pip install --require-hashes --no-deps -r requirements-dev.lock
requirements-dev.lockincludes everything inrequirements-docs.locktoo, so you do not need a separate install step for building the documentation.If you change a dependency in any of the
requirements*.txtfiles, regenerate all three lockfiles with the helper script from the repo root:./scripts/generate_lockfiles.sh
See Updating the Documentation for more details on the lockfiles.
Set up VS Code extentions
Once you open the Lakeflow Framework workspace in VS Code for the first time, VS Code will prompt you to install the recommended extensions. If you missed this prompt, you can review and install the recommended extensions with the Extensions: Show Recommended Extensions command or by clicking on the extentions tab on left side of the window and selecting “Workspace Recommendations”.
Note
To deploy the Lakeflow Framework to your Databricks workspace, follow the steps in Deploy the Framework.