* Added .devcontainer Updated comment, line 84 of Dockerfile * Update README.md Added links for devcontainer VS Code extension and general devcontainer docs
Devcontainer for DataTalksClub Data Engineering Zoomcamp
This devcontainer sets up a development environment for this class. This can be used with both VS Code and GitHub Codespaces.
Getting Started
To continue, make sure you have Visual Studio Code and Docker Desktop installed OR use GitHub Codespaces.
Option 1: Local VS Code
- Clone the repo and connect to it in VS Code:
$ cd your/desired/repo/location
$ git clone https://github.com/DataTalksClub/data-engineering-zoomcamp.git
-
Download the
Dev Containersextension from the VS Code marketplace. Full docs on devcontainers here -
Press Cmd + Shift + P (Mac) or Ctrl + Shift + P (Windows) to open the Command Pallette. Type in
Dev Containers: Open Folder in Containerand select the repo directory -
Wait for the container to build and the dependencies to install
Option 2: GitHub Codespaces
-
Fork this repo
-
From the repo page in GitHub, select the green
<> Codebutton and choose Codespaces -
Click
Create Codespace on Main, or checkout a branch if you prefer -
Wait for the container to build and the dependencies to install
-
Start developing!
Included Tools and Languages:
Python 3.9PandasSQLAlchemyPySparkPyArrowPolarsPrefect 2.7.7and all required Python dependenciesconfluent-kafka
Google Cloud SDKdbt-coredbt-postgresdbt-bigquery
TerraformJupyter Notebooks for VS CodeDockerSparkJDKversion 11Oh-My-Posh Powershell themes- Popular VS Code themes (GitHub, Atom One, Material Icons etc.)
Customization
Feel free to modify the Dockerfile, devcontainer.json or requirements.txt file to include any other tools or packages that you need for your development environment. In the Dockerfile, you can customize the POSH_THEME environment variable with a theme of your choosing from here