Commit Graph

54 Commits

Author SHA1 Message Date
7cb9d9a60b Padilha notes week5 (#361) 2023-03-18 15:56:58 +01:00
2e1d49449b Update download_data.sh (#345)
Fixed the "downloading" spelling error on line 18
2023-03-04 11:57:57 +01:00
14af32258e missing \ (#344) 2023-03-03 17:43:32 +01:00
f8604f3885 moved homework 2023-03-03 16:05:39 +01:00
afdf597bac Update README.md (#339) 2023-03-01 21:47:20 +01:00
0539a852fa Update README.md 2023-03-01 17:13:42 +01:00
acc211182a Updated download_data.sh file for week 05 (#338)
* updated download_data.sh for week 05

Updated to use new data repo and correct file extensions

* Update download_data.sh
2023-02-28 20:49:38 +01:00
f4cc1dc33b Update windows.md (#335)
changed spark version from 3.0.3 to the  3.3.2
2023-02-26 22:20:26 +01:00
52277cce88 Update linux.md (#324)
corrected url, made spark versions consistent
2023-02-25 12:16:48 +01:00
258415077b Update macos.md (#332)
* Update macos.md

added a useful information regarding the path where java is installed, if someone has problems to find it

* Update macos.md

again, just made clear how to get the path to the installation (spark)
2023-02-25 12:16:15 +01:00
3e7dfb6857 Add a link to my notes (#330) 2023-02-24 11:45:42 +01:00
7f52481821 Update linux.md (#329)
* Update linux.md

Consistent changes.

* Update linux.md

---------

Co-authored-by: Alexey Grigorev <alexeygrigorev@users.noreply.github.com>
2023-02-23 10:22:03 +01:00
3a127a6588 Homework Week5 (#322)
* Added week 5 homework

* Updated HW Answer Options

* Added Deadline to Week 5 Homework
2023-02-22 08:28:28 +01:00
2c4461668a Typo while exporting the spark path (#294)
installed spark version is 3.3.1, however, exporting the path to the bash file is with version 3.3.0, so the exporting spark version is updated to 3.3.1
Previously

```bash
export SPARK_HOME="${HOME}/spark/spark-3.3.0-bin-hadoop3"
export PATH="${SPARK_HOME}/bin:${PATH}"
```
Updated
```bash
export SPARK_HOME="${HOME}/spark/spark-3.3.1-bin-hadoop3"
export PATH="${SPARK_HOME}/bin:${PATH}"
```
2023-02-08 20:17:22 +01:00
af8b8ff5ed Merge branch 'main' of github.com:DataTalksClub/data-engineering-zoomcamp 2023-01-10 10:26:03 +01:00
5867e0d71f Updated Spark Version in Linux Instructions (#220) 2022-12-29 11:32:48 +01:00
78d9532ee4 Update README.md (#217) 2022-11-21 16:37:50 +01:00
518ad3f77a Updated spark version for Linux install (#211) 2022-09-20 05:51:48 +02:00
73ae05e4d7 hadoop 2022-04-22 15:06:10 +02:00
00caff1b66 docs 2022-03-30 20:27:38 +02:00
d8065cef73 dataproc 2022-03-30 18:12:56 +00:00
a868f23066 homework 5 2022-03-07 22:27:18 +00:00
0ab54d899b Update macos.md 2022-03-02 17:51:37 +01:00
9aac95b54f week_5_contribution (#103) 2022-03-02 17:42:49 +01:00
a2b6af4680 Added JAVA_HOME env var to avoid no JRE found error (#101) 2022-03-02 17:40:54 +01:00
0ad4e1c6cd Update homework.md 2022-03-02 17:40:20 +01:00
1c03cd3205 Update README.md 2022-02-26 08:52:32 +01:00
b6396f86cc Update homework.md 2022-02-26 07:03:15 +01:00
caf330c281 Update homework.md 2022-02-25 22:47:32 +01:00
2c16edecd1 inferschema note (#100) 2022-02-25 22:05:14 +01:00
e2f1a3748a Notes by Alvaro Navas (#99) 2022-02-25 16:20:30 +01:00
433e379a94 Update pyspark.md 2022-02-22 19:46:41 +01:00
9f40637c86 Added a note for different file names for py4j (#96)
Added a note since brew picks up the latest minor version and the filename might not match the version installed for the PYTHONPATH.
2022-02-22 19:44:28 +01:00
329cdaf5b3 Update linux.md (#95) 2022-02-22 19:35:36 +01:00
d9789e8ef2 Update homework.md 2022-02-22 17:45:46 +01:00
160eca57e9 new video 2022-02-22 12:05:34 +01:00
ac444c887e Edit linux.md (#94)
Edit JAVA_HOME and SPARK_HOME to match video example
2022-02-22 11:38:29 +01:00
9569913169 homework update 2022-02-22 07:29:37 +01:00
c45f63410f note 2022-02-22 07:09:11 +01:00
bed8de2728 homework 2022-02-22 07:02:32 +01:00
301e2b0dda rdd 2022-02-21 23:49:41 +00:00
0f603c775a Update README week 5 (#91)
* update README week5

* update README week5

* update README week5

* update README week5

* update README week5
2022-02-21 20:01:33 +01:00
68f03b11a6 Update macos.md (#93) 2022-02-21 20:01:05 +01:00
34f5827e5c Remove cask for brew cask install java. (#89)
I believe using `brew cask` here is incorrect (at least for me it didn't work)
1. `cask` has been deprecated since version `2.6.0` (see [changelog](https://brew.sh/2020/12/01/homebrew-2.6.0/))
2. `brew install --cask java` does not work either (no cask `java`), but `brew install java` does work
2022-02-20 20:09:01 +01:00
3fb08caff7 spark notebooks 2022-02-18 23:13:50 +00:00
41c6d766de Update macos.md 2022-02-18 08:56:57 +01:00
760d222036 spark setup for MacOs (#85)
* spark setup for macos

* added bash blocks
2022-02-18 08:51:53 +01:00
24401f78d8 data preparation video 2022-02-17 22:40:56 +00:00
3eadd01037 code for week 5 2022-02-16 22:21:35 +00:00
39bbbe59ae Update windows.md 2022-02-16 20:56:47 +01:00