7cb9d9a60b
Padilha notes week5 ( #361 )
2023-03-18 15:56:58 +01:00
2e1d49449b
Update download_data.sh ( #345 )
...
Fixed the "downloading" spelling error on line 18
2023-03-04 11:57:57 +01:00
14af32258e
missing \ ( #344 )
2023-03-03 17:43:32 +01:00
f8604f3885
moved homework
2023-03-03 16:05:39 +01:00
afdf597bac
Update README.md ( #339 )
2023-03-01 21:47:20 +01:00
0539a852fa
Update README.md
2023-03-01 17:13:42 +01:00
acc211182a
Updated download_data.sh file for week 05 ( #338 )
...
* updated download_data.sh for week 05
Updated to use new data repo and correct file extensions
* Update download_data.sh
2023-02-28 20:49:38 +01:00
f4cc1dc33b
Update windows.md ( #335 )
...
changed spark version from 3.0.3 to the 3.3.2
2023-02-26 22:20:26 +01:00
52277cce88
Update linux.md ( #324 )
...
corrected url, made spark versions consistent
2023-02-25 12:16:48 +01:00
258415077b
Update macos.md ( #332 )
...
* Update macos.md
added a useful information regarding the path where java is installed, if someone has problems to find it
* Update macos.md
again, just made clear how to get the path to the installation (spark)
2023-02-25 12:16:15 +01:00
3e7dfb6857
Add a link to my notes ( #330 )
2023-02-24 11:45:42 +01:00
7f52481821
Update linux.md ( #329 )
...
* Update linux.md
Consistent changes.
* Update linux.md
---------
Co-authored-by: Alexey Grigorev <alexeygrigorev@users.noreply.github.com >
2023-02-23 10:22:03 +01:00
3a127a6588
Homework Week5 ( #322 )
...
* Added week 5 homework
* Updated HW Answer Options
* Added Deadline to Week 5 Homework
2023-02-22 08:28:28 +01:00
2c4461668a
Typo while exporting the spark path ( #294 )
...
installed spark version is 3.3.1, however, exporting the path to the bash file is with version 3.3.0, so the exporting spark version is updated to 3.3.1
Previously
```bash
export SPARK_HOME="${HOME}/spark/spark-3.3.0-bin-hadoop3"
export PATH="${SPARK_HOME}/bin:${PATH}"
```
Updated
```bash
export SPARK_HOME="${HOME}/spark/spark-3.3.1-bin-hadoop3"
export PATH="${SPARK_HOME}/bin:${PATH}"
```
2023-02-08 20:17:22 +01:00
af8b8ff5ed
Merge branch 'main' of github.com:DataTalksClub/data-engineering-zoomcamp
2023-01-10 10:26:03 +01:00
5867e0d71f
Updated Spark Version in Linux Instructions ( #220 )
2022-12-29 11:32:48 +01:00
78d9532ee4
Update README.md ( #217 )
2022-11-21 16:37:50 +01:00
518ad3f77a
Updated spark version for Linux install ( #211 )
2022-09-20 05:51:48 +02:00
73ae05e4d7
hadoop
2022-04-22 15:06:10 +02:00
00caff1b66
docs
2022-03-30 20:27:38 +02:00
d8065cef73
dataproc
2022-03-30 18:12:56 +00:00
a868f23066
homework 5
2022-03-07 22:27:18 +00:00
0ab54d899b
Update macos.md
2022-03-02 17:51:37 +01:00
9aac95b54f
week_5_contribution ( #103 )
2022-03-02 17:42:49 +01:00
a2b6af4680
Added JAVA_HOME env var to avoid no JRE found error ( #101 )
2022-03-02 17:40:54 +01:00
0ad4e1c6cd
Update homework.md
2022-03-02 17:40:20 +01:00
1c03cd3205
Update README.md
2022-02-26 08:52:32 +01:00
b6396f86cc
Update homework.md
2022-02-26 07:03:15 +01:00
caf330c281
Update homework.md
2022-02-25 22:47:32 +01:00
2c16edecd1
inferschema note ( #100 )
2022-02-25 22:05:14 +01:00
e2f1a3748a
Notes by Alvaro Navas ( #99 )
2022-02-25 16:20:30 +01:00
433e379a94
Update pyspark.md
2022-02-22 19:46:41 +01:00
9f40637c86
Added a note for different file names for py4j ( #96 )
...
Added a note since brew picks up the latest minor version and the filename might not match the version installed for the PYTHONPATH.
2022-02-22 19:44:28 +01:00
329cdaf5b3
Update linux.md ( #95 )
2022-02-22 19:35:36 +01:00
d9789e8ef2
Update homework.md
2022-02-22 17:45:46 +01:00
160eca57e9
new video
2022-02-22 12:05:34 +01:00
ac444c887e
Edit linux.md ( #94 )
...
Edit JAVA_HOME and SPARK_HOME to match video example
2022-02-22 11:38:29 +01:00
9569913169
homework update
2022-02-22 07:29:37 +01:00
c45f63410f
note
2022-02-22 07:09:11 +01:00
bed8de2728
homework
2022-02-22 07:02:32 +01:00
301e2b0dda
rdd
2022-02-21 23:49:41 +00:00
0f603c775a
Update README week 5 ( #91 )
...
* update README week5
* update README week5
* update README week5
* update README week5
* update README week5
2022-02-21 20:01:33 +01:00
68f03b11a6
Update macos.md ( #93 )
2022-02-21 20:01:05 +01:00
34f5827e5c
Remove cask for brew cask install java. ( #89 )
...
I believe using `brew cask` here is incorrect (at least for me it didn't work)
1. `cask` has been deprecated since version `2.6.0` (see [changelog](https://brew.sh/2020/12/01/homebrew-2.6.0/ ))
2. `brew install --cask java` does not work either (no cask `java`), but `brew install java` does work
2022-02-20 20:09:01 +01:00
3fb08caff7
spark notebooks
2022-02-18 23:13:50 +00:00
41c6d766de
Update macos.md
2022-02-18 08:56:57 +01:00
760d222036
spark setup for MacOs ( #85 )
...
* spark setup for macos
* added bash blocks
2022-02-18 08:51:53 +01:00
24401f78d8
data preparation video
2022-02-17 22:40:56 +00:00
3eadd01037
code for week 5
2022-02-16 22:21:35 +00:00
39bbbe59ae
Update windows.md
2022-02-16 20:56:47 +01:00