23 lines
876 B
Markdown
23 lines
876 B
Markdown
# parser_OLX
|
|
forked from https://github.com/KrzysztofSajdok/parser_OLX
|
|
simple parser used to retrieve amount of sale, rent and exchange avertisements from OLX.pl in real estate category.
|
|
|
|
Installation:
|
|
```
|
|
python3 -m venv .venv && source .venv/bin/activate
|
|
pip3 install -r requirements.txt
|
|
sudo apt install chromium
|
|
chromium --version
|
|
```
|
|
Then download the appropriate version of `chromedriver` from [this webpage](https://chromedriver.chromium.org/downloads), extract it, give execution permissions and put it in .venv/bin/. Here I'm downloading chromedriver for chromium 112
|
|
```
|
|
wget "https://chromedriver.storage.googleapis.com/112.0.5615.49/chromedriver_linux64.zip"
|
|
unzip chromedriver_linux64.zip
|
|
mv chromedriver .venv/bin
|
|
chmod +x .venv/bin/chromedriver
|
|
rm LICENSE.chromedriver chromedriver_linux64.zip
|
|
```
|
|
|
|
`main.py` puts data in the database in the `olx_data` table.
|
|
|