python version: 3.12.0
enter pyspark folder:
cd pysparkestablish venv:
python3 -m venv venv
source venv/bin/activateinstall requirements:
pip install poetry
poetry installexample run:
python3 run_pyspark.py -t 1-t / --tasktask number-i / --inventoryfileptah to inventory parquet; defaltdata/inventory.parquet-u / --usersfileptah to inventory parquet; defaltdata/selected_users.parquet
enter python folder:
cd pythonestablish venv:
python3 -m venv venv
source venv/bin/activateinstall requirements:
pip install -r requirements.txtpython run_python.py -i path/to/input/csv/file.csv
python run_python.py -i data/sales_report_input.csv
-i / --inputfileptah to input csv-o / --outputfilepath to output csv-s / --separatorcsv separator; default:;-k / --keyapi key
on default settings run_python.py save result csv in data folder
example output: data/sales_report_output_20220919_020001.csv