This is a top project of using PySpark to conduct "small data" analysis. Env: Linux: Fedora in VM PySpark: 3.3 Conda: Python 3.7