Run the project
Run PySpark
Navigate to the PySpark directory
cd pyspark
Generate dummy data
The dummy data will be generated in the database of the active node.
python seed_products.py
Execute PySpark functionality
This process retrieves all data from the node, converts it into a PySpark DataFrame, and performs various analyses.
python get_products_paginated.py