Skip to main content

Run the project

Run PySpark

cd pyspark

Generate dummy data

The dummy data will be generated in the database of the active node.

python seed_products.py

Execute PySpark functionality

This process retrieves all data from the node, converts it into a PySpark DataFrame, and performs various analyses.

python get_products_paginated.py