A cluster is a collection of one or more VMs that work together
Master Node/ Name Node
Coordination (Driver Program)
Worker Node/ Data Node
Performs actual processing
Workers have Executor which are responsible for running the task
Each Worker node can have multiple executors (Depends on core/ memory configuration)
One worker can be assigned more than one task for execution
The tasks and processed results are stored in workers memory