A stream of entities is called an discretized stream or DStream
DStream = Sequence of RDD
Entities in a Stream are grouped into batches
Each batch = 1 RDD
Batch is formed based on Batch Interval
All entities received within a batch is one RDD
Within DStream Spark will perform operations on individual RDDs
We can use Netcat for creating streaming data source
nc -lk 9999
-l: Listen on port
-k: Keep active (Remain listening)