i am running a 300 mb file with certain transformation and my job is takin 30 MIN for that
i am using pyspark "
and most of the time it is taking storage.BlockManager: Removing RDD 30 job taking too long
size i am using is 4 nodes total 16 cores 8gb each node
i am utilizing full memory for that