ADD_JARS doubt.!!!!!

ADD_JARS doubt.!!!!!

Hi,

What does the parameter add_jars in the sc constructor exactly do? Does it add all the files to the classpath of worker JVM?

I have some text files that I read data from while processing.
Can I add it in add jars so that it doesn't have to read it again from HDFS and read from local (Something like Distributed Cache in Hadoop Mapreduce). What path would I read it from?

Re: ADD_JARS doubt.!!!!!

I would not recommend putting your text files in via ADD_JARS. The better thing to do is to put those files in HDFS or locally on your driver server, load them into memory and then use Spark's broadcast variable concept to spread the data out across the cluster.

What does the parameter add_jars in the sc constructor exactly do? Does it add all the files to the classpath of worker JVM?

I have some text files that I read data from while processing.
Can I add it in add jars so that it doesn't have to read it again from HDFS and read from local (Something like Distributed Cache in Hadoop Mapreduce). What path would I read it from?