pyspark.SparkContext.addPyFile

SparkContext.addPyFile(path: str) → None

Add a .py or .zip dependency for all tasks to be executed on this SparkContext in the future. The path passed can be either a local file, a file in HDFS (or other Hadoop-supported filesystems), or an HTTP, HTTPS or FTP URI.

Notes

A path can be added only once. Subsequent additions of the same path are ignored.