pyspark.RDD.keys

RDD.keys() → pyspark.rdd.RDD[K]

Return an RDD with the keys of each tuple.

Examples

>>> m = sc.parallelize([(1, 2), (3, 4)]).keys()
>>> m.collect()
[1, 3]