pyspark.streaming.DStream.partitionBy

DStream.partitionBy(numPartitions: int, partitionFunc: Callable[[K], int] = <function portable_hash>) → pyspark.streaming.dstream.DStream[Tuple[K, V]]

Return a copy of the DStream in which each RDD are partitioned using the specified partitioner.