pyspark.streaming.DStream.combineByKey¶
-
DStream.
combineByKey
(createCombiner: Callable[[V], U], mergeValue: Callable[[U, V], U], mergeCombiners: Callable[[U, U], U], numPartitions: Optional[int] = None) → pyspark.streaming.dstream.DStream[Tuple[K, U]]¶ Return a new DStream by applying combineByKey to each RDD.