pyspark.streaming.DStream.countByValue

DStream.countByValue() → pyspark.streaming.dstream.DStream[Tuple[K, int]]

Return a new DStream in which each RDD contains the counts of each distinct value in each RDD of this DStream.