pyspark.RDD.sampleStdev

RDD.sampleStdev() → NumberOrArray

Compute the sample standard deviation of this RDD’s elements (which corrects for bias in estimating the standard deviation by dividing by N-1 instead of N).

Examples

>>> sc.parallelize([1, 2, 3]).sampleStdev()
1.0