pyspark.pandas.groupby.GroupBy.var

GroupBy.var(ddof: int = 1) → FrameLike

Compute variance of groups, excluding missing values.

Parameters
ddofint, default 1

Delta Degrees of Freedom. The divisor used in calculations is N - ddof, where N represents the number of elements.

Examples

>>> df = ps.DataFrame({"A": [1, 2, 1, 2], "B": [True, False, False, True],
...                    "C": [3, 4, 3, 4], "D": ["a", "b", "b", "a"]})
>>> df.groupby("A").var()
     B    C
A
1  0.5  0.0
2  0.5  0.0