pyspark.sql.DataFrameStatFunctions.corr¶
-
DataFrameStatFunctions.
corr
(col1: str, col2: str, method: Optional[str] = None) → float¶ Calculates the correlation of two columns of a
DataFrame
as a double value. Currently only supports the Pearson Correlation Coefficient.DataFrame.corr()
andDataFrameStatFunctions.corr()
are aliases of each other.- Parameters
- col1str
The name of the first column
- col2str
The name of the second column
- methodstr, optional
The correlation method. Currently only supports “pearson”