GaussianMixtureSummary

class pyspark.ml.clustering.GaussianMixtureSummary(java_obj: Optional[JavaObject] = None)

Gaussian mixture clustering results for a given model.

Attributes

cluster

DataFrame of predicted cluster centers for each training data point.

clusterSizes

Size of (number of data points in) each cluster.

featuresCol

Name for column of features in predictions.

k

The number of clusters the model was trained with.

logLikelihood

Total log-likelihood for this model on the given data.

numIter

Number of iterations.

predictionCol

Name for column of predicted clusters in predictions.

predictions

DataFrame produced by the model’s transform method.

probability

DataFrame of probabilities of each cluster for each training data point.

probabilityCol

Name for column of predicted probability of each cluster in predictions.

Attributes Documentation

cluster

DataFrame of predicted cluster centers for each training data point.

clusterSizes

Size of (number of data points in) each cluster.

featuresCol

Name for column of features in predictions.

k

The number of clusters the model was trained with.

logLikelihood

Total log-likelihood for this model on the given data.

numIter

Number of iterations.

predictionCol

Name for column of predicted clusters in predictions.

predictions

DataFrame produced by the model’s transform method.

probability

DataFrame of probabilities of each cluster for each training data point.

probabilityCol

Name for column of predicted probability of each cluster in predictions.