Training Set

class databricks.feature_store.training_set.TrainingSet(feature_spec: databricks.feature_store.entities.feature_spec.FeatureSpec, df: pyspark.sql.dataframe.DataFrame, labels: List[str], feature_table_metadata_map: Dict[str, databricks.feature_store.entities.feature_table.FeatureTable], feature_table_data_map: Dict[str, pyspark.sql.dataframe.DataFrame], uc_function_infos: Dict[str, databricks.feature_store.information_schema_spark_client.FunctionInfo])

Bases: object

Class that defines TrainingSet objects.

Note

The TrainingSet constructor should not be called directly. Instead, call FeatureStoreClient.create_training_set.

load_df() → pyspark.sql.dataframe.DataFrame

Load a DataFrame.

Return a DataFrame for training.

The returned DataFrame has columns specified in the feature_spec and labels parameters provided in FeatureStoreClient.create_training_set.

Returns:A DataFrame for training