pyspark.sql.DataFrame.localCheckpoint¶
-
DataFrame.
localCheckpoint
(eager: bool = True) → pyspark.sql.dataframe.DataFrame¶ Returns a locally checkpointed version of this
DataFrame
. Checkpointing can be used to truncate the logical plan of thisDataFrame
, which is especially useful in iterative algorithms where the plan may grow exponentially. Local checkpoints are stored in the executors using the caching subsystem and therefore they are not reliable.- Parameters
- eagerbool, optional
Whether to checkpoint this
DataFrame
immediately
Notes
This API is experimental.