pyspark.SparkContext.binaryRecords¶

SparkContext.binaryRecords(path: str, recordLength: int) → pyspark.rdd.RDD[bytes]¶

Load data from a flat binary file, assuming each record is a set of numbers with the specified numerical format (see ByteBuffer), and the number of bytes per record is constant.

Parameters

pathstr: Directory to the input data files
recordLengthint: The length at which to split the records

pyspark.SparkContext.binaryFiles

pyspark.SparkContext.broadcast