pyspark.SparkContext.binaryRecords¶
-
SparkContext.
binaryRecords
(path: str, recordLength: int) → pyspark.rdd.RDD[bytes]¶ Load data from a flat binary file, assuming each record is a set of numbers with the specified numerical format (see ByteBuffer), and the number of bytes per record is constant.
- Parameters
- pathstr
Directory to the input data files
- recordLengthint
The length at which to split the records