pyspark.sql.DataFrame.withColumns

DataFrame.withColumns(*colsMap: Dict[str, pyspark.sql.column.Column]) → pyspark.sql.dataframe.DataFrame

Returns a new DataFrame by adding multiple columns or replacing the existing columns that has the same names.

The colsMap is a map of column name and column, the column must only refer to attributes supplied by this Dataset. It is an error to add columns that refer to some other Dataset.

Added support for multiple columns adding

Parameters
colsMapdict

a dict of column name and Column. Currently, only single map is supported.

Examples

>>> df.withColumns({'age2': df.age + 2, 'age3': df.age + 3}).collect()
[Row(age=2, name='Alice', age2=4, age3=5), Row(age=5, name='Bob', age2=7, age3=8)]