pyspark.sql.DataFrame.filter¶
-
DataFrame.filter(condition: ColumnOrName) → DataFrame¶ Filters rows using the given condition.
where()is an alias forfilter().- Parameters
- condition
Columnor str a
Columnoftypes.BooleanTypeor a string of SQL expression.
- condition
Examples
>>> df.filter(df.age > 3).collect() [Row(age=5, name='Bob')] >>> df.where(df.age == 2).collect() [Row(age=2, name='Alice')]
>>> df.filter("age > 3").collect() [Row(age=5, name='Bob')] >>> df.where("age = 2").collect() [Row(age=2, name='Alice')]