[Enh]: Spark Expr missing methods #1714

FBruzzesi · 2025-01-03T21:40:35Z

lucas-nelson-uiuc · 2025-01-04T21:57:26Z

Working on implementing scalar methods like any and all - should be ready to push later today.

Planning on working on the following methods - want to first check if my thought process is "correct".

arg_true
drop_nulls
filter
gather_every
sort
unique

Thinking of implementing two patterns for these methods:

# if predicate-based (e.g. drop_nulls, which uses predicate function `F.isnull`)
def method(self) -> Self:
        def _method(_input: Column) -> Column:
            from pyspark.sql import functions as F  # noqa: N812

            return F.explode(F.filter(F.array(_input), <predicate_func>))

        return self._from_call(_method, "method", returns_scalar=False)


# if not predicate-based (e.g. unique, which uses array function `F.array_distinct`)
def method(self) -> Self:
        def _method(_input: Column) -> Column:
            from pyspark.sql import functions as F  # noqa: N812

            return F.explode(<array_func>(F.array(_input)))

        return self._from_call(_method, "method", returns_scalar=False)

Not sure how expensive doing this is or if it collides with future API developments. Lmk what you think

MarcoGorelli · 2025-01-07T08:05:06Z

thanks @lucas-nelson-uiuc for your efforts here

can we leave the row-order dependent ones out for now, make sure we've got everything done from the others first? there's some broader api decisions we need to make for those

FBruzzesi added enhancement New feature or request help wanted Extra attention is needed good first issue Good for newcomers, but anyone is welcome to submit a pull request! labels Jan 3, 2025

Dhanunjaya-Elluri mentioned this issue Jan 4, 2025

feat: add few missing SparkLikeExpr methods #1721

Merged

10 tasks

lucas-nelson-uiuc mentioned this issue Jan 4, 2025

feat: add more Spark Expressions #1724

Open

10 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Enh]: Spark Expr missing methods #1714

[Enh]: Spark Expr missing methods #1714

FBruzzesi commented Jan 3, 2025 •

edited

Loading

lucas-nelson-uiuc commented Jan 4, 2025 •

edited

Loading

MarcoGorelli commented Jan 7, 2025

[Enh]: Spark Expr missing methods #1714

[Enh]: Spark Expr missing methods #1714

Comments

FBruzzesi commented Jan 3, 2025 • edited Loading

lucas-nelson-uiuc commented Jan 4, 2025 • edited Loading

MarcoGorelli commented Jan 7, 2025

FBruzzesi commented Jan 3, 2025 •

edited

Loading

lucas-nelson-uiuc commented Jan 4, 2025 •

edited

Loading