-
-
Notifications
You must be signed in to change notification settings - Fork 1.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Mean fails to compute for very large column of pyarrow type #11113
Labels
Comments
Thanks for your report. For context: we are basically using
under the hood to compute the mean, which also fails. I agree this is not great, I'll check if and how we can address this |
phofl
added
dataframe
and removed
needs triage
Needs a response from a contributor
labels
May 14, 2024
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
It's not possible to calculate the grouped mean for a very large column of pyarrow type.
It fails to compute with the legacy DataFrame:
I tried using the latest DataFrame API but this operation does not seem to be supported yet:
Pandas seems to execute this workflow without issues:
Environment:
The text was updated successfully, but these errors were encountered: