Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Expose the internal column data so that entire data can be copied directly #221

Open
wants to merge 1 commit into
base: async-await
Choose a base branch
from

Conversation

saherawat
Copy link

This change has helped us copying the data into apache arrow structures directly and it has improved the latency significantly (6 times faster) as compared to copying individual elements.

@saherawat
Copy link
Author

@suharev7 can you please take a look at this PR?

@saherawat
Copy link
Author

@suharev7 can you please take a look at this PR. Please help us get this checked in.

@suharev7
Copy link
Owner

suharev7 commented Aug 9, 2024

I just wonder how are you going to user ArcColumnData.

@saherawat
Copy link
Author

We are using apache arrow and with this change we are able to directly copy the memory blocks into Apache arrow column's data.

@suharev7
Copy link
Owner

suharev7 commented Aug 9, 2024

I would like to be able to change the implementation of ColumnData trait in the future, so I don't want to expose this trait.
I can add a new method that will consume Column and return Vec<T> without memory coping (when it's possible, for example, for primitive date types). Might this be suitable for you?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants