-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Lake][Fetching vs Joining] Fetch available data from subgraph rather than joining w/ sql. #989
Comments
Actually, I couldn't get what the main motivation is. |
Because it feels like the previous work wasn't quite complete. I tagged it as low priority. |
I am now reviewing this in issue #1000 and it's coming up again. The This exists in a few different places, and it gets run every time we process an event... notice how silly of a join it is too.. it's basically a configuration (not some special, unique data) |
This is now being tracked in #1299 and this will be closed. Please reopen as we address backlog. |
Background / motivation
It's easier to just fetch the data already from subgraph rather than perform a join (more costly too).
These, shouldn't have to be obtain through a join with the predictions table.
{etl_bronze_pdr_predictions_table_name}.pair as pair,
{etl_bronze_pdr_predictions_table_name}.timeframe as timeframe,
{etl_bronze_pdr_predictions_table_name}.source as source,
The issue here too, is that in SQL there is 1 slot event being joined <= N prediction events. Again, costly. This is being done as a left join, but it would be good to check the results of
bronze_pdr_slots.py
as a way to verify this.TODOs / DoD
Tasks
The text was updated successfully, but these errors were encountered: