[Lake] Issue-481: Adding Payouts to the Data Factory #535

kdetry · 2024-01-15T12:03:39Z

Fixes #481

Changes proposed in this PR:

Added Payouts schema
Created fetching process for payouts
Added fn to fetch payouts from subgraph
Added mocks and tests

…ssing

…f no timestamp is present

…tly.

… how transform handles datetime

…end and test_load_filtered.

…eng tests that are enabled are passing.

…an is passing.

…_df() to verify that its working as intended. I believe kraken data is returning null at the moment.

* Towards #232: Refactoring towards ppss.yaml part 3/3 * move everything in model_eng/ to data_eng/ * Fix #352: [SW eng] High DRY violation in test_predictoor_agent.py <> test_predictoor_agent3.py * Deprecate backend-dev.md (long obsolete), macos.md (obsolete due to vps), and envvars.md (obsolete because of ppss.yaml). * Rename BaseConfig to web3_pp.py and make it yaml-based * Move scripts into util/, incorporate them into pdr cli, some refactoring. * revamp READMEs for cli. And, tighten up text for getting OCEAN & ROSE * Deprecated ADDRESS_FILE and RPC_URL envvars. * deprecate Predictoor approach 2. Pita to maintain Co-authored-by: trizin <[email protected]>

* Update check script CI * Update cron topup * Workflow dispatch * Nevermind, revert previous commit * Run on push to test * Pass ppss.web3_pp instead of web3_config * Don't run on push

…il; get linters to pass

* Add main.py back * Black * Linter * Linter * Remove "switch back to version v0.1.1" * Black

into yaml-cli2

How fixed: use previous ascynio version. Calina: Asyncio has some known issues, per their changelog. Namely issues with fixture handling etc., which I believe causes the warnings and test skips in our runs. They recommend using the previous version until they are fixed. It is also why my setup didn't spew up any warnings, my asyncio version was 21.1. https://pytest-asyncio.readthedocs.io/en/latest/reference/changelog.html

* Fix web3_config.rpc_url in test_send_encrypted_tx * Add conftest.py for system tests * Add system test for get_traction_info * Add system test for get_predictions_info * Add system test for get_predictoors_info * Add "PDRS" argument to _ArgParser_ST_END_PQDIR_NETWORK_PPSS_PDRS class * Fix feed.exchange type conversion in publish_assets.py * Add print statement for payout completion * Add system level test for pdr topup * Add conditional break for testing via env * Add conditional break for testing via env * Black * Add test for pdr rose payout system * System level test pdr check network * System level test pdr claim OCEAN * System level test pdr trueval agent * Remove unused patchs * Fix wrong import position in conftest.py * Remove unused imports * System level test for pdr dfbuyer * System level tests for pdr trader * System level tests for publisher * Rename publisher test file * Add conditional break in take_step() method * Update dftool->pdr names in system tests * Refactor test_trader_agent_system.py * Add mock fixtures for SubgraphFeed and PredictoorContract * Add system tests for predictoor * Black * Refactor system test files - linter fixes * Linter fixes * Black * Add missing mock * Add savefig assertion in test_topup * Update VPS configuration to use development entry * Patch verify_feed_dependencies * Refactor test_predictoor_system.py to use a common test function * Refactor trader approach tests to improve DRY * Black * Indent * Ditch NETWORK_OVERRIDE * Black * Remove unused imports

* Add publisher feeds filtering.

…537)

codeclimate · 2024-01-16T12:53:40Z

Code Climate has analyzed commit 2ffb490 and detected 3 issues on this pull request.

Here's the issue category breakdown:

Category	Count
Complexity	3

The test coverage on the diff in this pull request is 98.4% (50% is the threshold).

This pull request will bring the total coverage in the repository to 95.2% (0.1% change).

View more on Code Climate.

idiom-bytes · 2024-01-17T20:52:58Z

pdr_backend/subgraph/subgraph_payout.py

+                }
+            """
+            % (start_ts, end_ts, asset_id)
+        )


I'm not sure why this has to be looped... it should already be a list[str] of addresses that can be embedded into the query.

We should be able to use a single where clause for subgraph, which may perhaps reduce the search/query time (and it's easier to read). Example:

That generates the following query for multiple assets

query { predictPayouts ( first: 10 skip: 1 where: { { timestamp_gte: 1622547000, timestamp_lte: 1622548800, prediction_contains: ['0x18f54cc21b7a2fdd011bea06bba7801b280e3151', '0x33334cc21b7a2fdd011bea06bba7801b280e3151'] } } ) { id timestamp payout prediction { user { id } slot { id predictContract{ id token{ name } } } } } }

as defined in this test example

I took another look at your comment, @idiom-bytes. The "aaset_ids" argument is a list of strings that represent the IDs of the contracts we want to retrieve. Unfortunately, the "predictPayouts" query's "where" clause does not have a contract filter. We could use a nested query, like the one below, but our subgraph system does not allow it:

where: { prediction_: { slot_: { predictContract_in: ["0xfeed1", "0xfeed2"] } } }

However, the "prediction_contains" argument searches for text within prediction IDs. It only allows strings. The structure of a prediction ID is as follows:

{contract address}-{slot}-{user}

Therefore, if the contract address is present in the prediction ID, we can retrieve it.

Looked into this and I agree with @kdetry, we can't just pass the list of addresses with the way subgraph it's structured right now

idiom-bytes · 2024-01-17T21:08:26Z

pdr_backend/subgraph/subgraph_payout.py

+                "token": payout["prediction"]["slot"]["predictContract"]["token"][
+                    "name"
+                ],
+                "slot": int(payout["id"].split("-")[1]),


Please make sure that we're retrieving all required data to generate the insights we want. Payout event is where a lot of the stake/revenue/prediction values are provided. Example:

predictedValue

prediction.slot.revenue

prediction.stake

prediction.slot.roundSumStakesUp

prediction.slot.roundSumStakes

So, we need to fetch this additional data such that we can update our local records.

We can then use the data from the payout event to update our lake/ pdr_predictions and pdr_slots tables.

query{ predictPayouts(...){ prediction{ slot { id predictContract { id } slot status revenue roundSumStakesUp roundSumStakes } user { id } stake } payout predictedValue trueValue timestamp } }``` I have tried to put some of this in the `lake.html` file, but that's not an exhaustive set of requirements. All data relating to `stake/revenue/prediction` outcome should be retrieved. To better understand what each contract event provides, you can check the contract for the event, or the subgraph event handler to see what data the subgraph updates/yields as a result of the contract event.

#447
I have followed the BRONZE_FOR_PAYOUTS table on the epic so I didn't add these values.

I am going to add them asap.

KatunaNorbert · 2024-01-18T12:36:11Z

pdr_backend/lake/gql_data_factory.py

@@ -43,6 +47,10 @@ def __init__(self, ppss: PPSS):
        )
        contract_list = [f.lower() for f in contract_list]

+        # For debugging
+        # t_contract_list = [f.lower() for f in contract_list]
+        # contract_list = [t_contract_list[0], t_contract_list[1]]


Do we need to keep this commented lines?

kdetry · 2024-01-19T11:11:05Z

I close this PR, it is replaced with the following ne:
#559

idiom-bytes and others added 30 commits November 21, 2023 23:18

First stab at porting various functions over to polars... lots to go

c5987e1

TOHLCV df initialization and type checking added. 2/8 pdutil tests pa…

c1fedf7

…ssing

black formatted

2a0a94b

Fixing initialization and improving test. datetime is not generated i…

a579c4d

…f no timestamp is present

Restructured pdutil a bit to reduce DRY and utilize schema more stric…

c5efe9f

…tly.

test initializing the df and datetime

6c1f3ec

improve init test to show exception without timestamp

b7a60b2

fixing test_concat such that it verifies that schemas must match, and…

754aaee

… how transform handles datetime

saving parquet enforces datetime and transform. updated test_load_app…

fce6718

…end and test_load_filtered.

black formatted

f9e6cb8

data_eng tests are passing

d0dccaf

initial data_eng tests are passing w/ black, mypy, and pylint.

026ad39

_merge_parquet_dfs updated and create_xy test_1 is passing. all data_…

2006f1a

…eng tests that are enabled are passing.

2exch_2coins_2signals is passing

933f1d8

Added polars support for fill_nans, has_nans, and create_xy__handle_n…

6264c61

…an is passing.

Starting to deprecate references to pandas and csv in data_factory.

1b5b665

Black formatted

a511bd5

Deprecated csv logic in DataFactory and created tests around get_hist…

444862b

…_df() to verify that its working as intended. I believe kraken data is returning null at the moment.

All tests should be passing.

6666618

Update CI to use pdr instead of scripts/ (#399)

80faf81

* Update check script CI * Update cron topup * Workflow dispatch * Nevermind, revert previous commit * Run on push to test * Pass ppss.web3_pp instead of web3_config * Don't run on push

Replace long try/except with _safe*() function; rename pdutil -> plut…

25bfa3e

…il; get linters to pass

Update entrypoint script to use pdr cli (#406)

148cb94

Add main.py back (#404)

6d7661a

* Add main.py back * Black * Linter * Linter * Remove "switch back to version v0.1.1" * Black

Merge from issue388-refactor-csvs-pandas, plus many changes

43a43df

Merge branch 'yaml-cli2' of https://github.com/oceanprotocol/pdr-backend

fa3b672

into yaml-cli2

make black happy

f7bbca4

small bug fix

27ac78e

many bug fixes. Still >=1 left

97370c7

fix warning

036ca3e

calina-c and others added 3 commits January 13, 2024 15:47

issue-481: Add Payouts to the Data Factory

2a36373

kdetry changed the base branch from main to yaml-cli2 January 15, 2024 12:03

kdetry requested a review from idiom-bytes January 15, 2024 12:03

kdetry marked this pull request as draft January 15, 2024 12:05

kdetry and others added 6 commits January 15, 2024 15:11

black fix

479ae8f

Merge branch 'yaml-cli2' into issue481

a8b460d

issue481: del_network_override is removed from tests

d0edb9d

Adds incremental waiting for subgraph tries. (#534)

11f05d2

Add publisher feeds filtering. (#533)

9402e5f

* Add publisher feeds filtering.

Merge branch 'yaml-cli2' into issue481

a16d859

kdetry marked this pull request as ready for review January 15, 2024 13:50

trizin and others added 4 commits January 15, 2024 18:36

Pass the ppss.web3_pp instead of web3_config into WrappedToken class (#…

995a103

…537)

Fix #542: Add code climate usage to developer flow READMEs

9422c82

Merge branch 'yaml-cli2' into issue481

dd184e2

replaced development network with sapphire-testnet

2ffb490

Base automatically changed from yaml-cli2 to main January 16, 2024 16:51

kdetry added 4 commits January 17, 2024 14:43

merge with main and conflict fix

6f05436

ohlcv_data_factory file conflict fix

0b3a11e

import mock_payouts issue

f5143e6

black fix

34c0e10

idiom-bytes reviewed Jan 17, 2024

View reviewed changes

KatunaNorbert reviewed Jan 18, 2024

View reviewed changes

requested changes

abaae89

kdetry closed this Jan 19, 2024

kdetry deleted the issue481 branch February 12, 2024 10:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Lake] Issue-481: Adding Payouts to the Data Factory #535

[Lake] Issue-481: Adding Payouts to the Data Factory #535

kdetry commented Jan 15, 2024

codeclimate bot commented Jan 16, 2024

idiom-bytes Jan 17, 2024 •

edited

Loading

kdetry Jan 18, 2024

KatunaNorbert Jan 18, 2024

idiom-bytes Jan 17, 2024 •

edited

Loading

kdetry Jan 18, 2024

KatunaNorbert Jan 18, 2024

kdetry commented Jan 19, 2024

[Lake] Issue-481: Adding Payouts to the Data Factory #535

[Lake] Issue-481: Adding Payouts to the Data Factory #535

Conversation

kdetry commented Jan 15, 2024

codeclimate bot commented Jan 16, 2024

idiom-bytes Jan 17, 2024 • edited Loading

Choose a reason for hiding this comment

kdetry Jan 18, 2024

Choose a reason for hiding this comment

KatunaNorbert Jan 18, 2024

Choose a reason for hiding this comment

idiom-bytes Jan 17, 2024 • edited Loading

Choose a reason for hiding this comment

kdetry Jan 18, 2024

Choose a reason for hiding this comment

KatunaNorbert Jan 18, 2024

Choose a reason for hiding this comment

kdetry commented Jan 19, 2024

idiom-bytes Jan 17, 2024 •

edited

Loading

idiom-bytes Jan 17, 2024 •

edited

Loading