EIP-7594: Passive sampling #3717

ppopth · 2024-04-23T14:08:11Z

Since a node doesn't know when it should request the samples from its peers, it's better to passively receive them from the subnets, so that the node doesn't have to guess when to request the samples. This method is called "passive sampling"

The former method, now called "active sampling", will be used only if the node wants to do sampling in the past slots or the passive sampling fails.

Since a node doesn't know when it should requests the samples from its peers, it's better to passively receive them from the subnets. This method is called "passive sampling" The former method, now called "active sampling", will be used only if the node wants to do sampling in the past slots or the passive sampling fails.

nisdas · 2024-04-23T15:16:12Z

specs/_features/eip7594/das-core.md

+
+### Passive sampling
+
+A few moments before each slot, the node SHOULD be subscribed to `SAMPLES_PER_SLOT` column subnets to receive the samples from their peers. A node utilizes `get_custody_columns` helper to determine which column subnets to be subscribed to. This should be easy to do because the node already has a diverse set of peers.


This seems to defeat the purpose of only subscribing to gossip from custodied subnets. If you subscribe to an extra SAMPLES_PER_SLOT column subnets before each slot, it is equivalent to increasing the size of the total subnets custodied. I don't think subscribing/unsubscribing quickly makes much of a difference here

So, how about defining "passive sampling" as receiving samples from the custodied subnets (instead of extra subnets) and decide that the data is available if it receives such samples. This hasn't been specified anywhere in the spec yet.

This sounds like the wrong term, I don't think being part of a gossip subnet can be thought as sampling . The level of amplification is 8x on a subnet vs a simple req/resp.

The level of amplification is 8x on a subnet vs a simple req/resp.

It doesn't really have to be 8x. You can just connects to a single node as a mesh peer, so the bandwidth used will be just the same as req/resp.

This sounds like the wrong term, I don't think being part of a gossip subnet can be thought as sampling

I disagree on this. Sampling means take some portion of something. Subscribing to some columns/subnets means taking some columns of all the columns, so I think the term still makes sense.

So, how about defining "passive sampling" as receiving samples from the custodied subnets (instead of extra subnets) and decide that the data is available if it receives such samples. This hasn't been specified anywhere in the spec yet.

I would like to change my mind on this. I think it's okay to be subscribed to extra subnets.

it is equivalent to increasing the size of the total subnets custodied

It's not really equivalent because you don't keep the past samples for the extra subnets. You just get them and throw them away.

In terms of the bandwidth usage, as I mentioned in the previous comment, you can reduce the mesh degree to 1 so that you use as much bandwidth as req/resp.

This requires us to be able to have dynamic mesh sizes for separate topics, currently in go-libp2p-pubsub and possibly in other language implementations all topics have the same mesh size. Also having a mesh size of 1 can be problematic for the general network as you would have increased latency on the propagation of a message. A remote peer will not know if a connected peer has a mesh size of 8 or 1. On some paths, data columns might take a lot longer to be propagated

Maybe for this usecase you would want a new protocol message ? Where instead of random gossip, a peer simply returns the most recent seen message ids for a particular topic.

Also having a mesh size of 1 can be problematic for the general network as you would have increased latency on the propagation of a message

I think that's okay because, even though the message is delayed or not received at all, the passive sampling acts as only a complement to active sampling.

Notice that, without passive sampling, the sampling node has to wait until the sampling time to request the samples. With passive sampling (even with a mesh size of 1), there will be a very likely chance that it will get the samples before the sampling time.

Maybe for this usecase you would want a new protocol message ? Where instead of random gossip, a peer simply returns the most recent seen message ids for a particular topic.

In fact, I have an upgrade to GossipSub in mind which will probably help on this issue. I will create a PR on that in a few days.

Maybe for this usecase you would want a new protocol message ? Where instead of random gossip, a peer simply returns the most recent seen message ids for a particular topic.

In fact, I have an upgrade to GossipSub in mind which will probably help on this issue. I will create a PR on that in a few days.

Here it is libp2p/specs#617

fradamt · 2024-04-24T08:11:08Z

I think we should just increase the custody requirement, though not all the way to SAMPLES_PER_SLOT. Custodying 6 to 8 subnets is already sufficient for the kind of security guarantees we need, and anything more than that is cheaper to do through peer sampling because there's no gossip overhead

ppopth · 2024-05-07T08:16:51Z

@fradamt

I think we should just increase the custody requirement, though not all the way to SAMPLES_PER_SLOT. Custodying 6 to 8 subnets is already sufficient for the kind of security guarantees we need, and anything more than that is cheaper to do through peer sampling because there's no gossip overhead

I don't quite follow this. Is this off-topic? The purpose of this PR is that the nodes don't have to guess when the samples arrive at their peers.

ppopth mentioned this pull request Apr 23, 2024

Network shards (Attnet Revamp + DAS Distribution Columns) #3623

Closed

nisdas reviewed Apr 23, 2024

View reviewed changes

hwwhww added the EIP-7594 PeerDAS label Apr 23, 2024

This was referenced May 21, 2024

p2p: Deprecate TTFB, RESP_TIMEOUT, introduce rate limiting recommenda… #3767

Open

[GossipSub 1.3] Topic observation libp2p/specs#617

Open

ppopth mentioned this pull request Jun 11, 2024

PeerDAS Breakout Room #1 ethereum/pm#1059

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

EIP-7594: Passive sampling #3717

EIP-7594: Passive sampling #3717

ppopth commented Apr 23, 2024 •

edited

nisdas Apr 23, 2024

ppopth Apr 24, 2024

nisdas Apr 25, 2024

ppopth May 7, 2024

ppopth May 7, 2024

nisdas May 22, 2024

nisdas May 22, 2024

ppopth May 27, 2024 •

edited

ppopth May 27, 2024

ppopth May 28, 2024

fradamt commented Apr 24, 2024

ppopth commented May 7, 2024 •

edited


		### Passive sampling

		A few moments before each slot, the node SHOULD be subscribed to `SAMPLES_PER_SLOT` column subnets to receive the samples from their peers. A node utilizes `get_custody_columns` helper to determine which column subnets to be subscribed to. This should be easy to do because the node already has a diverse set of peers.

EIP-7594: Passive sampling #3717

Are you sure you want to change the base?

EIP-7594: Passive sampling #3717

Conversation

ppopth commented Apr 23, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ppopth May 27, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fradamt commented Apr 24, 2024

ppopth commented May 7, 2024 • edited

ppopth commented Apr 23, 2024 •

edited

ppopth May 27, 2024 •

edited

ppopth commented May 7, 2024 •

edited