all: integrate snapshot into pathdb #29754

Fynnss · 2024-05-11T06:56:56Z

This PR implements a way to fetch account/storage directly from pathdb as an alternative to snapshot for better robustness and simplicity, then reducing data storage space.

Major changes:

add latestaccount/lateststorage/destructset cache to pathdb
use db.seekLT to query the leaf node of account/storage in pathdb
use deleteRange to destruct the storage trie

Status: Testing

Fynnss · 2024-05-11T07:02:10Z

Related issue. #28617

karalabe · 2024-05-13T09:31:52Z

I'm not entirely sure how this will help us. If I understand it correctly, you used range queries on the database to look up path-based leaves "directly". I'm unsude how fast that is (it might actually be reasonable for 1 value lookups).

That said, the primary purpose of snapshots is to serve snap sync, which needs a way to very quickly and cheaply iterate a range of accounts (e.g. 10K accounts), without having to go through the trie. This work doesn't seem to address that at all, so at teh end of the day, it seems we're still needing the snapshots just as before?

holiman · 2024-05-13T10:05:13Z

which needs a way to very quickly and cheaply iterate a range of accounts (e.g. 10K accounts), without having to go through the trie.

One thing that would be interesting to see, is

On a machine which is synced, and has a path-database and also a generated snapshot,
Pick a random key-segment,
- Read 10K accounts from the snapshot
- Read 10K accounts from the path-database
- Compare times
Repeat a few more times.

Fynnss · 2024-05-14T02:56:43Z

I'm not entirely sure how this will help us. If I understand it correctly, you used range queries on the database to look up path-based leaves "directly". I'm unsude how fast that is (it might actually be reasonable for 1 value lookups).

That said, the primary purpose of snapshots is to serve snap sync, which needs a way to very quickly and cheaply iterate a range of accounts (e.g. 10K accounts), without having to go through the trie. This work doesn't seem to address that at all, so at teh end of the day, it seems we're still needing the snapshots just as before?

Thanks for your reply.

db.seek will be about 10% slower than get because get can use bloom filters to accelerate
For snap sync, we can also iterate out all leaf nodes from pathdb. But It may slower than from snapshot. We can test it as @holiman comment above.

The reasons for the PR:

Better robustness and simplicity(as @rjl493456442 mentioned in Integrating state snapshot into pathdb #28617), no need to maintain two sets of similar structures, snapshot & pathdb (difflayer/disklayer/journal/recovery)
Reduce disk storage and db overhead(compaction). snapshop's data is a subset of pathdb.
Improve caching efficiency. The trie leaf node will be cached when execution.

In the end, this PR may not solve all the problems and hope your suggestions.

holiman · 2024-05-14T12:46:20Z

This seems to be based on seek to a key. The key used is the full 32-byte key, e.g. 0xaabbccdd....xx. But, the actual leaf stored on disk is shorter than the full path, since the last extension node contains the last part of the path. So let's say it's 0xaabbcc.
Now, if you seek to 0xaabbccddeeff, you wil step past 0xaabbcc. And wind up on the next leaf. So I don't get it, how can this work, as is?

As I understand it, this can work if you implement a special new seek-method which positions itself on step back, or somehow make the iterator step back on step.

Please explain if I've misunderstood something

Fynnss · 2024-05-15T01:23:27Z

Now, if you seek to 0xaabbccddeeff, you wil step past 0xaabbcc. And wind up on the next leaf. So I don't get it, how can this work, as is?

The db.seekLT will return the last key that less than the given key. So the 0xaabbcc will be returned in your case.

Please explain if I've misunderstood something

And there is a special case which the embedded leaf node in fullnode. The PR handles this special case by redundantly storing a copy of the embedded short node in the database.

https://github.com/ethereum/go-ethereum/pull/29754/files#diff-9fcc04b5701b122917f702ceb9f56bb5c190bcfd1f6728a319e171e819952147R116-R123

rjl493456442 · 2024-05-15T05:38:21Z

According to the leveldb description,

	// Seek moves the iterator to the first key/value pair whose key is greater
	// than or equal to the given key.
	// It returns whether such pair exist.
	//
	// It is safe to modify the contents of the argument after Seek returns.
	Seek(key []byte) bool

Iterator.Seek will return the first key/value pair whose key is greater than or equal to the given key.

Fynnss · 2024-05-15T05:45:21Z

According to the leveldb description,
	// Seek moves the iterator to the first key/value pair whose key is greater
	// than or equal to the given key.
	// It returns whether such pair exist.
	//
	// It is safe to modify the contents of the argument after Seek returns.
	Seek(key []byte) bool
Iterator.Seek will return the first key/value pair whose key is greater than or equal to the given key.

Oh, I made a mistake and you're right. The default behavior of db.seek is seek greater than or equal to in leveldb. In the PR, we use the seekLT in pebble which impl seeking the last key lower than the given key.

// SeekLT moves the iterator to the last key/value pair whose key is less than
// the given key. Returns true if the iterator is pointing at a valid entry and
// false otherwise.
func (i *Iterator) SeekLT(key []byte) bool {
	return i.SeekLTWithLimit(key, nil) == IterValid
}

And in pathdb, it is impossible to have a pathkey that is the same as a 32-bit hash, so the last key lower than it must be what we are looking for.

core,trie,triedb: add getAccount/Storage in pathdb

462e2bf

Fynnss requested review from karalabe, holiman and rjl493456442 as code owners May 11, 2024 06:56

Fynnss force-pushed the integrate_snapshot_into_pathdb branch from 4da023c to 22cee6d Compare May 11, 2024 06:57

core,trie: use direct get for statedb

f9fb58e

Fynnss force-pushed the integrate_snapshot_into_pathdb branch from 22cee6d to f9fb58e Compare May 13, 2024 09:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

all: integrate snapshot into pathdb #29754

all: integrate snapshot into pathdb #29754

Fynnss commented May 11, 2024 •

edited

Fynnss commented May 11, 2024 •

edited

karalabe commented May 13, 2024

holiman commented May 13, 2024

Fynnss commented May 14, 2024 •

edited

holiman commented May 14, 2024

Fynnss commented May 15, 2024 •

edited

rjl493456442 commented May 15, 2024

Fynnss commented May 15, 2024 •

edited

all: integrate snapshot into pathdb #29754

Are you sure you want to change the base?

all: integrate snapshot into pathdb #29754

Conversation

Fynnss commented May 11, 2024 • edited

Fynnss commented May 11, 2024 • edited

karalabe commented May 13, 2024

holiman commented May 13, 2024

Fynnss commented May 14, 2024 • edited

holiman commented May 14, 2024

Fynnss commented May 15, 2024 • edited

rjl493456442 commented May 15, 2024

Fynnss commented May 15, 2024 • edited

Fynnss commented May 11, 2024 •

edited

Fynnss commented May 11, 2024 •

edited

Fynnss commented May 14, 2024 •

edited

Fynnss commented May 15, 2024 •

edited

Fynnss commented May 15, 2024 •

edited