Journal fixed lru cache #498

najeal · 2025-02-17T16:45:40Z

Closes #452

This makes changes to use LRU Cache with fixed length item journal.

codecov · 2025-02-18T15:19:41Z

Codecov Report

Attention: Patch coverage is 96.15385% with 6 lines in your changes missing coverage. Please review.

Project coverage is 94.37%. Comparing base (0d7677e) to head (ed141e0).

Files with missing lines	Patch %	Lines
storage/src/journal/fixed.rs	96.10%	6 Missing ⚠️

@@            Coverage Diff             @@
##             main     #498      +/-   ##
==========================================
+ Coverage   94.34%   94.37%   +0.02%     
==========================================
  Files         109      109              
  Lines       30217    30282      +65     
==========================================
+ Hits        28509    28579      +70     
+ Misses       1708     1703       -5

Files with missing lines	Coverage Δ
storage/src/journal/mod.rs	`100.00% <ø> (ø)`
storage/src/mmr/journaled.rs	`99.06% <100.00%> (+<0.01%)`	⬆️
storage/src/journal/fixed.rs	`98.07% <96.10%> (-0.13%)`	⬇️

... and 2 files with indirect coverage changes

Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 0d7677e...ed141e0. Read the comment docs.

roberto-bayardo · 2025-02-18T15:35:52Z

storage/src/journal/fixed.rs

+    //blobs: BTreeMap<u64, B>,
+    min_blob_index: u64,
+    newest_index_blob: (u64, B),
+    lru_cache: Arc<tokio::sync::Mutex<LruCache<u64, B>>>,


Why does this need the Arc<> ? (& Mutex < > ?)

roberto-bayardo · 2025-02-18T15:38:48Z

storage/src/journal/fixed.rs

@@ -84,7 +91,10 @@ pub struct Journal<B: Blob, E: Storage<B>, A: Array> {

    // Blobs are stored in a BTreeMap to ensure they are always iterated in order of their indices.


update comment

najeal · 2025-02-18T15:41:25Z

storage/Cargo.toml

@@ -24,6 +24,8 @@ futures-util = { workspace = true }
 tracing = { workspace = true }
 zstd = { workspace = true }
 rangemap = "1.5.1"
+lru = "0.13.0"
+tokio = { workspace = true}


I remember team said we should not import tokio. This is currently used to open the discussion.

When removing Arc<Mutex<>> implies to change the read(&self) trait method signature to accept read(&mut self), this creates chained changes and not sure this is expected.

Ah I see, this introduces an internal mutability concern. I think we can just change the API to read(&mut self) for now?

That is correct. We shouldn't be importing tokio here.

We have a few options:

make read(&mut self) (gross)

use a cache that doesn't require a mutable reference (either external crate or write our own)

refactor code to avoid holding cache references over await

If we aren't thoughtful with this cache integration, we could accidentally prohibit concurrent reads (even if we keep read(&self) and use a cache with interior mutability, it may lock on all requests to update the cache state).

A concurrent_lru could look something like this (we shouldn't use this crate but to give you some idea of a more compatible interface with our goals): https://docs.rs/concurrent_lru/latest/concurrent_lru/.

roberto-bayardo · 2025-02-18T15:49:48Z

storage/src/journal/fixed.rs

+        }
+        let items_per_blob = self.cfg.items_per_blob;
+        let blob = self.get_blob(blob_index).await?;
+        let item_index = item_position % items_per_blob;


Suggested change

let item_index = item_position % items_per_blob;

let item_index = item_position % self.cfg.items_per_blob;

then drop L309

roberto-bayardo · 2025-02-18T15:52:16Z

storage/src/journal/fixed.rs

        let offset = item_index * Self::CHUNK_SIZE as u64;
        let mut buf = vec![0u8; Self::CHUNK_SIZE];
        blob.read_at(&mut buf, offset).await?;
-
        // Verify integrity


while you're here can you delete this comment which is simply stating the obvious?

roberto-bayardo · 2025-02-18T15:56:27Z

storage/src/journal/mod.rs

@@ -37,4 +37,6 @@ pub enum Error {
    InvalidItem(u64),
    #[error("invalid rewind: {0}")]
    InvalidRewind(u64),
+    #[error("invalid index: {0}")]
+    InvalidIndex(u64),


I wouldn't expose the fact that we index blobs using integers to the external API like this.

najeal added 2 commits February 17, 2025 17:42

[journal/fixed] Introduce LRU Cache

238a882

[journal/fixed] delete usage of cache in replay

ed141e0

roberto-bayardo reviewed Feb 18, 2025

View reviewed changes

najeal commented Feb 18, 2025

View reviewed changes

roberto-bayardo reviewed Feb 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Journal fixed lru cache #498

Journal fixed lru cache #498

najeal commented Feb 17, 2025

codecov bot commented Feb 18, 2025

roberto-bayardo Feb 18, 2025 •

edited

Loading

roberto-bayardo Feb 18, 2025

najeal Feb 18, 2025

roberto-bayardo Feb 18, 2025

patrick-ogrady Feb 18, 2025

roberto-bayardo Feb 18, 2025

roberto-bayardo Feb 18, 2025

roberto-bayardo Feb 18, 2025

		@@ -84,7 +91,10 @@ pub struct Journal<B: Blob, E: Storage<B>, A: Array> {

		// Blobs are stored in a BTreeMap to ensure they are always iterated in order of their indices.

	let item_index = item_position % items_per_blob;
	let item_index = item_position % self.cfg.items_per_blob;

Journal fixed lru cache #498

Are you sure you want to change the base?

Journal fixed lru cache #498

Conversation

najeal commented Feb 17, 2025

codecov bot commented Feb 18, 2025

Codecov Report

roberto-bayardo Feb 18, 2025 • edited Loading

Choose a reason for hiding this comment

roberto-bayardo Feb 18, 2025

Choose a reason for hiding this comment

najeal Feb 18, 2025

Choose a reason for hiding this comment

roberto-bayardo Feb 18, 2025

Choose a reason for hiding this comment

patrick-ogrady Feb 18, 2025

Choose a reason for hiding this comment

roberto-bayardo Feb 18, 2025

Choose a reason for hiding this comment

roberto-bayardo Feb 18, 2025

Choose a reason for hiding this comment

roberto-bayardo Feb 18, 2025

Choose a reason for hiding this comment

roberto-bayardo Feb 18, 2025 •

edited

Loading