feat(web): Uniform random distribution during shuffle #19902

Pascal-So · 2025-07-12T21:14:55Z

Description

In the slide show settings, the user can select if a slide show should run in forward, reverse, or shuffle order. This PR modifies the shuffle algorithm to improve the randomness, while not changing anything else.

Previously, the algorithm to determine the next photo was to: uniformly at random select a month from all the months with photos in them, then uniformly at random select a day within this month, then a random photo within that day. This has the issue that the algorithm does not result in a uniform distribution over all photos.

Assume that a user has 10 photos in their library, one in January and the other ones in February. Then the January photo will on average be shown on 50% of the slides in the slide show, even though we would only expect that to be 10%.

This PR achieves a uniform distribution by first select a month, weighted by the number of photos in that month, followed by a weighted selection of the day within that month, and then a uniform distribution over the photos within that day.

This change was discussed yesterday in the "contributing" channel on discord.

Note: I set up a quick benchmark here to compare the performance of a linear scan vs. binary search to find the matching element in the prefix sum. The runtime complexity is O(n) either way due to the assembly of the prefix sum. We could theoretically cache the prefix sum over the asset counts of the months, but since n is so small I'd argue that it's not worth it. Overall I prefer the linear scan version due to the shorter and simpler code.

How Has This Been Tested?

Testing was done manually. Without this change I always start to notice duplicates after a short while in the slide show. This has not been the case with this change added.

Note that I also added some basic tests for the sampling utility function.

Checklist:

I have performed a self-review of my own code
I have made corresponding changes to the documentation if applicable
I have no unrelated changes in the PR.
I have confirmed that any new dependencies are strictly necessary.
I have written tests for new code (if applicable)
I have followed naming conventions/patterns in the surrounding code
All code in src/services/ uses repositories implementations for database calls, filesystem operations, etc.
All code in src/repositories/ is pretty basic/simple and does not have any immich specific logic (that belongs in src/services/)

github-actions · 2025-07-12T21:15:05Z

Label error. Requires exactly 1 of: changelog:.*. Found: 🖥️web. A maintainer will add the required label.

Pascal-So · 2025-07-12T21:26:41Z

hmm, I intuitively set the title of the PR to "feat(web)".. now that I think about it a bit more, would you consider this more of a fix than a feat? I'm not sure, open to your opinion.

midzelis · 2025-07-13T01:23:42Z

web/src/lib/managers/timeline-manager/timeline-manager.svelte.ts

@@ -444,8 +449,12 @@ export class TimelineManager {
  }

  async getRandomMonthGroup() {
-    const random = Math.floor(Math.random() * this.months.length);
-    const month = this.months[random];
+    const weights: number[] = this.months.map((month) => month.assetsCount);


It may be a bit faster just to pass in the total number of assets: (this.assetCount) and then pick one at random: Math.random() * this.assetCount -- and then, just look up the asset by index - something like:

const randomAssetIndex = Math.random() * this.assetCount; let accumulatedOffset = 0; for(const month of this.months) { if (randomAssetIndex < accumulatedOffset + month.assetsCount) { await this.loadMonthGroup(month.yearMonth, { cancelable: false }) // repeat this search-pattern using month.dayGroups } else { accumulatedOffset += month.assetsCount; } }

Its probably super minimal, but this approach would prevent having to populate the weights array up front. i.e. if you have 300K assets, and the random asset was chosen to be 42, this PR will still generate a weights array for all the months. Whereas, the above example would only not use an array of weights at all, rather, just a single variable for the accumulated offsets, and will stop as soon as it was found. (Additionally, if implemented directly in manager.getRandomAsset() - you could drop the code in manager.getRandomDayGroup and daygroup.getRandomAsset()

That's a good idea, thanks! I changed it to your suggested version.

Pascal-So · 2025-07-13T12:15:50Z

About testing: I was thinking about splitting the code up into two methods on the timeline manager: getNthAsset which we could test, and then make getRandomAsset just a thin untested wrapper.

The downside that I could see with that is that getNthAsset might not have a clear semantic meaning, since from looking at DayGroup.sortAssets I saw that there is the concept of AssetOrder which opens up the question if such a getNthAsset method should change behaviour depending on the order. For the shuffle we don't actually care about the order so in a test I'd just ensure that every asset gets visited if we call getNthAsset with every index so that test would not break either way.

Do you think it would be sufficient if we just add a comment to the getNthAsset method warning any possible future users that the behaviour is dependent on the current AssetOrder?

Pascal-So added 3 commits July 12, 2025 18:18

add weightedRandomSample tests

9cf55aa

add weightedRandomSample implementation

bd03ee8

use weighted sampling for random month and day groups

a5009eb

github-actions bot added the 🖥️web label Jul 12, 2025

fix tests

f11ea17

fix lint

c1fb4a2

midzelis reviewed Jul 13, 2025

View reviewed changes

perform lookup in one single pass

f46bdd2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat(web): Uniform random distribution during shuffle #19902

feat(web): Uniform random distribution during shuffle #19902

Pascal-So commented Jul 12, 2025

Uh oh!

github-actions bot commented Jul 12, 2025 •

edited

Loading

Uh oh!

Pascal-So commented Jul 12, 2025

Uh oh!

midzelis Jul 13, 2025

Uh oh!

Pascal-So Jul 13, 2025

Uh oh!

Pascal-So commented Jul 13, 2025

Uh oh!

Uh oh!

Uh oh!

feat(web): Uniform random distribution during shuffle #19902

Are you sure you want to change the base?

feat(web): Uniform random distribution during shuffle #19902

Conversation

Pascal-So commented Jul 12, 2025

Description

How Has This Been Tested?

Checklist:

Uh oh!

github-actions bot commented Jul 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Pascal-So commented Jul 12, 2025

Uh oh!

midzelis Jul 13, 2025

Choose a reason for hiding this comment

Uh oh!

Pascal-So Jul 13, 2025

Choose a reason for hiding this comment

Uh oh!

Pascal-So commented Jul 13, 2025

Uh oh!

Uh oh!

github-actions bot commented Jul 12, 2025 •

edited

Loading