Improve performance of excluded files filter #5157

SimplyDanny · 2023-08-05T20:04:21Z

The current algorithm is like "collect all included files and subtract all excluded files". Collecting all included and all excluded files relies on the file system. This can become slow when the patterns used to exclude files resolve to a large number of files.

The new approach only collects all lintable files and checks them against the exclude patterns. This can be done by in-memory string-regex-match and does therefore not require file system accesses.

The most critical part is the conversion of glob patterns to regular expressions. I might have missed cases.

Fixes #5018.

SwiftLintBot · 2023-08-05T20:28:27Z

	17 Messages
📖	Linting Aerial with this PR took 1.12s vs 1.13s on main (0% faster)
📖	Linting Alamofire with this PR took 1.61s vs 1.64s on main (1% faster)
📖	Linting Brave with this PR took 9.35s vs 9.37s on main (0% faster)
📖	Linting DuckDuckGo with this PR took 4.74s vs 4.99s on main (5% faster)
📖	Linting Firefox with this PR took 11.72s vs 11.82s on main (0% faster)
📖	Linting Kickstarter with this PR took 11.37s vs 11.35s on main (0% slower)
📖	Linting Moya with this PR took 0.63s vs 0.64s on main (1% faster)
📖	Linting NetNewsWire with this PR took 3.34s vs 3.31s on main (0% slower)
📖	Linting Nimble with this PR took 0.92s vs 0.92s on main (0% slower)
📖	Linting PocketCasts with this PR took 9.17s vs 9.17s on main (0% slower)
📖	Linting Quick with this PR took 0.42s vs 0.42s on main (0% slower)
📖	Linting Realm with this PR took 5.84s vs 5.73s on main (1% slower)
📖	Linting Sourcery with this PR took 2.84s vs 2.86s on main (0% faster)
📖	Linting Swift with this PR took 5.61s vs 5.54s on main (1% slower)
📖	Linting VLC with this PR took 1.56s vs 1.54s on main (1% slower)
📖	Linting Wire with this PR took 20.87s vs 20.84s on main (0% slower)
📖	Linting WordPress with this PR took 13.73s vs 13.65s on main (0% slower)

Generated by 🚫 Danger

ileitch · 2023-11-02T14:01:53Z

Periphery had a similar performance issue not long ago, and I noticed there wasn't a solid glob to regex implementation, so I ported Python's fnmatch to Swift: https://github.com/ileitch/swift-filename-matcher. It might be useful here too.

jpsim · 2023-11-02T17:07:33Z

Last time we tried to speed this up, it caused some slight differences in the result of what was matched vs not, so please be super careful here.

SimplyDanny · 2023-11-02T21:17:08Z

At the moment, I'm rather concerned here that normal runs without any excludes and includes seem to become much slower sometimes.

Periphery had a similar performance issue not long ago, and I noticed there wasn't a solid glob to regex implementation, so I ported Python's fnmatch to Swift: https://github.com/ileitch/swift-filename-matcher. It might be useful here too.

This is a very helpful tip. I don't want to invent a half-backed version myself. Thanks!

JaviSoto · 2024-03-11T16:42:15Z

Any chance this can land? 🙏

SimplyDanny · 2024-03-16T13:48:29Z

Any chance this can land? 🙏

This is a critical change that needs thorough testing. Unfortunately, I'm lacking own projects with nifty included and excluded specifications.

@JaviSoto: In case this change took effect in your projects, I'd appreciate your feedback.

The current algorithm is like "collect all included files and subtract all excluded files". Collecting all included and all excluded files relies on the file system. This can become slow when the patterns used to exclude files resolve to a large number of files. The new approach only collects all lintable files and checks them against the exclude patterns. This can be done by in-memory string-regex-match and does therefore not require file system accesses.

SimplyDanny requested a review from keith August 5, 2023 20:04

SimplyDanny force-pushed the excluded-performance branch from e99f0a9 to a253a0d Compare August 5, 2023 20:42

SimplyDanny requested a review from jpsim August 13, 2023 18:02

SimplyDanny mentioned this pull request Aug 22, 2023

Excluded files impact the performance of swiftlint #5018

Open

2 tasks

SimplyDanny force-pushed the excluded-performance branch 2 times, most recently from 4094558 to 30d4908 Compare August 30, 2023 06:00

SimplyDanny mentioned this pull request Sep 8, 2023

Running lint without any code changes is still slow (30s vs 40s without cache) #5207

Open

2 tasks

SimplyDanny force-pushed the excluded-performance branch 3 times, most recently from bfd249c to 6a0585a Compare November 20, 2023 22:30

SimplyDanny force-pushed the excluded-performance branch from a4e724b to a95044b Compare January 4, 2024 20:00

SimplyDanny force-pushed the excluded-performance branch from a95044b to ca66b0c Compare January 23, 2024 21:11

SimplyDanny force-pushed the excluded-performance branch from ca66b0c to e5989f1 Compare March 11, 2024 19:29

SimplyDanny added 5 commits March 20, 2024 23:18

Use FilenameMatcher

ede4e12

Fix Bazel tests

2c868bd

Avoid .unique

5b6f077

Fix changelog entry position

dad51eb

SimplyDanny force-pushed the excluded-performance branch from e5989f1 to dad51eb Compare March 20, 2024 22:22

SimplyDanny mentioned this pull request Mar 20, 2024

Glob.resolveGlob() is veeeery sloooow #5501

Open

2 tasks

Accept with files or folders as the last part of the pattern

61acd6d

SimplyDanny force-pushed the excluded-performance branch from 21c0add to 61acd6d Compare March 22, 2024 07:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve performance of excluded files filter #5157

Improve performance of excluded files filter #5157

SimplyDanny commented Aug 5, 2023

SwiftLintBot commented Aug 5, 2023 •

edited

ileitch commented Nov 2, 2023

jpsim commented Nov 2, 2023

SimplyDanny commented Nov 2, 2023

JaviSoto commented Mar 11, 2024

SimplyDanny commented Mar 16, 2024

Improve performance of excluded files filter #5157

Are you sure you want to change the base?

Improve performance of excluded files filter #5157

Conversation

SimplyDanny commented Aug 5, 2023

SwiftLintBot commented Aug 5, 2023 • edited

ileitch commented Nov 2, 2023

jpsim commented Nov 2, 2023

SimplyDanny commented Nov 2, 2023

JaviSoto commented Mar 11, 2024

SimplyDanny commented Mar 16, 2024

SwiftLintBot commented Aug 5, 2023 •

edited