Remove AST-node dependency from `FunctionType` and `ClassType` #14087

MichaReiser · 2024-11-04T08:58:32Z

Summary

This PR improves Red Knot's incremental computation by:

Making FunctionType::return_type and ClassType::explicit_bases salsa queries. I suspect this is where the main performance improvement comes from. Making these two functions salsa queries prevents that the class or function's node (accessed through definition.node) is marked as a dependency at the callee side. Instead, the callee only depends on whatever the return type or bases are.
Removing FunctionType::definition and ClassType::definition and instead use semantic_index.definition(..) to lookup the type's definition lazily. This ensures that deserializing a type doesn't require deserializing the AST as well.
Remove ScopeId::node and move it to Scope::node. Same as for FunctionType::definition. Moving the Node from the ScopeId to the definition allows deserializing the ScopeId without deserializing the AST node. This is important because both FunctionType and ClassType have a ScopeId field.

Fixes #14054

Test Plan

cargo test

github-actions · 2024-11-04T09:16:02Z

`ruff-ecosystem` results

Linter (stable)

✅ ecosystem check detected no linter changes.

Linter (preview)

✅ ecosystem check detected no linter changes.

codspeed-hq · 2024-11-04T14:35:57Z

CodSpeed Performance Report

Merging #14087 will improve performances by 12.36%

_{Comparing micha/remove-ast-dependencies-from-type (eb97246) with main (9dddd73)}

Summary

⚡ 1 improvements
✅ 31 untouched benchmarks

Benchmarks breakdown

	Benchmark	`main`	`micha/remove-ast-dependencies-from-type`	Change
⚡	`red_knot_check_file[incremental]`	4.6 ms	4.1 ms	+12.36%

MichaReiser · 2024-11-04T14:38:37Z

Lol what

MichaReiser · 2024-11-04T15:20:27Z

crates/red_knot_python_semantic/src/types.rs

+    /// would depend on the function's AST and rerun for every change in that file.
+    pub fn return_ty(self, db: &'db dyn Db) -> Type<'db> {
+        let scope = self.body_scope(db);
+        let function_stmt_node = scope.node(db).expect_function();


The issue without this being a salsa query is that the calling-query suddenly depends on the function's ast (and, therefore, the file).

In the case of our benchmark: _parser.py calls match_to_datetime which is defined in _re.py. Now, it shouldn't be necessary to recheck _parser.py because no type in _re.py changed. However, Salsa had to recheck every callsite because the return_type query accessed the definition.kind(db) which is the AST node of the match_to_datetime and its AST has changed (not really, but we use no_eq...)

I verified that making return_ty is the reason for the perf improvement by removing the #[salsa::tracked].

crates/red_knot_python_semantic/src/semantic_index/builder.rs

AlexWaygood

Nice!

crates/red_knot_python_semantic/src/semantic_index/definition.rs

crates/red_knot_python_semantic/src/semantic_index/expression.rs

crates/red_knot_python_semantic/src/types.rs

crates/red_knot_python_semantic/src/types/infer.rs

crates/red_knot_python_semantic/src/types/mro.rs

crates/red_knot_python_semantic/src/types.rs

AlexWaygood

LGTM other than my remaining docs nits and one further optional comment:

crates/red_knot_python_semantic/src/types.rs

carljm

Code changes here look great, thanks for sorting this out!

My main concern is that I think the guidance here is going to prove quite difficult to remember and follow manually and solely via code review; it's just very easy for these things to leak. It seems like it should be possible to write tests that are similar to what happen in our benchmark, just verifying that if you make a no-op change to one file, it doesn't cause re-execution of queries in another file that shouldn't need to re-execute. I think a few tests like that could go a really long way in helping us avoid regression here.

crates/red_knot_python_semantic/src/semantic_index/definition.rs

# Conflicts: # crates/red_knot_python_semantic/src/types.rs

Co-authored-by: Alex Waygood <[email protected]>

MichaReiser · 2024-11-05T07:50:49Z

@carljm I agree that a test would be nice. I added one for call inference although I'm not sure how valuable it is because it is rather fragile (a slight change to inference will break the test itself).

I don't know how to write the ideal test that:

Exercises most language features
Asserts that infer_definition_types or infer_expression_types is never called for any other file than the file to which the comment was added.

The problem here is that salsa doesn't provide the necessary reflection API to answer the second question. We could log all events but that test is likely going to break with every inference test AND it's kind of impossible to tell from a ExpressionId to which file it belongs.

I don't mind adding more tests if you have suggestions on how to test this, but I'll leave it at what I have for now. I do think that our benchmarks do a decent job at this anyway. I would hope that we would investigate a 12% perf regression ;)

I don't think there's a way to test deserialization in a meaningful before salsa adds persistent caching support.

crates/red_knot_python_semantic/src/types.rs

Co-authored-by: Alex Waygood <[email protected]>

MichaReiser changed the title ~~Move ScopeId::node to Scope to avoid depending on AST node~~ Remove AST-node dependency from Type Nov 4, 2024

MichaReiser added the red-knot Multi-file analysis & type inference label Nov 4, 2024

MichaReiser force-pushed the micha/remove-ast-dependencies-from-type branch from 817ee80 to 7908514 Compare November 4, 2024 09:01

MichaReiser force-pushed the micha/remove-ast-dependencies-from-type branch from 079e46a to 1fc3dc4 Compare November 4, 2024 14:30

MichaReiser mentioned this pull request Nov 4, 2024

Avoid cloning Name when looking up function and class types #14092

Merged

MichaReiser force-pushed the micha/remove-ast-dependencies-from-type branch from 1fc3dc4 to a1a709f Compare November 4, 2024 14:55

MichaReiser changed the title ~~Remove AST-node dependency from Type~~ Remove AST-node dependency from FunctionType and ClassType Nov 4, 2024

MichaReiser force-pushed the micha/remove-ast-dependencies-from-type branch from a1a709f to 6d5f492 Compare November 4, 2024 15:13

MichaReiser commented Nov 4, 2024

View reviewed changes

MichaReiser force-pushed the micha/remove-ast-dependencies-from-type branch from 0b80226 to 6d5f492 Compare November 4, 2024 15:21

Skylion007 reviewed Nov 4, 2024

View reviewed changes

crates/red_knot_python_semantic/src/semantic_index/builder.rs Outdated Show resolved Hide resolved

MichaReiser force-pushed the micha/remove-ast-dependencies-from-type branch 2 times, most recently from f93cea2 to 4bb6209 Compare November 4, 2024 15:25

MichaReiser marked this pull request as ready for review November 4, 2024 15:33

MichaReiser requested review from carljm, AlexWaygood and sharkdp as code owners November 4, 2024 15:33

MichaReiser force-pushed the micha/remove-ast-dependencies-from-type branch from 4bb6209 to a4f8f95 Compare November 4, 2024 15:42

AlexWaygood reviewed Nov 4, 2024

View reviewed changes

MichaReiser force-pushed the micha/remove-ast-dependencies-from-type branch from a4f8f95 to 51a9983 Compare November 4, 2024 15:51

AlexWaygood approved these changes Nov 4, 2024

View reviewed changes

crates/red_knot_python_semantic/src/types.rs Show resolved Hide resolved

AlexWaygood mentioned this pull request Nov 4, 2024

[red-knot] Support metaclasses #14096

Open

carljm approved these changes Nov 4, 2024

View reviewed changes

dhruvmanila reviewed Nov 5, 2024

View reviewed changes

crates/red_knot_python_semantic/src/semantic_index/definition.rs Outdated Show resolved Hide resolved

MichaReiser and others added 3 commits November 5, 2024 08:18

Move ScopeId::node to Scope to avoid depending on AST node

44e8223

Remove Definition dependency from Function and Class type

a0b02ba

# Conflicts: # crates/red_knot_python_semantic/src/types.rs

Apply suggestions from code review

aaa407e

Co-authored-by: Alex Waygood <[email protected]>

MichaReiser force-pushed the micha/remove-ast-dependencies-from-type branch from f3bec2d to 26cd7ed Compare November 5, 2024 07:51

MichaReiser added 2 commits November 5, 2024 08:53

Add test that call doesn't add a dependency to the callee's file

da485ed

Add comment to Unpack

969461f

MichaReiser force-pushed the micha/remove-ast-dependencies-from-type branch from 26cd7ed to 969461f Compare November 5, 2024 07:53

MichaReiser enabled auto-merge (squash) November 5, 2024 07:53

AlexWaygood reviewed Nov 5, 2024

View reviewed changes

crates/red_knot_python_semantic/src/types.rs Outdated Show resolved Hide resolved

Update crates/red_knot_python_semantic/src/types.rs

eb97246

Co-authored-by: Alex Waygood <[email protected]>

MichaReiser merged commit 4323512 into main Nov 5, 2024
16 checks passed

MichaReiser deleted the micha/remove-ast-dependencies-from-type branch November 5, 2024 08:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove AST-node dependency from `FunctionType` and `ClassType` #14087

Remove AST-node dependency from `FunctionType` and `ClassType` #14087

MichaReiser commented Nov 4, 2024 •

edited

Loading

github-actions bot commented Nov 4, 2024 •

edited

Loading

codspeed-hq bot commented Nov 4, 2024 •

edited

Loading

MichaReiser commented Nov 4, 2024

MichaReiser Nov 4, 2024 •

edited

Loading

AlexWaygood left a comment

AlexWaygood left a comment

carljm left a comment

MichaReiser commented Nov 5, 2024 •

edited

Loading

Remove AST-node dependency from FunctionType and ClassType #14087

Remove AST-node dependency from FunctionType and ClassType #14087

Conversation

MichaReiser commented Nov 4, 2024 • edited Loading

Summary

Test Plan

github-actions bot commented Nov 4, 2024 • edited Loading

ruff-ecosystem results

Linter (stable)

Linter (preview)

codspeed-hq bot commented Nov 4, 2024 • edited Loading

CodSpeed Performance Report

Merging #14087 will improve performances by 12.36%

Summary

Benchmarks breakdown

MichaReiser commented Nov 4, 2024

MichaReiser Nov 4, 2024 • edited Loading

Choose a reason for hiding this comment

AlexWaygood left a comment

Choose a reason for hiding this comment

AlexWaygood left a comment

Choose a reason for hiding this comment

carljm left a comment

Choose a reason for hiding this comment

MichaReiser commented Nov 5, 2024 • edited Loading

Remove AST-node dependency from `FunctionType` and `ClassType` #14087

Remove AST-node dependency from `FunctionType` and `ClassType` #14087

MichaReiser commented Nov 4, 2024 •

edited

Loading

github-actions bot commented Nov 4, 2024 •

edited

Loading

`ruff-ecosystem` results

codspeed-hq bot commented Nov 4, 2024 •

edited

Loading

MichaReiser Nov 4, 2024 •

edited

Loading

MichaReiser commented Nov 5, 2024 •

edited

Loading