Cache repeated string instances in the lexer (.NET 9) #38

alexrp · 2023-02-22T04:49:38Z

When lexing a typical source file, there's going to be a lot of repeated strings - identifiers, literals, white space, and so on. We can't intern these, but it would make good sense to cache tokens up to a certain length and return the same instance instead of building them up repeatedly.

To implement this, instead of building up the token string in a StringBuilder, we would keep track of where the token starts and ends. When creating the token, if the length is below our caching threshold, we first look it up in the token cache. For larger tokens, we shouldn't bother as the lookup will take too long to be worth it.

The text was updated successfully, but these errors were encountered:

alexrp · 2023-03-18T08:52:07Z

Along with this work, we should also create lexed strings through SourceText.ToString(SourceTextSpan).

alexrp · 2024-07-12T02:27:26Z

dotnet/runtime#27229 should make this quite a bit easier to implement in .NET 9.

alexrp added state: approved Enhancements and tasks that have been approved. type: performance area: analysis Issues related to language analyses. labels Feb 22, 2023

alexrp added this to the v2.0 milestone Feb 22, 2023

alexrp self-assigned this Feb 22, 2023

alexrp modified the milestones: v2.0, Future Jan 1, 2024

alexrp removed their assignment Jan 27, 2024

alexrp changed the title ~~Cache repeated string instances in the lexer~~ Cache repeated string instances in the lexer (.NET 9) Jul 12, 2024

alexrp removed the type: performance label Oct 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache repeated string instances in the lexer (.NET 9) #38

Cache repeated string instances in the lexer (.NET 9) #38

alexrp commented Feb 22, 2023

alexrp commented Mar 18, 2023

alexrp commented Jul 12, 2024

Cache repeated string instances in the lexer (.NET 9) #38

Cache repeated string instances in the lexer (.NET 9) #38

Comments

alexrp commented Feb 22, 2023

alexrp commented Mar 18, 2023

alexrp commented Jul 12, 2024