Fix `,-` parsing #1277

AlienKevin · 2024-04-22T16:12:47Z

Current requires a space to separate the , and the -.

The text was updated successfully, but these errors were encountered:

disconcision · 2024-04-22T16:28:36Z

@dm0n3y curious how the new system handles such cases. this is an awkward one in the current arrangement, as we have a notion of operator characters, where a (possibly user-defined soon) operator can consist of any run of those characters. there are a number of ways this categorization could be made more precise, but the case of characters which can be both prefix operators and parts of infix operators like this one seems pernicious.

dm0n3y · 2024-04-22T17:32:09Z

First thought is that comma is special in the same way parens/braces are and would not be included in the arbitrary operator token class

disconcision · 2024-04-22T20:28:41Z

@dm0n3y solves ,- but not e.g. *-

dm0n3y · 2024-04-23T02:08:45Z

Hard to solve in general short of doing some more elaborate context-informed lexing. ,- lexing into an unrecognized operator is esp annoying though and worth specializing. I'm more ok with *- lexing into an unrecognized operator. OCaml makes the same distinction.

cyrus- · 2024-04-23T02:27:22Z

could try to restrict infix operators to not end in a token that can also be used as a prefix operator?

disconcision · 2024-04-23T05:04:14Z

@cyrus- could work but would involve some slightly grody intermediate states, e.g. is "-" was an operator then it goes from being one operator to two back to one again. could say more restrictively that prefix operator characters can't be used as non-initial characters in infix ops.

i don't find the ocaml approach fully satisfying but the fact that they're doing it suggests it's at least annoying to do better

disconcision · 2024-04-23T05:26:21Z

@dm0n3y i feel like in principle there could be something analogous to your error-counting metric at the lexing level. an invalid token gets broken up if doing so results in a state with less total errors

dm0n3y · 2024-04-23T18:07:14Z

Yeah ultimately I think there should be character-level molding, which is what I really meant by context-informed lexing above. I agree with @disconcision that OCaml approach is not perfect but best bang for buck short of full solution.

cyrus- added the bug label Apr 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix `,-` parsing #1277

Fix `,-` parsing #1277

AlienKevin commented Apr 22, 2024 •

edited

disconcision commented Apr 22, 2024

dm0n3y commented Apr 22, 2024

disconcision commented Apr 22, 2024

dm0n3y commented Apr 23, 2024

cyrus- commented Apr 23, 2024

disconcision commented Apr 23, 2024

disconcision commented Apr 23, 2024

dm0n3y commented Apr 23, 2024

Fix ,- parsing #1277

Fix ,- parsing #1277

Comments

AlienKevin commented Apr 22, 2024 • edited

disconcision commented Apr 22, 2024

dm0n3y commented Apr 22, 2024

disconcision commented Apr 22, 2024

dm0n3y commented Apr 23, 2024

cyrus- commented Apr 23, 2024

disconcision commented Apr 23, 2024

disconcision commented Apr 23, 2024

dm0n3y commented Apr 23, 2024

Fix `,-` parsing #1277

Fix `,-` parsing #1277

AlienKevin commented Apr 22, 2024 •

edited