regex/token name separator for regexes that end with trailing spaces

This is an issue that affects current grmtools which was brought to light when considering the feature request in #597 it seemed like it would be good to split this out into it's own issue than discuss it there.

If we take the following lex file (foo.l) which ends in a trailing space, current grmtools accepts it,
but it seems like it should probably reject it.

```
%%
\ \ "weirdness"
\n ;
```

The thing is the trailing space in `\ \ ` is part of a regex, but it gets treated as the space separator that separates regex from the token name. Then it seems like perhaps it gets trimmed off of the regular expression before being passed to regex.

```
$ echo " " | cargo run --quiet -p lrlex ./foo.l -
weirdness
$ echo "  " | cargo run --quiet -p lrlex ./foo.l -
weirdness
weirdness
```

I would have expected the echo with two spaces to match a single weirdness token, and the one with a single token to not match.

The following regex *is* rejected, but by the regex crate, I assume it is just sending the '\' token alone to regex.

```
%%
\ "rejectedByRegex"
\n ;
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

regex/token name separator for regexes that end with trailing spaces #634

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

regex/token name separator for regexes that end with trailing spaces #634

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions