[geany/geany] Why aren't cyrillic characters accepted as the first letter of the [keywords] section of filetypes.*.conf files? (Discussion #3370)

List overview All Threads

newer

older

Re: [geany/geany-plugins] Add...

[geany/geany] Feature Request:...

PaulStogov

20 Jan 2023 20 Jan '23

1:10 a.m.

My [keywords] section is as follows: `primary=Cheese Käse Сыр Сыp Cыp \u0421ыр Déjà Уже Already Bereits HНOО`

I have a UTF-8 encoded file colored like this: **Cheese Käse** Сыр Сыp **Cыp** **Déjà** Уже **Already Bereits HНOО**

Words in **bold** are treated like keywords but any word with initial Cyrillic letter is ignored.

-- Reply to this email directly or view it on GitHub: https://github.com/geany/geany/discussions/3370 You are receiving this because you are subscribed to this thread. Message ID: geany/geany/repo-discussions/3370@github.com

Attachments:

attachment.html (text/html — 1.9 KB)

Show replies by date

elextr

20 Jan 20 Jan

1:55 a.m.

What filetype?

-- Reply to this email directly or view it on GitHub: https://github.com/geany/geany/discussions/3370#discussioncomment-4733122 You are receiving this because you are subscribed to this thread. Message ID: geany/geany/repo-discussions/3370/comments/4733122@github.com

PaulStogov

6:25 a.m.

lexer_filetype=C

-- Reply to this email directly or view it on GitHub: https://github.com/geany/geany/discussions/3370#discussioncomment-4734175 You are receiving this because you are subscribed to this thread. Message ID: geany/geany/repo-discussions/3370/comments/4734175@github.com

elextr

6:49 a.m.

The way highlighting works is that a language specific lexer (C in your case) analyses the input according to the rules of its language. So "words" (keywords, identifiers, etc depending on the language) are first identified by the lexer, then compared to one or more of the keyword lists.

C keywords and identifiers began with ASCII alphabetic or underscore until recently when the ability to use Unicode escape sequences was introduced [see](https://en.cppreference.com/w/c/language/identifier).

Note that actually allowing unescaped Unicode in identifiers is implementation defined, not standard C. It appears that the lexer is lenient about trailing characters, but has not been updated to allow escape sequences or Unicode as leading characters.

Lexers come from the [Lexilla](https://github.com/ScintillaOrg/lexilla) project, so patches should be provided there first.

-- Reply to this email directly or view it on GitHub: https://github.com/geany/geany/discussions/3370#discussioncomment-4734282 You are receiving this because you are subscribed to this thread. Message ID: geany/geany/repo-discussions/3370/comments/4734282@github.com

PaulStogov

4 Feb 4 Feb

11:50 p.m.

Looks like the patch is provided [here](https://github.com/ScintillaOrg/lexilla/commit/c1a8d798e2cad76aae9d4425819be...) Topic is [here](https://github.com/ScintillaOrg/lexilla/issues/130)

-- Reply to this email directly or view it on GitHub: https://github.com/geany/geany/discussions/3370#discussioncomment-4871850 You are receiving this because you are subscribed to this thread. Message ID: geany/geany/repo-discussions/3370/comments/4871850@github.com

elextr

5 Feb 5 Feb

12:29 a.m.

Thanks for upstreaming, will be imported when Geany is updated to Lexilla 5.2.2

-- Reply to this email directly or view it on GitHub: https://github.com/geany/geany/discussions/3370#discussioncomment-4871991 You are receiving this because you are subscribed to this thread. Message ID: geany/geany/repo-discussions/3370/comments/4871991@github.com

PaulStogov

2:38 a.m.

Thank you!

...

Am 05.02.2023 um 02:29 schrieb elextr ***@***.***>:

Thanks for upstreaming, will be imported when Geany is updated to Lexilla 5.2.2

— Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you authored the thread.

-- Reply to this email directly or view it on GitHub: https://github.com/geany/geany/discussions/3370#discussioncomment-4872327 You are receiving this because you are subscribed to this thread. Message ID: geany/geany/repo-discussions/3370/comments/4872327@github.com

627

Age (days ago)

643

Last active (days ago)

github-comments@lists.geany.org

6 comments

2 participants

tags (0)

participants (2)

elextr
PaulStogov