Most programming languages do not support identifiers with arbitrary bytes, some do support it, but Scintilla lexers are a small subset of that. You could open bug reports for Scintilla lexers which should support arbitrary identifier characters which don't.