The question about the file type is not very clear. Encodings are everywhere UTF-8. This works in any configuration file. For example filetypes.c, or any other. (If you meant it). And yes, it looks like this will only recognize keywords that have [a-zA-Z_] at the beginning. But in the source I did not find any obvious restrictions on this. (Maybe I was looking badly).