[Github-comments] [geany/geany] Failure to open correctly files with cyrillic encoding (#1429)

elextr notifications at xxxxx
Wed Mar 15 05:53:15 UTC 2017


It is not generally possible to always accurately determine the encoding of a file just from its content, and particularly if it is a small file with little content.  This is even more difficult for files encoded in various "code pages" since they resuse the same codes to mean different characters.

What encodings are specified by your locales on Windows and Linux?  My guess is that the Windows locales match the file encoding, but the Linux one is UTF-8.

Inside Geany all files are UTF-8 encoded, they are only converted on opening and closing, thats why changing the encoding specification does nothing. 

On opening a file with no embedded encoding specification, and no explicitly set encoding, Geany will first check to see if the file is already UTF-8, then check the encoding specified by the locale, then get desperate and try the known encodings and accept the first that converts to a valid UTF-8.  At least that gets the file open and you can then often see that its wrong and close and re-open it with a different encoding (you did notice that the "more options" section of the open dialog allows you to select an encoding for this file).

-- 
You are receiving this because you are subscribed to this thread.
Reply to this email directly or view it on GitHub:
https://github.com/geany/geany/issues/1429#issuecomment-286647888
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.geany.org/pipermail/github-comments/attachments/20170314/824b2560/attachment.html>


More information about the Github-comments mailing list