[Geany-Devel] Building geany using msys2 on Windows 10

Doug Henderson djndnbvg at xxxxx
Tue Sep 14 13:36:37 UTC 2021


On Mon, 13 Sept 2021 at 15:55, Enrico Tröger <enrico.troeger at uvena.de> wrote:
>
> > Doug, I'm assuming you mean the Enrico in the little Authors table at
> > the top of the help manual?
>
> I guess yes because my name is broken there.
>
Attached is an image of what I see:

And here is the offending line from
file:///C:/Program%20Files/Geany/share/doc/geany/html/index.html:for
1.38beta1 on win10.

<meta name="authors" content="Enrico Tröger  Nick Treleaven  Frank
Lanitz  Colomban Wendling  Matthew Brush" />

And a dump of that line:

$ head -8 index.html | tail -1 | od -c
0000000   <   m   e   t   a       n   a   m   e   =   "   a   u   t   h
0000020   o   r   s   "       c   o   n   t   e   n   t   =   "   E   n
0000040   r   i   c   o       T   r 303 203 302 266   g   e   r
0000060   N   i   c   k       T   r   e   l   e   a   v   e   n
0000100   F   r   a   n   k       L   a   n   i   t   z           C   o
0000120   l   o   m   b   a   n       W   e   n   d   l   i   n   g
0000140       M   a   t   t   h   e   w       B   r   u   s   h   "
0000160   /   >  \r  \n
0000164

This tells me that at some point the UTF8 content was loaded as ISO-8859-?

So looking at https://nightly.geany.org/win32/build_win32_geany.log,
the problem is clear: the build is using Python 2.7. Python2 will read
the file as ASCII by default, while Python3 will try to detect the
encoding and fall back to UTF-8. Plus Python2 is now unsupported.

When I tried to recreate the problem on msys and cygwin for 1.31.1,
1.36, and 1.37.1 , I could not reproduce it because I have Python3
installed as the only Python on both systems. I spent more time than I
should have on this, and I think you may have found a solution
yourself.

Also this the results from the locale command:

Doug at mars MINGW64 ~/src/MINGW-packages/mingw-w64-geany
$ locale
LANG=en_CA.UTF-8
LC_CTYPE="en_CA.UTF-8"
LC_NUMERIC="en_CA.UTF-8"
LC_TIME="en_CA.UTF-8"
LC_COLLATE="en_CA.UTF-8"
LC_MONETARY="en_CA.UTF-8"
LC_MESSAGES="en_CA.UTF-8"
LC_ALL=

On cygwin, I get the same, but with LC_ALL having the same value.

Also:

Microsoft Windows [Version 10.0.19043.1165]
(c) Microsoft Corporation. All rights reserved.

Doug at MARS C:\Users\Doug
> chcp
Active code page: 65001

Code page 65001 specifies UTF-8

MINGW64_NT-10.0-19043 mars 3.2.0-340.x86_64 2021-08-02 16:30 UTC x86_64 Msys
CYGWIN_NT-10.0 mars 3.2.0(0.340/5/3) 2021-03-29 08:42 x86_64 Cygwin

Doug

--
Doug Henderson, Calgary, Alberta, Canada - from gmail.com
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Geany Doug's 1.38beta1 -Enrico.png
Type: image/png
Size: 8913 bytes
Desc: not available
URL: <https://lists.geany.org/pipermail/devel/attachments/20210914/0e600fe3/attachment.png>


More information about the Devel mailing list