<p>Reading the Wikipedia articles make me wonder about so-called "WTF-8", so I played with it and wrote that: <a href="https://github.com/b4n/wtf8tools">https://github.com/b4n/wtf8tools</a></p>

<p>Converting the file here to WTF-8 makes Geany able to open it just fine (like SciTE does), and it's convertible back to the original UTF-16.<br>
I'm not sure why we accept invalid UTF-8 (well, it's structurally valid, but contains reserved code points), but probably because the tools we use are happy so long as it's structurally valid or something.  Or we don't use the same thing for UTF-8 (GLib) than UTF-16 (iconv through GLib), and GLib is more forgiving.</p>

<p style="font-size:small;-webkit-text-size-adjust:none;color:#666;">—<br />You are receiving this because you are subscribed to this thread.<br />Reply to this email directly, <a href="https://github.com/geany/geany/issues/1238#issuecomment-248076516">view it on GitHub</a>, or <a href="https://github.com/notifications/unsubscribe-auth/ABDrJ3xMHiF3y2NwpoS9U-EEs9ggWkiHks5qrtJEgaJpZM4KAGgV">mute the thread</a>.<img alt="" height="1" src="https://github.com/notifications/beacon/ABDrJwwPZNbRWrtf-zEUJJQ5dxsxsBRzks5qrtJEgaJpZM4KAGgV.gif" width="1" /></p>
<div itemscope itemtype="http://schema.org/EmailMessage">
<div itemprop="action" itemscope itemtype="http://schema.org/ViewAction">
  <link itemprop="url" href="https://github.com/geany/geany/issues/1238#issuecomment-248076516"></link>
  <meta itemprop="name" content="View Issue"></meta>
</div>
<meta itemprop="description" content="View this Issue on GitHub"></meta>
</div>

<script type="application/json" data-scope="inboxmarkup">{"api_version":"1.0","publisher":{"api_key":"05dde50f1d1a384dd78767c55493e4bb","name":"GitHub"},"entity":{"external_key":"github/geany/geany","title":"geany/geany","subtitle":"GitHub repository","main_image_url":"https://cloud.githubusercontent.com/assets/143418/17495839/a5054eac-5d88-11e6-95fc-7290892c7bb5.png","avatar_image_url":"https://cloud.githubusercontent.com/assets/143418/15842166/7c72db34-2c0b-11e6-9aed-b52498112777.png","action":{"name":"Open in GitHub","url":"https://github.com/geany/geany"}},"updates":{"snippets":[{"icon":"PERSON","message":"@b4n in #1238: Reading the Wikipedia articles make me wonder about so-called \"WTF-8\", so I played with it and wrote that: https://github.com/b4n/wtf8tools\r\n\r\nConverting the file here to WTF-8 makes Geany able to open it just fine (like SciTE does), and it's convertible back to the original UTF-16.\r\nI'm not sure why we accept invalid UTF-8 (well, it's structurally valid, but contains reserved code points), but probably because the tools we use are happy so long as it's structurally valid or something.  Or we don't use the same thing for UTF-8 (GLib) than UTF-16 (iconv through GLib), and GLib is more forgiving."}],"action":{"name":"View Issue","url":"https://github.com/geany/geany/issues/1238#issuecomment-248076516"}}}</script>