[Geany-devel] Use of Scintilla word boundaries for word searches

Lex Trotman elextr at xxxxx
Sat Aug 20 13:58:47 UTC 2011

On 20 August 2011 23:11, Dimitar Zhekov <dimitar.zhekov at gmail.com> wrote:
> On Sat, 20 Aug 2011 09:33:29 +1000
> Lex Trotman <elextr at gmail.com> wrote:
>> > Shoudn't we have symbolchars for the current programming language
>> > ([A-Za-z_] if unknown), and wordchars that match the current
>> > locale? They don't have much in common.
>> By wordchars we mean symbolchars, this confusion has existed from the
>> beginnings of C at least, and we ain't gonna change it now. :-)
> It never did, at least not for the countries where the latin letters
> are not [the base of] alphabet.

Ah well, C was invented in an English speaking country & we native
English speakers are not smart enough to handle more than 26 letters

>> Locale/human language word ends are not as simple as sets of
>> characters so lets not go there, we would need something like IIUC to
>> do that.

s/IIUC/ICU/ to correct for fumble fingers.

> Scintilla and regex use charsets, so we can't jump beyond that. Sorry,

I agree, thats what I said above, but therefore it doesn't do words as
in natural language words, so what exactly do you mean, I agree, but
I'm confused?

> Lex, I have to side with Colomban here: the (locale) word chars are
> good for word searching, (symbol) word chars for tags, find usage etc.

I don't think Geany does words, it only ever does symbols, see my
question below.

>> Maybe everything should use the filetype wordchars definition, with
>> GEANY_WORDCHARS moved to filetypes.common as the default.
> Now that would probably be the first editor which works with UTF-8,
> but can't do locale word search. :)

Can you point me to where Geany loads locale dependent wordchars and
where it gets it from?


More information about the Devel mailing list