On 20 August 2011 23:11, Dimitar Zhekov dimitar.zhekov@gmail.com wrote:
On Sat, 20 Aug 2011 09:33:29 +1000 Lex Trotman elextr@gmail.com wrote:
Shoudn't we have symbolchars for the current programming language ([A-Za-z_] if unknown), and wordchars that match the current locale? They don't have much in common.
By wordchars we mean symbolchars, this confusion has existed from the beginnings of C at least, and we ain't gonna change it now. :-)
It never did, at least not for the countries where the latin letters are not [the base of] alphabet.
Ah well, C was invented in an English speaking country & we native English speakers are not smart enough to handle more than 26 letters :-D
Locale/human language word ends are not as simple as sets of characters so lets not go there, we would need something like IIUC to do that.
s/IIUC/ICU/ to correct for fumble fingers.
Scintilla and regex use charsets, so we can't jump beyond that. Sorry,
I agree, thats what I said above, but therefore it doesn't do words as in natural language words, so what exactly do you mean, I agree, but I'm confused?
Lex, I have to side with Colomban here: the (locale) word chars are good for word searching, (symbol) word chars for tags, find usage etc.
I don't think Geany does words, it only ever does symbols, see my question below.
Maybe everything should use the filetype wordchars definition, with GEANY_WORDCHARS moved to filetypes.common as the default.
Now that would probably be the first editor which works with UTF-8, but can't do locale word search. :)
Can you point me to where Geany loads locale dependent wordchars and where it gets it from?
Cheers Lex