[Geany-devel] Use of Scintilla word boundaries for word searches

Colomban Wendling lists.ban at xxxxx
Sat Aug 20 20:30:02 UTC 2011


Le 20/08/2011 20:56, Dimitar Zhekov a écrit :
> On Sat, 20 Aug 2011 20:19:56 +0200
> Colomban Wendling <lists.ban at herbesfolles.org> wrote:
> 
>>> For Scintilla/Geany, "boza" or "боза", enclosed in non-ascii
>>> quotes, is not a word any more.
>>
>> Ah OK, got it.  Yeah, it doesn't detect the quote as "blank chars",
>> so doesn't fit in "...consists of sequences of non-blank characters
>> separated by blanks".
> 
> If you enclose boza in ascii quotes, it doesn't fit the definition
> either, but is recognized as a word.

Yes it does, since Geany defines whitespace chars to include the quote
(see filetypes.common):

whitespace_chars=\s\t!\"#$%&'()*+,-./:;<=>?@[\\]^`{|}~

> There is no reason to guess,
> Scintilla is open source, and here is the exact definition:

True enough, I should have though of digg into that weird C++ thing :-'

> /**
>  * Check that the character at the given position is a word or
>  * punctuation character and that the previous character is of
>  * a different character class.
>  */
> bool Document::IsWordStartAt(int pos) {
> 
> And the same goes for word end and whole word. Weird.



More information about the Devel mailing list