[Github-comments] [geany/geany] crash on multiline regexp (#1586)

Chayyoo notifications at xxxxx
Mon Aug 28 08:21:54 UTC 2017


The file consists of sorted lines, each with three numbers separated by a semicolon and followed by two or three capital letters. The first number is repeated on several consecutive lines, typically 1 .. 8 times. Here's an extract:10000006;116680003;29857009;SCT
10000006;116680003;9972008;SCT
10000006;363698007;51185008;SCT
10000006;47429007;22253000;BT
10001005;116676008;409774005;SCT
10001005;116680003;87628006;SCT
10001005;116680003;91302008;SCT
10001005;246075003;409822003;SCT
10001005;370135005;441862004;SCT
10001005;47429007;23583003;BT
10002003;116680003;116175006;SCT
10002003;260507000;309795001;SCT
10002003;260686004;129304002;SCT
10002003;405813007;414003;SCT
10003008;116680003;106234000;SCTThe original file had about a million such lines. I have now discovered that the crash on my PC occurs from somewhere between 5300 and 5400 lines onwards. Below 5300 lines the regex works as expected.To narrow things down, the following regex does NOT crash, not even on a miliion lines:^(\d+?);(\d+?);(\d+?);[BCST]+\n(.*?\n)\1But this one does:^(\d+?);(\d+?);(\d+?);[BCST]+\n(.*?\n)*\1
 

    On Friday, 25 August 2017, 10:34, elextr <notifications at github.com> wrote:
 

 @Chayyoo then you need to post a gist with a small file where it does happen because, as @codebrainz said, we can't reproduce it with any of our files.—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub, or mute the thread.  

   

-- 
You are receiving this because you are subscribed to this thread.
Reply to this email directly or view it on GitHub:
https://github.com/geany/geany/issues/1586#issuecomment-325291663
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.geany.org/pipermail/github-comments/attachments/20170828/fb633690/attachment.html>


More information about the Github-comments mailing list