#15630 - grep 2.14 much slower than 2.5.1

GNU bug report logs - #15630
grep 2.14 much slower than 2.5.1

Package: grep;

Reported by: "Z. Majeed" <zmajeed <at> sbcglobal.net>

Date: Wed, 16 Oct 2013 14:11:02 UTC

Severity: normal

Done: Paul Eggert <eggert <at> cs.ucla.edu>

Bug is archived. No further changes may be made.

View this message in rfc822 format

From: "Z. Majeed" <zmajeed <at> sbcglobal.net> To: "15630 <at> debbugs.gnu.org" <15630 <at> debbugs.gnu.org> Cc: Jim Meyering <jim <at> meyering.net> Subject: bug#15630: Acknowledgement (grep 2.14 much slower than 2.5.1) Date: Sat, 19 Oct 2013 18:39:45 -0700 (PDT)

[Message part 1 (text/plain, inline)]

Thanks - I nearly always use -i so a fix would be highly appreciated - meantime I dug a bit more into this issue - it's not as straightforward as it first seemed - the crux of the problem is not the workaround for UTF-8 but -i "<name>.*russia" causing dfamust to be just the one-character string "<" because "name" turns into character classes - for XML input that practically makes keyword matching worthless and the main loop in EGexecute degenerates to line-by-line processing - it seems to me dfaparse ought to deal with case foldings a little better so the trans table support in cwexec gets used - there have also been some simple patches submitted to make use of trans in bmexec Zartaj ________________________________ From: Jim Meyering <jim <at> meyering.net> To: Z. Majeed <zmajeed <at> sbcglobal.net> Cc: "15630 <at> debbugs.gnu.org" <15630 <at> debbugs.gnu.org> Sent: Saturday, October 19, 2013 8:58 PM Subject: Re: bug#15630: Acknowledgement (grep 2.14 much slower than 2.5.1) On Wed, Oct 16, 2013 at 12:20 PM, Z. Majeed <zmajeed <at> sbcglobal.net> wrote: > I see the reason is the workaround in do_execute that turns on line-by-line matching for -i across the board - I got runtime confirmation by trying "<name>.*[rR][uU][sS][sS][iI][aA]" - the times were faster than for grep 2.5.1 with -i: > 3.59user 2.95system 0:06.55elapsed > > I'm not sure if the workaround is for the -i problem in UTF-8 locales discussed in http://savannah.gnu.org/bugs/?29391. This bug report really should be titled "--ignore-case very slow in grep 2.14" Thanks for the reminder. I'm about to release grep-2.15, but after that, I will be inclined to address that problem.

[Message part 2 (text/html, inline)]

This bug report was last modified 11 years and 92 days ago.

GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.

GNU bug report logs - #15630 grep 2.14 much slower than 2.5.1

GNU bug report logs - #15630
grep 2.14 much slower than 2.5.1