GNU bug report logs - #18454
Improve performance when -P (PCRE) is used in UTF-8 locales

Previous Next

Package: grep;

Reported by: Vincent Lefevre <vincent <at> vinc17.net>

Date: Fri, 12 Sep 2014 01:26:02 UTC

Severity: normal

Done: Paul Eggert <eggert <at> cs.ucla.edu>

Bug is archived. No further changes may be made.

Full log


View this message in rfc822 format

From: Jim Meyering <jim <at> meyering.net>
To: Paul Eggert <eggert <at> cs.ucla.edu>
Cc: Vincent Lefevre <vincent <at> vinc17.net>, 18454 <at> debbugs.gnu.org
Subject: bug#18454: Improve performance when -P (PCRE) is used in UTF-8 locales
Date: Tue, 16 Sep 2014 21:57:53 -0700
On Tue, Sep 16, 2014 at 6:43 PM, Paul Eggert <eggert <at> cs.ucla.edu> wrote:
> I worked on this some more, and came up with the attached patches proposed
> against the current grep Savannah master (commit
> 9ea9254ea58456b84ed2f0c1481ca91cdd325bf7).
>
> For years I've been wanting to write that last patch and I finally got
> around to it.  It improves grep -P's performance by a factor of 1.2 trillion
> on one (admittedly artificial) benchmark.  I hope its 1 ZB/s scan rate is
> some kind of record.  The last patch probably won't help your test cases,
> though I hope the other patches do help somewhat.

Awesome :-)  I found time to look through all but the 5th.
Slightly surprised that 4/6 makes a measurable performance
difference (didn't check), but moving away from file-scoped
is an improvement in any case.




This bug report was last modified 3 years and 181 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.