GNU bug report logs - #38223
grep >=2.28 cannot handle -wF correctly under LANG=ja_JP.eucjp

Previous Next

Package: grep;

Reported by: "NIDE, Naoyuki" <nide <at> ics.nara-wu.ac.jp>

Date: Fri, 15 Nov 2019 19:54:02 UTC

Severity: normal

Tags: moreinfo

Done: Jim Meyering <jim <at> meyering.net>

Bug is archived. No further changes may be made.

Full log


Message #27 received at 38223 <at> debbugs.gnu.org (full text, mbox):

From: Norihiro Tanaka <noritnk <at> kcn.ne.jp>
To: 38223 <at> debbugs.gnu.org
Subject: Re: bug#38223: grep >=2.28 cannot handle -wF correctly under
 LANG=ja_JP.eucjp
Date: Sun, 17 Nov 2019 09:01:21 +0900
[Message part 1 (text/plain, inline)]
On Sat, 16 Nov 2019 11:00:38 -0800
Jim Meyering <jim <at> meyering.net> wrote:

> I've pushed the complete fix here:
> https://git.savannah.gnu.org/cgit/grep.git/commit/?id=090a4dbe03951e427f03f83be424caacc3303799
> 
> I've also fixed the comment and a variable name and updated gnulib to latest.

After patched, I found extreamly slowdown.

  yes $(printf %040d 0) | head -1000000 >k
  time -p env LC_ALL=ja_JP.eucjp src/grep -F -w 0 k

First patch fixes it, and second improves performance more.
[0001-grep-fix-performance-degration-with-previous-patch.patch (text/plain, attachment)]
[0002-grep-performance-improvement-for-grep-F-w-in-non-UTF.patch (text/plain, attachment)]

This bug report was last modified 5 years and 174 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.