GNU bug report logs - #17203
[PATCH] grep: speed-up of DFA by checking multibyte characters on demand

Previous Next

Package: grep;

Reported by: Norihiro Tanaka <noritnk <at> kcn.ne.jp>

Date: Sun, 6 Apr 2014 09:54:02 UTC

Severity: normal

Tags: patch

Done: Paul Eggert <eggert <at> cs.ucla.edu>

Bug is archived. No further changes may be made.

Full log


View this message in rfc822 format

From: help-debbugs <at> gnu.org (GNU bug Tracking System)
To: Norihiro Tanaka <noritnk <at> kcn.ne.jp>
Subject: bug#17203: closed (Re: bug#17203: [PATCH] grep: speed-up of DFA
 by checking multibyte characters on demand)
Date: Fri, 18 Apr 2014 06:31:03 +0000
[Message part 1 (text/plain, inline)]
Your bug report

#17203: [PATCH] grep: speed-up of DFA by checking multibyte characters on demand

which was filed against the grep package, has been closed.

The explanation is attached below, along with your original report.
If you require more details, please reply to 17203 <at> debbugs.gnu.org.

-- 
17203: http://debbugs.gnu.org/cgi/bugreport.cgi?bug=17203
GNU Bug Tracking System
Contact help-debbugs <at> gnu.org with problems
[Message part 2 (message/rfc822, inline)]
From: Paul Eggert <eggert <at> cs.ucla.edu>
To: Norihiro Tanaka <noritnk <at> kcn.ne.jp>, 17203-done <at> debbugs.gnu.org
Subject: Re: bug#17203: [PATCH] grep: speed-up of DFA by checking multibyte
 characters on demand
Date: Thu, 17 Apr 2014 23:30:21 -0700
[Message part 3 (text/plain, inline)]
Thanks.  It's nice to see the code get smaller and simpler.  I rebased 
that patch again and tweaked the commit message (attachment 1), and 
followed up with a minor fixup patch to remove a now-unused macro and 
reindent to fit into 80 columns (attachment 2.)
[0001-dfa-speed-up-by-checking-multibyte-characters-on-dem.patch (text/plain, attachment)]
[0002-dfa-minor-cleanup.patch (text/plain, attachment)]
[Message part 6 (message/rfc822, inline)]
From: Norihiro Tanaka <noritnk <at> kcn.ne.jp>
To: bug-grep <at> gnu.org
Subject: [PATCH] grep: speed-up of DFA by checking multibyte characters on
 demand
Date: Sun, 06 Apr 2014 18:52:17 +0900
[Message part 7 (text/plain, inline)]
If dfaexec() runs in non-UTF8 locales, length and wide character
representation are checked for all characters of a line in a input
string.  However, if matched early in the line, results for remaining
characters are wasted.

This patch checks multibyte characters on demand.  It enables to
accomplish to speed-up for matched early and reduce required memories.

Norihiro
[patch.txt (text/plain, attachment)]

This bug report was last modified 11 years and 97 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.