GNU bug report logs - #17070
[PATCH] grep: optimization of DFA by reuse of multi-byte buffers in non-UTF8 locales

Previous Next

Package: grep;

Reported by: Norihiro Tanaka <noritnk <at> kcn.ne.jp>

Date: Sun, 23 Mar 2014 13:20:03 UTC

Severity: normal

Tags: patch

Done: Paul Eggert <eggert <at> cs.ucla.edu>

Bug is archived. No further changes may be made.

Full log


Message #14 received at 17070 <at> debbugs.gnu.org (full text, mbox):

From: Norihiro Tanaka <noritnk <at> kcn.ne.jp>
To: 17070 <at> debbugs.gnu.org
Subject: bug#17070: [PATCH] grep: optimization of DFA by reuse of multi-byte
 buffers in non-UTF8 locales
Date: Sun, 30 Mar 2014 20:14:21 +0900
[Message part 1 (text/plain, inline)]
I added further improvement to previous patch.

If dfaexec() runs in non-UTF8 locales, length and wide character
representation are checked for all characters of a line in a input
string.  However, if matched early in the line, results for remaining
characters are wasted.

The new patch doesn't reuse both `mblen_buf' and `inputwcs' but stops
using them, and checks multibyte characters on demand.  It enables to
accomplish to speed-up for matched early and reduce required memories.

Norihiro
[patch.txt (text/plain, attachment)]

This bug report was last modified 11 years and 100 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.