GNU bug report logs - #16912
[PATCH] no longer use CSET for non-UTF8 locale in DFA engine

Previous Next

Package: grep;

Reported by: Norihiro Tanaka <noritnk <at> kcn.ne.jp>

Date: Sat, 1 Mar 2014 09:49:02 UTC

Severity: normal

Tags: patch

Done: Paul Eggert <eggert <at> cs.ucla.edu>

Bug is archived. No further changes may be made.

Full log


View this message in rfc822 format

From: Paolo Bonzini <bonzini <at> gnu.org>
To: Norihiro Tanaka <noritnk <at> kcn.ne.jp>
Cc: Paul Eggert <eggert <at> cs.ucla.edu>, 16912 <at> debbugs.gnu.org
Subject: bug#16912: [PATCH] no longer use CSET for non-UTF8 locale in DFA engine
Date: Wed, 05 Mar 2014 08:59:52 +0100
Il 05/03/2014 00:12, Norihiro Tanaka ha scritto:
> First, I also thoutht such a case.  But perhaps it's no problem, because
> DFA will never come across CSET on second byte in Shift_JIS.
>
>   "grep -i A" -> [Aa] -> CSET
>   "grep -i $"\x82A" -> [$"\x82\x82A"$"\x82\x82"] -> \x82 A CAT \x82 \x82 CAT OR
>
> Laster will be never \x82 [A\x82] -> \x82 CSET CAT.

What about these two commands:

   grep [a]
   grep -i A

Would they match \x82\x61 ("B", U+0FF22) with your patch?  And without it?

Paolo




This bug report was last modified 11 years and 108 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.