GNU bug report logs - #55331
Improved support for combining diacritics

Previous Next

Package: grep;

Reported by: Benson Muite <benson_muite <at> emailplus.org>

Date: Mon, 9 May 2022 07:04:02 UTC

Severity: wishlist

Full log


View this message in rfc822 format

From: Benson Muite <benson_muite <at> emailplus.org>
To: 55331 <at> debbugs.gnu.org
Subject: bug#55331: Improved support for combining diacritics
Date: Mon, 9 May 2022 09:38:26 +0300
Hi,

Unicode allows for combining diacritics. When using

grep -E "\s[a-z\`\'āáàēéèīíìịị̄ị́ị̀ōóòọọ̄ọọ́ọ̀ūúùụ̄ụ́ụ̀n̄ńǹm̄ḿm̀]{4}$"

to extract 4 letter Igbo words from a text, akụ̀ is incorrectly 
classified as a 4 letter word, when it is a three letter word.  Would a 
patch to fix this be accepted?

Regards,
Benson Muite




This bug report was last modified 3 years and 40 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.