From unknown Mon Jun 16 23:37:56 2025 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-Mailer: MIME-tools 5.509 (Entity 5.509) Content-Type: text/plain; charset=utf-8 From: bug#52067 <52067@debbugs.gnu.org> To: bug#52067 <52067@debbugs.gnu.org> Subject: Status: 29.0.50; string-glyph-split halts on certain emoji strings Reply-To: bug#52067 <52067@debbugs.gnu.org> Date: Tue, 17 Jun 2025 06:37:56 +0000 retitle 52067 29.0.50; string-glyph-split halts on certain emoji strings reassign 52067 emacs submitter 52067 PAVLOS MARAGAKIS severity 52067 normal thanks From debbugs-submit-bounces@debbugs.gnu.org Tue Nov 23 18:01:27 2021 Received: (at submit) by debbugs.gnu.org; 23 Nov 2021 23:01:27 +0000 Received: from localhost ([127.0.0.1]:52454 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mpen0-0000pb-KX for submit@debbugs.gnu.org; Tue, 23 Nov 2021 18:01:26 -0500 Received: from lists.gnu.org ([209.51.188.17]:38364) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mpemx-0000pQ-Jo for submit@debbugs.gnu.org; Tue, 23 Nov 2021 18:01:24 -0500 Received: from eggs.gnu.org ([209.51.188.92]:36582) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mpemx-0006zv-CW for bug-gnu-emacs@gnu.org; Tue, 23 Nov 2021 18:01:23 -0500 Received: from st43p00im-ztfb10073301.me.com ([17.58.63.186]:38440) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mpemv-0004zC-NI for bug-gnu-emacs@gnu.org; Tue, 23 Nov 2021 18:01:23 -0500 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=icloud.com; s=1a1hai; t=1637708473; bh=PfUHlULWExjf3cxRLpYFx39ITnBq8Jj8zy+xFrNEjP0=; h=From:Content-Type:Mime-Version:Subject:Message-Id:Date:To; b=xrzVF5k/nR10eOJpNvLZ9B4fzKGI4YfgDBkD5+lv9NLbDrXImmHznB3q2c7Vkek7C N8tt4VF4AbdQ+aw6tLNShCR05Y/VN1aJgqwYGN1CDEI0MxWO0Lx/JwXZ+pT4q0VwTz wrVhPi0jVx8lBxvsmcvGnyV8rYyY0RbIu3KPrRAb/hYZ08ginqwprovJcE8yC8egfJ ZKDuNq3P0iDBff3n7pbL4NVJxA1jFuzeUuGixc5lVVcHlDUuE4g+C0tXJ13A9zasYu qceJJ+lxHuwLamnzaVU7fV569EKPkElFti6E+61szicDWdZsWGKsWpuB4S6+8tEmNt zpgGJjKekxidQ== Received: from smtpclient.apple (unknown [160.39.47.209]) by st43p00im-ztfb10073301.me.com (Postfix) with ESMTPSA id 0979A2A0852 for ; Tue, 23 Nov 2021 23:01:12 +0000 (UTC) From: PAVLOS MARAGAKIS Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Mime-Version: 1.0 (Mac OS X Mail 15.0 \(3693.20.0.1.32\)) Subject: 29.0.50; string-glyph-split halts on certain emoji strings Message-Id: Date: Tue, 23 Nov 2021 18:01:12 -0500 To: bug-gnu-emacs@gnu.org X-Mailer: Apple Mail (2.3693.20.0.1.32) X-Proofpoint-Virus-Version: =?UTF-8?Q?vendor=3Dfsecure_engine=3D1.1.170-22c6f66c430a71ce266a39bfe25bc?= =?UTF-8?Q?2903e8d5c8f:6.0.425,18.0.790,17.0.607.475.0000000_definitions?= =?UTF-8?Q?=3D2021-11-23=5F08:2021-11-23=5F01,2021-11-23=5F08,2020-04-07?= =?UTF-8?Q?=5F01_signatures=3D0?= X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 malwarescore=0 mlxlogscore=999 phishscore=0 bulkscore=0 suspectscore=0 mlxscore=0 clxscore=1015 spamscore=0 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-2009150000 definitions=main-2111230112 Received-SPF: pass client-ip=17.58.63.186; envelope-from=paul.maragakis@icloud.com; helo=st43p00im-ztfb10073301.me.com X-Spam_score_int: -27 X-Spam_score: -2.8 X-Spam_bar: -- X-Spam_report: (-2.8 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H2=-0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-Spam-Score: -1.3 (-) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -2.3 (--) In a clean instance of emacs paste the following lines into the scratch buffer and evaluate the lines starting with string-glyph-split. I show the outputs below: (string-glyph-split "=F0=9F=8C=8D=F0=9F=A6=B9") ("=F0=9F=8C=8D" "=F0=9F=A6=B9") (string-glyph-split "=E2=9C=88=EF=B8=8F=E2=9C=88=EF=B8=8F") ("=E2=9C=88=EF=B8=8F" "=E2=9C=88=EF=B8=8F") (string-glyph-split "=F0=9F=8C=8D=E2=9C=88=EF=B8=8F") ("=F0=9F=8C=8D" "=E2=9C=88=EF=B8=8F") (string-glyph-split "=E2=9C=88=EF=B8=8F=F0=9F=8C=8D") The last line will halt emacs; C-g can stop the evaluation. The expected behavior was to split the string in two glyphs. In GNU Emacs 29.0.50 (build 2, aarch64-apple-darwin21.1.0, NS = appkit-2113.00 Version 12.0.1 (Build 21A559)) of 2021-11-21 built on MacbookPro13.local Repository revision: b7db7eb2c7b8ac1bddf4afa9ccf9b30ebeb0224e Repository branch: master Windowing system distributor 'Apple', version 10.3.2113 System Description: macOS 12.0.1 Configured using: 'configure --disable-silent-rules --enable-locallisppath=3D/usr/local/share/emacs/28.0.50/site-lisp --prefix=3D/usr/local/opt/gccemacs --without-dbus --without-imagemagick --with-mailutils --with-ns --disable-ns-self-contained --with-cairo --with-modules --with-xml2 --with-gnutls --with-json --with-rsvg --with-native-compilation CC=3D/usr/bin/clang CFLAGS=3D-I/opt/homebrew/lib/gcc/11/include 'LDFLAGS=3D-L/opt/homebrew/lib/gcc/11/ -I/opt/homebrew/lib/gcc/11/include' CPPFLAGS=3D-I/opt/homebrew/opt/libffi/include 'PKG_CONFIG_PATH=3D/opt/homebrew/opt/libffi/lib/pkgconfig --no-create --no-recursion'' Configured features: ACL GLIB GNUTLS JSON LCMS2 LIBXML2 MODULES NATIVE_COMP NOTIFY KQUEUE NS PDUMPER PNG RSVG THREADS TOOLKIT_SCROLL_BARS WEBP XIM ZLIB Important settings: value of $LANG: en_US.UTF-8 locale-coding-system: utf-8-unix Major mode: Lisp Interaction Minor modes in effect: tooltip-mode: t global-eldoc-mode: t eldoc-mode: t show-paren-mode: t electric-indent-mode: t mouse-wheel-mode: t tool-bar-mode: t menu-bar-mode: t file-name-shadow-mode: t global-font-lock-mode: t font-lock-mode: t blink-cursor-mode: t auto-composition-mode: t auto-encryption-mode: t auto-compression-mode: t line-number-mode: t indent-tabs-mode: t transient-mark-mode: t Load-path shadows: None found. Features: (shadow sort mail-extr emacsbug message mailcap yank-media rmc puny dired dired-loaddefs rfc822 mml mml-sec epa derived epg rfc6068 epg-config gnus-util rmail rmail-loaddefs auth-source cl-seq eieio eieio-core cl-macs eieio-loaddefs password-cache json map text-property-search seq gv byte-opt bytecomp byte-compile cconv mm-decode mm-bodies mm-encode mail-parse rfc2231 mailabbrev gmm-utils mailheader sendmail rfc2047 rfc2045 ietf-drums mm-util mail-prsvr mail-utils time-date subr-x help-fns radix-tree cl-print debug backtrace find-func help-mode cl-loaddefs cl-lib iso-transl tooltip eldoc paren electric uniquify ediff-hook vc-hooks lisp-float-type elisp-mode mwheel term/ns-win ns-win ucs-normalize mule-util term/common-win tool-bar dnd fontset image regexp-opt fringe tabulated-list replace newcomment text-mode lisp-mode prog-mode register page tab-bar menu-bar rfn-eshadow isearch easymenu timer select scroll-bar mouse jit-lock font-lock syntax font-core term/tty-colors frame minibuffer cl-generic cham georgian utf-8-lang misc-lang vietnamese tibetan thai tai-viet lao korean japanese eucjp-ms cp51932 hebrew greek romanian slovak czech european ethiopic indian cyrillic chinese composite emoji-zwj charscript charprop case-table epa-hook jka-cmpr-hook help simple abbrev obarray cl-preloaded nadvice button loaddefs faces cus-face macroexp files window text-properties overlay sha1 md5 base64 format env code-pages mule custom widget keymap hashtable-print-readable backquote threads kqueue cocoa ns lcms2 multi-tty make-network-process native-compile emacs) Memory information: ((conses 16 76126 7924) (symbols 48 7053 0) (strings 32 21398 1914) (string-bytes 1 725508) (vectors 16 15268) (vector-slots 8 317989 13284) (floats 8 26 61) (intervals 56 341 0) (buffers 992 13)) From debbugs-submit-bounces@debbugs.gnu.org Tue Nov 23 22:51:16 2021 Received: (at 52067) by debbugs.gnu.org; 24 Nov 2021 03:51:16 +0000 Received: from localhost ([127.0.0.1]:52685 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mpjJU-0000SL-8l for submit@debbugs.gnu.org; Tue, 23 Nov 2021 22:51:16 -0500 Received: from st43p00im-ztfb10061701.me.com ([17.58.63.172]:35110) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mpjJS-0000S8-7q for 52067@debbugs.gnu.org; Tue, 23 Nov 2021 22:51:15 -0500 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=icloud.com; s=1a1hai; t=1637725868; bh=mde0/dTOdujiWU1i00v748aWFHKaI4dOzcVuKH4fzZo=; h=From:Content-Type:Mime-Version:Subject:Date:To:Message-Id; b=tCvRMN2axeJpGres+69kntlc4I7Zv6Ae44tBolpp5apQuMe53qjcS/HTQBjGzqSTx fvB/SfPXTVuusoAIas322Bd1HtonY4o2NoGUHKu5/fvX6QYBdXbETHq2mnAKKEJHmA vTLEh58uOKNm7fvgt8fXbaboTK8DPbfhKKXv9kH7pbxSZsKWwNpOtjC+bs/lFoPass BfmDuotLvnDnhpt8fwogmyZDuA6yUSF/US+9IZPBuiSkj+G4OqYwFfKaYzVXBLPWKc jgAUWN7WjtZB8K7/JPxexEiktl/vCTnE4QS4dtoz6WySX+f0oIS/BjJnpIaYa0P2u9 xYksaDfQUIe+w== Received: from smtpclient.apple (unknown [160.39.47.209]) by st43p00im-ztfb10061701.me.com (Postfix) with ESMTPSA id 5783BAC085E for <52067@debbugs.gnu.org>; Wed, 24 Nov 2021 03:51:08 +0000 (UTC) From: Paul Maragakis Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Mime-Version: 1.0 (Mac OS X Mail 15.0 \(3693.20.0.1.32\)) Subject: Re: bug#52067: Acknowledgement (29.0.50; string-glyph-split halts on certain emoji strings) Date: Tue, 23 Nov 2021 22:51:07 -0500 References: To: 52067@debbugs.gnu.org In-Reply-To: Message-Id: <829C7136-B97A-47CB-9F35-76EF73D971E5@icloud.com> X-Mailer: Apple Mail (2.3693.20.0.1.32) X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:6.0.425, 18.0.790 definitions=2021-11-24_01:2021-11-23, 2021-11-23 signatures=0 X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 suspectscore=0 malwarescore=0 phishscore=0 bulkscore=0 spamscore=0 clxscore=1011 mlxscore=0 mlxlogscore=866 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-2009150000 definitions=main-2111240021 X-Spam-Score: -0.7 (/) X-Debbugs-Envelope-To: 52067 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.7 (-) The logic in string-glyph-split expects the first two elements in the = result from find-composition-internal to give the start and end of a multibyte = grapheme and return nil when there is a regular character at position POS. = However, this=20 isn't always the case. Let's call x the argument POS in find-composition-internal,=20 and "interval" the first two elements of the return value. The following example works as expected, i.e. x of 0, or 1 returns the = interval (0 2),=20 and x of 2, or 3 returns (2 4). (null (pp (mapcar '(lambda (x) (list x (find-composition-internal x nil = "=E2=9C=88=EF=B8=8F=E2=9C=88=EF=B8=8F" nil))) '(0 1 2 3 4)))) ((0 (0 2 [[# 9992 65039] 296 [0 1 9992 233 23 0 23 18 4 nil]])) (1 (0 2 [[# 9992 65039] 296 [0 1 9992 233 23 0 23 18 4 nil]])) (2 (2 4 [[# 9992 65039] 296 [0 1 9992 233 23 0 23 18 4 nil]])) (3 (2 4 [[# 9992 65039] 296 [0 1 9992 233 23 0 23 18 4 nil]])) (4 nil)) nil In the following case, however, x of 2 returns interval (0 2). (null (pp (mapcar '(lambda (x) (list x (find-composition-internal x nil = "=E2=9C=88=EF=B8=8F=F0=9F=8C=8D" nil))) '(0 1 2 3)))) ((0 (0 2 [[# 9992 65039] 296 [0 1 9992 233 23 0 23 18 4 nil]])) (1 (0 2 [[# 9992 65039] 296 [0 1 9992 233 23 0 23 18 4 nil]])) (2 (0 2 [[# 9992 65039] 296 [0 1 9992 233 23 0 23 18 4 nil]])) (3 nil)) nil Interestingly, in the following case, an x of 0, 1, 2, or 3 all return = (0 2). (null (pp (mapcar '(lambda (x) (list x (find-composition-internal x nil = "=E2=9C=88=EF=B8=8F=F0=9F=8C=8D=F0=9F=8C=8D" nil))) '(0 1 2 3 4)))) ((0 (0 2 [[# 9992 65039] 296 [0 1 9992 233 23 0 23 18 4 nil]])) (1 (0 2 [[# 9992 65039] 296 [0 1 9992 233 23 0 23 18 4 nil]])) (2 (0 2 [[# 9992 65039] 296 [0 1 9992 233 23 0 23 18 4 nil]])) (3 (0 2 [[# 9992 65039] 296 [0 1 9992 233 23 0 23 18 4 nil]])) (4 nil)) nil And in the following case a POS of 3 returns (3 5) (null (pp (mapcar '(lambda (x) (list x (find-composition-internal x nil = "=E2=9C=88=EF=B8=8F=F0=9F=8C=8D=E2=9C=88=EF=B8=8F" nil))) '(0 1 2 3 4 = 5)))) ((0 (0 2 [[# 9992 65039] 296 [0 1 9992 233 23 0 23 18 4 nil]])) (1 (0 2 [[# 9992 65039] 296 [0 1 9992 233 23 0 23 18 4 nil]])) (2 (0 2 [[# 9992 65039] 296 [0 1 9992 233 23 0 23 18 4 nil]])) (3 (3 5 [[# 9992 65039] 296 [0 1 9992 233 23 0 23 18 4 nil]])) (4 (3 5 [[# 9992 65039] 296 [0 1 9992 233 23 0 23 18 4 nil]])) (5 nil)) nil > On Nov 23, 2021, at 6:02 PM, GNU bug Tracking System = wrote: >=20 > Thank you for filing a new bug report with debbugs.gnu.org. >=20 > This is an automatically generated reply to let you know your message > has been received. >=20 > Your message is being forwarded to the package maintainers and other > interested parties for their attention; they will reply in due course. >=20 > Your message has been sent to the package maintainer(s): > bug-gnu-emacs@gnu.org >=20 > If you wish to submit further information on this problem, please > send it to 52067@debbugs.gnu.org. >=20 > Please do not send mail to help-debbugs@gnu.org unless you wish > to report a problem with the Bug-tracking system. >=20 > --=20 > 52067: http://debbugs.gnu.org/cgi/bugreport.cgi?bug=3D52067 > GNU Bug Tracking System > Contact help-debbugs@gnu.org with problems From debbugs-submit-bounces@debbugs.gnu.org Tue Nov 23 23:58:29 2021 Received: (at 52067) by debbugs.gnu.org; 24 Nov 2021 04:58:29 +0000 Received: from localhost ([127.0.0.1]:52713 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mpkMW-00029x-Ri for submit@debbugs.gnu.org; Tue, 23 Nov 2021 23:58:29 -0500 Received: from st43p00im-zteg10073401.me.com ([17.58.63.181]:42607) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mpkMS-00029e-9O for 52067@debbugs.gnu.org; Tue, 23 Nov 2021 23:58:27 -0500 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=icloud.com; s=1a1hai; t=1637729898; bh=9O21npuiPl2TwTHBEI8rKbU00dn6q++ouLjcKGavrI0=; h=From:Content-Type:Mime-Version:Subject:Message-Id:Date:To; b=d65Skoe4QZnTQYAfcotduWWhlL8nqAFIpyXB30KMpUR4WFPLgy6NipMpLKBs4VXKs R1x8aqVd/AlzdM9IHhqwaJo+I7DX/kB7NQhlY/xxOK47yaKuouYTyqO7t+EOWf76jO gjWhGcjaxAM4fipxf2tJxCgdPZxeGG3uiYCXRl0Y0H6X7jicpI9LTfvkVi3pwT6l4/ TKggOQRUm83uTLRryAnFJub4RHBdf0KfTM4BW8oSnugi++qxIQmyFTEEVyIP8cQvUt Us5/KbzC/U3lNycC+2G0xsS+tK53hllFOVA2y48nFV89Vz0TtdQw+PYnHx+AGEjOiJ bZbTr8Nywmm/Q== Received: from smtpclient.apple (unknown [160.39.47.209]) by st43p00im-zteg10073401.me.com (Postfix) with ESMTPSA id 812EA5E0428 for <52067@debbugs.gnu.org>; Wed, 24 Nov 2021 04:58:18 +0000 (UTC) From: Paul Maragakis Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable Mime-Version: 1.0 (Mac OS X Mail 15.0 \(3693.20.0.1.32\)) Subject: possible fix for string-glyph-split halts on certain emoji strings. Message-Id: <33CD01AE-0B26-42EA-83F0-A1FFEBE6E11B@icloud.com> Date: Tue, 23 Nov 2021 23:58:17 -0500 To: 52067@debbugs.gnu.org X-Mailer: Apple Mail (2.3693.20.0.1.32) X-Proofpoint-Virus-Version: =?UTF-8?Q?vendor=3Dfsecure_engine=3D1.1.170-22c6f66c430a71ce266a39bfe25bc?= =?UTF-8?Q?2903e8d5c8f:6.0.425,18.0.790,17.0.607.475.0000000_definitions?= =?UTF-8?Q?=3D2021-11-24=5F01:2021-11-23=5F01,2021-11-24=5F01,2020-04-07?= =?UTF-8?Q?=5F01_signatures=3D0?= X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 bulkscore=0 malwarescore=0 clxscore=1011 mlxscore=0 spamscore=0 adultscore=0 mlxlogscore=746 phishscore=0 suspectscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-2009150000 definitions=main-2111240028 X-Spam-Score: -0.7 (/) X-Debbugs-Envelope-To: 52067 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.7 (-) The following code fixes this bug, though there might be better ways to = fix it for someone who understands the domain. I don't know much about glyph/grapheme representations, so although this = code passes my limited tests, it may break other things. (defun pm-string-glyph-split (string) "Split STRING into a list of strings representing separate glyphs. This takes into account combining characters and grapheme clusters." (let ((result nil) (start 0) (laststart -1) ;; the last start of a character with the = composition property comp) (while (< start (length string)) (setq comp (find-composition-internal start nil string nil)) (if (and comp (/=3D laststart (car comp))) ;; check that we don't = return to same start (progn (push (substring string (car comp) (cadr comp)) result) (setq laststart start) ;; keep the start of the last = successful search. (setq start (cadr comp))) (push (substring string start (1+ start)) result) (setq start (1+ start)))) (nreverse result))) Compare to the original: (defun string-glyph-split (string) "Split STRING into a list of strings representing separate glyphs. This takes into account combining characters and grapheme clusters." (let ((result nil) (start 0) comp) (while (< start (length string)) (if (setq comp (find-composition-internal start nil string nil)) (progn (push (substring string (car comp) (cadr comp)) result) (setq start (cadr comp))) (push (substring string start (1+ start)) result) (setq start (1+ start)))) (nreverse result))) From debbugs-submit-bounces@debbugs.gnu.org Wed Nov 24 02:31:10 2021 Received: (at 52067) by debbugs.gnu.org; 24 Nov 2021 07:31:10 +0000 Received: from localhost ([127.0.0.1]:52895 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mpmkH-0000W5-U0 for submit@debbugs.gnu.org; Wed, 24 Nov 2021 02:31:10 -0500 Received: from quimby.gnus.org ([95.216.78.240]:54108) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mpmkC-0000VV-UE for 52067@debbugs.gnu.org; Wed, 24 Nov 2021 02:31:08 -0500 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnus.org; s=20200322; h=Content-Type:MIME-Version:Message-ID:In-Reply-To:Date: References:Subject:Cc:To:From:Sender:Reply-To:Content-Transfer-Encoding: Content-ID:Content-Description:Resent-Date:Resent-From:Resent-Sender: Resent-To:Resent-Cc:Resent-Message-ID:List-Id:List-Help:List-Unsubscribe: List-Subscribe:List-Post:List-Owner:List-Archive; bh=xI4SGX0EBUG4rvTZKh5zpGTlyu6Ka0oBp24a4r8H2j8=; b=ICuZUuMHpcesq/4+Xopipxlbij DONEKzzCtITfLWpHC8Ufoam1nIcQeVLv9AKKIzpBKA60dZeBySJ/cqbNrdeaNCrxVK7KItwD9AUrw ei/T14TaFkHh7iwGtgFtXPFnC9ls8jD917B7tSoiaVEnFwHVHTy9xzTBVuElzUoRteeI=; Received: from [84.212.220.105] (helo=xo) by quimby.gnus.org with esmtpsa (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1mpmk4-0007aB-3C; Wed, 24 Nov 2021 08:30:58 +0100 From: Lars Ingebrigtsen To: Paul Maragakis Subject: Re: bug#52067: 29.0.50; string-glyph-split halts on certain emoji strings References: <829C7136-B97A-47CB-9F35-76EF73D971E5@icloud.com> X-Now-Playing: Irreversible Entanglements's _Open The Gates_: "Keys to Creation" Date: Wed, 24 Nov 2021 08:30:55 +0100 In-Reply-To: <829C7136-B97A-47CB-9F35-76EF73D971E5@icloud.com> (Paul Maragakis's message of "Tue, 23 Nov 2021 22:51:07 -0500") Message-ID: <87a6huvwls.fsf_-_@gnus.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/29.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: Paul Maragakis writes: > The logic in string-glyph-split expects the first two elements in the result > from find-composition-internal to give the start and end of a multibyte grapheme > and return nil when there is a regul [...] Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 52067 Cc: 52067@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) Paul Maragakis writes: > The logic in string-glyph-split expects the first two elements in the result > from find-composition-internal to give the start and end of a multibyte grapheme > and return nil when there is a regular character at position POS. However, this > isn't always the case. Yup. Paul Maragakis writes: > The following code fixes this bug, though there might be better ways > to fix it for someone who understands the domain. Thanks. `find-composition' takes a the LIMIT parameter, and that'll make it avoid searching back into the bit of the string that we've already handled. So I did that instead in Emacs 29. -- (domestic pets only, the antidote for overdose, milk.) bloggy blog: http://lars.ingebrigtsen.no From debbugs-submit-bounces@debbugs.gnu.org Wed Nov 24 02:31:13 2021 Received: (at control) by debbugs.gnu.org; 24 Nov 2021 07:31:14 +0000 Received: from localhost ([127.0.0.1]:52898 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mpmkL-0000WK-HY for submit@debbugs.gnu.org; Wed, 24 Nov 2021 02:31:13 -0500 Received: from quimby.gnus.org ([95.216.78.240]:54124) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mpmkH-0000Vt-QY for control@debbugs.gnu.org; Wed, 24 Nov 2021 02:31:10 -0500 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnus.org; s=20200322; h=Subject:From:To:Message-Id:Date:Sender:Reply-To:Cc: MIME-Version:Content-Type:Content-Transfer-Encoding:Content-ID: Content-Description:Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc :Resent-Message-ID:In-Reply-To:References:List-Id:List-Help:List-Unsubscribe: List-Subscribe:List-Post:List-Owner:List-Archive; bh=8WnjsdvNdfVpLCgyLK9V7ee+qyJrLR9uzyBTxyOxWKg=; b=o5F9KLlQgtfcsZhn5LRziOuN+0 T+w+I8FrtzlzGjNoCyIZ5OVks/NdOJ6XTZOvmGMbrcQdYVIyUizRuUoBJq/DgwvaTWwmvSq0Mo8hJ JkebQlY6T7hG4KbSEEHbJZTllZOR1MNum+D3VV16qj4TFkVSq086Hcgz/MNLWrtIu+pc=; Received: from [84.212.220.105] (helo=xo) by quimby.gnus.org with esmtpsa (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1mpmk9-0007aL-V0 for control@debbugs.gnu.org; Wed, 24 Nov 2021 08:31:04 +0100 Date: Wed, 24 Nov 2021 08:31:01 +0100 Message-Id: <878rxevwlm.fsf@gnus.org> To: control@debbugs.gnu.org From: Lars Ingebrigtsen Subject: control message for bug #52067 X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: close 52067 29.1 quit Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: control X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) close 52067 29.1 quit From debbugs-submit-bounces@debbugs.gnu.org Wed Nov 24 10:15:21 2021 Received: (at 52067) by debbugs.gnu.org; 24 Nov 2021 15:15:21 +0000 Received: from localhost ([127.0.0.1]:54721 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mptzV-0002Ln-23 for submit@debbugs.gnu.org; Wed, 24 Nov 2021 10:15:21 -0500 Received: from st43p00im-ztdg10071801.me.com ([17.58.63.171]:53750) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mptzR-0002LU-OP for 52067@debbugs.gnu.org; Wed, 24 Nov 2021 10:15:20 -0500 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=icloud.com; s=1a1hai; t=1637766912; bh=KRT+zjQvS1Bm4FrQfvV45Xc7AzpgN5m2bbi+OrS+7JQ=; h=Content-Type:Mime-Version:Subject:From:Date:Message-Id:To; b=IlY2Cy2Uo5pgcAGX7hop+nsk0nKR/kiEZ0tb72ODVMNYfkRjnLNtLaHjOY45hi2By ex85lQ/tDr+R2BJBlfz4e+rm/D5euk8kjXpQD0pQYJe5ZCSadD8VyjLqutU4/DZoyq NISmMjKj1+nTe6FXNVYDoy0Vdrw49/2g/C5I5eEvSmty5mRd+DD84hmr7Ywni1kU+w QZxJsbjjLIqVkkAQSQoPKaIXpA2RvF6WhhgTbDgESsj0eRTKHNo6gHaLtJbMtFtlwq yJxe+EZ/irz+cEM3zcqPd/gqwhU5XHuc64EssaMHarzlF0YKv+h1WbwPpvZXIItHny lsb4AAg4xCQIw== Received: from smtpclient.apple (unknown [160.39.47.209]) by st43p00im-ztdg10071801.me.com (Postfix) with ESMTPSA id 894125403B7; Wed, 24 Nov 2021 15:15:08 +0000 (UTC) Content-Type: text/plain; charset=us-ascii Mime-Version: 1.0 (Mac OS X Mail 15.0 \(3693.20.0.1.32\)) Subject: Re: bug#52067: 29.0.50; string-glyph-split halts on certain emoji strings From: Paul Maragakis In-Reply-To: <87a6huvwls.fsf_-_@gnus.org> Date: Wed, 24 Nov 2021 10:15:07 -0500 Content-Transfer-Encoding: quoted-printable Message-Id: <29E98EEC-07C6-4270-9B23-A9734CFC9279@icloud.com> References: <829C7136-B97A-47CB-9F35-76EF73D971E5@icloud.com> <87a6huvwls.fsf_-_@gnus.org> To: Lars Ingebrigtsen X-Mailer: Apple Mail (2.3693.20.0.1.32) X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:6.0.425, 18.0.790 definitions=2021-11-24_04:2021-11-23, 2021-11-24 signatures=0 X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 suspectscore=0 malwarescore=0 phishscore=0 bulkscore=0 spamscore=0 clxscore=1011 mlxscore=0 mlxlogscore=999 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-2009150000 definitions=main-2111240085 X-Spam-Score: -0.7 (/) X-Debbugs-Envelope-To: 52067 Cc: 52067@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.7 (-) Excellent---and thanks for the explanation! =20 I confirm that the latest Emacs 29 fixes the bug. =20 You can close this ticket. Paul > On Nov 24, 2021, at 2:30 AM, Lars Ingebrigtsen wrote: >=20 > Paul Maragakis writes: >=20 >> The logic in string-glyph-split expects the first two elements in the = result >> from find-composition-internal to give the start and end of a = multibyte grapheme >> and return nil when there is a regular character at position POS. = However, this=20 >> isn't always the case. >=20 > Yup. =20 >=20 > Paul Maragakis writes: >=20 >> The following code fixes this bug, though there might be better ways >> to fix it for someone who understands the domain. >=20 > Thanks. `find-composition' takes a the LIMIT parameter, and that'll > make it avoid searching back into the bit of the string that we've > already handled. So I did that instead in Emacs 29. >=20 > --=20 > (domestic pets only, the antidote for overdose, milk.) > bloggy blog: http://lars.ingebrigtsen.no From debbugs-submit-bounces@debbugs.gnu.org Wed Nov 24 11:15:11 2021 Received: (at 52067) by debbugs.gnu.org; 24 Nov 2021 16:15:11 +0000 Received: from localhost ([127.0.0.1]:54769 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mpuvP-00063l-3A for submit@debbugs.gnu.org; Wed, 24 Nov 2021 11:15:11 -0500 Received: from quimby.gnus.org ([95.216.78.240]:58488) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mpuvK-00062u-RK for 52067@debbugs.gnu.org; Wed, 24 Nov 2021 11:15:10 -0500 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnus.org; s=20200322; h=Content-Type:MIME-Version:Message-ID:In-Reply-To:Date: References:Subject:Cc:To:From:Sender:Reply-To:Content-Transfer-Encoding: Content-ID:Content-Description:Resent-Date:Resent-From:Resent-Sender: Resent-To:Resent-Cc:Resent-Message-ID:List-Id:List-Help:List-Unsubscribe: List-Subscribe:List-Post:List-Owner:List-Archive; bh=lT2A3/XgbTO9Kl3Knoi4cws1Np/oSwO3lp/u1ZxC0l4=; b=PswzsEVE7FFZnUltIDRL8Vox/k TbMiu5nnWkIIxLMIvEUQ8+6IHHQely+8bqAxboH3++UQBCYd6DeAI5/97em2uNJa33WogoX7J5PDH XZMU+tDYV9AOGPRtTg9lo/MVdSfckQrW7P1AKbPRL8aujusz5cAqFpkqdIjLW3iemdZA=; Received: from [84.212.220.105] (helo=xo) by quimby.gnus.org with esmtpsa (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1mpuvB-0003JK-OD; Wed, 24 Nov 2021 17:15:00 +0100 From: Lars Ingebrigtsen To: Paul Maragakis Subject: Re: bug#52067: 29.0.50; string-glyph-split halts on certain emoji strings References: <829C7136-B97A-47CB-9F35-76EF73D971E5@icloud.com> <87a6huvwls.fsf_-_@gnus.org> <29E98EEC-07C6-4270-9B23-A9734CFC9279@icloud.com> Face: iVBORw0KGgoAAAANSUhEUgAAADAAAAAwBAMAAAClLOS0AAAABGdBTUEAALGPC/xhBQAAACBj SFJNAAB6JgAAgIQAAPoAAACA6AAAdTAAAOpgAAA6mAAAF3CculE8AAAAElBMVEXJyMayr7OWlKBX V2P28xr///9HRQ2EAAAAAWJLR0QF+G/pxwAAAAd0SU1FB+ULGA8LEwQ2sdgAAAG0SURBVDjLrZPr deQwCIWFtwHADRipgYzoYE/6r2kvV35McpJ/y5mxZX1CvFu7RMXMVM+vrb2JgJrhKVbrc88oahJc eN1B5H12SuI/C0isS2szYsFOUCqqVhRXrCvdsYsteCMqTeyUEKfReqjIZR+u2dZON8q9S0HpFZey vt6EQCsi+w5g25AOpdqjWBpCh+Xe1cqDNw/4yR9UgmoVz51E7AB5WHgxebIrfuCh1geve0v7HqUX e97ZvVSYLSGAPj/krVp7hsDJpx4MHu/McD+vKtfX22A7P+qUPxYq9CZb5utLH+hKraK0syokzRY5 Tq25ZyrbZx3c4mynHJnHA2yfKvQgIVPuQtmKl2HMnMjyfmugA2ox8gBoMp6Oseoqz/zY42iSdCWu giK8MbeM9ie/1Xpkz8gJmzeIar8OlyrEGASXEQ+HQskFgl1A7gR9zGZ3w+JAh3WCPphitADOR8cb YNSgDLaBEZh3wJzj5QSqZyRRg2P5Gq/uAOw7GudIGXKFeQKgM6UgNWHlVa+ACtTi0uBwcvVMMpLo lcX+qnT66o2ouULgYGNN7d4+f5a/v4HP/wj+AQY+Z1k/V1ZPAAAAJXRFWHRkYXRlOmNyZWF0ZQAy MDIxLTExLTI0VDE1OjExOjE5KzAwOjAwXmLGugAAACV0RVh0ZGF0ZTptb2RpZnkAMjAyMS0xMS0y NFQxNToxMToxOSswMDowMC8/fgYAAAAASUVORK5CYII= X-Now-Playing: DJ Rupture's _Curiosity Slowdown_: "Curiosity Slowdown (part 2)" Date: Wed, 24 Nov 2021 17:14:54 +0100 In-Reply-To: <29E98EEC-07C6-4270-9B23-A9734CFC9279@icloud.com> (Paul Maragakis's message of "Wed, 24 Nov 2021 10:15:07 -0500") Message-ID: <87wnkxy1hd.fsf@gnus.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/29.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: Paul Maragakis writes: > Excellent---and thanks for the explanation! > I confirm that the latest Emacs 29 fixes the bug. > You can close this ticket. Thanks for checking; closed now. Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 52067 Cc: 52067@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) Paul Maragakis writes: > Excellent---and thanks for the explanation! > I confirm that the latest Emacs 29 fixes the bug. > You can close this ticket. Thanks for checking; closed now. -- (domestic pets only, the antidote for overdose, milk.) bloggy blog: http://lars.ingebrigtsen.no From unknown Mon Jun 16 23:37:56 2025 Received: (at fakecontrol) by fakecontrolmessage; To: internal_control@debbugs.gnu.org From: Debbugs Internal Request Subject: Internal Control Message-Id: bug archived. Date: Thu, 23 Dec 2021 12:24:08 +0000 User-Agent: Fakemail v42.6.9 # This is a fake control message. # # The action: # bug archived. thanks # This fakemail brought to you by your local debbugs # administrator