From debbugs-submit-bounces@debbugs.gnu.org Tue Sep 21 05:28:09 2021 Received: (at submit) by debbugs.gnu.org; 21 Sep 2021 09:28:09 +0000 Received: from localhost ([127.0.0.1]:44932 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mSc4O-0002HX-Vk for submit@debbugs.gnu.org; Tue, 21 Sep 2021 05:28:09 -0400 Received: from lists.gnu.org ([209.51.188.17]:34398) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mSc4J-0002HI-13 for submit@debbugs.gnu.org; Tue, 21 Sep 2021 05:28:07 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:52024) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mSc4I-000112-RB for bug-gnu-emacs@gnu.org; Tue, 21 Sep 2021 05:28:02 -0400 Received: from mail-vk1-xa29.google.com ([2607:f8b0:4864:20::a29]:39729) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1mSc4G-0007sU-Pu for bug-gnu-emacs@gnu.org; Tue, 21 Sep 2021 05:28:02 -0400 Received: by mail-vk1-xa29.google.com with SMTP id f73so5904046vkf.6 for ; Tue, 21 Sep 2021 02:28:00 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=mime-version:from:date:message-id:subject:to; bh=CeKw8G9cdmMUMch3Lh1tNNu/ItbE0FqXxN3Yn0TmsDk=; b=Ujvvw5FIk7Ab46mbA59ul6lRUZK0msO1P7TauPB1KhjCRa0l/VyTgxQ5oas5qCLed2 kN0qdGE2xB949aZTRmcGS86EUv9MXPETB9zkDp7EKEJBuOBOZAfwdAUECjVFkzmsc89Q cvvWPdw83kupYFBN+Y1YOZHNr3Rwx5DxZuHXP65Bbay1DKrtQ3OGZO4h8Lao25KM6gUb vLzL0bG0THimGF/l5Sw0tLBlnt740cFJgBou2BW9kb809CeUBnRmQ89g4OiuCJxf7IVz Syf3QvnZykGfk4ys2b1H1twQ4ZMrLKROASrwDltlPFHM4cc2Z3kg37ZjjEVSEWo2e1t/ KfJw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:from:date:message-id:subject:to; bh=CeKw8G9cdmMUMch3Lh1tNNu/ItbE0FqXxN3Yn0TmsDk=; b=dLhVJpF1cvw0j7FonYra/+oIa0EQvRJX+Ucnu1uYaGgZ2hTuvCpHjsobsF9BPF5AMg OqEboqLM9c3xAldiLj/WydIClN3zcQoWeRMZCISKQyPNx+pAFSeG+0QvUEDjJ7qEeBEU HDZrnK3kVYNLym0AunJ9wjIVOmEwCgeDdKOZrY9FM72JUp5T8os059RDBIoKUbw5L1Wy 035Svh7mbwVJsGw9IZ4f6qnasIqHhm+AVHGnD+KWOPAMi0UzvyZ0marAXdWPTzBI98ru Rsu7hCAvLL0VoDlhwcBxuqORqf+hWhV0aF4Jee3gpSU3PjTUd447FupbRbXlqxeh7ZOH GwoA== X-Gm-Message-State: AOAM531Mzpu+OAadszg5AyhF3zLovWWVZyBzdeWhOr7StSO+8eu5kN/g wXFZUBmxWD7Nel2PRXq+fZs6AFWCRVenNLd5T8218fJEeWE= X-Google-Smtp-Source: ABdhPJxuTIW16cXK9D8AFcV+zv7L2t+u7s5JYXgJNPk6HOk8XenY3AMNj9nE9UHfYNKUlrU7N1PgeoS7VX19zSN21uM= X-Received: by 2002:a1f:9f10:: with SMTP id i16mr10434342vke.0.1632216479217; Tue, 21 Sep 2021 02:27:59 -0700 (PDT) MIME-Version: 1.0 From: dalanicolai Date: Tue, 21 Sep 2021 11:27:48 +0200 Message-ID: Subject: 28.0.50; `split-string` fails on certain unicode strings To: bug-gnu-emacs@gnu.org Content-Type: multipart/alternative; boundary="00000000000070ba7005cc7e03d1" Received-SPF: pass client-ip=2607:f8b0:4864:20::a29; envelope-from=dalanicolai@gmail.com; helo=mail-vk1-xa29.google.com X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -2.3 (--) --00000000000070ba7005cc7e03d1 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Evaluate: (split-string "=E0=A5=A7=E0=A5=A6.=E0=A5=A9" ".") It wrongly returns a list with only empty string. Of course it should return alist with the individual devanagari numbers. In GNU Emacs 28.0.50 (build 3, x86_64-pc-linux-gnu, GTK+ Version 3.24.30, cairo version 1.17.4) of 2021-09-06 built on daniel-fedora Repository revision: c4724add006e62b81f847937db56335a81bdcc74 Repository branch: master Windowing system distributor 'The X.Org Foundation', version 11.0.12011000 System Description: Fedora 34 (Workstation Edition) Configured using: 'configure --with-mailutils --with-cairo --with-modules --with-pgtk --with-native-compilation' Configured features: ACL CAIRO DBUS FREETYPE GIF GLIB GMP GNUTLS GPM GSETTINGS HARFBUZZ JPEG JSON LCMS2 LIBOTF LIBSELINUX LIBSYSTEMD LIBXML2 M17N_FLT MODULES NATIVE_COMP NOTIFY INOTIFY PDUMPER PNG RSVG SECCOMP SOUND THREADS TIFF TOOLKIT_SCROLL_BARS X11 XDBE XIM XPM GTK3 ZLIB Important settings: value of $LANG: en_US.UTF-8 value of $XMODIFIERS: @im=3Dnone locale-coding-system: utf-8-unix Major mode: Lisp Interaction Minor modes in effect: tooltip-mode: t global-eldoc-mode: t eldoc-mode: t electric-indent-mode: t mouse-wheel-mode: t tool-bar-mode: t menu-bar-mode: t file-name-shadow-mode: t global-font-lock-mode: t font-lock-mode: t blink-cursor-mode: t auto-composition-mode: t auto-encryption-mode: t auto-compression-mode: t line-number-mode: t indent-tabs-mode: t transient-mark-mode: t Load-path shadows: None found. Features: (shadow sort mail-extr emacsbug comp comp-cstr warnings rx message rmc puny dired dired-loaddefs rfc822 mml mml-sec epa derived epg rfc6068 epg-config gnus-util rmail rmail-loaddefs auth-source cl-seq eieio eieio-core cl-macs eieio-loaddefs password-cache json map mm-decode mm-bodies mm-encode mail-parse rfc2231 mailabbrev gmm-utils mailheader sendmail rfc2047 rfc2045 ietf-drums mm-util mail-prsvr mail-utils time-date subr-x cl-extra shortdoc text-property-search seq byte-opt gv bytecomp byte-compile cconv help-fns radix-tree help-mode cl-loaddefs cl-lib iso-transl tooltip eldoc electric uniquify ediff-hook vc-hooks lisp-float-type mwheel term/x-win x-win term/common-win x-dnd tool-bar dnd fontset image regexp-opt fringe tabulated-list replace newcomment text-mode elisp-mode lisp-mode prog-mode register page tab-bar menu-bar rfn-eshadow isearch easymenu timer select scroll-bar mouse jit-lock font-lock syntax font-core term/tty-colors frame minibuffer cl-generic cham georgian utf-8-lang misc-lang vietnamese tibetan thai tai-viet lao korean japanese eucjp-ms cp51932 hebrew greek romanian slovak czech european ethiopic indian cyrillic chinese composite charscript charprop case-table epa-hook jka-cmpr-hook help simple abbrev obarray cl-preloaded nadvice button loaddefs faces cus-face macroexp files window text-properties overlay sha1 md5 base64 format env code-pages mule custom widget hashtable-print-readable backquote threads dbusbind inotify lcms2 dynamic-setting system-font-setting font-render-setting cairo move-toolbar gtk x-toolkit x multi-tty make-network-process native-compile emacs) Memory information: ((conses 16 94870 10759) (symbols 48 7970 1) (strings 32 23722 1760) (string-bytes 1 872683) (vectors 16 16528) (vector-slots 8 305866 17210) (floats 8 71 35) (intervals 56 444 0) (buffers 992 14)) --00000000000070ba7005cc7e03d1 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
Evaluate: (split-str= ing "=E0=A5=A7=E0=A5=A6.=E0=A5=A9" ".")
It wrongly returns a list with only empty s= tring.
Of course it should r= eturn alist with the individual devanagari numbers.

In GNU Emacs 28.0.50 (build 3,= x86_64-pc-linux-gnu, GTK+ Version 3.24.30, cairo version 1.17.4)
=
=C2=A0of 2021-09-06 built on daniel-f= edora
Repository revision: c= 4724add006e62b81f847937db56335a81bdcc74
Repository branch: master
Windowing system distributor 'The X.Org Foundation', vers= ion 11.0.12011000
System Des= cription: Fedora 34 (Workstation Edition)

Configured us= ing:
=C2=A0'configure --= with-mailutils --with-cairo --with-modules --with-pgtk
=C2=A0--with-native-compilation'
<= div style=3D"color:rgb(46,52,54);font-family:monospace;font-size:13.3333px;= font-style:normal;font-variant-caps:normal;font-weight:normal;letter-spacin= g:normal;text-align:start;text-indent:0px;text-transform:none;word-spacing:= 0px;text-decoration:none;width:71ch">
Configured features:
ACL CAIRO DBUS FREETYPE GIF GLIB GMP GNUTLS GPM GSETTINGS HARFBUZZ JPEG<= br>
JSON LCMS2 LIBOTF LIBSELINUX= LIBSYSTEMD LIBXML2 M17N_FLT MODULES
NATIVE_COMP NOTIFY INOTIFY PDUMPER PNG RSVG SECCOMP SOUND THREADS = TIFF
TOOLKIT_SCROLL_BARS X11= XDBE XIM XPM GTK3 ZLIB

=
Important settings:
=C2=A0 value of $LANG: en_US.UTF-8
<= /div>
=C2=A0 value of $XMODIFIERS: @im= =3Dnone
=C2=A0 locale-coding= -system: utf-8-unix

Major mode: Lisp Interaction

Minor modes in effect:
=C2=A0 tooltip-mode: t
=C2=A0 global-eldoc-mode: t
=C2=A0 eldoc-mode: t
=C2= =A0 electric-indent-mode: t
= =C2=A0 mouse-wheel-mode: t
= =C2=A0 tool-bar-mode: t
=C2= =A0 menu-bar-mode: t
=C2=A0 = file-name-shadow-mode: t
=C2= =A0 global-font-lock-mode: t
=C2=A0 font-lock-mode: t
= =C2=A0 blink-cursor-mode: t
= =C2=A0 auto-composition-mode: t
=C2=A0 auto-encryption-mode: t
=C2=A0 auto-compression-mode: t
=C2=A0 line-number-mode: t
=C2=A0 indent-tabs-mode: t
=C2=A0 transient-mark-mode: t

Load-path shadow= s:
None found.

Features:
(shadow sor= t mail-extr emacsbug comp comp-cstr warnings rx message rmc
puny dired dired-loaddefs rfc822 mml mml-se= c epa derived epg rfc6068
ep= g-config gnus-util rmail rmail-loaddefs auth-source cl-seq eieio
<= div style=3D"color:rgb(46,52,54);font-family:monospace;font-size:13.3333px;= font-style:normal;font-variant-caps:normal;font-weight:normal;letter-spacin= g:normal;text-align:start;text-indent:0px;text-transform:none;word-spacing:= 0px;text-decoration:none;width:71ch">eieio-core cl-macs eieio-loaddefs pass= word-cache json map mm-decode
sendmail rfc2047 rfc2045 ietf-= drums mm-util mail-prsvr mail-utils
time-date subr-x cl-extra shortdoc text-property-search seq byte-op= t gv
bytecomp byte-compile c= conv help-fns radix-tree help-mode cl-loaddefs
cl-lib iso-transl tooltip eldoc electric uniquify ediff-= hook vc-hooks
lisp-float-typ= e mwheel term/x-win x-win term/common-win x-dnd tool-bar
dnd fontset image regexp-opt fringe tabulated-= list replace newcomment
text= -mode elisp-mode lisp-mode prog-mode register page tab-bar menu-bar
rfn-eshadow isearch easymenu timer = select scroll-bar mouse jit-lock
font-lock syntax font-core term/tty-colors frame minibuffer cl-generic=
cham georgian utf-8-lang mi= sc-lang vietnamese tibetan thai tai-viet lao
korean japanese eucjp-ms cp51932 hebrew greek romanian slo= vak czech
european ethiopic = indian cyrillic chinese composite charscript charprop
case-table epa-hook jka-cmpr-hook help simple a= bbrev obarray
cl-preloaded n= advice button loaddefs faces cus-face macroexp files
window text-properties overlay sha1 md5 base64 for= mat env code-pages
mule cust= om widget hashtable-print-readable backquote threads dbusbind
inotify lcms2 dynamic-setting system-font= -setting font-render-setting
cairo move-toolbar gtk x-toolkit x multi-tty make-network-process
native-compile emacs)

Memory information:
((= conses 16 94870 10759)
=C2= =A0(symbols 48 7970 1)
=C2= =A0(strings 32 23722 1760)
= =C2=A0(string-bytes 1 872683)
= =C2=A0(vector-slots 8 305866 17210)
=C2=A0(floats 8 71 35)
=C2=A0(intervals 56 444 0)

--00000000000070ba7005cc7e03d1-- From debbugs-submit-bounces@debbugs.gnu.org Tue Sep 21 05:44:36 2021 Received: (at 50718) by debbugs.gnu.org; 21 Sep 2021 09:44:36 +0000 Received: from localhost ([127.0.0.1]:44963 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mScKK-0002nG-6g for submit@debbugs.gnu.org; Tue, 21 Sep 2021 05:44:36 -0400 Received: from eggs.gnu.org ([209.51.188.92]:37700) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mScKH-0002my-Qy; Tue, 21 Sep 2021 05:44:34 -0400 Received: from fencepost.gnu.org ([2001:470:142:3::e]:41984) by eggs.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1mScKC-0004ve-Gh; Tue, 21 Sep 2021 05:44:28 -0400 Received: from 84.94.185.95.cable.012.net.il ([84.94.185.95]:4278 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mScKB-0000Rb-KB; Tue, 21 Sep 2021 05:44:28 -0400 Date: Tue, 21 Sep 2021 12:44:22 +0300 Message-Id: <83czp2z2ix.fsf@gnu.org> From: Eli Zaretskii To: dalanicolai In-Reply-To: (message from dalanicolai on Tue, 21 Sep 2021 11:27:48 +0200) Subject: Re: bug#50718: 28.0.50; `split-string` fails on certain unicode strings References: MIME-version: 1.0 Content-type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 50718 Cc: 50718@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) tags 50718 notabug thanks > From: dalanicolai > Date: Tue, 21 Sep 2021 11:27:48 +0200 > > Evaluate: (split-string "१०.३" ".") > It wrongly returns a list with only empty string. > Of course it should return alist with the individual devanagari numbers. That's a cockpit error: the SEPARATORS argument should be a regular expression, so you should use "\\." instead. From debbugs-submit-bounces@debbugs.gnu.org Tue Sep 21 05:51:49 2021 Received: (at 50718) by debbugs.gnu.org; 21 Sep 2021 09:51:49 +0000 Received: from localhost ([127.0.0.1]:44977 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mScRJ-000329-Ed for submit@debbugs.gnu.org; Tue, 21 Sep 2021 05:51:49 -0400 Received: from mail-out.m-online.net ([212.18.0.9]:38502) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mScRH-000320-Cr for 50718@debbugs.gnu.org; Tue, 21 Sep 2021 05:51:48 -0400 Received: from frontend01.mail.m-online.net (unknown [192.168.8.182]) by mail-out.m-online.net (Postfix) with ESMTP id 4HDGss6GyNz1qxHJ; Tue, 21 Sep 2021 11:51:45 +0200 (CEST) Received: from localhost (dynscan1.mnet-online.de [192.168.6.70]) by mail.m-online.net (Postfix) with ESMTP id 4HDGss4hDkz1qqkB; Tue, 21 Sep 2021 11:51:45 +0200 (CEST) X-Virus-Scanned: amavisd-new at mnet-online.de Received: from mail.mnet-online.de ([192.168.8.182]) by localhost (dynscan1.mail.m-online.net [192.168.6.70]) (amavisd-new, port 10024) with ESMTP id lv6HvSnukmsh; Tue, 21 Sep 2021 11:51:45 +0200 (CEST) X-Auth-Info: N6Ys6KnoxQBUcbo3mdS/Z91UgsM/u0u86nZY+C/1whRoFs7dW65Jwbr+JGBnP2fN Received: from igel.home (ppp-46-244-182-158.dynamic.mnet-online.de [46.244.182.158]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.mnet-online.de (Postfix) with ESMTPSA; Tue, 21 Sep 2021 11:51:45 +0200 (CEST) Received: by igel.home (Postfix, from userid 1000) id 4D9F12C258D; Tue, 21 Sep 2021 11:51:44 +0200 (CEST) From: Andreas Schwab To: dalanicolai Subject: Re: bug#50718: 28.0.50; `split-string` fails on certain unicode strings References: X-Yow: I'm having a BIG BANG THEORY!! Date: Tue, 21 Sep 2021 11:51:44 +0200 In-Reply-To: (dalanicolai@gmail.com's message of "Tue, 21 Sep 2021 11:27:48 +0200") Message-ID: <87o88mfe8f.fsf@igel.home> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.2 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Spam-Score: -0.5 (/) X-Debbugs-Envelope-To: 50718 Cc: 50718@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.5 (-) On Sep 21 2021, dalanicolai wrote: > Evaluate: (split-string "१०.३" ".") > It wrongly returns a list with only empty string. You have specified all characters as separators, since "." matches any character. If you want to match only the period you need to use "\\." has the regexp. Andreas. -- Andreas Schwab, schwab@linux-m68k.org GPG Key fingerprint = 7578 EB47 D4E5 4D69 2510 2552 DF73 E780 A9DA AEC1 "And now for something completely different." From debbugs-submit-bounces@debbugs.gnu.org Tue Sep 21 11:29:35 2021 Received: (at 50718-done) by debbugs.gnu.org; 21 Sep 2021 15:29:36 +0000 Received: from localhost ([127.0.0.1]:47508 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mShiB-0003FE-O3 for submit@debbugs.gnu.org; Tue, 21 Sep 2021 11:29:35 -0400 Received: from mail-pf1-f174.google.com ([209.85.210.174]:39715) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mShi9-0003Ex-IN for 50718-done@debbugs.gnu.org; Tue, 21 Sep 2021 11:29:34 -0400 Received: by mail-pf1-f174.google.com with SMTP id e16so19840804pfc.6 for <50718-done@debbugs.gnu.org>; Tue, 21 Sep 2021 08:29:33 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:from:in-reply-to:references:user-agent :mime-version:date:message-id:subject:to:cc :content-transfer-encoding; bh=rPfkP07/tuRWiPGo7+gDIYC/TpnRIhvDX0r6zeOx+RQ=; b=Ni5OL8cCi7XrATOXB89qPQutDSmecTNmlazp9UMJKG8vmC53d7AVDB33cmM5gQG6CB 8Inq59pza1UwnrxajYJigTTdNAQFdTKyE/hMhQOHj/I4E3y6ZnZCsXF4xG3Ema0MWRjY E4IQlZbKxDTVxP6Zoaj5HOPUhA1AJmMwJTPyda1MDv+DsF7Gr/rvVsVAQI5Z/aXA+bNd XuzjY7LNky20+vicraEbJ5pqMbeJwLOMsTlon/GgGUCLF4y5XmJ2pBnMqjL52M/LZiMF 8C1AT32gbWfxKKJE9WYwWA7TmYi7eUHKgVD6nRZVvY2iZgBr6QlVqZbAYKigVQ+xlDiX R3cw== X-Gm-Message-State: AOAM5337cdqmlUsJLF9VWXMWl3FlwLxgutpdfY1Mrw2OGvBU5eshghA0 8F6r5JuGf3IUNTSaFnpHsQzFXpD8JZOaDRHyJLo= X-Google-Smtp-Source: ABdhPJyeeFZ0CKqD56tbY21/lUxTPCeVPGh1njKDQBGaevFvvd9uuhstFHNLAfvBwZmcdA0hAg0Ivw1WyW8Q5GossNo= X-Received: by 2002:a63:a311:: with SMTP id s17mr28305009pge.359.1632238167539; Tue, 21 Sep 2021 08:29:27 -0700 (PDT) Received: from 753933720722 named unknown by gmailapi.google.com with HTTPREST; Tue, 21 Sep 2021 08:29:27 -0700 From: Stefan Kangas In-Reply-To: <83czp2z2ix.fsf@gnu.org> (Eli Zaretskii's message of "Tue, 21 Sep 2021 12:44:22 +0300") References: <83czp2z2ix.fsf@gnu.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.0.50 (gnu/linux) MIME-Version: 1.0 Date: Tue, 21 Sep 2021 08:29:27 -0700 Message-ID: Subject: Re: bug#50718: 28.0.50; `split-string` fails on certain unicode strings To: Eli Zaretskii Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Spam-Score: 0.5 (/) X-Debbugs-Envelope-To: 50718-done Cc: 50718-done@debbugs.gnu.org, dalanicolai X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.5 (/) Eli Zaretskii writes: > tags 50718 notabug > thanks > >> From: dalanicolai >> Date: Tue, 21 Sep 2021 11:27:48 +0200 >> >> Evaluate: (split-string "=E0=A5=A7=E0=A5=A6.=E0=A5=A9" ".") >> It wrongly returns a list with only empty string. >> Of course it should return alist with the individual devanagari numbers. > > That's a cockpit error: the SEPARATORS argument should be a regular > expression, so you should use "\\." instead. I'm therefore closing this bug report. From debbugs-submit-bounces@debbugs.gnu.org Wed Sep 22 03:59:41 2021 Received: (at 50718) by debbugs.gnu.org; 22 Sep 2021 07:59:41 +0000 Received: from localhost ([127.0.0.1]:48778 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mSxAL-0003AM-6f for submit@debbugs.gnu.org; Wed, 22 Sep 2021 03:59:41 -0400 Received: from mail-vs1-f42.google.com ([209.85.217.42]:44646) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mSxAJ-0003A6-QH for 50718@debbugs.gnu.org; Wed, 22 Sep 2021 03:59:40 -0400 Received: by mail-vs1-f42.google.com with SMTP id u4so2005749vsu.11 for <50718@debbugs.gnu.org>; Wed, 22 Sep 2021 00:59:39 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=L7EWT97T98fo4p8D6ssnFumiO2pGLL4EUA2L+qLA4nI=; b=MTssBRtNvZmmvNoa9jouFdYFCQUsPxnkWU3FGgcaxPqLzhmyE5jZ8EzPsvRgxkfbKr rAIJxLR66/2MBqOpPwcZO6WWpQWW/M4gozyZEC3QM7pPueqDdwk/Z8SJ9EqRC/J+Gjbd eSmPCNM6hlHQWIIcH25qeQcYdDDH5DJCpJhboq2SFQfyW6AllDwJrmsP5WUUteQP9UN/ 6Kiht8zy4LKR02gzgbUpfR+5qBMbdg0eocrI+dtFf4UaZNXOnkaPdOnc1x2AS6YyUyM0 49Cl4MdlRMn2Y5KS1YjLJO4DvadckLor/qzZhEOF6NER2jUGEvBKs+JgL0PKo+WGWqnC Sb8w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=L7EWT97T98fo4p8D6ssnFumiO2pGLL4EUA2L+qLA4nI=; b=F9c2A4sf6Vaay5dmAXbsQIN9c675bIrSgSUJ+lJXvDIubfzrxEnxKBGcp9huy8zKw3 LIi+ie75V/SNa7fw/ZBjNH1QrvNC7VqKnWGL9LEfw/FMfu3N8jdXz6nv7AytibwMW2bR uzPOQ1jQBmuQXJ3AQ8+GGeov6BMIuQTZ4lZc8e/ep6cvvv8Pk3h7bY0fKpxxe9cdxBlx p53EefhxoXvtPbOH9JqZQGdfJpKMcr8j0NNizU4fM1vUTmOg4Zz9p5HQRfMr9Oh6Hq1o lJwZoIW9pTvzsWuphmbgm7xgJXNLvTuKqwTFk5XfV4MYnfGRQtLvW11+lOi7mDgrgu6J xEjw== X-Gm-Message-State: AOAM532NiQwBq5V/NMtSTmEVyqeLc+IRqQyqZoXrnaLomQ0d+IzsGANi twmF+9s2QtBBmBdrg5rX5he6ANtKfU07WMoC4o8DWJto X-Google-Smtp-Source: ABdhPJzLTn4RV8XByiXLh3EvYc+v5hzvrhGREa2krSNqGGrICaDkOOAKj68/XMbNh65lm+W60cB9t7LUWY+Fz2fmwKA= X-Received: by 2002:a67:e10a:: with SMTP id d10mr5905300vsl.29.1632297573911; Wed, 22 Sep 2021 00:59:33 -0700 (PDT) MIME-Version: 1.0 References: <87o88mfe8f.fsf@igel.home> In-Reply-To: <87o88mfe8f.fsf@igel.home> From: dalanicolai Date: Wed, 22 Sep 2021 09:59:22 +0200 Message-ID: Subject: Re: bug#50718: 28.0.50; `split-string` fails on certain unicode strings To: Andreas Schwab Content-Type: multipart/alternative; boundary="0000000000000f8f8805cc90e572" X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 50718 Cc: 50718@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) --0000000000000f8f8805cc90e572 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Haha, okay that is a some unexperienced (or not fully awake) mistake. Anyway, will not forget about that again, I guess. Thanks for the reply! On Tue, 21 Sept 2021 at 11:51, Andreas Schwab wrote= : > On Sep 21 2021, dalanicolai wrote: > > > Evaluate: (split-string "=E0=A5=A7=E0=A5=A6.=E0=A5=A9" ".") > > It wrongly returns a list with only empty string. > > You have specified all characters as separators, since "." matches any > character. If you want to match only the period you need to use "\\." > has the regexp. > > Andreas. > > -- > Andreas Schwab, schwab@linux-m68k.org > GPG Key fingerprint =3D 7578 EB47 D4E5 4D69 2510 2552 DF73 E780 A9DA AEC= 1 > "And now for something completely different." > --0000000000000f8f8805cc90e572 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
Haha, okay that is a some unexperienced (or not fully awak= e) mistake. Anyway, will not forget about that again, I guess. Thanks for t= he reply!

On Tue, 21 Sept 2021 at 11:51, Andreas Schwab <schwab@linux-m68k.org> wrote:
=
On Sep 21 2021, dalanicol= ai wrote:

> Evaluate: (split-string "=E0=A5=A7=E0=A5=A6.=E0=A5=A9" "= ;.")
> It wrongly returns a list with only empty string.

You have specified all characters as separators, since "." matche= s any
character.=C2=A0 If you want to match only the period you need to use "= ;\\."
has the regexp.

Andreas.

--
Andreas Schwab, = schwab@linux-m68k.org
GPG Key fingerprint =3D 7578 EB47 D4E5 4D69 2510=C2=A0 2552 DF73 E780 A9DA = AEC1
"And now for something completely different."
--0000000000000f8f8805cc90e572-- From unknown Tue Jun 17 01:48:32 2025 Received: (at fakecontrol) by fakecontrolmessage; To: internal_control@debbugs.gnu.org From: Debbugs Internal Request Subject: Internal Control Message-Id: bug archived. Date: Wed, 20 Oct 2021 11:24:06 +0000 User-Agent: Fakemail v42.6.9 # This is a fake control message. # # The action: # bug archived. thanks # This fakemail brought to you by your local debbugs # administrator