From unknown Fri Jun 20 05:37:38 2025 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-Mailer: MIME-tools 5.509 (Entity 5.509) Content-Type: text/plain; charset=utf-8 From: bug#36085 <36085@debbugs.gnu.org> To: bug#36085 <36085@debbugs.gnu.org> Subject: Status: find-dired could handle/avoid octal escapes printed by GNU find -ls for non-ASCII filenames Reply-To: bug#36085 <36085@debbugs.gnu.org> Date: Fri, 20 Jun 2025 12:37:38 +0000 retitle 36085 find-dired could handle/avoid octal escapes printed by GNU fi= nd -ls for non-ASCII filenames reassign 36085 emacs submitter 36085 Nikita severity 36085 wishlist thanks From debbugs-submit-bounces@debbugs.gnu.org Tue Jun 04 00:11:54 2019 Received: (at submit) by debbugs.gnu.org; 4 Jun 2019 04:11:54 +0000 Received: from localhost ([127.0.0.1]:43606 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hY0ng-0000Pl-Lc for submit@debbugs.gnu.org; Tue, 04 Jun 2019 00:11:53 -0400 Received: from eggs.gnu.org ([209.51.188.92]:53910) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hY0M8-00088g-7M for submit@debbugs.gnu.org; Mon, 03 Jun 2019 23:43:25 -0400 Received: from lists.gnu.org ([209.51.188.17]:42074) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from ) id 1hY0M3-0002ue-2j for submit@debbugs.gnu.org; Mon, 03 Jun 2019 23:43:19 -0400 Received: from eggs.gnu.org ([209.51.188.92]:40057) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1hY0M1-0001Nh-H6 for bug-gnu-emacs@gnu.org; Mon, 03 Jun 2019 23:43:18 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=0.8 required=5.0 tests=BAYES_50,FREEMAIL_FROM, URIBL_BLOCKED autolearn=disabled version=3.3.2 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1hY0M0-0002rJ-3Y for bug-gnu-emacs@gnu.org; Mon, 03 Jun 2019 23:43:17 -0400 Received: from forward103p.mail.yandex.net ([2a02:6b8:0:1472:2741:0:8b7:106]:52790) by eggs.gnu.org with esmtps (TLS1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from ) id 1hY0Ly-0002mU-DP for bug-gnu-emacs@gnu.org; Mon, 03 Jun 2019 23:43:16 -0400 Received: from mxback5o.mail.yandex.net (mxback5o.mail.yandex.net [IPv6:2a02:6b8:0:1a2d::1f]) by forward103p.mail.yandex.net (Yandex) with ESMTP id 9993718C016B for ; Tue, 4 Jun 2019 06:43:08 +0300 (MSK) Received: from smtp3o.mail.yandex.net (smtp3o.mail.yandex.net [2a02:6b8:0:1a2d::27]) by mxback5o.mail.yandex.net (nwsmtp/Yandex) with ESMTP id dRhwhRNd5t-h8eC9jEZ; Tue, 04 Jun 2019 06:43:08 +0300 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yandex.ru; s=mail; t=1559619788; bh=jB/Y8jVMCWcoLxoVztZHIASZmPriT8w8v43nmc6/yC0=; h=From:Subject:To:Date:Message-ID; b=ZGiGHaYVmLvKbYqdJqFFUwstU7IXtRt9irMuPH7KGaYGfY6DE2HTbACUxsWkte3C9 xzrWaAK5O8Mex2ChOetxLT3V7AeYWanFvgXhS5jnemUvNsiqOpfxkI/Exz5wBrBEs2 0NtItNnuV+Pts0SYS4DaYqAzsdc01uvE3Cxi65ck= Authentication-Results: mxback5o.mail.yandex.net; dkim=pass header.i=@yandex.ru Received: by smtp3o.mail.yandex.net (nwsmtp/Yandex) with ESMTPSA id R0kKdFWJaZ-h7xS1Wu1; Tue, 04 Jun 2019 06:43:07 +0300 (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (Client certificate not present) To: bug-gnu-emacs@gnu.org Subject: 26.2; find-dired octal escapes instead of Cyrillic text From: Nikita Message-ID: <02150b33-7105-f106-c8ea-afb4e7d04b67@yandex.ru> Date: Tue, 4 Jun 2019 08:43:06 +0500 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.7.0 MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Content-Language: en-US X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 2a02:6b8:0:1472:2741:0:8b7:106 X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-Spam-Score: -1.3 (-) X-Debbugs-Envelope-To: submit X-Mailman-Approved-At: Tue, 04 Jun 2019 00:11:51 -0400 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -2.3 (--) When i open dired, go to the needed directory, run "M-x dired-find" "-name "*=D0=9F=D0=BE=D1=80=D1=82=D1=80=D0=B5=D1=82*" (or anything at all= that will give some results) results come back with octal escapes instead of Cyrillic letters. I cannot open pictures that it finds for example. ---- In GNU Emacs 26.2 (build 2, x86_64-pc-linux-gnu, GTK+ Version 3.22.30) of 2019-04-13 built on lgw01-amd64-060 Windowing system distributor 'The X.Org Foundation', version 11.0.1190600= 0 System Description: Linux Mint 19.1 Tessa Recent messages: For information about GNU Emacs and the GNU system, type C-h C-a. user-error: No further undo information [2 times] Quit Mark saved where search started Making completion list... find-dired *Find* finished. Making completion list... [3 times] user-error: Beginning of history; no preceding item user-error: End of history; no default available user-error: Beginning of history; no preceding item Configured using: 'configure --build=3Dx86_64-linux-gnu --prefix=3D/usr '--includedir=3D${prefix}/include' '--mandir=3D${prefix}/share/man' '--infodir=3D${prefix}/share/info' --sysconfdir=3D/etc --localstatedir=3D= /var --disable-silent-rules '--libdir=3D${prefix}/lib/x86_64-linux-gnu' '--libexecdir=3D${prefix}/lib/x86_64-linux-gnu' --disable-maintainer-mode= --disable-dependency-tracking --prefix=3D/usr --sharedstatedir=3D/var/lib= --program-suffix=3D26 --with-modules --with-file-notification=3Dinotify --with-mailutils --with-x=3Dyes --with-x-toolkit=3Dgtk3 --with-xwidgets --with-lcms2 'CFLAGS=3D-g -O2 -fdebug-prefix-map=3D/build/emacs26-CYbeHB/emacs26-26.2~1.gitfd1b34b=3D. -fstack-protector-strong -Wformat -Werror=3Dformat-security -no-pie' 'CPPFLAGS=3D-Wdate-time -D_FORTIFY_SOURCE=3D2' 'LDFLAGS=3D-Wl,-Bsymbolic-functions -Wl,-z,relro -no-pie'' Configured features: XPM JPEG TIFF GIF PNG RSVG IMAGEMAGICK SOUND GPM DBUS GSETTINGS GLIB NOTIFY LIBSELINUX GNUTLS LIBXML2 FREETYPE M17N_FLT LIBOTF XFT ZLIB TOOLKIT_SCROLL_BARS GTK3 X11 XDBE XIM MODULES THREADS XWIDGETS LIBSYSTEMD LCMS2 Important settings: value of $LC_MONETARY: ru_RU.UTF-8 value of $LC_NUMERIC: ru_RU.UTF-8 value of $LANG: ru_RU locale-coding-system: utf-8-unix Major mode: Dired by name Minor modes in effect: shell-dirtrack-mode: t pdf-occur-dired-minor-mode: t pdf-occur-global-minor-mode: t engine-mode: t which-key-mode: t xah-fly-keys: t recentf-mode: t tooltip-mode: t global-eldoc-mode: t electric-indent-mode: t mouse-wheel-mode: t file-name-shadow-mode: t global-font-lock-mode: t font-lock-mode: t blink-cursor-mode: t auto-composition-mode: t auto-encryption-mode: t auto-compression-mode: t buffer-read-only: t column-number-mode: t line-number-mode: t global-visual-line-mode: t visual-line-mode: t transient-mark-mode: t abbrev-mode: t Load-path shadows: /usr/share/emacs/site-lisp/dictionaries-common/flyspell hides /usr/share/emacs/26.2/lisp/textmodes/flyspell /usr/share/emacs/site-lisp/dictionaries-common/ispell hides /usr/share/emacs/26.2/lisp/textmodes/ispell /usr/share/emacs/site-lisp/latex-cjk-thai/thai-word hides /usr/share/emacs/26.2/lisp/language/thai-word Features: (shadow sort mail-extr emacsbug message rmc puny rfc822 mml mml-sec epa derived epg gnus-util rmail rmail-loaddefs mm-decode mm-bodies mm-encode mail-parse rfc2231 mailabbrev gmm-utils mailheader sendmail rfc2047 rfc2045 ietf-drums mm-util mail-prsvr mail-utils shell find-dired misearch multi-isearch dired-aux elec-pair ob-R ob-python pdf-occur ibuf-ext ibuffer ibuffer-loaddefs tablist tablist-filter semantic/wisent/comp semantic/wisent semantic/wisent/wisent semantic/util-modes semantic/util semantic semantic/tag semantic/lex semantic/fw mode-local cedet pdf-isearch let-alist pdf-misc imenu pdf-tools compile cus-edit cus-start cus-load pdf-view bookmark pp jka-compr pdf-cache pdf-info tq pdf-util image-mode engine-mode which-key org-clock org-element avl-tree generator org org-macro org-footnote org-pcomplete pcomplete org-list org-faces org-entities org-version ob-emacs-lisp ob ob-tangle org-src ob-ref ob-lob ob-table ob-keys ob-exp ob-comint comint ansi-color ob-core ob-eval org-compat org-macs org-loaddefs format-spec advice find-func cal-menu calendar cal-loaddefs pandoc-mode cl-extra pandoc-mode-utils hydra ring lv cl markdown-toc dash s markdown-mode-table markdown-mode color thingatpt noutline outline easy-mmode edit-indirect xah-fly-keys ido finder-inf info package epg-config url-handlers url-parse auth-source cl-seq eieio eieio-core cl-macs eieio-loaddefs password-cache url-vars seq byte-opt gv bytecomp byte-compile cconv quail help-mode dired-x dired dired-loaddefs edmacro kmacro recentf tree-widget wid-edit cl-loaddefs cl-lib easymenu server time-date mule-util tooltip eldoc electric uniquify ediff-hook vc-hooks lisp-float-type mwheel term/x-win x-win term/common-win x-dnd tool-bar dnd fontset image regexp-opt fringe tabulated-list replace newcomment text-mode elisp-mode lisp-mode prog-mode register page menu-bar rfn-eshadow isearch timer select scroll-bar mouse jit-lock font-lock syntax facemenu font-core term/tty-colors frame cl-generic cham georgian utf-8-lang misc-lang vietnamese tibetan thai tai-viet lao korean japanese eucjp-ms cp51932 hebrew greek romanian slovak czech european ethiopic indian cyrillic chinese composite charscript charprop case-table epa-hook jka-cmpr-hook help simple abbrev obarray minibuffer cl-preloaded nadvice loaddefs button faces cus-face macroexp files text-properties overlay sha1 md5 base64 format env code-pages mule custom widget hashtable-print-readable backquote threads dbusbind inotify lcms2 dynamic-setting system-font-setting font-render-setting xwidget-internal move-toolbar gtk x-toolkit x multi-tty make-network-process emacs) Memory information: ((conses 16 393368 12549) (symbols 48 41286 2) (miscs 40 873 266) (strings 32 122175 2569) (string-bytes 1 3374293) (vectors 16 44321) (vector-slots 8 833717 14994) (floats 8 436 68) (intervals 56 2681 0) (buffers 992 21)) From debbugs-submit-bounces@debbugs.gnu.org Tue Jun 04 07:45:07 2019 Received: (at 36085) by debbugs.gnu.org; 4 Jun 2019 11:45:07 +0000 Received: from localhost ([127.0.0.1]:44071 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hY7sJ-0007W3-Bh for submit@debbugs.gnu.org; Tue, 04 Jun 2019 07:45:07 -0400 Received: from forward102p.mail.yandex.net ([77.88.28.102]:60079) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hY7sG-0007VE-Tw for 36085@debbugs.gnu.org; Tue, 04 Jun 2019 07:45:06 -0400 Received: from mxback16o.mail.yandex.net (mxback16o.mail.yandex.net [IPv6:2a02:6b8:0:1a2d::67]) by forward102p.mail.yandex.net (Yandex) with ESMTP id 78DB61D404E2 for <36085@debbugs.gnu.org>; Tue, 4 Jun 2019 14:44:55 +0300 (MSK) Received: from localhost (localhost [::1]) by mxback16o.mail.yandex.net (nwsmtp/Yandex) with ESMTP id hdCsHy23yh-iseGgHdB; Tue, 04 Jun 2019 14:44:54 +0300 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yandex.ru; s=mail; t=1559648694; bh=fiXVFGwftFw2zMAOcFpk8qsDX99WnQWtEZ/jnPMR+qU=; h=Message-Id:Date:Subject:To:From; b=UFbeMATVDaeHQm+qKnrzfKFS+qkynP1yk6O5+QDtlT/4iVY2WWRxHoKIK3gK5D8Z9 hfia5BJ1xtbCix6oWhKW52nZaKpI4C7ulRdVSpQE5Ff2IM5Od7xrb8lCGuosCdImo3 9+d4SzjSb/8d2nBpwlUCKWRuueAe21KkPz0r+riU= Authentication-Results: mxback16o.mail.yandex.net; dkim=pass header.i=@yandex.ru Received: by myt5-96980ae04900.qloud-c.yandex.net with HTTP; Tue, 04 Jun 2019 14:44:54 +0300 From: =?utf-8?B?0J3QuNC60LjRgtCwINCd0LjQutC40YLQsA==?= Envelope-From: grindeg@yandex.ru To: 36085@debbugs.gnu.org Subject: Screenshots for th bug MIME-Version: 1.0 X-Mailer: Yamail [ http://yandex.ru ] 5.0 Date: Tue, 04 Jun 2019 16:44:54 +0500 Message-Id: <7257461559648694@myt5-96980ae04900.qloud-c.yandex.net> Content-Transfer-Encoding: 8bit Content-Type: text/html; charset=utf-8 X-Spam-Score: 1.1 (+) X-Spam-Report: Spam detection software, running on the system "debbugs.gnu.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see the administrator of that system for details. Content preview: Some screenshots I open dired, go to the needed directory, run "M-x dired-find" "-name "*Портрет*" https://i.imgur.com/FThZxim.png Content analysis details: (1.1 points, 10.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- 0.0 URIBL_BLOCKED ADMINISTRATOR NOTICE: The query to URIBL was blocked. See http://wiki.apache.org/spamassassin/DnsBlocklists#dnsbl-block for more information. [URIs: imgur.com] -0.7 RCVD_IN_DNSWL_LOW RBL: Sender listed at https://www.dnswl.org/, low trust [77.88.28.102 listed in list.dnswl.org] -0.0 SPF_PASS SPF: sender matches SPF record 0.0 FREEMAIL_FROM Sender email is commonly abused enduser mail provider (grindeg[at]yandex.ru) 0.0 SPF_HELO_NONE SPF: HELO does not publish an SPF Record 1.1 MIME_HTML_ONLY BODY: Message only has text/html MIME parts 0.0 HTML_MESSAGE BODY: HTML included in message 0.6 HTML_MIME_NO_HTML_TAG HTML-only message, but there is no HTML tag 0.1 FROM_EXCESS_BASE64 From: base64 encoded unnecessarily X-Debbugs-Envelope-To: 36085 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 0.1 (/)

Some screenshots

I open dired, go to the needed directory, run "M-x dired-find" "-name "*Портрет*"

https://i.imgur.com/FThZxim.png

When i open file it's not showing as image

https://i.imgur.com/cmYo07G.png

"dired-jump" works though

https://i.imgur.com/vXpCH7y.png

This is when i go to that image manually

https://i.imgur.com/FpMzADt.png

 
From debbugs-submit-bounces@debbugs.gnu.org Sat Jun 08 08:20:36 2019 Received: (at 36085) by debbugs.gnu.org; 8 Jun 2019 12:20:36 +0000 Received: from localhost ([127.0.0.1]:52960 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hZaKq-0005UK-Ic for submit@debbugs.gnu.org; Sat, 08 Jun 2019 08:20:36 -0400 Received: from eggs.gnu.org ([209.51.188.92]:36531) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hZaKo-0005U6-92 for 36085@debbugs.gnu.org; Sat, 08 Jun 2019 08:20:34 -0400 Received: from fencepost.gnu.org ([2001:470:142:3::e]:40600) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1hZaKh-0006u1-Qs; Sat, 08 Jun 2019 08:20:28 -0400 Received: from [176.228.60.248] (port=1113 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1hZaKh-0005cG-A0; Sat, 08 Jun 2019 08:20:27 -0400 Date: Sat, 08 Jun 2019 15:20:18 +0300 Message-Id: <83v9xgnu9p.fsf@gnu.org> From: Eli Zaretskii To: Nikita In-reply-to: <02150b33-7105-f106-c8ea-afb4e7d04b67@yandex.ru> (message from Nikita on Tue, 4 Jun 2019 08:43:06 +0500) Subject: Re: bug#36085: 26.2; find-dired octal escapes instead of Cyrillic text References: <02150b33-7105-f106-c8ea-afb4e7d04b67@yandex.ru> MIME-version: 1.0 Content-type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 36085 Cc: 36085@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) > From: Nikita > Date: Tue, 4 Jun 2019 08:43:06 +0500 > > When i open dired, go to the needed directory, run "M-x dired-find" > "-name "*Портрет*" (or anything at all that will give some results) > results come back with octal escapes instead of Cyrillic letters. > I cannot open pictures that it finds for example. Turns out the octal escapes are produced by 'find' itself in this case. Try the following command in that directory from the shell prompt: find . \( -iname "*Портрет*" \) -ls and you will see the same octal escape instead of the Cyrillic characters. The man page for 'find' clearly documents this, under "Unusual Filenames": Unusual characters are handled differently by various actions, as described below. [...] -ls, -fls Unusual characters are always escaped. White space, backslash, and double quote characters are printed using C-style escaping (for example `\f', `\"'). Other unusual characters are printed using an octal escape. Other printable characters (for -ls and -fls these are the characters between octal 041 and 0176) are printed as-is. What this means is that any non-ASCII character will be converted to a series of octal escapes. IMO, this is a terrible misfeature in GNU Findutils, as such "handling" of non-ASCII characters has no place in today's global environment. I suggest to report this bug to the GNU Findutils developers. Thanks. P.S. Emacs could perhaps go above and beyond the call of duty, and attempt to convert the octal escapes back to readable text. But I don't think we should do it, as it's a clear bug in 'find'. Nonetheless, if someone wants to submit patches to do such a conversion, I won't block them. From debbugs-submit-bounces@debbugs.gnu.org Sat Jun 08 09:15:29 2019 Received: (at control) by debbugs.gnu.org; 8 Jun 2019 13:15:29 +0000 Received: from localhost ([127.0.0.1]:53006 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hZbBx-0006sn-01 for submit@debbugs.gnu.org; Sat, 08 Jun 2019 09:15:29 -0400 Received: from eggs.gnu.org ([209.51.188.92]:44697) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hZbBv-0006sa-QA for control@debbugs.gnu.org; Sat, 08 Jun 2019 09:15:28 -0400 Received: from fencepost.gnu.org ([2001:470:142:3::e]:41235) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1hZbBo-0001Du-54 for control@debbugs.gnu.org; Sat, 08 Jun 2019 09:15:21 -0400 Received: from [176.228.60.248] (port=4839 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1hZbBj-0007CQ-62 for control@debbugs.gnu.org; Sat, 08 Jun 2019 09:15:20 -0400 Date: Sat, 08 Jun 2019 16:15:07 +0300 Message-Id: <83r284nrqc.fsf@gnu.org> From: Eli Zaretskii To: control@debbugs.gnu.org In-reply-to: <83v9xgnu9p.fsf@gnu.org> (message from Eli Zaretskii on Sat, 08 Jun 2019 15:20:18 +0300) Subject: Re: bug#36085: 26.2; find-dired octal escapes instead of Cyrillic text References: <02150b33-7105-f106-c8ea-afb4e7d04b67@yandex.ru> <83v9xgnu9p.fsf@gnu.org> X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: control X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) tags 36085 notabug thanks From debbugs-submit-bounces@debbugs.gnu.org Sat Jun 08 11:14:23 2019 Received: (at 36085) by debbugs.gnu.org; 8 Jun 2019 15:14:23 +0000 Received: from localhost ([127.0.0.1]:53507 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hZd31-0003Ne-0p for submit@debbugs.gnu.org; Sat, 08 Jun 2019 11:14:23 -0400 Received: from mail173c50.megamailservers.eu ([91.136.10.183]:57728 helo=mail56c50.megamailservers.eu) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hZd2z-0003NS-3g for 36085@debbugs.gnu.org; Sat, 08 Jun 2019 11:14:22 -0400 X-Authenticated-User: mattiase@bredband.net DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=megamailservers.eu; s=maildub; t=1560006857; bh=a91czOg0Eo71qIuyytpoJ+WxOSeKF9oNi+j62WnmYSY=; h=From:Subject:Date:To:From; b=MX66t/9/muLebltiyIFlGsv5MAFoTasR+4MtKThngA2XoA/OL2tT4alcu0/ftbtMz e41GGo3E/PVSN6vf7bwcJB0owrYecNDp7eilgoDsU/vbOTbSKdiaVDYgwQrUFJpuj4 2x+6I2O09mAv1vxY5RDqEA/2O9W3n5jjjvpuywyM= Feedback-ID: mattiase@acm.or Received: from [192.168.0.4] ([188.150.171.71]) (authenticated bits=0) by mail56c50.megamailservers.eu (8.14.9/8.13.1) with ESMTP id x58FEBhb019186; Sat, 8 Jun 2019 15:14:15 +0000 From: =?utf-8?Q?Mattias_Engdeg=C3=A5rd?= Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Mime-Version: 1.0 (Mac OS X Mail 12.4 \(3445.104.11\)) Subject: bug#36085: 26.2; find-dired octal escapes instead of Cyrillic text Message-Id: Date: Sat, 8 Jun 2019 17:14:11 +0200 To: Eli Zaretskii , 36085@debbugs.gnu.org, grindeg@yandex.ru X-Mailer: Apple Mail (2.3445.104.11) X-CTCH-RefID: str=0001.0A0B020E.5CFBD0C9.002D, ss=1, re=0.000, recu=0.000, reip=0.000, cl=1, cld=1, fgs=0 X-CTCH-VOD: Unknown X-CTCH-Spam: Unknown X-CTCH-Score: 0.000 X-CTCH-Rules: X-CTCH-Flags: 0 X-CTCH-ScoreCust: 0.000 X-CSC: 0 X-CHA: v=2.3 cv=AeKf4UfG c=1 sm=1 tr=0 a=SF+I6pRkHZhrawxbOkkvaA==:117 a=SF+I6pRkHZhrawxbOkkvaA==:17 a=IkcTkHD0fZMA:10 a=zbCZ1y8Z2p7ypQJju2kA:9 a=QEXdDO2ut3YA:10 X-Spam-Score: 0.3 (/) X-Debbugs-Envelope-To: 36085 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.7 (/) Eli wrote: > P.S. Emacs could perhaps go above and beyond the call of duty, and = attempt to convert the octal escapes back to readable text. But I don't = think we should do it, as it's a clear bug in 'find'. Nonetheless, if = someone wants to submit patches to do such a conversion, I won't block = them.=20 The default (BSD) find in macOS does not seem to escape anything; files = named =D0=9F=D0=BE=D1=80=D1=82=D1=80=D0=B5=D1=82 or APL\360 are printed = exactly that way. Thus, Emacs would need to know what 'find' it is = running. This appears to validate your recommendation. From debbugs-submit-bounces@debbugs.gnu.org Sat Jun 08 11:35:08 2019 Received: (at 36085) by debbugs.gnu.org; 8 Jun 2019 15:35:08 +0000 Received: from localhost ([127.0.0.1]:53527 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hZdN6-0003sM-FB for submit@debbugs.gnu.org; Sat, 08 Jun 2019 11:35:08 -0400 Received: from eggs.gnu.org ([209.51.188.92]:44683) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hZdN5-0003s8-Gw for 36085@debbugs.gnu.org; Sat, 08 Jun 2019 11:35:07 -0400 Received: from fencepost.gnu.org ([2001:470:142:3::e]:43418) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1hZdMz-00061s-Er; Sat, 08 Jun 2019 11:35:01 -0400 Received: from [176.228.60.248] (port=1443 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1hZdMu-0008Lb-4j; Sat, 08 Jun 2019 11:34:59 -0400 Date: Sat, 08 Jun 2019 18:34:48 +0300 Message-Id: <83o938nl9j.fsf@gnu.org> From: Eli Zaretskii To: Mattias =?utf-8?Q?Engdeg=C3=A5rd?= In-reply-to: (message from Mattias =?utf-8?Q?Engdeg=C3=A5rd?= on Sat, 8 Jun 2019 17:14:11 +0200) Subject: Re: bug#36085: 26.2; find-dired octal escapes instead of Cyrillic text References: MIME-version: 1.0 Content-type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 36085 Cc: grindeg@yandex.ru, 36085@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) > From: Mattias Engdegård > Date: Sat, 8 Jun 2019 17:14:11 +0200 > > Eli wrote: > > > P.S. Emacs could perhaps go above and beyond the call of duty, and attempt to convert the octal escapes back to readable text. But I don't think we should do it, as it's a clear bug in 'find'. Nonetheless, if someone wants to submit patches to do such a conversion, I won't block them. > > The default (BSD) find in macOS does not seem to escape anything; files named Портрет or APL\360 are printed exactly that way. Thus, Emacs would need to know what 'find' it is running. This appears to validate your recommendation. Indeed, the hard part is to distinguish between \nnn an octal escape and the literal string "\nnn". That difficulty is one reason why gdb-mi.el performs a similar decoding only as an opt-in optional behavior. From debbugs-submit-bounces@debbugs.gnu.org Sun Jun 09 01:22:25 2019 Received: (at 36085) by debbugs.gnu.org; 9 Jun 2019 05:22:25 +0000 Received: from localhost ([127.0.0.1]:54072 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hZqHh-0001FQ-77 for submit@debbugs.gnu.org; Sun, 09 Jun 2019 01:22:25 -0400 Received: from eggs.gnu.org ([209.51.188.92]:47197) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hZqHe-0001F9-0b for 36085@debbugs.gnu.org; Sun, 09 Jun 2019 01:22:22 -0400 Received: from fencepost.gnu.org ([2001:470:142:3::e]:54879) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1hZqHY-0006mr-Bt; Sun, 09 Jun 2019 01:22:16 -0400 Received: from [176.228.60.248] (port=4338 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1hZqHX-0006gh-Sw; Sun, 09 Jun 2019 01:22:16 -0400 Date: Sun, 09 Jun 2019 08:22:09 +0300 Message-Id: <83h88zcozi.fsf@gnu.org> From: Eli Zaretskii To: mattiase@acm.org, grindeg@yandex.ru In-reply-to: <83o938nl9j.fsf@gnu.org> (message from Eli Zaretskii on Sat, 08 Jun 2019 18:34:48 +0300) Subject: Re: bug#36085: 26.2; find-dired octal escapes instead of Cyrillic text References: <83o938nl9j.fsf@gnu.org> X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 36085 Cc: 36085@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) > Date: Sat, 08 Jun 2019 18:34:48 +0300 > From: Eli Zaretskii > Cc: grindeg@yandex.ru, 36085@debbugs.gnu.org > > Indeed, the hard part is to distinguish between \nnn an octal escape > and the literal string "\nnn". That difficulty is one reason why > gdb-mi.el performs a similar decoding only as an opt-in optional > behavior. Here's an idea for making this command work with non-ASCII file names: do NOT add "-ls" to the 'find' command line, then in the process filter function call file-attributes on each file name we receive from 'find', and format the result according to Dired convention before inserting it into the buffer. Any takers? From debbugs-submit-bounces@debbugs.gnu.org Sun Jun 09 05:09:08 2019 Received: (at 36085) by debbugs.gnu.org; 9 Jun 2019 09:09:08 +0000 Received: from localhost ([127.0.0.1]:54250 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hZtp4-00073K-41 for submit@debbugs.gnu.org; Sun, 09 Jun 2019 05:09:08 -0400 Received: from mail1423c50.megamailservers.eu ([91.136.14.23]:53820 helo=mail102c50.megamailservers.eu) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hZtp0-00072j-Qr for 36085@debbugs.gnu.org; Sun, 09 Jun 2019 05:09:04 -0400 X-Authenticated-User: mattiase@bredband.net DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=megamailservers.eu; s=maildub; t=1560071335; bh=zNYS7VdADCTfnjm1yqw6F2R8WQ5MjTtxOdmc5avLgfI=; h=Subject:From:In-Reply-To:Date:Cc:References:To:From; b=o9k7ekbJgMVo4VTAGFKnDMhitDRGRTMH2JqOkXj6eYJjCAg3/yM4LcvHVYJA5JFPR nvbVtDy5dc1b5witiTwb70jneg7uJwL1o2uvmid/AEniMNk6I2i22CuZ26kNVhFgO8 BNmjxs8uYVEHZ4PHLByLBDB4gi5bmqhMsRZlqGBQ= Feedback-ID: mattiase@acm.or Received: from [192.168.0.4] ([188.150.171.71]) (authenticated bits=0) by mail102c50.megamailservers.eu (8.14.9/8.13.1) with ESMTP id x5998qFV027893; Sun, 9 Jun 2019 09:08:54 +0000 Content-Type: text/plain; charset=us-ascii Mime-Version: 1.0 (Mac OS X Mail 12.4 \(3445.104.11\)) Subject: Re: bug#36085: 26.2; find-dired octal escapes instead of Cyrillic text From: =?utf-8?Q?Mattias_Engdeg=C3=A5rd?= In-Reply-To: <83h88zcozi.fsf@gnu.org> Date: Sun, 9 Jun 2019 11:08:51 +0200 Content-Transfer-Encoding: quoted-printable Message-Id: References: <83o938nl9j.fsf@gnu.org> <83h88zcozi.fsf@gnu.org> To: Eli Zaretskii X-Mailer: Apple Mail (2.3445.104.11) X-CTCH-RefID: str=0001.0A0B0205.5CFCCCA7.0015, ss=1, re=0.000, recu=0.000, reip=0.000, cl=1, cld=1, fgs=0 X-CTCH-VOD: Unknown X-CTCH-Spam: Unknown X-CTCH-Score: 0.000 X-CTCH-Rules: X-CTCH-Flags: 0 X-CTCH-ScoreCust: 0.000 X-CSC: 0 X-CHA: v=2.3 cv=F7x5iJpN c=1 sm=1 tr=0 a=SF+I6pRkHZhrawxbOkkvaA==:117 a=SF+I6pRkHZhrawxbOkkvaA==:17 a=jpOVt7BSZ2e4Z31A5e1TngXxSK0=:19 a=kj9zAlcOel0A:10 a=mDV3o1hIAAAA:8 a=0lwT8P5eEbaoNYuv7VsA:9 a=CjuIK1q_8ugA:10 a=_FVE-zBwftR9WsbkzFJk:22 X-Spam-Score: 1.0 (+) X-Debbugs-Envelope-To: 36085 Cc: grindeg@yandex.ru, 36085@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.0 (/) 9 juni 2019 kl. 07.22 skrev Eli Zaretskii : >=20 > Here's an idea for making this command work with non-ASCII file names: > do NOT add "-ls" to the 'find' command line, then in the process > filter function call file-attributes on each file name we receive from > 'find', and format the result according to Dired convention before > inserting it into the buffer. Maybe we can trust -print0 to work everywhere (BSD find has it). It's probably a quaint notion, but I wish Emacs were be able to do = without the help of external programs for something as basic as listing = directories. From debbugs-submit-bounces@debbugs.gnu.org Sun Jun 09 06:58:01 2019 Received: (at 36085) by debbugs.gnu.org; 9 Jun 2019 10:58:01 +0000 Received: from localhost ([127.0.0.1]:54369 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hZvWS-0001WC-O1 for submit@debbugs.gnu.org; Sun, 09 Jun 2019 06:58:01 -0400 Received: from eggs.gnu.org ([209.51.188.92]:42678) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hZvWO-0001Vv-8t for 36085@debbugs.gnu.org; Sun, 09 Jun 2019 06:57:59 -0400 Received: from fencepost.gnu.org ([2001:470:142:3::e]:57873) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1hZvWG-0000jl-BI; Sun, 09 Jun 2019 06:57:48 -0400 Received: from [176.228.60.248] (port=1391 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1hZvWC-000760-Bk; Sun, 09 Jun 2019 06:57:45 -0400 Date: Sun, 09 Jun 2019 13:57:37 +0300 Message-Id: <83pnnnauvy.fsf@gnu.org> From: Eli Zaretskii To: Mattias =?utf-8?Q?Engdeg=C3=A5rd?= In-reply-to: (message from Mattias =?utf-8?Q?Engdeg=C3=A5rd?= on Sun, 9 Jun 2019 11:08:51 +0200) Subject: Re: bug#36085: 26.2; find-dired octal escapes instead of Cyrillic text References: <83o938nl9j.fsf@gnu.org> <83h88zcozi.fsf@gnu.org> MIME-version: 1.0 Content-type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 36085 Cc: grindeg@yandex.ru, 36085@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) > From: Mattias Engdegård > Date: Sun, 9 Jun 2019 11:08:51 +0200 > Cc: grindeg@yandex.ru, 36085@debbugs.gnu.org > > > Here's an idea for making this command work with non-ASCII file names: > > do NOT add "-ls" to the 'find' command line, then in the process > > filter function call file-attributes on each file name we receive from > > 'find', and format the result according to Dired convention before > > inserting it into the buffer. > > Maybe we can trust -print0 to work everywhere (BSD find has it). That's orthogonal, isn't it? It is only needed to make sure we don't get confused by file names with embedded newlines, AFAIU. > It's probably a quaint notion, but I wish Emacs were be able to do without the help of external programs for something as basic as listing directories. We have such capabilities, see directory-files-and-attributes and directory-files-recursively. We also have find-lisp.el. I just assumed these alternatives will be significantly slower, but maybe that's not the case? One other consideration is that for large directory trees the current implementation of find-dired updates the buffer in parallel with 'find' still running, whereas the alternatives will not return until the whole listing has been generated, which might take a long time. But maybe we could run the Lisp implementation in a separate thread, and get the same effect? From debbugs-submit-bounces@debbugs.gnu.org Sun Jun 09 08:34:59 2019 Received: (at 36085) by debbugs.gnu.org; 9 Jun 2019 12:34:59 +0000 Received: from localhost ([127.0.0.1]:54417 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hZx2I-0005vT-Mh for submit@debbugs.gnu.org; Sun, 09 Jun 2019 08:34:58 -0400 Received: from mout02.posteo.de ([185.67.36.66]:59733) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hZx2G-0005vG-Tx for 36085@debbugs.gnu.org; Sun, 09 Jun 2019 08:34:57 -0400 Received: from submission (posteo.de [89.146.220.130]) by mout02.posteo.de (Postfix) with ESMTPS id BE5A02400E5 for <36085@debbugs.gnu.org>; Sun, 9 Jun 2019 14:34:49 +0200 (CEST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=posteo.net; s=2017; t=1560083690; bh=PnLh8WjJ6arUK25m+oT2RhgLKNanugGDLNu2V4QcYlU=; h=From:To:Cc:Subject:Date:From; b=TrmMTZqOql6Mdg9uFbiDYhNTTP9n5y0Z2icbVfeswvUatPWQJ74XhX5YL6GdFPYLr mew+ooGqcxxfQq7wHAzw2Pfc3MF5RKirSG6o2MTwaIgadvjxks6/SXG4uTWbeqliQV 0BDPcrq+SJCrNc3YT0e08+SqSaXlqJtQcY9I+qs8zPQ0ss1msrwMYvixRi9jiZdDUU TqR66gnX4al69nv784zkteBUaZPp5whnCpMwGyg9ai5Z18h7VYpo4K6WlnwqY1/h49 WeoTQPNmvdKJbkF/YGtYMbFSnc0pHVgP62QpxsUc9ChaYtGjvrssgFhGvgaF3ri3eL DZnie0zEhheKw== Received: from customer (localhost [127.0.0.1]) by submission (posteo.de) with ESMTPSA id 45MG1P1BkVz9rxW; Sun, 9 Jun 2019 14:34:48 +0200 (CEST) From: Tomas Nordin To: Eli Zaretskii , Nikita Subject: Re: bug#36085: 26.2; find-dired octal escapes instead of Cyrillic text In-Reply-To: <83v9xgnu9p.fsf@gnu.org> References: <02150b33-7105-f106-c8ea-afb4e7d04b67@yandex.ru> <83v9xgnu9p.fsf@gnu.org> Date: Sun, 09 Jun 2019 14:34:45 +0200 Message-ID: <87ftojos2i.fsf@fliptop.i-did-not-set--mail-host-address--so-tickle-me> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 36085 Cc: 36085@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) Eli Zaretskii writes: >> From: Nikita >> Date: Tue, 4 Jun 2019 08:43:06 +0500 >>=20 >> When i open dired, go to the needed directory, run "M-x dired-find" >> "-name "*=D0=9F=D0=BE=D1=80=D1=82=D1=80=D0=B5=D1=82*" (or anything at al= l that will give some results) >> results come back with octal escapes instead of Cyrillic letters. >> I cannot open pictures that it finds for example. > > Turns out the octal escapes are produced by 'find' itself in this > case. Try the following command in that directory from the shell > prompt: > > find . \( -iname "*=D0=9F=D0=BE=D1=80=D1=82=D1=80=D0=B5=D1=82*" \) -ls > > and you will see the same octal escape instead of the Cyrillic > characters. The man page for 'find' clearly documents this, under > "Unusual Filenames": > > Unusual characters are handled differently by various actions, as > described below. > [...] > > -ls, -fls > Unusual characters are always escaped. White space, backslash, > and double quote characters are printed using C-style escaping > (for example `\f', `\"'). Other unusual characters are printed > using an octal escape. Other printable characters (for -ls and > -fls these are the characters between octal 041 and 0176) are > printed as-is. > > What this means is that any non-ASCII character will be converted to a > series of octal escapes. IMO, this is a terrible misfeature in GNU > Findutils, as such "handling" of non-ASCII characters has no place in > today's global environment. Here on 27.0.50 the customize option for `find-ls-option` says For example, to use human-readable file sizes with GNU ls: ("-exec ls -ldh {} +" . "-ldh") Is it ignorant to suggest to try this as a workaround? It "worked" here. Thanks for this bug anyway because i have had the same issue sometimes and I will continue use this option and see if it makes any problems. > > I suggest to report this bug to the GNU Findutils developers. Because ls doesn't seem to do this conversion -- inconsistent? :P Best regards -- Tomas From debbugs-submit-bounces@debbugs.gnu.org Sun Jun 09 08:39:45 2019 Received: (at 36085) by debbugs.gnu.org; 9 Jun 2019 12:39:45 +0000 Received: from localhost ([127.0.0.1]:54423 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hZx6v-00062T-Cb for submit@debbugs.gnu.org; Sun, 09 Jun 2019 08:39:45 -0400 Received: from mail1424c50.megamailservers.eu ([91.136.14.24]:33858 helo=mail102c50.megamailservers.eu) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hZx6s-00062D-CV for 36085@debbugs.gnu.org; Sun, 09 Jun 2019 08:39:43 -0400 X-Authenticated-User: mattiase@bredband.net DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=megamailservers.eu; s=maildub; t=1560083975; bh=Jegiuq4sW3Kw8Lhl6SKV4EaWKPfW17UBMZsK4YhHI8I=; h=Subject:From:In-Reply-To:Date:Cc:References:To:From; b=gkvVm0VSe977QbA54YSluacbC2KYuFN0PLKUAW7N6VSJ288EfeCQKGEZfksO8osIV EwHV4w8lQKxlG0q3pJOQK8tqA8omh96qActWzQj6MYJYYEjYkoz/ZdNKu0TLthwLzP PmXVctJav8K1A1bwfzIT5aRbed+i81Gpxf/lS078= Feedback-ID: mattiase@acm.or Received: from [192.168.0.4] ([188.150.171.71]) (authenticated bits=0) by mail102c50.megamailservers.eu (8.14.9/8.13.1) with ESMTP id x59CdXGm025393; Sun, 9 Jun 2019 12:39:34 +0000 Content-Type: text/plain; charset=us-ascii Mime-Version: 1.0 (Mac OS X Mail 12.4 \(3445.104.11\)) Subject: Re: bug#36085: 26.2; find-dired octal escapes instead of Cyrillic text From: =?utf-8?Q?Mattias_Engdeg=C3=A5rd?= In-Reply-To: <83pnnnauvy.fsf@gnu.org> Date: Sun, 9 Jun 2019 14:39:32 +0200 Content-Transfer-Encoding: quoted-printable Message-Id: <377032D3-8461-4E82-9CB5-319DE2E9C1F6@acm.org> References: <83o938nl9j.fsf@gnu.org> <83h88zcozi.fsf@gnu.org> <83pnnnauvy.fsf@gnu.org> To: Eli Zaretskii X-Mailer: Apple Mail (2.3445.104.11) X-CTCH-RefID: str=0001.0A0B0213.5CFCFE07.001E, ss=1, re=0.000, recu=0.000, reip=0.000, cl=1, cld=1, fgs=0 X-CTCH-VOD: Unknown X-CTCH-Spam: Unknown X-CTCH-Score: 0.000 X-CTCH-Rules: X-CTCH-Flags: 0 X-CTCH-ScoreCust: 0.000 X-CSC: 0 X-CHA: v=2.3 cv=F7x5iJpN c=1 sm=1 tr=0 a=SF+I6pRkHZhrawxbOkkvaA==:117 a=SF+I6pRkHZhrawxbOkkvaA==:17 a=jpOVt7BSZ2e4Z31A5e1TngXxSK0=:19 a=kj9zAlcOel0A:10 a=mDV3o1hIAAAA:8 a=r5nI36mXk4gAHC_GvlYA:9 a=CjuIK1q_8ugA:10 a=_FVE-zBwftR9WsbkzFJk:22 X-Spam-Score: 1.0 (+) X-Debbugs-Envelope-To: 36085 Cc: grindeg@yandex.ru, 36085@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.0 (/) 9 juni 2019 kl. 12.57 skrev Eli Zaretskii : >=20 >> Maybe we can trust -print0 to work everywhere (BSD find has it). >=20 > That's orthogonal, isn't it? It is only needed to make sure we don't > get confused by file names with embedded newlines, AFAIU. Not quite orthogonal as the -ls quoting also takes care of newlines, but = I have no strong opinion on the matter. >> It's probably a quaint notion, but I wish Emacs were be able to do = without the help of external programs for something as basic as listing = directories. >=20 > We have such capabilities, see directory-files-and-attributes and > directory-files-recursively. We also have find-lisp.el. I just > assumed these alternatives will be significantly slower, but maybe > that's not the case? You are right, they are slower, but need not be. The directory listing = functions are slow because they throw away information, leading to lots = of unnecessary syscalls and, on remote file systems, network roundtrips. = This is true both on Unix and Windows. Fixing this is not difficult but the elisp interface design requires = care, and this goes beyond the scope of this bug. Your suggestions sound = more realistic in the short term. > One other consideration is that for large directory trees the current > implementation of find-dired updates the buffer in parallel with > 'find' still running, whereas the alternatives will not return until > the whole listing has been generated, which might take a long time. This concern is definitely valid. I don't know to what extent = parallelism is possible in the current thread implementation. Again, = improvements in this respect would have benefits beyond find-dired. From debbugs-submit-bounces@debbugs.gnu.org Sun Jun 09 08:49:17 2019 Received: (at 36085) by debbugs.gnu.org; 9 Jun 2019 12:49:17 +0000 Received: from localhost ([127.0.0.1]:54441 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hZxG9-0006GS-8z for submit@debbugs.gnu.org; Sun, 09 Jun 2019 08:49:17 -0400 Received: from eggs.gnu.org ([209.51.188.92]:34652) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hZxG5-0006GB-61 for 36085@debbugs.gnu.org; Sun, 09 Jun 2019 08:49:13 -0400 Received: from fencepost.gnu.org ([2001:470:142:3::e]:59049) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1hZxFz-0007w4-G7; Sun, 09 Jun 2019 08:49:07 -0400 Received: from [176.228.60.248] (port=4464 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1hZxFy-0001wy-NX; Sun, 09 Jun 2019 08:49:07 -0400 Date: Sun, 09 Jun 2019 15:49:01 +0300 Message-Id: <83lfyac4aq.fsf@gnu.org> From: Eli Zaretskii To: Mattias =?utf-8?Q?Engdeg=C3=A5rd?= In-reply-to: <377032D3-8461-4E82-9CB5-319DE2E9C1F6@acm.org> (message from Mattias =?utf-8?Q?Engdeg=C3=A5rd?= on Sun, 9 Jun 2019 14:39:32 +0200) Subject: Re: bug#36085: 26.2; find-dired octal escapes instead of Cyrillic text References: <83o938nl9j.fsf@gnu.org> <83h88zcozi.fsf@gnu.org> <83pnnnauvy.fsf@gnu.org> <377032D3-8461-4E82-9CB5-319DE2E9C1F6@acm.org> MIME-version: 1.0 Content-type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 36085 Cc: grindeg@yandex.ru, 36085@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) > From: Mattias Engdegård > Date: Sun, 9 Jun 2019 14:39:32 +0200 > Cc: grindeg@yandex.ru, 36085@debbugs.gnu.org > > > One other consideration is that for large directory trees the current > > implementation of find-dired updates the buffer in parallel with > > 'find' still running, whereas the alternatives will not return until > > the whole listing has been generated, which might take a long time. > > This concern is definitely valid. I don't know to what extent parallelism is possible in the current thread implementation. Just a note: the current "parallel" implementation is not really parallel either: 'find' indeed runs in parallel, but the process filter functions in Emacs only run when Emacs is idle, so if the user types very quickly after invoking find-dired, they will not see the results until they make a break in typing. And our threads work in the same manner, at least in principle, so we should be good running the Lisp implementation in a non-main thread. Of course, until someone actually tries that, we won't know whether there are any obstacles: the devil, as always, is in the details. > Again, improvements in this respect would have benefits beyond find-dired. Sure. From debbugs-submit-bounces@debbugs.gnu.org Sun Jun 09 08:52:12 2019 Received: (at 36085) by debbugs.gnu.org; 9 Jun 2019 12:52:12 +0000 Received: from localhost ([127.0.0.1]:54448 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hZxIx-0006Kj-SB for submit@debbugs.gnu.org; Sun, 09 Jun 2019 08:52:12 -0400 Received: from eggs.gnu.org ([209.51.188.92]:35147) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hZxIv-0006KW-Qt for 36085@debbugs.gnu.org; Sun, 09 Jun 2019 08:52:10 -0400 Received: from fencepost.gnu.org ([2001:470:142:3::e]:59104) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1hZxIq-0000eF-K1; Sun, 09 Jun 2019 08:52:04 -0400 Received: from [176.228.60.248] (port=4642 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1hZxIn-00029i-OK; Sun, 09 Jun 2019 08:52:02 -0400 Date: Sun, 09 Jun 2019 15:51:56 +0300 Message-Id: <83k1duc45v.fsf@gnu.org> From: Eli Zaretskii To: Tomas Nordin In-reply-to: <87ftojos2i.fsf@fliptop.i-did-not-set--mail-host-address--so-tickle-me> (message from Tomas Nordin on Sun, 09 Jun 2019 14:34:45 +0200) Subject: Re: bug#36085: 26.2; find-dired octal escapes instead of Cyrillic text References: <02150b33-7105-f106-c8ea-afb4e7d04b67@yandex.ru> <83v9xgnu9p.fsf@gnu.org> <87ftojos2i.fsf@fliptop.i-did-not-set--mail-host-address--so-tickle-me> X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 36085 Cc: grindeg@yandex.ru, 36085@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) > From: Tomas Nordin > Cc: 36085@debbugs.gnu.org > Date: Sun, 09 Jun 2019 14:34:45 +0200 > > Here on 27.0.50 the customize option for `find-ls-option` says > > For example, to use human-readable file sizes with GNU ls: > ("-exec ls -ldh {} +" . "-ldh") > > Is it ignorant to suggest to try this as a workaround? No, it isn't ignorant. Thanks for mentioning it. Although invoking 'ls' for each and every file reported by 'find' sounds gross to me, and is definitely slower. > > I suggest to report this bug to the GNU Findutils developers. > > Because ls doesn't seem to do this conversion -- inconsistent? :P Mainly because doing so with non-ASCII characters is highly inappropriate nowadays. From debbugs-submit-bounces@debbugs.gnu.org Thu Jun 27 16:59:51 2019 Received: (at control) by debbugs.gnu.org; 27 Jun 2019 20:59:51 +0000 Received: from localhost ([127.0.0.1]:39878 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hgbUl-0000in-9a for submit@debbugs.gnu.org; Thu, 27 Jun 2019 16:59:51 -0400 Received: from mail-io1-f53.google.com ([209.85.166.53]:42814) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hgbUj-0000iX-8C for control@debbugs.gnu.org; Thu, 27 Jun 2019 16:59:49 -0400 Received: by mail-io1-f53.google.com with SMTP id u19so7808027ior.9 for ; Thu, 27 Jun 2019 13:59:49 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:to:subject:date:message-id:mime-version; bh=8j9SEhzK/po7k5AhyuQqFksWz2DjdwMRi6SL2d9AeFc=; b=CWtK3bNW/+JPv48k9x1Cair6AiR+b3TyD3KaI18qeAkRSFjXO6jl8zodxGap16dUxF Q95bP2poOu34+SUC13F8yElIG0QCtWbFTap7NWCLtJXiPyvDAZmugVNmKZihphJJM67F vX+TRct2872AjxZhuQ+qHgGeJaElvTUMhxgbCCVAlj537citmF3osOki+PopZxDgU4/K 97tInqXa+Oto8cH6x3UubzMCZyBwOHuZydFy59Ijmv3wwE0z5Q0Ew/dJ6hMZKTGK7Z3q k26L76GJB+ZvG+wCSyc6MmFJdk0ESB7XzHh081k7exnC0I1GGFfGF8UUMPCyj6bgUrDX 34sA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:subject:date:message-id:mime-version; bh=8j9SEhzK/po7k5AhyuQqFksWz2DjdwMRi6SL2d9AeFc=; b=QtBQlws+wJ3DmUFqh2yJyksGEhvbs+0nFacL2nFghzbL0JLXlfKLx99ItcUJgC/n8D ijxBVVt7eqqX//w3eHWAjE2gGL9Gu2I7u+v14rk/rI40D3KCAwmLcygbIlcm+YB/2wLd PjqEQNXgUhknXcTEQn992RFuzHEIqVy6YkJ/7k2hyf89aS2QmtOnIjAeRwyFyNfd8cu+ RyaZLKHP8ItgiuFpSqLJ7TcGN9ZJEsiR2jomOWNzodE1TI9/xzcQZDHjTL8v8ofMuiii HNqTTmIc38V9oPv8lgsjENE37khVCwfRBA4j+mGyeil6V+IC3f/szInQ/rBsZAnWjsCB ET9A== X-Gm-Message-State: APjAAAXNrtTn2V9+eB68P/RGGYFND8o19dE537orwkIiTYOurX4v9mIa AKJnrAwq+5K6fXwRsgU98ikVGCHC X-Google-Smtp-Source: APXvYqxNjufNxeh7XZ1gDvvZQ/QJRaN62No7hLOWaIoqWntLh/dGPNynYJii1l9ukPqoepUYu8S0Cg== X-Received: by 2002:a6b:5106:: with SMTP id f6mr6267994iob.15.1561669183355; Thu, 27 Jun 2019 13:59:43 -0700 (PDT) Received: from vhost2 (CPE001143542e1f-CMf81d0f809fa0.cpe.net.cable.rogers.com. [99.230.51.196]) by smtp.gmail.com with ESMTPSA id w23sm558616ioa.51.2019.06.27.13.59.42 for (version=TLS1_2 cipher=ECDHE-RSA-CHACHA20-POLY1305 bits=256/256); Thu, 27 Jun 2019 13:59:43 -0700 (PDT) From: Noam Postavsky To: control@debbugs.gnu.org Subject: control message for bug #36085 Date: Thu, 27 Jun 2019 16:59:43 -0400 Message-ID: <85h88a68xs.fsf@gmail.com> MIME-Version: 1.0 Content-Type: text/plain X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: control X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) retitle 36085 find-dired could handle/avoid octal escapes printed by GNU find -ls for non-ASCII filenames severity 36085 wishlist quit From debbugs-submit-bounces@debbugs.gnu.org Sat May 23 12:47:32 2020 Received: (at control) by debbugs.gnu.org; 23 May 2020 16:47:32 +0000 Received: from localhost ([127.0.0.1]:35439 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1jcXJ6-0003L3-Ig for submit@debbugs.gnu.org; Sat, 23 May 2020 12:47:32 -0400 Received: from eggs.gnu.org ([209.51.188.92]:37148) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1jcXJ5-0003Kr-KU for control@debbugs.gnu.org; Sat, 23 May 2020 12:47:31 -0400 Received: from fencepost.gnu.org ([2001:470:142:3::e]:45940) by eggs.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1jcXJ0-0001CB-BT for control@debbugs.gnu.org; Sat, 23 May 2020 12:47:26 -0400 Received: from [176.228.60.248] (port=4019 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1jcXIz-0005m9-9k for control@debbugs.gnu.org; Sat, 23 May 2020 12:47:25 -0400 Date: Sat, 23 May 2020 19:47:33 +0300 Message-Id: <835zcmzjh6.fsf@gnu.org> From: Eli Zaretskii To: control@debbugs.gnu.org In-Reply-To: <87imgmyarw.fsf@hwslack.ware> (message from Viktor Larkin on Sat, 23 May 2020 17:40:51 +0300) Subject: Re: bug#41488: 26.3; Cyrillic chars in output of find-dired and find-name-dired looks like '\234\188\123' References: <87imgmyarw.fsf@hwslack.ware> X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: control X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) severity 41488 wishlist merge 41488 36085 thanks From debbugs-submit-bounces@debbugs.gnu.org Mon Nov 08 00:33:11 2021 Received: (at control) by debbugs.gnu.org; 8 Nov 2021 05:33:11 +0000 Received: from localhost ([127.0.0.1]:55713 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mjxHL-00044h-0U for submit@debbugs.gnu.org; Mon, 08 Nov 2021 00:33:11 -0500 Received: from quimby.gnus.org ([95.216.78.240]:42806) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mjxHJ-00043y-Jo for control@debbugs.gnu.org; Mon, 08 Nov 2021 00:33:10 -0500 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnus.org; s=20200322; h=Subject:From:To:Message-Id:Date:Sender:Reply-To:Cc: MIME-Version:Content-Type:Content-Transfer-Encoding:Content-ID: Content-Description:Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc :Resent-Message-ID:In-Reply-To:References:List-Id:List-Help:List-Unsubscribe: List-Subscribe:List-Post:List-Owner:List-Archive; bh=8JWRRj4+C3LRAocND4EPqKDpQAQl3ZjCOqzuZcZC3zc=; b=TAbod5+OWGSS5KOKyktBGpPKou VCFk+rDie0O+jm5Eub9B/9nAzpO1DsSmpVeE5It3HJC/QeC8unJTNON0X6ALNh/IjHsX3ILbdV7sc qGKuqX0rmKRaCHZf73sT/Gd+x/MxFgyK6T5BPTmySAeTAGTwWZ7942Z9UbmmZcK7AM5I=; Received: from [84.212.220.105] (helo=elva) by quimby.gnus.org with esmtpsa (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1mjxHB-0007YR-Sk for control@debbugs.gnu.org; Mon, 08 Nov 2021 06:33:04 +0100 Date: Mon, 08 Nov 2021 06:33:01 +0100 Message-Id: <87zgqf9ppu.fsf@gnus.org> To: control@debbugs.gnu.org From: Lars Ingebrigtsen Subject: control message for bug #36085 X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: tags 36085 - notabug quit Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: control X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) tags 36085 - notabug quit From debbugs-submit-bounces@debbugs.gnu.org Sun Mar 13 01:05:12 2022 Received: (at 36085) by debbugs.gnu.org; 13 Mar 2022 06:05:12 +0000 Received: from localhost ([127.0.0.1]:41463 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1nTHLs-0008Rf-DL for submit@debbugs.gnu.org; Sun, 13 Mar 2022 01:05:12 -0500 Received: from mail-pl1-f195.google.com ([209.85.214.195]:44942) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1nTHLq-0008RO-Fm for 36085@debbugs.gnu.org; Sun, 13 Mar 2022 01:05:10 -0500 Received: by mail-pl1-f195.google.com with SMTP id q11so11007723pln.11 for <36085@debbugs.gnu.org>; Sat, 12 Mar 2022 22:05:10 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=from:to:cc:subject:references:date:in-reply-to:message-id :user-agent:mime-version:content-transfer-encoding; bh=GAwoLJDNR7yHRXHxHEpgO2UqjdTuHclyq+LmTFKycOM=; b=BEYSt7tlEGZPsxWcueH63vw3V4HA17CpLPhfEgI3HAYnHVh/EyrLymfekmBHF32Nxd CsNRfXynMYXPkbgIt8z6/JBsgxfzcAGexad9kmvmBJ1qePd2lHyIDAxqO025f1ywAtvE Rs7UH1Lc0AzvfRdjw/QLoBNPkHMPmTxqH9S2P1LxlcbzozM+YjGeQMJNKfJtETkXHJWh 2guK92poGi1QqzTjEQnoSKsYnSEOPStgLK2mMSsijeuCFR/fB8PmsmkPRWVi4+s9QBmD fOQa3Y26VtzThdMB1xWGcgkX3Ga2hcKcEhOjYPMf9L2xPPdbeKTGRazgiBlK+wzmJOP4 4Lvg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:from:to:cc:subject:references:date:in-reply-to :message-id:user-agent:mime-version:content-transfer-encoding; bh=GAwoLJDNR7yHRXHxHEpgO2UqjdTuHclyq+LmTFKycOM=; b=pFV7Atg7MNRhRBiX712rF0fl6s06+htdFgMDBzVn1tAMUoruijnk3XV4oaRUdUKKik ilXO6jvNnYeWXyLE/lXMryKu1BEomfsEuZ5g07PytJqFBxEUvVITO3F0FugAVs9UazPO V6VsGapw8PSz5pHGhL3i02dPGEsC2isUSCxLY7zHBweEPwkqE1e95X857yebMreVqDQR htQah2CgRy9ujfirVhtGGKKxRtAVrH/t8oOjI/M+dNM190ZbBclaRDziH/bfNxqs8JIp YU8CImIJgv18LBx1/qlS5cdF2YvdBeKmLobMwXp4QldUACYAVz5O7UuZUAWjYeEl4lMX 9C1w== X-Gm-Message-State: AOAM532L5XVMF/XbGRED+F06iSMe0Al6kWMGL30aKdkrF0ydItO66O/+ EoyUC/c6a/vCd70v+4R0s/U= X-Google-Smtp-Source: ABdhPJwdHDNU3FQ/ypEdoqpkUvN2ztpBl81wtrDUjxVueDa4sQepYZHvAEtv8bzzTgYsfSbofOQBlA== X-Received: by 2002:a17:90a:8581:b0:1b2:7541:af6c with SMTP id m1-20020a17090a858100b001b27541af6cmr19257223pjn.48.1647151504581; Sat, 12 Mar 2022 22:05:04 -0800 (PST) Received: from localhost ([118.185.152.162]) by smtp.gmail.com with ESMTPSA id oc3-20020a17090b1c0300b001bf8c88a8c4sm17573207pjb.35.2022.03.12.22.05.03 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sat, 12 Mar 2022 22:05:04 -0800 (PST) From: Visuwesh To: Eli Zaretskii Subject: Re: bug#36085: 26.2; find-dired octal escapes instead of Cyrillic text References: <83o938nl9j.fsf@gnu.org> Date: Sun, 13 Mar 2022 11:35:01 +0530 In-Reply-To: <83o938nl9j.fsf@gnu.org> (Eli Zaretskii's message of "Sat, 08 Jun 2019 18:34:48 +0300") Message-ID: <87ilsigzg2.fsf@gmail.com> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/29.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Score: 3.6 (+++) X-Spam-Report: Spam detection software, running on the system "debbugs.gnu.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see the administrator of that system for details. Content preview: [சனி, ஜூன் 08 2019] Eli Zaretskii wrote: Hi Eli, >> From: Mattias Engdegård >> Date: Sat, 8 Jun 2019 17:14:11 +0200 >> >> Eli wrote: >> >> > P.S. Emacs could perhaps go above and beyond the call of duty, and >> > attempt to convert the octal esca [...] Content analysis details: (3.6 points, 10.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- 3.6 RCVD_IN_SBL_CSS RBL: Received via a relay in Spamhaus SBL-CSS [118.185.152.162 listed in zen.spamhaus.org] -0.0 SPF_PASS SPF: sender matches SPF record 0.0 SPF_HELO_NONE SPF: HELO does not publish an SPF Record 0.0 FREEMAIL_FROM Sender email is commonly abused enduser mail provider (visuweshm[at]gmail.com) -0.0 RCVD_IN_DNSWL_NONE RBL: Sender listed at https://www.dnswl.org/, no trust [209.85.214.195 listed in list.dnswl.org] 0.0 RCVD_IN_MSPIKE_H3 RBL: Good reputation (+3) [209.85.214.195 listed in wl.mailspike.net] 0.0 RCVD_IN_MSPIKE_WL Mailspike good senders -0.0 T_SCC_BODY_TEXT_LINE No description available. X-Debbugs-Envelope-To: 36085 Cc: Mattias =?utf-8?Q?Engdeg=C3=A5rd?= , grindeg@yandex.ru, 36085@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 2.6 (++) X-Spam-Report: Spam detection software, running on the system "debbugs.gnu.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see the administrator of that system for details. Content preview: [சனி, ஜூன் 08 2019] Eli Zaretskii wrote: Hi Eli, >> From: Mattias Engdegård >> Date: Sat, 8 Jun 2019 17:14:11 +0200 >> >> Eli wrote: >> >> > P.S. Emacs could perhaps go above and beyond the call of duty, and >> > attempt to convert the octal esca [...] Content analysis details: (2.6 points, 10.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -0.0 RCVD_IN_DNSWL_NONE RBL: Sender listed at https://www.dnswl.org/, no trust [209.85.214.195 listed in list.dnswl.org] 0.0 RCVD_IN_MSPIKE_H3 RBL: Good reputation (+3) [209.85.214.195 listed in wl.mailspike.net] 3.6 RCVD_IN_SBL_CSS RBL: Received via a relay in Spamhaus SBL-CSS [118.185.152.162 listed in zen.spamhaus.org] -0.0 SPF_PASS SPF: sender matches SPF record 0.0 SPF_HELO_NONE SPF: HELO does not publish an SPF Record 0.0 FREEMAIL_FROM Sender email is commonly abused enduser mail provider (visuweshm[at]gmail.com) 0.0 RCVD_IN_MSPIKE_WL Mailspike good senders -0.0 T_SCC_BODY_TEXT_LINE No description available. -1.0 MAILING_LIST_MULTI Multiple indicators imply a widely-seen list manager [=E0=AE=9A=E0=AE=A9=E0=AE=BF, =E0=AE=9C=E0=AF=82=E0=AE=A9=E0=AF=8D 08 2019]= Eli Zaretskii wrote: Hi Eli, >> From: Mattias Engdeg=C3=A5rd >> Date: Sat, 8 Jun 2019 17:14:11 +0200 >>=20 >> Eli wrote: >>=20 >> > P.S. Emacs could perhaps go above and beyond the call of duty, and >> > attempt to convert the octal escapes back to readable text. But I >> > don't think we should do it, as it's a clear bug in >> > 'find'. Nonetheless, if someone wants to submit patches to do such >> > a conversion, I won't block them. >>=20 >> The default (BSD) find in macOS does not seem to escape anything; >> files named =D0=9F=D0=BE=D1=80=D1=82=D1=80=D0=B5=D1=82 or APL\360 are pr= inted exactly that way. Thus, >> Emacs would need to know what 'find' it is running. This appears to >> validate your recommendation. > > Indeed, the hard part is to distinguish between \nnn an octal escape > and the literal string "\nnn". That difficulty is one reason why > gdb-mi.el performs a similar decoding only as an opt-in optional > behavior. After being annoyed by the same exact behaviour, and with the helpful hint about gdb-mi.el, I came up with the following function. With a preliminary testing, it does not choke on literal "\nnn" and it does not noticeably slow down find-dired unlike the xargs option. Maybe, we can include something like this, WDYT? (defun vz/find-dired-unescape () "Unescape the C-style octal escape strings." (while (not (eobp)) (when-let ((beg (next-single-property-change (point) 'dired-filenam= e)) (props (text-properties-at beg))) (goto-char beg) (while (and (re-search-forward (rx "\\" (group (any "0-7") (? (an= y "0-7") (? (any "0-7"))))) (line-end-position) 'noerror) (not (eq (char-before (match-beginning 0)) ?\\))) (let ((num (string-to-number (match-string 1) 8))) (replace-match (unibyte-string num) t nil nil 0))) (decode-coding-region beg (line-end-position) buffer-file-coding-= system) (set-text-properties beg (line-end-position) props)) (forward-line))) (custom-set-variables '(find-ls-option (cons "-ls" "-dlis")) '(find-dired-refine-function #'vz/find-dired-unescape))