GNU bug report logs - #65301
29.1; archive-mode can not handle archive named with foreign characters

Previous Next

Package: emacs;

Reported by: awrhygty <at> outlook.com

Date: Tue, 15 Aug 2023 04:19:01 UTC

Severity: normal

Tags: wontfix

Found in version 29.1

Done: Eli Zaretskii <eliz <at> gnu.org>

Bug is archived. No further changes may be made.

To add a comment to this bug, you must first unarchive it, by sending
a message to control AT debbugs.gnu.org, with unarchive 65301 in the body.
You can then email your comments to 65301 AT debbugs.gnu.org in the normal way.

Toggle the display of automated, internal messages from the tracker.

View this report as an mbox folder, status mbox, maintainer mbox


Report forwarded to bug-gnu-emacs <at> gnu.org:
bug#65301; Package emacs. (Tue, 15 Aug 2023 04:19:01 GMT) Full text and rfc822 format available.

Acknowledgement sent to awrhygty <at> outlook.com:
New bug report received and forwarded. Copy sent to bug-gnu-emacs <at> gnu.org. (Tue, 15 Aug 2023 04:19:02 GMT) Full text and rfc822 format available.

Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):

From: awrhygty <at> outlook.com
To: bug-gnu-emacs <at> gnu.org
Subject: 29.1; archive-mode can not handle archive named with foreign
 characters
Date: Tue, 15 Aug 2023 13:17:54 +0900
If archive file name contains characters not encodable with local
language encoding(cp932 for Japanese Windows OS),
archive-mode fails to extract subfiles.
(even if subfile name consists of only ASCII characters)

For example, open c:/tmp/test丄.zip and type RET on a entry,
emacs reports:
  unzip:  cannot find either c:/tmp/test .zip or c:/tmp/test .zip.zip.


In GNU Emacs 29.1 (build 2, x86_64-w64-mingw32) of 2023-08-02 built on
 AVALON
Windowing system distributor 'Microsoft Corp.', version 10.0.19045
System Description: Microsoft Windows 10 Pro (v10.0.2009.19045.3324)

Configured using:
 'configure --with-modules --without-dbus --with-native-compilation=aot
 --without-compress-install --with-tree-sitter CFLAGS=-O2'

Configured features:
ACL GIF GMP GNUTLS HARFBUZZ JPEG JSON LCMS2 LIBXML2 MODULES NATIVE_COMP
NOTIFY W32NOTIFY PDUMPER PNG RSVG SOUND SQLITE3 THREADS TIFF
TOOLKIT_SCROLL_BARS TREE_SITTER WEBP XPM ZLIB

(NATIVE_COMP present but libgccjit not available)

Important settings:
  value of $LANG: JPN
  locale-coding-system: cp932

Major mode: Dired by name

Minor modes in effect:
  shell-dirtrack-mode: t
  tooltip-mode: t
  global-eldoc-mode: t
  show-paren-mode: t
  electric-indent-mode: t
  mouse-wheel-mode: t
  tool-bar-mode: t
  menu-bar-mode: t
  file-name-shadow-mode: t
  global-font-lock-mode: t
  font-lock-mode: t
  blink-cursor-mode: t
  buffer-read-only: t
  line-number-mode: t
  indent-tabs-mode: t
  transient-mark-mode: t
  auto-composition-mode: t
  auto-encryption-mode: t
  auto-compression-mode: t

Load-path shadows:
None found.

Features:
(qp files-x shell pcomplete comint ansi-osc ansi-color ring dired-aux
image-mode exif arc-mode archive-mode pp shadow emacsbug help-mode
gnutls network-stream nsm mailalias smtpmail textsec uni-scripts url
url-proxy url-privacy url-expand url-methods url-history url-cookie
generate-lisp-file url-domsuf url-util url-parse auth-source eieio
eieio-core cl-macs json map byte-opt gv bytecomp byte-compile url-vars
idna-mapping ucs-normalize uni-confusable textsec-check sort cl-seq
misearch multi-isearch mail-extr message sendmail mailcap yank-media
puny dired dired-loaddefs rfc822 mml mml-sec password-cache epa derived
epg rfc6068 epg-config gnus-util text-property-search time-date subr-x
mm-decode mm-bodies mm-encode mail-parse rfc2231 rfc2047 rfc2045 mm-util
ietf-drums mail-prsvr mailabbrev mail-utils gmm-utils mailheader
cl-loaddefs cl-lib term/bobcat japan-util rmc iso-transl tooltip cconv
eldoc paren electric uniquify ediff-hook vc-hooks lisp-float-type
elisp-mode mwheel dos-w32 ls-lisp disp-table term/w32-win w32-win
w32-vars term/common-win tool-bar dnd fontset image regexp-opt fringe
tabulated-list replace newcomment text-mode lisp-mode prog-mode register
page tab-bar menu-bar rfn-eshadow isearch easymenu timer select
scroll-bar mouse jit-lock font-lock syntax font-core term/tty-colors
frame minibuffer nadvice seq simple cl-generic indonesian philippine
cham georgian utf-8-lang misc-lang vietnamese tibetan thai tai-viet lao
korean japanese eucjp-ms cp51932 hebrew greek romanian slovak czech
european ethiopic indian cyrillic chinese composite emoji-zwj charscript
charprop case-table epa-hook jka-cmpr-hook help abbrev obarray oclosure
cl-preloaded button loaddefs theme-loaddefs faces cus-face macroexp
files window text-properties overlay sha1 md5 base64 format env
code-pages mule custom widget keymap hashtable-print-readable backquote
threads w32notify w32 lcms2 multi-tty make-network-process
native-compile emacs)

Memory information:
((conses 16 235491 21481)
 (symbols 48 8709 3)
 (strings 32 40470 2713)
 (string-bytes 1 870636)
 (vectors 16 45015)
 (vector-slots 8 1266046 131540)
 (floats 8 105 300)
 (intervals 56 1343 0)
 (buffers 984 16))




Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#65301; Package emacs. (Tue, 15 Aug 2023 11:14:02 GMT) Full text and rfc822 format available.

Message #8 received at 65301 <at> debbugs.gnu.org (full text, mbox):

From: Eli Zaretskii <eliz <at> gnu.org>
To: awrhygty <at> outlook.com
Cc: 65301 <at> debbugs.gnu.org
Subject: Re: bug#65301: 29.1;
 archive-mode can not handle archive named with foreign characters
Date: Tue, 15 Aug 2023 14:13:06 +0300
tags 65301 wontfix
close 65301
thanks

> From: awrhygty <at> outlook.com
> Date: Tue, 15 Aug 2023 13:17:54 +0900
> 
> 
> If archive file name contains characters not encodable with local
> language encoding(cp932 for Japanese Windows OS),
> archive-mode fails to extract subfiles.
> (even if subfile name consists of only ASCII characters)
> 
> For example, open c:/tmp/test丄.zip and type RET on a entry,
> emacs reports:
>   unzip:  cannot find either c:/tmp/test .zip or c:/tmp/test .zip.zip.

This is because we run the unzip.exe program to extract the files, and
running subprocesses from Emacs on MS-Windows is limited to
command-line arguments which can be encoded in the system codepage.

Basically, this is a duplicate of bug#65300.




Added tag(s) wontfix. Request was from Eli Zaretskii <eliz <at> gnu.org> to control <at> debbugs.gnu.org. (Tue, 15 Aug 2023 11:14:02 GMT) Full text and rfc822 format available.

bug closed, send any further explanations to 65301 <at> debbugs.gnu.org and awrhygty <at> outlook.com Request was from Eli Zaretskii <eliz <at> gnu.org> to control <at> debbugs.gnu.org. (Tue, 15 Aug 2023 11:14:03 GMT) Full text and rfc822 format available.

Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#65301; Package emacs. (Tue, 15 Aug 2023 13:11:02 GMT) Full text and rfc822 format available.

Message #15 received at 65301 <at> debbugs.gnu.org (full text, mbox):

From: awrhygty <at> outlook.com
To: Eli Zaretskii <eliz <at> gnu.org>
Cc: 65301 <at> debbugs.gnu.org
Subject: Re: bug#65301: 29.1; archive-mode can not handle archive named with
 foreign characters
Date: Tue, 15 Aug 2023 22:09:50 +0900
Eli Zaretskii <eliz <at> gnu.org> writes:

> tags 65301 wontfix
> close 65301
> thanks
>
>> From: awrhygty <at> outlook.com
>> Date: Tue, 15 Aug 2023 13:17:54 +0900
>> 
>> 
>> If archive file name contains characters not encodable with local
>> language encoding(cp932 for Japanese Windows OS),
>> archive-mode fails to extract subfiles.
>> (even if subfile name consists of only ASCII characters)
>> 
>> For example, open c:/tmp/test丄.zip and type RET on a entry,
>> emacs reports:
>>   unzip:  cannot find either c:/tmp/test .zip or c:/tmp/test .zip.zip.
>
> This is because we run the unzip.exe program to extract the files, and
> running subprocesses from Emacs on MS-Windows is limited to
> command-line arguments which can be encoded in the system codepage.
>
> Basically, this is a duplicate of bug#65300.

I have installed not only unzip.exe but also 7z.exe.
7z.exe works fine on cmd terminal.
But I can not find how to call 7z.exe on emacs with parameters encoded
with utf-8.




Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#65301; Package emacs. (Tue, 15 Aug 2023 14:17:01 GMT) Full text and rfc822 format available.

Message #18 received at 65301 <at> debbugs.gnu.org (full text, mbox):

From: Eli Zaretskii <eliz <at> gnu.org>
To: awrhygty <at> outlook.com
Cc: 65301 <at> debbugs.gnu.org
Subject: Re: bug#65301: 29.1; archive-mode can not handle archive named with
 foreign characters
Date: Tue, 15 Aug 2023 17:16:09 +0300
> From: awrhygty <at> outlook.com
> Cc: 65301 <at> debbugs.gnu.org
> Date: Tue, 15 Aug 2023 22:09:50 +0900
> 
> Eli Zaretskii <eliz <at> gnu.org> writes:
> 
> > This is because we run the unzip.exe program to extract the files, and
> > running subprocesses from Emacs on MS-Windows is limited to
> > command-line arguments which can be encoded in the system codepage.
> >
> > Basically, this is a duplicate of bug#65300.
> 
> I have installed not only unzip.exe but also 7z.exe.
> 7z.exe works fine on cmd terminal.
> But I can not find how to call 7z.exe on emacs with parameters encoded
> with utf-8.

You can't.  It's a basic limitation of invoking programs from Emacs on
Windows, as explained above.

In reality, this is rarely a serious limitation, because most programs
users routinely invoke from Emacs on Windows don't support UTF-8
encoded arguments anyway.  But the limitation is deep, so even if we
had more of these situations, we cannot really fix this until Windows
systems support UTF-8 better (in particular in file-related APIs).




bug archived. Request was from Debbugs Internal Request <help-debbugs <at> gnu.org> to internal_control <at> debbugs.gnu.org. (Wed, 13 Sep 2023 11:24:06 GMT) Full text and rfc822 format available.

This bug report was last modified 1 year and 283 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.