#57531 - 28.1; Character encoding missing for "eo"

GNU bug report logs - #57531
28.1; Character encoding missing for "eo"

Package: emacs;

Reported by: Jonathan Reeve <jonathan <at> jonreeve.com>

Date: Thu, 1 Sep 2022 19:34:02 UTC

Severity: normal

Tags: moreinfo

Found in version 28.1

Done: Lars Ingebrigtsen <larsi <at> gnus.org>

Bug is archived. No further changes may be made.

View this message in rfc822 format

From: Gregory Heytings <gregory <at> heytings.org> To: Eli Zaretskii <eliz <at> gnu.org> Cc: jonathan <at> jonreeve.com, 57531 <at> debbugs.gnu.org Subject: bug#57531: 28.1; Character encoding missing for "eo" Date: Mon, 05 Sep 2022 13:33:12 +0000

[Message part 1 (text/plain, inline)]

>>> What would you suggest that Emacs does instead? >> >> I don't know, because anything that it could do would be backward >> incompatible. > > The only change I could think of that is almost backward-compatible > (except for this single locale) is the one I posted, if we modify it to > also make the 'lang-info' pseudo-encoding override the locale.alias > file. > Agreed, yes. >> What is clear is that, on reasonably modern systems, legacy locales are >> not used anymore, and their use is discouraged (e.g. the Debian >> installer does not present you with any legacy encoding, they remain >> available but to activate them you need to edit the /etc/locale.gen >> file manually). So perhaps Emacs could always assume UTF-8, and use >> another encoding only when there are good reasons to do so (e.g. when >> opening a file with a legacy encoding). The presence of the >> equivalence eo / Latin-3 in locale.alias is IMO not a good enough >> reason. > > I have no idea what this kind of change could do. > I have no idea either, I was thinking aloud. But what is clear (at least to me) is that this change is inevitable at some point. UTF-8 has been the default encoding almost everywhere for two decades or so, and that's unlikely to change in the forseeable future. In that world we cannot continue forever to let Emacs choose another encoding based on some heuristics, because "nobody" expects that anymore. Unless there's a good reason to do so, of course. > > Maybe nothing, maybe breakage across the board. Keep in mind that the > default encoding is used for stuff other than decoding text in files > Emacs visits, and also for some important tasks during startup. > > I also think our encoding detection doesn't always succeed to discern > between UTF-8 and single-byte Latin-N encodings. > I keep all that in mind, yes 😃

This bug report was last modified 2 years and 228 days ago.

GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.

GNU bug report logs - #57531 28.1; Character encoding missing for "eo"

GNU bug report logs - #57531
28.1; Character encoding missing for "eo"