GNU bug report logs - #20623
XML and HTML files with encoding/charset="utf-8" declaration lose BOM; Coding system is reset from utf-8-with-signature to utf-8 on save

Previous Next

Package: emacs;

Reported by: Simon Ledergerber <sledergerber <at> gmx.net>

Date: Thu, 21 May 2015 18:53:02 UTC

Severity: normal

Found in version 26.1

Fixed in version 26.2

Done: Eli Zaretskii <eliz <at> gnu.org>

Bug is archived. No further changes may be made.

Full log


View this message in rfc822 format

From: Eli Zaretskii <eliz <at> gnu.org>
To: Stefan Monnier <monnier <at> iro.umontreal.ca>
Cc: sledergerber <at> gmx.net, 20623 <at> debbugs.gnu.org
Subject: bug#20623: XML and HTML files with encoding/charset="utf-8"	declaration loose BOM; Coding system is reset from utf-8-with-signature to utf-8 on save
Date: Sat, 23 May 2015 09:44:12 +0300
> From: Stefan Monnier <monnier <at> iro.umontreal.ca>
> Cc: sledergerber <at> gmx.net,  20623 <at> debbugs.gnu.org
> Date: Fri, 22 May 2015 17:51:07 -0400
> 
> >> > What would you expect Emacs to do instead?  It just obeys the stated
> >> > encoding, which says nothing about the BOM.  How can Emacs know when
> >> > to use utf-8 and when utf-8-with-signature?
> >> To the extent that Emacs has seen the BOM when opening the file, it
> >> would make sense for Emacs to try and preserve this detail.
> > It does.
> 
> While there are cases where it does, this bug report is about a case
> where it doesn't, IIUC.

AFAIU, that happened because the user has this in ~/.emacs:

  (setq-default buffer-file-coding-system 'utf-8-dos)

IMO, this bad customization should be removed, and then the problem
will go away.




This bug report was last modified 6 years and 279 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.