GNU bug report logs - #20623
XML and HTML files with encoding/charset="utf-8" declaration lose BOM; Coding system is reset from utf-8-with-signature to utf-8 on save

Previous Next

Package: emacs;

Reported by: Simon Ledergerber <sledergerber <at> gmx.net>

Date: Thu, 21 May 2015 18:53:02 UTC

Severity: normal

Found in version 26.1

Fixed in version 26.2

Done: Eli Zaretskii <eliz <at> gnu.org>

Bug is archived. No further changes may be made.

Full log


View this message in rfc822 format

From: Eli Zaretskii <eliz <at> gnu.org>
To: Stefan Monnier <monnier <at> iro.umontreal.ca>
Cc: sledergerber <at> gmx.net, 20623 <at> debbugs.gnu.org
Subject: bug#20623: XML and HTML files with encoding/charset="utf-8"	declaration loose BOM; Coding system is reset from utf-8-with-signature to utf-8 on save
Date: Fri, 22 May 2015 18:26:57 +0300
> From: Stefan Monnier <monnier <at> iro.umontreal.ca>
> Cc: Simon Ledergerber <sledergerber <at> gmx.net>,  20623 <at> debbugs.gnu.org
> Date: Fri, 22 May 2015 11:22:27 -0400
> 
> > What would you expect Emacs to do instead?  It just obeys the stated
> > encoding, which says nothing about the BOM.  How can Emacs know when
> > to use utf-8 and when utf-8-with-signature?
> 
> To the extent that Emacs has seen the BOM when opening the file, it
> would make sense for Emacs to try and preserve this detail.

It does.




This bug report was last modified 6 years and 279 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.