GNU bug report logs - #42007
ps-print encodes post-ASCII Unicode incorrectly for common characters

Previous Next

Package: emacs;

Reported by: "James P. Ascher" <jpsa <at> protonmail.com>

Date: Mon, 22 Jun 2020 17:02:01 UTC

Severity: normal

Merged with 35355, 41812

Found in versions 27.0.50, 28.0.50

To reply to this bug, email your comments to 42007 AT debbugs.gnu.org.

Toggle the display of automated, internal messages from the tracker.

View this report as an mbox folder, status mbox, maintainer mbox


Report forwarded to bug-gnu-emacs <at> gnu.org:
bug#42007; Package emacs. (Mon, 22 Jun 2020 17:02:01 GMT) Full text and rfc822 format available.

Acknowledgement sent to "James P. Ascher" <jpsa <at> protonmail.com>:
New bug report received and forwarded. Copy sent to bug-gnu-emacs <at> gnu.org. (Mon, 22 Jun 2020 17:02:01 GMT) Full text and rfc822 format available.

Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):

From: "James P. Ascher" <jpsa <at> protonmail.com>
To: bug-gnu-emacs <at> gnu.org, Vinicius Jose Latorre <viniciusjl <at> ig.com.br>
Subject: ps-print encodes post-ASCII Unicode incorrectly for common characters
Date: Mon, 22 Jun 2020 16:47:52 +0000
I'd like to use ps-print to print hard copies of emails from my
colleagues, but it fails for certain common Unicode characters beyond
the basic ASCII, rendering them as '?'.

MINIMAL EXAMPLE:

1. New buffer with "What’s up?—Dude."
2. Call C-u M-x ps-print-buffer and save as test.ps
3. The line in question PostScript code reads:

(What?s up??Dude.) S

which renders:

"What?s up??Dude."

It should render "What’s up?—Dude."


DISCUSSION:

This fails for emoji and most of the hello file as well, but that's not
really what I'm after.  In trying to debug this, I made a little
progress.  In ps-print.el, `ps-output-string-prim` seems to be designed
to handle these, however:

(ps-output-string-prim "What’s up?—Dude.")

gives

(What\3FFFE2\3FFF80\3FFF99s up?\3FFFE2\3FFF80\3FFF94Dude.)

Putting that string in the PostScript file doesn't work quite right
either: it drops the back slash. So I think there are two bugs here:

Bug 1: Post-ASCII Unicode encodes as '?' instead of "\number"
Bug 2: Post-ASCII Unicode should be encoded so at least Ghostscript can
handle it.

Ghostscript supports post-ASCII Unicode:
https://ghostscript.com/doc/9.52/Use.htm#UnicodeTT

But, I don't know if such behavior is standard PostScript.  I also don't
know if there's a more obvious solution.

Respectfully submitted,
-James

-- 
James P. Ascher
Doctoral Student, English Department
University of Virginia





Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#42007; Package emacs. (Mon, 22 Jun 2020 18:06:02 GMT) Full text and rfc822 format available.

Message #8 received at 42007 <at> debbugs.gnu.org (full text, mbox):

From: Eli Zaretskii <eliz <at> gnu.org>
To: "James P. Ascher" <jpsa <at> protonmail.com>
Cc: 42007 <at> debbugs.gnu.org, viniciusjl <at> ig.com.br
Subject: Re: bug#42007: ps-print encodes post-ASCII Unicode incorrectly for
 common characters
Date: Mon, 22 Jun 2020 21:05:29 +0300
> Date: Mon, 22 Jun 2020 16:47:52 +0000
> From: "James P. Ascher" via "Bug reports for GNU Emacs,
>  the Swiss army knife of text editors" <bug-gnu-emacs <at> gnu.org>
> 
> I'd like to use ps-print to print hard copies of emails from my
> colleagues, but it fails for certain common Unicode characters beyond
> the basic ASCII, rendering them as '?'.
> 
> MINIMAL EXAMPLE:
> 
> 1. New buffer with "What’s up?—Dude."
> 2. Call C-u M-x ps-print-buffer and save as test.ps
> 3. The line in question PostScript code reads:
> 
> (What?s up??Dude.) S
> 
> which renders:
> 
> "What?s up??Dude."
> 
> It should render "What’s up?—Dude."

I don't think ps-print supports non-ASCII characters beyond Latin-1,
definitely not unless you customize ps-mule and ps-bdf with
appropriate font.

Sorry.




Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#42007; Package emacs. (Mon, 22 Jun 2020 19:12:01 GMT) Full text and rfc822 format available.

Message #11 received at 42007 <at> debbugs.gnu.org (full text, mbox):

From: "James P. Ascher" <jpsa <at> protonmail.com>
To: Eli Zaretskii <eliz <at> gnu.org>
Cc: 42007 <at> debbugs.gnu.org, viniciusjl <at> ig.com.br
Subject: Re: bug#42007: ps-print encodes post-ASCII Unicode incorrectly for
 common characters
Date: Mon, 22 Jun 2020 19:11:07 +0000
> I don't think ps-print supports non-ASCII characters beyond Latin-1,
> definitely not unless you customize ps-mule and ps-bdf with
> appropriate font.
>
> Sorry.

No worries!  Knowing not to bang my head on that wall is
certainly progress.

Thanks for the quick response.





Severity set to 'wishlist' from 'normal' Request was from Stefan Kangas <stefan <at> marxist.se> to control <at> debbugs.gnu.org. (Thu, 13 Aug 2020 00:51:02 GMT) Full text and rfc822 format available.

Forcibly Merged 35355 42007. Request was from Lars Ingebrigtsen <larsi <at> gnus.org> to control <at> debbugs.gnu.org. (Sat, 31 Jul 2021 15:03:02 GMT) Full text and rfc822 format available.

Forcibly Merged 35355 41812 42007. Request was from Lars Ingebrigtsen <larsi <at> gnus.org> to control <at> debbugs.gnu.org. (Sat, 31 Jul 2021 16:32:02 GMT) Full text and rfc822 format available.

Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#42007; Package emacs. (Tue, 07 Dec 2021 16:09:02 GMT) Full text and rfc822 format available.

Message #20 received at 42007 <at> debbugs.gnu.org (full text, mbox):

From: Peter Münster <pm <at> a16n.net>
To: Eli Zaretskii <eliz <at> gnu.org>
Cc: "James P. Ascher" <jpsa <at> protonmail.com>, 42007 <at> debbugs.gnu.org,
 viniciusjl <at> ig.com.br
Subject: Re: bug#42007: ps-print encodes post-ASCII Unicode incorrectly for
 common characters
Date: Tue, 07 Dec 2021 17:08:24 +0100
On Mon, Jun 22 2020, Eli Zaretskii wrote:

> I don't think ps-print supports non-ASCII characters beyond Latin-1,
> definitely not unless you customize ps-mule and ps-bdf with
> appropriate font.

Hi,

Where could I find an example setup please? I.e. what font should be
used, that looks similar to the standard font used by ps-print and that
supports at least latin-9 characters like the €-symbol? And how should
ps-mule be set up to use that font?

TIA for any pointers,
-- 
           Peter




Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#42007; Package emacs. (Tue, 07 Dec 2021 16:55:01 GMT) Full text and rfc822 format available.

Message #23 received at 42007 <at> debbugs.gnu.org (full text, mbox):

From: Eli Zaretskii <eliz <at> gnu.org>
To: Peter Münster <pm <at> a16n.net>
Cc: jpsa <at> protonmail.com, 42007 <at> debbugs.gnu.org, viniciusjl <at> ig.com.br
Subject: Re: bug#42007: ps-print encodes post-ASCII Unicode incorrectly for
 common characters
Date: Tue, 07 Dec 2021 18:54:15 +0200
> From: Peter Münster <pm <at> a16n.net>
> Cc: "James P. Ascher" <jpsa <at> protonmail.com>,  42007 <at> debbugs.gnu.org,
>   viniciusjl <at> ig.com.br
> Date: Tue, 07 Dec 2021 17:08:24 +0100
> 
> On Mon, Jun 22 2020, Eli Zaretskii wrote:
> 
> > I don't think ps-print supports non-ASCII characters beyond Latin-1,
> > definitely not unless you customize ps-mule and ps-bdf with
> > appropriate font.
> 
> Hi,
> 
> Where could I find an example setup please? I.e. what font should be
> used, that looks similar to the standard font used by ps-print and that
> supports at least latin-9 characters like the €-symbol? And how should
> ps-mule be set up to use that font?

The only source of documentation I know of are the commentaries at the
beginning of ps-mule.el and the sample settings there which provide
examples.




Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#42007; Package emacs. (Tue, 07 Dec 2021 18:42:02 GMT) Full text and rfc822 format available.

Message #26 received at 42007 <at> debbugs.gnu.org (full text, mbox):

From: Peter Münster <pm <at> a16n.net>
To: Eli Zaretskii <eliz <at> gnu.org>
Cc: jpsa <at> protonmail.com, 42007 <at> debbugs.gnu.org
Subject: Re: bug#42007: ps-print encodes post-ASCII Unicode incorrectly for
 common characters
Date: Tue, 07 Dec 2021 19:40:54 +0100
[Message part 1 (text/plain, inline)]
On Tue, Dec 07 2021, Eli Zaretskii wrote:

> The only source of documentation I know of are the commentaries at the
> beginning of ps-mule.el and the sample settings there which provide
> examples.

Unfortunately the bdf-fonts are very ugly...

-- 
           Peter
[signature.asc (application/pgp-signature, inline)]

This bug report was last modified 3 years and 190 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.