GNU bug report logs -
#42007
ps-print encodes post-ASCII Unicode incorrectly for common characters
Previous Next
Full log
View this message in rfc822 format
I'd like to use ps-print to print hard copies of emails from my
colleagues, but it fails for certain common Unicode characters beyond
the basic ASCII, rendering them as '?'.
MINIMAL EXAMPLE:
1. New buffer with "What’s up?—Dude."
2. Call C-u M-x ps-print-buffer and save as test.ps
3. The line in question PostScript code reads:
(What?s up??Dude.) S
which renders:
"What?s up??Dude."
It should render "What’s up?—Dude."
DISCUSSION:
This fails for emoji and most of the hello file as well, but that's not
really what I'm after. In trying to debug this, I made a little
progress. In ps-print.el, `ps-output-string-prim` seems to be designed
to handle these, however:
(ps-output-string-prim "What’s up?—Dude.")
gives
(What\3FFFE2\3FFF80\3FFF99s up?\3FFFE2\3FFF80\3FFF94Dude.)
Putting that string in the PostScript file doesn't work quite right
either: it drops the back slash. So I think there are two bugs here:
Bug 1: Post-ASCII Unicode encodes as '?' instead of "\number"
Bug 2: Post-ASCII Unicode should be encoded so at least Ghostscript can
handle it.
Ghostscript supports post-ASCII Unicode:
https://ghostscript.com/doc/9.52/Use.htm#UnicodeTT
But, I don't know if such behavior is standard PostScript. I also don't
know if there's a more obvious solution.
Respectfully submitted,
-James
--
James P. Ascher
Doctoral Student, English Department
University of Virginia
This bug report was last modified 3 years and 190 days ago.
Previous Next
GNU bug tracking system
Copyright (C) 1999 Darren O. Benham,
1997,2003 nCipher Corporation Ltd,
1994-97 Ian Jackson.