GNU bug report logs - #33445
Say "U+1F52E"

Previous Next

Package: emacs;

Reported by: 積丹尼 Dan Jacobson <jidanni <at> jidanni.org>

Date: Tue, 20 Nov 2018 18:52:02 UTC

Severity: wishlist

Tags: wontfix

Merged with 26477, 35704

Done: Lars Ingebrigtsen <larsi <at> gnus.org>

Bug is archived. No further changes may be made.

To add a comment to this bug, you must first unarchive it, by sending
a message to control AT debbugs.gnu.org, with unarchive 33445 in the body.
You can then email your comments to 33445 AT debbugs.gnu.org in the normal way.

Toggle the display of automated, internal messages from the tracker.

View this report as an mbox folder, status mbox, maintainer mbox


Report forwarded to bug-gnu-emacs <at> gnu.org:
bug#33445; Package emacs. (Tue, 20 Nov 2018 18:52:02 GMT) Full text and rfc822 format available.

Acknowledgement sent to 積丹尼 Dan Jacobson <jidanni <at> jidanni.org>:
New bug report received and forwarded. Copy sent to bug-gnu-emacs <at> gnu.org. (Tue, 20 Nov 2018 18:52:02 GMT) Full text and rfc822 format available.

Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):

From: 積丹尼 Dan Jacobson <jidanni <at> jidanni.org>
To: bug-gnu-emacs <at> gnu.org
Subject: Say "U+1F52E"
Date: Wed, 21 Nov 2018 02:50:58 +0800
Regarding e.g.,
🔮 U+1F52E CRYSTAL BALL

Well doing C-u C-x = on it should really mention "U+1F52E" somewhere, by
default, without needing customization.

P.S., by the way here on Debian it is just a box with numbers in it,
with no package available to fix it.




Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#33445; Package emacs. (Tue, 20 Nov 2018 19:25:02 GMT) Full text and rfc822 format available.

Message #8 received at 33445 <at> debbugs.gnu.org (full text, mbox):

From: Eli Zaretskii <eliz <at> gnu.org>
To: 積丹尼 Dan Jacobson <jidanni <at> jidanni.org>
Cc: 33445 <at> debbugs.gnu.org
Subject: Re: bug#33445: Say "U+1F52E"
Date: Tue, 20 Nov 2018 21:23:59 +0200
> From: 積丹尼 Dan Jacobson
> 	<jidanni <at> jidanni.org>
> Date: Wed, 21 Nov 2018 02:50:58 +0800
> 
> Regarding e.g.,
> 🔮 U+1F52E CRYSTAL BALL
> 
> Well doing C-u C-x = on it should really mention "U+1F52E" somewhere, by
> default, without needing customization.

It does, at least twice.  Or maybe I don't understand what is it that
you are missing in the resulting display.

> P.S., by the way here on Debian it is just a box with numbers in it,
> with no package available to fix it.

You need to install a suitable font, I suggest Symbola.




Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#33445; Package emacs. (Tue, 20 Nov 2018 20:05:02 GMT) Full text and rfc822 format available.

Message #11 received at 33445 <at> debbugs.gnu.org (full text, mbox):

From: 積丹尼 Dan Jacobson <jidanni <at> jidanni.org>
To: Eli Zaretskii <eliz <at> gnu.org>
Cc: 33445 <at> debbugs.gnu.org
Subject: Re: bug#33445: Say "U+1F52E"
Date: Wed, 21 Nov 2018 04:04:13 +0800
>>>>> "EZ" == Eli Zaretskii <eliz <at> gnu.org> writes:

>> Well doing C-u C-x = on it should really mention "U+1F52E" somewhere, by
>> default, without needing customization.

EZ> It does, at least twice.  Or maybe I don't understand what is it that
EZ> you are missing in the resulting display.

$ grep 1F52E output
code point in charset: 0x1F52E

I want it to say "U+1F52E" somewhere by default.

emacs-version "25.2.2"




Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#33445; Package emacs. (Tue, 20 Nov 2018 20:18:02 GMT) Full text and rfc822 format available.

Message #14 received at 33445 <at> debbugs.gnu.org (full text, mbox):

From: Eli Zaretskii <eliz <at> gnu.org>
To: 積丹尼 Dan Jacobson <jidanni <at> jidanni.org>
Cc: 33445 <at> debbugs.gnu.org
Subject: Re: bug#33445: Say "U+1F52E"
Date: Tue, 20 Nov 2018 22:17:28 +0200
> From: 積丹尼 Dan Jacobson <jidanni <at> jidanni.org>
> Cc: 33445 <at> debbugs.gnu.org
> Date: Wed, 21 Nov 2018 04:04:13 +0800
> 
> $ grep 1F52E output
> code point in charset: 0x1F52E

Yes, and the charset is Unicode (which is where "U" comes from).

> I want it to say "U+1F52E" somewhere by default.

Why?




Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#33445; Package emacs. (Tue, 20 Nov 2018 20:23:01 GMT) Full text and rfc822 format available.

Message #17 received at 33445 <at> debbugs.gnu.org (full text, mbox):

From: 積丹尼 Dan Jacobson <jidanni <at> jidanni.org>
To: Eli Zaretskii <eliz <at> gnu.org>
Cc: 33445 <at> debbugs.gnu.org
Subject: Re: bug#33445: Say "U+1F52E"
Date: Wed, 21 Nov 2018 04:21:55 +0800
>>>>> "EZ" == Eli Zaretskii <eliz <at> gnu.org> writes:

>> I want it to say "U+1F52E" somewhere by default.

EZ> Why?

Because those are their proper names!
https://en.wikipedia.org/wiki/List_of_Unicode_characters




Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#33445; Package emacs. (Tue, 20 Nov 2018 21:48:01 GMT) Full text and rfc822 format available.

Message #20 received at 33445 <at> debbugs.gnu.org (full text, mbox):

From: Werner LEMBERG <wl <at> gnu.org>
To: jidanni <at> jidanni.org
Cc: eliz <at> gnu.org, 33445 <at> debbugs.gnu.org
Subject: Re: bug#33445: Say "U+1F52E"
Date: Tue, 20 Nov 2018 22:46:52 +0100 (CET)
>>> I want it to say "U+1F52E" somewhere by default.
> 
> EZ> Why?
> 
> Because those are their proper names!
> https://en.wikipedia.org/wiki/List_of_Unicode_characters

I second that.  It might be a small difference to people who are well
acquainted with Emacs coding systems.  However, I believe that Joe
User would prefer the `U+' notation.  (I have a good knowledge of
coding systems, and even I would prefer to see the `U+' prefix :-)


    Werner




Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#33445; Package emacs. (Wed, 21 Nov 2018 03:35:02 GMT) Full text and rfc822 format available.

Message #23 received at 33445 <at> debbugs.gnu.org (full text, mbox):

From: Eli Zaretskii <eliz <at> gnu.org>
To: 積丹尼 Dan Jacobson <jidanni <at> jidanni.org>
Cc: 33445 <at> debbugs.gnu.org
Subject: Re: bug#33445: Say "U+1F52E"
Date: Wed, 21 Nov 2018 05:34:15 +0200
> From: 積丹尼 Dan Jacobson <jidanni <at> jidanni.org>
> Cc: 33445 <at> debbugs.gnu.org
> Date: Wed, 21 Nov 2018 04:21:55 +0800
> 
> >>>>> "EZ" == Eli Zaretskii <eliz <at> gnu.org> writes:
> 
> >> I want it to say "U+1F52E" somewhere by default.
> 
> EZ> Why?
> 
> Because those are their proper names!
> https://en.wikipedia.org/wiki/List_of_Unicode_characters

The proper name is CRYSTAL BALL, and it is shown.




Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#33445; Package emacs. (Wed, 21 Nov 2018 03:41:01 GMT) Full text and rfc822 format available.

Message #26 received at 33445 <at> debbugs.gnu.org (full text, mbox):

From: Eli Zaretskii <eliz <at> gnu.org>
To: Werner LEMBERG <wl <at> gnu.org>
Cc: 33445 <at> debbugs.gnu.org, jidanni <at> jidanni.org
Subject: Re: bug#33445: Say "U+1F52E"
Date: Wed, 21 Nov 2018 05:40:41 +0200
> Date: Tue, 20 Nov 2018 22:46:52 +0100 (CET)
> Cc: eliz <at> gnu.org, 33445 <at> debbugs.gnu.org
> From: Werner LEMBERG <wl <at> gnu.org>
> 
> I second that.  It might be a small difference to people who are well
> acquainted with Emacs coding systems.  However, I believe that Joe
> User would prefer the `U+' notation.  (I have a good knowledge of
> coding systems, and even I would prefer to see the `U+' prefix :-)

This is not about coding systems at all.  The codepoint is shown at
least twice in the display, just without the U+ prefix.

The display produced by that function is already cluttered beyond
hope, so much so that many important details evade you until you
actually read each and every symbol.  Adding the U+ to them (where,
exactly?) would make aa bad situation even worse, for very little
gain.

The proper Unicode name of the character, CRYSTAL BALL, is shown in
the display.




Added tag(s) wontfix. Request was from Glenn Morris <rgm <at> gnu.org> to control <at> debbugs.gnu.org. (Wed, 21 Nov 2018 04:39:02 GMT) Full text and rfc822 format available.

Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#33445; Package emacs. (Wed, 21 Nov 2018 05:46:02 GMT) Full text and rfc822 format available.

Message #31 received at 33445 <at> debbugs.gnu.org (full text, mbox):

From: Werner LEMBERG <wl <at> gnu.org>
To: eliz <at> gnu.org
Cc: 33445 <at> debbugs.gnu.org, jidanni <at> jidanni.org
Subject: Re: bug#33445: Say "U+1F52E"
Date: Wed, 21 Nov 2018 06:44:35 +0100 (CET)
>> I second that.  It might be a small difference to people who are
>> well acquainted with Emacs coding systems.  However, I believe that
>> Joe User would prefer the `U+' notation.  (I have a good knowledge
>> of coding systems, and even I would prefer to see the `U+' prefix
>> :-)
> 
> This is not about coding systems at all.  The codepoint is shown at
> least twice in the display, just without the U+ prefix.
> 
> The display produced by that function is already cluttered beyond
> hope, so much so that many important details evade you until you
> actually read each and every symbol.

Well, this is a different issue.

> Adding the U+ to them (where, exactly?) would make aa bad situation
> even worse, for very little gain.
> 
> The proper Unicode name of the character, CRYSTAL BALL, is shown in
> the display.

I would like to see

  U+1F52E CRYSTAL BALL


    Werner




Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#33445; Package emacs. (Wed, 21 Nov 2018 09:30:02 GMT) Full text and rfc822 format available.

Message #34 received at 33445 <at> debbugs.gnu.org (full text, mbox):

From: Eli Zaretskii <eliz <at> gnu.org>
To: Werner LEMBERG <wl <at> gnu.org>
Cc: 33445 <at> debbugs.gnu.org, jidanni <at> jidanni.org
Subject: Re: bug#33445: Say "U+1F52E"
Date: Wed, 21 Nov 2018 11:28:59 +0200
> Date: Wed, 21 Nov 2018 06:44:35 +0100 (CET)
> Cc: jidanni <at> jidanni.org, 33445 <at> debbugs.gnu.org
> From: Werner LEMBERG <wl <at> gnu.org>
> 
> I would like to see
> 
>   U+1F52E CRYSTAL BALL

Given that the current display is

	       position: 588 of 1015 (58%), column: 2
	      character: 🔮 (displayed as 🔮) (codepoint 128302, #o372456, #x1f52e)
      preferred charset: unicode (Unicode (ISO10646))
  code point in charset: 0x1F52E
		 script: symbol
		 syntax: w 	which means: word
	       category: .:Base
	       to input: type "C-x 8 RET 1f52e" or "C-x 8 RET CRYSTAL BALL"
	    buffer code: #xF0 #x9F #x94 #xAE
	      file code: #xF0 #x9F #x94 #xAE (encoded by coding system utf-8)
		display: by this font (glyph code)
      uniscribe:-outline-Symbola-normal-normal-normal-serif-15-*-*-*-p-*-iso8859-1 (#x1AE4)

  Character code properties: customize what to show
    name: CRYSTAL BALL

With the codepoint shown 3 times above the name, why would you want it
displayed yet another time?  Isn't that highly redundant?

Emacs displays this information for Emacs users.  "C-x =" is a
general-purpose command, not a command intended for learning Unicode
or perusing Unicode blocks.  If someone writes a command for viewing
Unicode blocks, I can understand why U+1F52E should be there.  But
this is not that case.




Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#33445; Package emacs. (Wed, 21 Nov 2018 10:38:02 GMT) Full text and rfc822 format available.

Message #37 received at 33445 <at> debbugs.gnu.org (full text, mbox):

From: Werner LEMBERG <wl <at> gnu.org>
To: eliz <at> gnu.org
Cc: 33445 <at> debbugs.gnu.org, jidanni <at> jidanni.org
Subject: Re: bug#33445: Say "U+1F52E"
Date: Wed, 21 Nov 2018 11:36:58 +0100 (CET)
From: Eli Zaretskii <eliz <at> gnu.org>
Subject: Re: bug#33445: Say "U+1F52E"
Date: Wed, 21 Nov 2018 11:28:59 +0200

>> Date: Wed, 21 Nov 2018 06:44:35 +0100 (CET)
>> Cc: jidanni <at> jidanni.org, 33445 <at> debbugs.gnu.org
>> From: Werner LEMBERG <wl <at> gnu.org>
>> 
>> I would like to see
>> 
>>   U+1F52E CRYSTAL BALL
> 
> Given that the current display is
> 
> 	       position: 588 of 1015 (58%), column: 2
> 	      character: 🔮 (displayed as 🔮) (codepoint 128302, #o372456, #x1f52e)

Is this `codepoint' ever different to the value shown two rows later?

>       preferred charset: unicode (Unicode (ISO10646))
>   code point in charset: 0x1F52E

This could be

  code point in charset: U+1F52E

by special-casing Unicode, contrary to other charsets.


    Werner

Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#33445; Package emacs. (Wed, 21 Nov 2018 11:00:02 GMT) Full text and rfc822 format available.

Message #40 received at 33445 <at> debbugs.gnu.org (full text, mbox):

From: Eli Zaretskii <eliz <at> gnu.org>
To: Werner LEMBERG <wl <at> gnu.org>
Cc: 33445 <at> debbugs.gnu.org, jidanni <at> jidanni.org
Subject: Re: bug#33445: Say "U+1F52E"
Date: Wed, 21 Nov 2018 12:59:44 +0200
> Date: Wed, 21 Nov 2018 11:36:58 +0100 (CET)
> Cc: jidanni <at> jidanni.org, 33445 <at> debbugs.gnu.org
> From: Werner LEMBERG <wl <at> gnu.org>
> 
> > 	       position: 588 of 1015 (58%), column: 2
> > 	      character: 🔮 (displayed as 🔮) (codepoint 128302, #o372456, #x1f52e)
> 
> Is this `codepoint' ever different to the value shown two rows later?

For characters from other charsets, yes.

> >       preferred charset: unicode (Unicode (ISO10646))
> >   code point in charset: 0x1F52E
> 
> This could be
> 
>   code point in charset: U+1F52E
> 
> by special-casing Unicode, contrary to other charsets.

What would be the utility of U+ for Emacs users?  It just makes this
more confusing for people who aren't familiar with the U+nnnn notation
(which obviously excludes me, you, and the OP).  Emacs doesn't accept
that notation in any input mode, and never displays it.

But I'm beginning to repeat myself, so I will shut up on this from now
on.




Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#33445; Package emacs. (Wed, 21 Nov 2018 12:42:02 GMT) Full text and rfc822 format available.

Message #43 received at 33445 <at> debbugs.gnu.org (full text, mbox):

From: 積丹尼 Dan Jacobson <jidanni <at> jidanni.org>
To: Eli Zaretskii <eliz <at> gnu.org>
Cc: Werner LEMBERG <wl <at> gnu.org>, 33445 <at> debbugs.gnu.org
Subject: Re: bug#33445: Say "U+1F52E"
Date: Wed, 21 Nov 2018 20:41:20 +0800
Also currently saying

            character: 🔮 (displayed as 🔮) (codepoint 128302, #o372456, #x1f52e)
    preferred charset: unicode (Unicode (ISO10646))
code point in charset: 0x1F52E

leaves the user still not totally sure the last line really is talking
about the second line or not.

Otherwise instead of saying

    preferred charset: unicode (Unicode (ISO10646))
code point in charset: 0x1F52E

It should say

    preferred charset: unicode (Unicode (ISO10646))
code point in preferred charset: 0x1F52E

That way we could be sure it is really talking about unicode.




Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#33445; Package emacs. (Wed, 21 Nov 2018 13:10:01 GMT) Full text and rfc822 format available.

Message #46 received at 33445 <at> debbugs.gnu.org (full text, mbox):

From: Eli Zaretskii <eliz <at> gnu.org>
To: 積丹尼 Dan Jacobson <jidanni <at> jidanni.org>
Cc: wl <at> gnu.org, 33445 <at> debbugs.gnu.org
Subject: Re: bug#33445: Say "U+1F52E"
Date: Wed, 21 Nov 2018 15:09:55 +0200
> From: 積丹尼 Dan Jacobson <jidanni <at> jidanni.org>
> Cc: Werner LEMBERG <wl <at> gnu.org>,  33445 <at> debbugs.gnu.org
> Date: Wed, 21 Nov 2018 20:41:20 +0800
> 
> Otherwise instead of saying
> 
>     preferred charset: unicode (Unicode (ISO10646))
> code point in charset: 0x1F52E
> 
> It should say
> 
>     preferred charset: unicode (Unicode (ISO10646))
> code point in preferred charset: 0x1F52E
> 
> That way we could be sure it is really talking about unicode.

Fine with me, but we will have to realign the other lines, as they are
all aligned to have the colon in the same column.




Merged 26477 33445. Request was from Noam Postavsky <npostavs <at> gmail.com> to control <at> debbugs.gnu.org. (Wed, 21 Nov 2018 14:20:02 GMT) Full text and rfc822 format available.

Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#33445; Package emacs. (Thu, 22 Nov 2018 15:33:02 GMT) Full text and rfc822 format available.

Message #51 received at 33445 <at> debbugs.gnu.org (full text, mbox):

From: Eli Zaretskii <eliz <at> gnu.org>
To: jidanni <at> jidanni.org
Cc: 33445 <at> debbugs.gnu.org
Subject: Re: bug#33445: Say "U+1F52E"
Date: Thu, 22 Nov 2018 17:32:23 +0200
> Date: Wed, 21 Nov 2018 15:09:55 +0200
> From: Eli Zaretskii <eliz <at> gnu.org>
> Cc: 33445 <at> debbugs.gnu.org
> 
> > Otherwise instead of saying
> > 
> >     preferred charset: unicode (Unicode (ISO10646))
> > code point in charset: 0x1F52E
> > 
> > It should say
> > 
> >     preferred charset: unicode (Unicode (ISO10646))
> > code point in preferred charset: 0x1F52E
> > 
> > That way we could be sure it is really talking about unicode.
> 
> Fine with me, but we will have to realign the other lines, as they are
> all aligned to have the colon in the same column.

I've now removed the "preferred" part from the display, so it says
just "charset", thus the next line is more clearly related to this
one.

Thanks.




Merged 26477 33445 35704. Request was from Glenn Morris <rgm <at> gnu.org> to control <at> debbugs.gnu.org. (Mon, 13 May 2019 00:31:02 GMT) Full text and rfc822 format available.

bug closed, send any further explanations to 26477 <at> debbugs.gnu.org and 積丹尼 Dan Jacobson <jidanni <at> jidanni.org> Request was from Lars Ingebrigtsen <larsi <at> gnus.org> to control <at> debbugs.gnu.org. (Fri, 23 Aug 2019 05:34:02 GMT) Full text and rfc822 format available.

bug archived. Request was from Debbugs Internal Request <help-debbugs <at> gnu.org> to internal_control <at> debbugs.gnu.org. (Fri, 20 Sep 2019 11:24:03 GMT) Full text and rfc822 format available.

This bug report was last modified 5 years and 275 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.