GNU bug report logs - #10136
Can't view some strange characters in some of the man pages

Previous Next

Package: coreutils;

Reported by: Harold Raulston <hr0201 <at> gmail.com>

Date: Fri, 25 Nov 2011 18:09:01 UTC

Severity: normal

Tags: moreinfo, notabug

Done: era eriksson <era <at> iki.fi>

Bug is archived. No further changes may be made.

To add a comment to this bug, you must first unarchive it, by sending
a message to control AT debbugs.gnu.org, with unarchive 10136 in the body.
You can then email your comments to 10136 AT debbugs.gnu.org in the normal way.

Toggle the display of automated, internal messages from the tracker.

View this report as an mbox folder, status mbox, maintainer mbox


Report forwarded to bug-coreutils <at> gnu.org:
bug#10136; Package coreutils. (Fri, 25 Nov 2011 18:09:01 GMT) Full text and rfc822 format available.

Acknowledgement sent to Harold Raulston <hr0201 <at> gmail.com>:
New bug report received and forwarded. Copy sent to bug-coreutils <at> gnu.org. (Fri, 25 Nov 2011 18:09:02 GMT) Full text and rfc822 format available.

Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):

From: Harold Raulston <hr0201 <at> gmail.com>
To: bug-coreutils <at> gnu.org
Subject: Can't view some strange characters in some of the man pages
Date: Fri, 25 Nov 2011 10:50:39 +0000
[Message part 1 (text/plain, inline)]
Hi,

Could you tell me what encoding I need to use to view your man pages?

I've tried Unicode, Western, Western ISO, but still get some unreadable
characters in the EXAMPLES (I've just looked at the find and du commands so
far):

â€â€™ linuxcommand find1 can't display read

BTW, I'm using Win7 Pro English, IE9. All latest updates.
I have the same problem in Chrome...

Thx,
Harold
[Message part 2 (text/html, inline)]

Information forwarded to bug-coreutils <at> gnu.org:
bug#10136; Package coreutils. (Fri, 25 Nov 2011 21:25:02 GMT) Full text and rfc822 format available.

Message #8 received at 10136 <at> debbugs.gnu.org (full text, mbox):

From: "Alan Curry" <pacman-cu <at> kosh.dhis.org>
To: hr0201 <at> gmail.com (Harold Raulston)
Cc: 10136 <at> debbugs.gnu.org
Subject: Re: bug#10136: Can't view some strange characters in some of the man
	pages
Date: Fri, 25 Nov 2011 16:22:42 -0500 (GMT+5)
Harold Raulston writes:
> 
> Hi,
> 
> Could you tell me what encoding I need to use to view your man pages?
> 
> I've tried Unicode, Western, Western ISO, but still get some unreadable
> characters in the EXAMPLES (I've just looked at the find and du commands so
> far):
> 
> =C3=A2=E2=82=AC=C3=A2=E2=82=AC=E2=84=A2 linuxcommand find1 can't display re=
> ad
> 
> BTW, I'm using Win7 Pro English, IE9. All latest updates.
> I have the same problem in Chrome...

man pages are read with the man program. HTML is Not The Way.

[c3 a2 e2 82 ac c3 a2 e2 82 ac e2 84 a2] is what you get when you start with
U+2019 RIGHT SINGLE QUOTATION MARK in UTF8, then misinterpret it as
windows-1252 and convert it to UTF8 again.

We were *so* unfortunate when we didn't have all these extra kinds of
quotation marks.

-- 
Alan Curry




Information forwarded to bug-coreutils <at> gnu.org:
bug#10136; Package coreutils. (Fri, 25 Nov 2011 22:06:02 GMT) Full text and rfc822 format available.

Message #11 received at 10136 <at> debbugs.gnu.org (full text, mbox):

From: Bob Proulx <bob <at> proulx.com>
To: Harold Raulston <hr0201 <at> gmail.com>, 10136 <at> debbugs.gnu.org
Subject: Re: bug#10136: Can't view some strange characters in some of the man
	pages
Date: Fri, 25 Nov 2011 15:03:35 -0700
Alan Curry wrote:
> Harold Raulston writes:
> > Could you tell me what encoding I need to use to view your man pages?
> > 
> > I've tried Unicode, Western, Western ISO, but still get some unreadable
> > characters in the EXAMPLES (I've just looked at the find and du commands so
> > far):
> > 
> > â€â€™ linuxcommand find1 can't display read

I do not see any unicode characters in the man page even with LC_ALL
set to en_US.UTF-8.

> > BTW, I'm using Win7 Pro English, IE9. All latest updates.
> > I have the same problem in Chrome...
> 
> man pages are read with the man program. HTML is Not The Way.

Agreed.

Harold, you said IE9 in the same message talking about man pages.
This creates confusion.  Man pages are read with the man program.

  $ man du

But if you are using the web browser then it sounds like you are
looking at a web page instead?  If that is true then the web page you
are looking at will be the problem.

Harold, Please tell us what web page URL to which you are referering.

Documentation for the coreutils is available on the web here:

  http://www.gnu.org/software/coreutils/manual/

All of those pages look okay to me.  For 'du' here is the relevant
documentation page on the web.

  http://www.gnu.org/software/coreutils/manual/html_node/du-invocation.html#du-invocation

And of course the info pages should be installed on your system.  You
can access them using the 'info' command.

  $ info coreutils

> [c3 a2 e2 82 ac c3 a2 e2 82 ac e2 84 a2] is what you get when you start with
> U+2019 RIGHT SINGLE QUOTATION MARK in UTF8, then misinterpret it as
> windows-1252 and convert it to UTF8 again.

Good observation.

> We were *so* unfortunate when we didn't have all these extra kinds of
> quotation marks.

/me shakes head, People do like eye candy.

Bob




Added tag(s) moreinfo. Request was from Bob Proulx <bob <at> proulx.com> to control <at> debbugs.gnu.org. (Fri, 25 Nov 2011 22:10:02 GMT) Full text and rfc822 format available.

Information forwarded to bug-coreutils <at> gnu.org:
bug#10136; Package coreutils. (Sat, 26 Nov 2011 11:49:02 GMT) Full text and rfc822 format available.

Message #16 received at 10136 <at> debbugs.gnu.org (full text, mbox):

From: Pádraig Brady <P <at> draigBrady.com>
To: Alan Curry <pacman-cu <at> kosh.dhis.org>
Cc: 10136 <at> debbugs.gnu.org, Harold Raulston <hr0201 <at> gmail.com>
Subject: Re: bug#10136: Can't view some strange characters in some of the
	man pages
Date: Sat, 26 Nov 2011 11:46:54 +0000
tags 10136 + notabug
close 10136

You didn't give the source of this mangled man page,
so I'm closing this as it's not related to coreutils.
More notes below.

On 11/25/2011 09:22 PM, Alan Curry wrote:
> Harold Raulston writes:
>>
>> Hi,
>>
>> Could you tell me what encoding I need to use to view your man pages?
>>
>> I've tried Unicode, Western, Western ISO, but still get some unreadable
>> characters in the EXAMPLES (I've just looked at the find and du commands so
>> far):
>>
>> =C3=A2=E2=82=AC=C3=A2=E2=82=AC=E2=84=A2 linuxcommand find1 can't display re=
>> ad
>>
>> BTW, I'm using Win7 Pro English, IE9. All latest updates.
>> I have the same problem in Chrome...
> 
> man pages are read with the man program. HTML is Not The Way.
> 
> [c3 a2 e2 82 ac c3 a2 e2 82 ac e2 84 a2] is what you get when you start with
> U+2019 RIGHT SINGLE QUOTATION MARK in UTF8, then misinterpret it as
> windows-1252 and convert it to UTF8 again.

This is a common manglement. Reversing the process is a little tricky:
http://www.pixelbeat.org/docs/unicode_utils/

> We were *so* unfortunate when we didn't have all these extra kinds of
> quotation marks.

Yes, they are a common source of confusion:
http://www2.cs.sfu.ca/~ggbaker/reference/characters/

cheers,
Pádraig.




Added tag(s) notabug. Request was from era eriksson <era <at> iki.fi> to control <at> debbugs.gnu.org. (Thu, 30 Aug 2012 08:27:03 GMT) Full text and rfc822 format available.

bug closed, send any further explanations to 10136 <at> debbugs.gnu.org and Harold Raulston <hr0201 <at> gmail.com> Request was from era eriksson <era <at> iki.fi> to control <at> debbugs.gnu.org. (Thu, 30 Aug 2012 08:27:04 GMT) Full text and rfc822 format available.

bug archived. Request was from Debbugs Internal Request <help-debbugs <at> gnu.org> to internal_control <at> debbugs.gnu.org. (Thu, 27 Sep 2012 11:24:03 GMT) Full text and rfc822 format available.

This bug report was last modified 12 years and 269 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.