GNU bug report logs - #13947
bug report for core-utils command : OD

Previous Next

Package: coreutils;

Reported by: Marc Grondin <marc.grondin <at> oracle.com>

Date: Wed, 13 Mar 2013 20:25:02 UTC

Severity: normal

Done: Pádraig Brady <P <at> draigBrady.com>

Bug is archived. No further changes may be made.

Full log


View this message in rfc822 format

From: Marc Grondin <marc.grondin <at> oracle.com>
To: 13947 <at> debbugs.gnu.org
Cc: Mark.Jaeger <at> oracle.com
Subject: bug#13947: bug report for core-utils command :  OD
Date: Wed, 13 Mar 2013 13:16:16 -0700 (PDT)
Good Afternoon, 

My client was attempting to run the command : od -c on this xml file (sample only) 
------------------------------------------------------------------------------
<?xml version = '1.0' encoding = 'UTF-8'?>
<top>
   <x>丸</x>
   <y>丸</y>
   <z>𠄌</z>
   <x>?</x>
   <x>?</x>
   <x>?丸</x>
   <x>??丸</x>
</top>
------------------------------------------------------------------------------

note : this system is a : 2.6.18-164.0.0.0.1.el5xen #1 SMP Thu Sep 3 00:34:43 EDT 2009 x86_64 x86_64 x86_64 GNU/Linux

He was getting this output : 
------------------------------------------------------------------------------
0000000   <   ?   x   m   l       v   e   r   s   i   o   n       =    
0000020   '   1   .   0   '       e   n   c   o   d   i   n   g       =
0000040       '   U   T   F   -   8   '   ?   >  \n   <   t   o   p   >
0000060  \n               <   x   >   �   �   �   <   /   x   >  \n    
0000100           <   y   >   �   �   � 201   <   /   y   >  \n        
0000120       <   z   >   �   � 204 214   <   /   z   >  \n            
0000140   <   x   >   ?   <   /   x   >  \n               <   x   >   ?
0000160   <   /   x   >  \n               <   x   >   ?   �   �   � 201
0000200   <   /   x   >  \n               <   x   >   ?   ?   �   �   �
0000220 201   <   /   x   >  \n   <   /   t   o   p   >  \n
------------------------------------------------------------------------------

Instead of this : 
------------------------------------------------------------------------------
000000   <   ?   x   m   l       v   e   r   s   i   o   n       =    
0000020   '   1   .   0   '       e   n   c   o   d   i   n   g       =
0000040       '   U   T   F   -   8   '   ?   >  \n   <   t   o   p   >
0000060  \n               <   x   > 344 270 270   <   /   x   >  \n    
0000100           <   y   > 360 257 240 201   <   /   y   >  \n        
0000120       <   z   > 360 240 204 214   <   /   z   >  \n            
0000140   <   x   >   ?   <   /   x   >  \n               <   x   >   ?
0000160   <   /   x   >  \n               <   x   >   ? 360 257 240 201
0000200   <   /   x   >  \n               <   x   >   ?   ? 360 257 240
0000220 201   <   /   x   >  \n   <   /   t   o   p   >  \n
0000235
------------------------------------------------------------------------------

This all based on the LANG env.  He was using : 
LANG=en_US.iso88591, instead of
LANG=en_US.UTF-8 

------------------------------------------------------------------------------

Question : 
Since the output is based on the ASCII character set, should it not, in both cases give a numerical output (as it did in scenario #2) 
for a symbol outside the ascii/extended-ascii character set ? 
------------------------------------------------------------------------------


Regards, 

Marc Grondin, 

__________________________________
Oracle - Quebec city, Qc.
Senior System Administrator, PDIT
---------------------------------
400-330 St-Vallier, G1K 9C5
418.524.5665 # 1256
=================================




This bug report was last modified 12 years and 59 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.