GNU bug report logs -
#13947
bug report for core-utils command : OD
Previous Next
Full log
Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):
Good Afternoon,
My client was attempting to run the command : od -c on this xml file (sample only)
------------------------------------------------------------------------------
<?xml version = '1.0' encoding = 'UTF-8'?>
<top>
<x>丸</x>
<y>丸</y>
<z>𠄌</z>
<x>?</x>
<x>?</x>
<x>?丸</x>
<x>??丸</x>
</top>
------------------------------------------------------------------------------
note : this system is a : 2.6.18-164.0.0.0.1.el5xen #1 SMP Thu Sep 3 00:34:43 EDT 2009 x86_64 x86_64 x86_64 GNU/Linux
He was getting this output :
------------------------------------------------------------------------------
0000000 < ? x m l v e r s i o n =
0000020 ' 1 . 0 ' e n c o d i n g =
0000040 ' U T F - 8 ' ? > \n < t o p >
0000060 \n < x > � � � < / x > \n
0000100 < y > � � � 201 < / y > \n
0000120 < z > � � 204 214 < / z > \n
0000140 < x > ? < / x > \n < x > ?
0000160 < / x > \n < x > ? � � � 201
0000200 < / x > \n < x > ? ? � � �
0000220 201 < / x > \n < / t o p > \n
------------------------------------------------------------------------------
Instead of this :
------------------------------------------------------------------------------
000000 < ? x m l v e r s i o n =
0000020 ' 1 . 0 ' e n c o d i n g =
0000040 ' U T F - 8 ' ? > \n < t o p >
0000060 \n < x > 344 270 270 < / x > \n
0000100 < y > 360 257 240 201 < / y > \n
0000120 < z > 360 240 204 214 < / z > \n
0000140 < x > ? < / x > \n < x > ?
0000160 < / x > \n < x > ? 360 257 240 201
0000200 < / x > \n < x > ? ? 360 257 240
0000220 201 < / x > \n < / t o p > \n
0000235
------------------------------------------------------------------------------
This all based on the LANG env. He was using :
LANG=en_US.iso88591, instead of
LANG=en_US.UTF-8
------------------------------------------------------------------------------
Question :
Since the output is based on the ASCII character set, should it not, in both cases give a numerical output (as it did in scenario #2)
for a symbol outside the ascii/extended-ascii character set ?
------------------------------------------------------------------------------
Regards,
Marc Grondin,
__________________________________
Oracle - Quebec city, Qc.
Senior System Administrator, PDIT
---------------------------------
400-330 St-Vallier, G1K 9C5
418.524.5665 # 1256
=================================
This bug report was last modified 12 years and 59 days ago.
Previous Next
GNU bug tracking system
Copyright (C) 1999 Darren O. Benham,
1997,2003 nCipher Corporation Ltd,
1994-97 Ian Jackson.