GNU bug report logs - #12659
the join command bug report!

Previous Next

Package: coreutils;

Reported by: "Michael" <chinalinux <at> hotmail.com>

Date: Tue, 16 Oct 2012 15:38:02 UTC

Severity: normal

Tags: notabug

Done: Bob Proulx <bob <at> proulx.com>

Bug is archived. No further changes may be made.

To add a comment to this bug, you must first unarchive it, by sending
a message to control AT debbugs.gnu.org, with unarchive 12659 in the body.
You can then email your comments to 12659 AT debbugs.gnu.org in the normal way.

Toggle the display of automated, internal messages from the tracker.

View this report as an mbox folder, status mbox, maintainer mbox


Report forwarded to bug-coreutils <at> gnu.org:
bug#12659; Package coreutils. (Tue, 16 Oct 2012 15:38:02 GMT) Full text and rfc822 format available.

Acknowledgement sent to "Michael" <chinalinux <at> hotmail.com>:
New bug report received and forwarded. Copy sent to bug-coreutils <at> gnu.org. (Tue, 16 Oct 2012 15:38:02 GMT) Full text and rfc822 format available.

Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):

From: "Michael" <chinalinux <at> hotmail.com>
To: <bug-coreutils <at> gnu.org>
Subject: the join command bug report!
Date: Tue, 16 Oct 2012 16:29:14 +0800
[Message part 1 (text/plain, inline)]
Hi,

I have two sorted files with million of lines to join together, I am sure of that there are at least 1/3 keys in two files are the same, but no even one key joined together.  there's no such situation within small files. 

Michael Wu
[Message part 2 (text/html, inline)]

Information forwarded to bug-coreutils <at> gnu.org:
bug#12659; Package coreutils. (Tue, 16 Oct 2012 16:13:01 GMT) Full text and rfc822 format available.

Message #8 received at 12659 <at> debbugs.gnu.org (full text, mbox):

From: Paul Eggert <eggert <at> cs.ucla.edu>
To: Michael <chinalinux <at> hotmail.com>
Cc: 12659 <at> debbugs.gnu.org
Subject: Re: bug#12659: the join command bug report!
Date: Tue, 16 Oct 2012 09:10:40 -0700
Sounds like a locale problem.  What does the "locale"
command say?  How exactly are you invoking 'sort' and
'join'?  What do the input and output lines look like?




Information forwarded to bug-coreutils <at> gnu.org:
bug#12659; Package coreutils. (Wed, 17 Oct 2012 07:21:01 GMT) Full text and rfc822 format available.

Message #11 received at 12659 <at> debbugs.gnu.org (full text, mbox):

From: "Michael" <chinalinux <at> hotmail.com>
To: "Paul Eggert" <eggert <at> cs.ucla.edu>
Cc: 12659 <at> debbugs.gnu.org
Subject: Re: bug#12659: the join command bug report!
Date: Wed, 17 Oct 2012 15:19:26 +0800
en_US.UTF-8

# sort -n file1 > file3
# sort -n file2 > file4

# join file3 file4 | wc -l
19
# sort file3 file4 | uniq -d | wc -l
4698
#

There are only numbers in my both joined files, I have realized that join 
does not support numeric sort method for the time being. if sort without 
option '-n', the result after joining was correct.

Michael

--------------------------------------------------
From: "Paul Eggert" <eggert <at> cs.ucla.edu>
Sent: Wednesday, October 17, 2012 12:10 AM
To: "Michael" <chinalinux <at> hotmail.com>
Cc: <12659 <at> debbugs.gnu.org>
Subject: Re: bug#12659: the join command bug report!

> Sounds like a locale problem.  What does the "locale"
> command say?  How exactly are you invoking 'sort' and
> 'join'?  What do the input and output lines look like?
> 




Information forwarded to bug-coreutils <at> gnu.org:
bug#12659; Package coreutils. (Wed, 17 Oct 2012 15:17:02 GMT) Full text and rfc822 format available.

Message #14 received at 12659 <at> debbugs.gnu.org (full text, mbox):

From: Paul Eggert <eggert <at> cs.ucla.edu>
To: Michael <chinalinux <at> hotmail.com>
Cc: 12659 <at> debbugs.gnu.org
Subject: Re: bug#12659: the join command bug report!
Date: Wed, 17 Oct 2012 08:14:47 -0700
On 10/17/2012 12:19 AM, Michael wrote:
> # sort -n file1 > file3
> # sort -n file2 > file4
> 
> # join file3 file4

That won't work.  You have to join with the same
sorting order that you sorted with.  This is discussed
in the manual.




Added tag(s) notabug. Request was from Bob Proulx <bob <at> proulx.com> to control <at> debbugs.gnu.org. (Wed, 17 Oct 2012 18:16:01 GMT) Full text and rfc822 format available.

bug closed, send any further explanations to 12659 <at> debbugs.gnu.org and "Michael" <chinalinux <at> hotmail.com> Request was from Bob Proulx <bob <at> proulx.com> to control <at> debbugs.gnu.org. (Wed, 17 Oct 2012 18:16:01 GMT) Full text and rfc822 format available.

Information forwarded to bug-coreutils <at> gnu.org:
bug#12659; Package coreutils. (Wed, 17 Oct 2012 18:17:02 GMT) Full text and rfc822 format available.

Message #21 received at 12659 <at> debbugs.gnu.org (full text, mbox):

From: Bob Proulx <bob <at> proulx.com>
To: Michael <chinalinux <at> hotmail.com>, 12659 <at> debbugs.gnu.org
Subject: Re: bug#12659: the join command bug report!
Date: Wed, 17 Oct 2012 12:15:06 -0600
Paul Eggert wrote:
> On 10/17/2012 12:19 AM, Michael wrote:
> > # sort -n file1 > file3
> > # sort -n file2 > file4
> > 
> > # join file3 file4
> 
> That won't work.  You have to join with the same
> sorting order that you sorted with.  This is discussed
> in the manual.

Since this seems to have been resolved satisfactorily I have closed
the bug report.  If you have any further information please feel free
to respond as I have done here and it will be delivered to all of the
interested parties.

Bob




Information forwarded to bug-coreutils <at> gnu.org:
bug#12659; Package coreutils. (Thu, 18 Oct 2012 01:45:02 GMT) Full text and rfc822 format available.

Message #24 received at 12659 <at> debbugs.gnu.org (full text, mbox):

From: "Michael" <chinalinux <at> hotmail.com>
To: "Paul Eggert" <eggert <at> cs.ucla.edu>
Cc: 12659 <at> debbugs.gnu.org
Subject: Re: bug#12659: the join command bug report!
Date: Thu, 18 Oct 2012 09:43:27 +0800
" Important: FILE1 and FILE2 must be sorted on the join fields."

There is only words above in the manual. it should mention the sort method 
at least. I strongly suggest improve the join maunal in the latter 
distribution.

Thanks

Michael


--------------------------------------------------
From: "Paul Eggert" <eggert <at> cs.ucla.edu>
Sent: Wednesday, October 17, 2012 11:14 PM
To: "Michael" <chinalinux <at> hotmail.com>
Cc: <12659 <at> debbugs.gnu.org>
Subject: Re: bug#12659: the join command bug report!

> On 10/17/2012 12:19 AM, Michael wrote:
>> # sort -n file1 > file3
>> # sort -n file2 > file4
>>
>> # join file3 file4
>
> That won't work.  You have to join with the same
> sorting order that you sorted with.  This is discussed
> in the manual.
> 




bug archived. Request was from Debbugs Internal Request <help-debbugs <at> gnu.org> to internal_control <at> debbugs.gnu.org. (Thu, 15 Nov 2012 12:24:03 GMT) Full text and rfc822 format available.

This bug report was last modified 12 years and 278 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.