GNU bug report logs - #6903
join: support numeric keys

Previous Next

Package: coreutils;

Reported by: Bernhard Schiffner <bernhard <at> schiffner-limbach.de>

Date: Tue, 24 Aug 2010 19:57:01 UTC

Severity: wishlist

Full log


Message #17 received at 6903 <at> debbugs.gnu.org (full text, mbox):

From: Bernhard Schiffner <bernhard <at> schiffner-limbach.de>
To: Paul Eggert <eggert <at> cs.ucla.edu>
Cc: 6903 <at> debbugs.gnu.org
Subject: Re: bug#6903: join: improve paralleles to sort?
Date: Thu, 26 Aug 2010 21:08:29 +0200
[Message part 1 (text/plain, inline)]
Am Mittwoch, 25. August 2010, 18:22:13 schrieb Paul Eggert:
> On 08/24/2010 11:57 PM, Bernhard Schiffner wrote:
> > 2146427	/LBAtoJM/ROOT/WEB-INF/lib/hibernate-3.2.0.cr3.jar
> > 214618118	/temp/marketing_ms/emails.dat
> 
> That won't work, because the two lines are not sorted correctly.
> Recall that join uses lexicographic comparison, not numeric.
> Its input must be sorted lexicographically.

Ok.
I solved my problem using the attached patch.

The patch shows that it is possible to use different sortings for keys 
(joinfield) in join.

I integrated some / most of the code from sort.c verbaly  in order to see 
what's needed to compile it successfully in join.c .
I did no tests beside my special usecase mentioned earlier.

It's clear that a user-friendly key-selection needs a lot more work. Same is 
about a unified version of join and sort.

Thanks to Paul and Christian Perle for their valueable help so far.

The FSF can make any use of the code here. 
It was theirs already before  ;-)


Bernhard


[join_proposal_2.diff (text/x-patch, attachment)]

This bug report was last modified 6 years and 320 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.