From debbugs-submit-bounces@debbugs.gnu.org Fri Nov 06 11:42:13 2015 Received: (at submit) by debbugs.gnu.org; 6 Nov 2015 16:42:13 +0000 Received: from localhost ([127.0.0.1]:56027 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Zuk5d-0001Ck-1V for submit@debbugs.gnu.org; Fri, 06 Nov 2015 11:42:13 -0500 Received: from eggs.gnu.org ([208.118.235.92]:55287) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Zuk5I-0001By-0G for submit@debbugs.gnu.org; Fri, 06 Nov 2015 11:42:10 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Zuk5D-0005FJ-Nx for submit@debbugs.gnu.org; Fri, 06 Nov 2015 11:41:51 -0500 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-0.5 required=5.0 tests=BAYES_05 autolearn=disabled version=3.3.2 Received: from lists.gnu.org ([2001:4830:134:3::11]:34065) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zuk5D-0005F7-Kn for submit@debbugs.gnu.org; Fri, 06 Nov 2015 11:41:47 -0500 Received: from eggs.gnu.org ([2001:4830:134:3::10]:36936) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zuk5C-0002tc-SC for bug-coreutils@gnu.org; Fri, 06 Nov 2015 11:41:47 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Zuk5A-0005BV-0F for bug-coreutils@gnu.org; Fri, 06 Nov 2015 11:41:46 -0500 Received: from smtp.gentoo.org ([140.211.166.183]:45659) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Zuk59-0005AV-RS for bug-coreutils@gnu.org; Fri, 06 Nov 2015 11:41:43 -0500 Received: from vapier.lan (localhost [127.0.0.1]) by smtp.gentoo.org (Postfix) with SMTP id DE6C634070A for ; Fri, 6 Nov 2015 16:41:42 +0000 (UTC) Date: Fri, 6 Nov 2015 11:41:41 -0500 From: Mike Frysinger To: bug-coreutils@gnu.org Subject: `sort` behavior unstable based on neighboring elements ? Message-ID: <20151106164141.GD5154@vapier.lan> Mail-Followup-To: bug-coreutils@gnu.org MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha256; protocol="application/pgp-signature"; boundary="HENu/cXyPKFN4XCQ" Content-Disposition: inline X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-detected-operating-system: by eggs.gnu.org: Error: Malformed IPv6 address (bad octet value). X-Received-From: 2001:4830:134:3::11 X-Spam-Score: -4.3 (----) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -4.3 (----) --HENu/cXyPKFN4XCQ Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable i got this bug report today about sort mismatches. the order of the inputs changes the order of the outputs which surprised me. but it might be a nuance of unicode collation i'm not familiar with ? $ printf '%s\n' aarch64 abc zed | LC_ALL=3Dnb_NO.UTF-8 sort -u aarch64 abc zed $ printf '%s\n' abc aarch64 zed | LC_ALL=3Dnb_NO.UTF-8 sort -u abc zed aarch64 why aren't the outputs here the same ? a nordic user pointed out that aa is an alternative for =C3=A5 which comes after z, which is fine, but that doesn't explain why the output isn't the same here. -mike --HENu/cXyPKFN4XCQ Content-Type: application/pgp-signature; name="signature.asc" Content-Description: Digital signature -----BEGIN PGP SIGNATURE----- Version: GnuPG v2 iQIcBAEBCAAGBQJWPNhFAAoJEEFjO5/oN/WBuPsP/jmhPUoRjhnsrAcK/j/D8Jeb 0+B4bnonPM+4w5/kc9j0v6TMC5/WueFoBW/mOzJUe98S+AB/i6fj501YgRZQKCp0 nGqySL5O8WprcTGBoIfUlDI7YhuPt3P87QGVqTNblZyg4jmwB5pdMMfB4ugGzQaw zJIGeIFRHA8E/QF2yKJ640GThUbh6GT9ZepLXLYt/oph5EhLD1W3fbk9Yuhl09WJ 4p+vUdDQu1QRwj3tKNUth4HhN76o5jMg7qmbupvy8+g1x8ZJ4xge7bGcMmEk2M/5 qyglPIhLCFsTWiB0q22aaSj4qvI50ZNPt0FjdW2mjkhHjgOjiRK5N3xowunbjEBp EkXF/kAnuGsYLQVJrAgN6ljxrDeIJcMzUTUbswER7IZTfcQLyXOuwig2Y6CGERpu +0L2JmwXQMNrNmq0wSqV2uWWSW7M/SfkWcdGwiWHOwq1ly8njJKP0MIWvw9XY1LT BzEWhBjIF8KLe3bTVy+umamSoDHdnOCWi0oFPVF6zEdViUncAJgWKNxfL/zpW0kV HyviHvvJAZ6SpcKlxGekC7/slulATt5ZIfsBlJyLyjzr5KgrGRv5gVTpSavtPB3u wa/A4/Vlu2fvLPO+F8c2+glBhz9QFPICaG9KxDayKPAZBVCM/7lGNiyFcKwMVjQy cV0/LpxOH22CsU+Inq09 =LlMv -----END PGP SIGNATURE----- --HENu/cXyPKFN4XCQ-- From debbugs-submit-bounces@debbugs.gnu.org Fri Nov 06 11:53:32 2015 Received: (at submit) by debbugs.gnu.org; 6 Nov 2015 16:53:32 +0000 Received: from localhost ([127.0.0.1]:56037 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1ZukGZ-0001VU-Qz for submit@debbugs.gnu.org; Fri, 06 Nov 2015 11:53:32 -0500 Received: from eggs.gnu.org ([208.118.235.92]:33624) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1ZukGF-0001V1-Tr for submit@debbugs.gnu.org; Fri, 06 Nov 2015 11:53:30 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1ZukGE-0004Kq-PJ for submit@debbugs.gnu.org; Fri, 06 Nov 2015 11:53:11 -0500 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=0.8 required=5.0 tests=BAYES_50 autolearn=disabled version=3.3.2 Received: from lists.gnu.org ([2001:4830:134:3::11]:34430) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZukGE-0004Kj-MD for submit@debbugs.gnu.org; Fri, 06 Nov 2015 11:53:10 -0500 Received: from eggs.gnu.org ([2001:4830:134:3::10]:43610) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZukGD-0006pZ-Hw for bug-coreutils@gnu.org; Fri, 06 Nov 2015 11:53:10 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1ZukGA-0004I2-AJ for bug-coreutils@gnu.org; Fri, 06 Nov 2015 11:53:09 -0500 Received: from mail1.vodafone.ie ([213.233.128.43]:46532) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZukGA-0004Gq-4c for bug-coreutils@gnu.org; Fri, 06 Nov 2015 11:53:06 -0500 X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: AiwNAEzaPFZtTKq6/2dsb2JhbABegmlSU2/AFSGFbwKBN0wBAQEBAQGBC4Q2AQEEIwQLAVYLDQsCAgUWCwICCQMCAQIBRQYBDAgBAYguAQiwXYVtinQBAQgBAQEBGwSBAYRYhXmHdYFEBZZIhR2CcI5MkxBjhAQ+NIUUAQEB Received: from unknown (HELO localhost.localdomain) ([109.76.170.186]) by mail1.vodafone.ie with ESMTP; 06 Nov 2015 16:53:03 +0000 Subject: Re: bug#21844: `sort` behavior unstable based on neighboring elements ? To: bug-coreutils@gnu.org, Mike Frysinger References: <20151106164141.GD5154@vapier.lan> From: =?UTF-8?Q?P=c3=a1draig_Brady?= Message-ID: <563CDAEE.6060403@draigBrady.com> Date: Fri, 6 Nov 2015 16:53:02 +0000 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:38.0) Gecko/20100101 Thunderbird/38.3.0 MIME-Version: 1.0 In-Reply-To: <20151106164141.GD5154@vapier.lan> Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-detected-operating-system: by eggs.gnu.org: Error: Malformed IPv6 address (bad octet value). X-Received-From: 2001:4830:134:3::11 X-Spam-Score: -5.0 (-----) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -5.0 (-----) tag 21844 notabug close 21844 stop On 06/11/15 16:41, Mike Frysinger wrote: > i got this bug report today about sort mismatches. the order of the > inputs changes the order of the outputs which surprised me. but it > might be a nuance of unicode collation i'm not familiar with ? > > $ printf '%s\n' aarch64 abc zed | LC_ALL=nb_NO.UTF-8 sort -u > aarch64 > abc > zed > $ printf '%s\n' abc aarch64 zed | LC_ALL=nb_NO.UTF-8 sort -u > abc > zed > aarch64 > > why aren't the outputs here the same ? a nordic user pointed out > that aa is an alternative for å which comes after z, which is fine, > but that doesn't explain why the output isn't the same here. > -mike strcoll is giving the wrong results: $ printf '%s\n' abc aarch64 zed | LC_ALL=nb_NO.UTF-8 ltrace -e strcoll sort >/dev/null sort->strcoll("aarch64", "zed") = 3 sort->strcoll("abc", "zed") = -25 $ printf '%s\n' aarch64 abc zed | LC_ALL=nb_NO.UTF-8 ltrace -e strcoll sort >/dev/null sort->strcoll("abc", "zed") = -25 sort->strcoll("aarch64", "abc") = -1 I think this is due to: https://sourceware.org/bugzilla/show_bug.cgi?id=18589 Fixed by: https://sourceware.org/git/?p=glibc.git;a=commit;h=87701a58 cheers, Pádraig. From debbugs-submit-bounces@debbugs.gnu.org Wed Oct 24 17:16:46 2018 Received: (at control) by debbugs.gnu.org; 24 Oct 2018 21:16:46 +0000 Received: from localhost ([127.0.0.1]:40707 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gFQWE-0005iY-Cb for submit@debbugs.gnu.org; Wed, 24 Oct 2018 17:16:46 -0400 Received: from mail-pf1-f177.google.com ([209.85.210.177]:34638) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gFQWC-0005dd-Vj for control@debbugs.gnu.org; Wed, 24 Oct 2018 17:16:45 -0400 Received: by mail-pf1-f177.google.com with SMTP id f78-v6so3067097pfe.1 for ; Wed, 24 Oct 2018 14:16:44 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=to:from:message-id:date:user-agent:mime-version:content-language :content-transfer-encoding; bh=kYC9+LCNakBOme1DsCXFROli3bNcub2xmAVRYIjOZec=; b=WjLrY+Fi9QCc5ZJgJdUUm7FdbQTTPBWfVC06xP29b1USq2AhH4Lh91dWULGSMvGmvS CTo0hsYmvX35KT2FkX8oo7LlYBccvqHJbL2EclXBTuGtRHviRowGRt8iC9++wzKrSsOJ jZLkmynLDZW6B8L05qDSszoTVo95UtII/ZCRGJjhJbchJ/b73s83zJEVb6FGq2220JBJ xhZPPPtYwcbvyjUJ6XN2CQN0irn5s5y6fMUhLKaCDErasvUAoeCqA55i2N00wfJLquPD XYCmq5L7gg5JgzVmRMBvgSUNCR+3K2JxdXAim+f0asCLfff+tsG7iaWUSj9wt3QCag+4 iPyQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:to:from:message-id:date:user-agent:mime-version :content-language:content-transfer-encoding; bh=kYC9+LCNakBOme1DsCXFROli3bNcub2xmAVRYIjOZec=; b=pagM0hycEMoydXlZl/0ez6r2di4RFBo/DDEP43mf6xp2ZCRU3CXaw8y1E9HbOVnWrg RaOXIX1tOYBxaCD3nJiVRcWnmvbzIOifKAjo7DxLkh1AUm5qTLZf03HBbBxJbRBMWoPG M4KoBEoVWOGGBhO9KTUMh12VKNZ/S4kknoHDmE72E/o8o6WMCba3HtE6AbNjUfW/jY3l v2IoYjErhsOmMWVyYzI/oSnrnJMoJWdkiWgLJ95PVE+wJfvGJB85lpUB/fHivf6fqU2s hYOacqvCSbXuOcZVmWNy/WdhTn9mKFNRhlOlO1Wygw2xQhr0jDpogqs1EvyLB3IVQtu1 8XbQ== X-Gm-Message-State: AGRZ1gKYWc6r4mNzpckUrouesM+Zh0xeq4Gtv0rS+gVS3aOkxmLyRUjx ggxlq+CFxdK73gwQ0YMrSxHlsRaLUDk= X-Google-Smtp-Source: AJdET5fPac1CDZQMNfpVhu98VCHO61zkSwsP/rPcMi//2GoRnFNQo1fEKNpZ+s96cEOE2cfZC2GUhg== X-Received: by 2002:a62:ab0a:: with SMTP id p10-v6mr2522195pff.211.1540415798850; Wed, 24 Oct 2018 14:16:38 -0700 (PDT) Received: from tomato.housegordon.com (moose.housegordon.com. [184.68.105.38]) by smtp.googlemail.com with ESMTPSA id d187-v6sm10253485pfc.60.2018.10.24.14.16.37 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Wed, 24 Oct 2018 14:16:37 -0700 (PDT) To: control@debbugs.gnu.org From: Assaf Gordon Message-ID: <104428d5-6d3b-fa6c-b5f2-cfa6802f3916@gmail.com> Date: Wed, 24 Oct 2018 15:16:36 -0600 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.9.1 MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit X-Spam-Score: 2.0 (++) X-Spam-Report: Spam detection software, running on the system "debbugs.gnu.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see the administrator of that system for details. Content preview: tags 21844 notabug close 21844 [...] Content analysis details: (2.0 points, 10.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- 0.0 FREEMAIL_FROM Sender email is commonly abused enduser mail provider (assafgordon[at]gmail.com) -0.0 SPF_PASS SPF: sender matches SPF record 0.0 RCVD_IN_MSPIKE_H3 RBL: Good reputation (+3) [209.85.210.177 listed in wl.mailspike.net] -0.0 RCVD_IN_DNSWL_NONE RBL: Sender listed at http://www.dnswl.org/, no trust [209.85.210.177 listed in list.dnswl.org] 0.0 RCVD_IN_MSPIKE_WL Mailspike good senders 1.8 MISSING_SUBJECT Missing Subject: header 0.2 NO_SUBJECT Extra score for no subject X-Debbugs-Envelope-To: control X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 1.0 (+) tags 21844 notabug close 21844 From unknown Tue Aug 19 10:03:52 2025 Received: (at fakecontrol) by fakecontrolmessage; To: internal_control@debbugs.gnu.org From: Debbugs Internal Request Subject: Internal Control Message-Id: bug archived. Date: Thu, 22 Nov 2018 12:24:08 +0000 User-Agent: Fakemail v42.6.9 # This is a fake control message. # # The action: # bug archived. thanks # This fakemail brought to you by your local debbugs # administrator