GNU bug report logs - #17804
RFC: fixing the 32-bit size and time limits in gzip file format

Previous Next

Package: gzip;

Reported by: Petr Stodůlka <pstodulk <at> redhat.com>

Date: Wed, 18 Jun 2014 16:54:01 UTC

Severity: normal

Merged with 29089, 30935, 30936, 38766, 42965, 48424, 52227

Done: Paul Eggert <eggert <at> cs.ucla.edu>

Bug is archived. No further changes may be made.

Full log


View this message in rfc822 format

From: help-debbugs <at> gnu.org (GNU bug Tracking System)
To: Paul Eggert <eggert <at> cs.ucla.edu>
Cc: tracker <at> debbugs.gnu.org
Subject: bug#30936: closed (gzip -l reports wrong size for decompressed
 files larger than 4GB)
Date: Thu, 16 Dec 2021 02:34:02 +0000
[Message part 1 (text/plain, inline)]
Your message dated Wed, 15 Dec 2021 18:33:12 -0800
with message-id <b546014b-f19b-8025-cd7b-2266676b41d9 <at> cs.ucla.edu>
and subject line Re: bug#17804: RFC: fixing the 32-bit size and time limits in gzip file format
has caused the debbugs.gnu.org bug report #17804,
regarding gzip -l reports wrong size for decompressed files larger than 4GB
to be marked as done.

(If you believe you have received this mail in error, please contact
help-debbugs <at> gnu.org.)


-- 
17804: http://debbugs.gnu.org/cgi/bugreport.cgi?bug=17804
GNU Bug Tracking System
Contact help-debbugs <at> gnu.org with problems
[Message part 2 (message/rfc822, inline)]
From: Wolfgang Formann <wformann <at> arcor.de>
To: bug-gzip <at> gnu.org
Subject: gzip -l reports wrong size for decompressed files larger than 4GB
Date: Sun, 25 Mar 2018 11:15:46 +0200
Hello!

I am using gzip 1.6 from openSUSE Leap 42.3 with latest patches

$ file /usr/bin/gzip
/usr/bin/gzip: ELF 64-bit LSB executable, x86-64, version 1 (SYSV), dynamically linked, interpreter 
/lib64/ld-linux-x86-64.so.2, for GNU/Linux 3.0.0, BuildID[sha1]=7103d56e17e6f81a52db927e393dce601c3af0e1, stripped

There is a compressed file available at https://data.dnb.de/opendata/GND.rdf.gz which has a size of 1.232.465.678 bytes. 
Uncompressed it will have a size of 19.465.374.298

The problem is:
$ gzip -l GND.rdf.gz
         compressed        uncompressed  ratio uncompressed_name
         1232465678          2285505114  46.1% GND.rdf

This number 2285505114 is actually the lower 32 bits of the real size 19GB.
$ echo "19465374298-16*1024*1024*1024" | bc
2285505114

Such a behaviour is okay for 32-bit software, 64-bit should show correct numbers.

Thanks
Wolfgang



[Message part 3 (message/rfc822, inline)]
From: Paul Eggert <eggert <at> cs.ucla.edu>
To: Petr Stodůlka <pstodulk <at> redhat.com>
Cc: 17804-done <at> debbugs.gnu.org
Subject: Re: bug#17804: RFC: fixing the 32-bit size and time limits in gzip
 file format
Date: Wed, 15 Dec 2021 18:33:12 -0800
On 6/18/14 15:12, Paul Eggert wrote:
> One simple way forward would be to implement what pigz -tl does, namely, 
> decompress the input stream and discard the output, but print its size.

I finally got around to implementing that suggestion:

https://git.savannah.gnu.org/cgit/gzip.git/commit/?id=cf26200380585019e927fe3cf5c0ecb7c8b3ef14
https://git.savannah.gnu.org/cgit/gzip.git/commit/?id=32fef43b442c7abc70414863d64718cd06f6477a

So I am closing this old bug report.


This bug report was last modified 3 years and 155 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.