From unknown Tue Jun 17 01:48:47 2025 X-Loop: help-debbugs@gnu.org Subject: bug#40018: How to diff PDF files? Resent-From: Peng Yu Original-Sender: "Debbugs-submit" Resent-CC: bug-diffutils@gnu.org Resent-Date: Tue, 10 Mar 2020 21:17:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: report 40018 X-GNU-PR-Package: diffutils X-GNU-PR-Keywords: To: 40018@debbugs.gnu.org X-Debbugs-Original-To: bug-diffutils Received: via spool by submit@debbugs.gnu.org id=B.158387496617622 (code B ref -1); Tue, 10 Mar 2020 21:17:02 +0000 Received: (at submit) by debbugs.gnu.org; 10 Mar 2020 21:16:06 +0000 Received: from localhost ([127.0.0.1]:53403 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1jBmEP-0004aA-NZ for submit@debbugs.gnu.org; Tue, 10 Mar 2020 17:16:05 -0400 Received: from lists.gnu.org ([209.51.188.17]:58219) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1jBmEN-0004a2-59 for submit@debbugs.gnu.org; Tue, 10 Mar 2020 17:16:05 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:45450) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1jBmEM-0007e3-5y for bug-diffutils@gnu.org; Tue, 10 Mar 2020 17:16:02 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=0.8 required=5.0 tests=BAYES_50,FREEMAIL_FROM autolearn=disabled version=3.3.2 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1jBmEK-0007Sk-4j for bug-diffutils@gnu.org; Tue, 10 Mar 2020 17:16:01 -0400 Received: from mail-il1-x12c.google.com ([2607:f8b0:4864:20::12c]:34548) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1jBmEH-0007NI-JC for bug-diffutils@gnu.org; Tue, 10 Mar 2020 17:15:58 -0400 Received: by mail-il1-x12c.google.com with SMTP id c8so8469ilm.1 for ; Tue, 10 Mar 2020 14:15:55 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:from:date:message-id:subject:to; bh=KET2gPMzX8K+l7ffY2sa2EYJGdyuDn/e6+IbzL4Nd60=; b=ujqo77VY+6cpIztBysyiBrh+WWO57Jx/MbgLjO+7uozZYKEZ0gsDf7WvtBnGpSBTpG eiZikAUoYLCfN5xJlEdlR9y7xSMzBhOKFLvaDJ7mqtRR2jpAPAb6TC5nj6uDGJI+LrQt PyCt3/HhUsNnnairXor+Y/LeTpD9AimIA1qaNR2O7awj3c9gnVgQRv+3FMq5RKRRf4mI cQjzBqn7YSXPhkhRFSMhDew3/wMdepA7fVqP0gi8eXWSg4dWO7uOAAM92L4vpr0xyI+T EsxWMhhTQvxpY+VseWtQ4QVexjN7PCtIepT87aw0rrGfirkd8X7ytLf8pj+lhO3mNKJ4 Ww9Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:from:date:message-id:subject:to; bh=KET2gPMzX8K+l7ffY2sa2EYJGdyuDn/e6+IbzL4Nd60=; b=kSovpqRY4GG5xJaPqNuioljsYjZ8gfvHtBWiUS0NzSXSSg6pEWg/cUQ69tpJe0VZdL /RrVmGJ/6lbO9exxJLPCMC6Rdgo0L/gZt2AzUaS45anA66xPdTk4T1gKcGzi6/gqsngo AXu+TO4Virz3i/5NLAFNmy1doZMjMcSrz2x/hUGe6AsDrQR8SIMhhTdPRdOAZsgDPWEr 3N8kalI5z8A36b9N3egRs/KMkY61inS2J1DIwJK7Bvzv4NGuSeEaMHHeMdKgfDkXA42G Am/Ron0NRfM/kH2lsyT8hxXPfYD7JbvHZuiGJJtKyrqYBdVWsZbG6ieOn+jUDMfVLp0x /8bQ== X-Gm-Message-State: ANhLgQ2is7XRALwA0E+ZmLTVlDas3e50r9TdVTxtXptswtmRDZjvhGqT OQZKcJOsLaxDgPKO722Ysra+pVMs1yFOK7SHf/idNw== X-Google-Smtp-Source: ADFU+vugLeY2WEFHMWtHWE1WG8BuHSEOD0ewfpmbKYH2rR2MwD2qdE6ZhUX/rTBKR+9kvmF22o44qN3OxaLGMHB/miE= X-Received: by 2002:a92:8cc1:: with SMTP id s62mr10652457ill.153.1583874953829; Tue, 10 Mar 2020 14:15:53 -0700 (PDT) MIME-Version: 1.0 Received: by 2002:a5e:8c1a:0:0:0:0:0 with HTTP; Tue, 10 Mar 2020 14:15:52 -0700 (PDT) From: Peng Yu Date: Tue, 10 Mar 2020 16:15:52 -0500 Message-ID: Content-Type: text/plain; charset="UTF-8" X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-Received-From: 2607:f8b0:4864:20::12c X-Spam-Score: 2.3 (++) X-Spam-Report: Spam detection software, running on the system "debbugs.gnu.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see the administrator of that system for details. Content preview: Hi, I'd like to diff PDF files. $ seq 10 | gs -q -P- -dSAFER -dNOPAUSE -dBATCH -sDEVICE=pdfwrite -dCompatibilityLevel=1.4 -sstdout=%stderr -sOutputFile=1.pdf -f - $ pdftk 1.pdf output 2.pdf uncompress $ diff {1, 2}.pdf Binary files 1 [...] Content analysis details: (2.3 points, 10.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- 1.0 SPF_SOFTFAIL SPF: sender does not match SPF record (softfail) 0.0 FREEMAIL_FROM Sender email is commonly abused enduser mail provider (pengyu.ut[at]gmail.com) 0.0 SPF_HELO_NONE SPF: HELO does not publish an SPF Record -0.7 RCVD_IN_DNSWL_LOW RBL: Sender listed at https://www.dnswl.org/, low trust [209.51.188.17 listed in list.dnswl.org] 2.0 PDS_TONAME_EQ_TOLOCAL_SHORT Short body with To: name matches everything in local email 0.0 SPOOFED_FREEMAIL No description available. X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 1.3 (+) X-Spam-Report: Spam detection software, running on the system "debbugs.gnu.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see the administrator of that system for details. Content preview: Hi, I'd like to diff PDF files. $ seq 10 | gs -q -P- -dSAFER -dNOPAUSE -dBATCH -sDEVICE=pdfwrite -dCompatibilityLevel=1.4 -sstdout=%stderr -sOutputFile=1.pdf -f - $ pdftk 1.pdf output 2.pdf uncompress $ diff {1,2}.pdf Binary files 1 [...] Content analysis details: (1.3 points, 10.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -0.7 RCVD_IN_DNSWL_LOW RBL: Sender listed at https://www.dnswl.org/, low trust [209.51.188.17 listed in list.dnswl.org] 1.0 SPF_SOFTFAIL SPF: sender does not match SPF record (softfail) 0.0 FREEMAIL_FROM Sender email is commonly abused enduser mail provider (pengyu.ut[at]gmail.com) 0.0 SPF_HELO_NONE SPF: HELO does not publish an SPF Record -1.0 MAILING_LIST_MULTI Multiple indicators imply a widely-seen list manager 2.0 PDS_TONAME_EQ_TOLOCAL_SHORT Short body with To: name matches everything in local email Hi, I'd like to diff PDF files. $ seq 10 | gs -q -P- -dSAFER -dNOPAUSE -dBATCH -sDEVICE=pdfwrite -dCompatibilityLevel=1.4 -sstdout=%stderr -sOutputFile=1.pdf -f - $ pdftk 1.pdf output 2.pdf uncompress $ diff {1,2}.pdf Binary files 1.pdf and 2.pdf differ Since PDF files contains both texts and binaries, just saying that they are binary files and refusing to show the difference is not very helpful. Is there a way for me to easily see the differences between PDF files. Thanks. -- Regards, Peng From unknown Tue Jun 17 01:48:47 2025 X-Loop: help-debbugs@gnu.org Subject: bug#40018: [bug-diffutils] bug#40018: How to diff PDF files? Resent-From: Eric Deplagne Original-Sender: "Debbugs-submit" Resent-CC: bug-diffutils@gnu.org Resent-Date: Tue, 10 Mar 2020 21:32:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 40018 X-GNU-PR-Package: diffutils X-GNU-PR-Keywords: To: Peng Yu Cc: 40018@debbugs.gnu.org Received: via spool by 40018-submit@debbugs.gnu.org id=B40018.158387588219062 (code B ref 40018); Tue, 10 Mar 2020 21:32:02 +0000 Received: (at 40018) by debbugs.gnu.org; 10 Mar 2020 21:31:22 +0000 Received: from localhost ([127.0.0.1]:53411 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1jBmTC-0004xO-2b for submit@debbugs.gnu.org; Tue, 10 Mar 2020 17:31:22 -0400 Received: from goldorak5.eric.deplagne.name ([213.246.56.18]:47101 helo=exim4.goldorak5.eric.deplagne.name) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1jBmTA-0004xG-RN for 40018@debbugs.gnu.org; Tue, 10 Mar 2020 17:31:21 -0400 Received: from [192.168.0.4] (helo=mail.eric.deplagne.name) by exim4 with esmtp (Exim 4.72) (envelope-from ) id 1jBmT9-00078I-6O; Tue, 10 Mar 2020 22:31:19 +0100 Received: from deplagne by mail.eric.deplagne.name with local (Exim 4.69) (envelope-from ) id 1jBmT9-00077b-3c; Tue, 10 Mar 2020 22:31:19 +0100 Date: Tue, 10 Mar 2020 22:31:19 +0100 From: Eric Deplagne Message-ID: <20200310213119.GW7606@mail.eric.deplagne.name> References: MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="w03smVzBQO+3NbLO" Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.18 (2008-05-17) X-Spam-Score: 0.0 (/) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) --w03smVzBQO+3NbLO Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable Hi $ diff -a {1,2}.pdf Should show something, it being readable is another story... Regards. On Tue, 10 Mar 2020 16:15:52 -0500, Peng Yu wrote: > Hi, >=20 > I'd like to diff PDF files. >=20 > $ seq 10 | gs -q -P- -dSAFER -dNOPAUSE -dBATCH -sDEVICE=3Dpdfwrite > -dCompatibilityLevel=3D1.4 -sstdout=3D%stderr -sOutputFile=3D1.pdf -f - > $ pdftk 1.pdf output 2.pdf uncompress > $ diff {1,2}.pdf > Binary files 1.pdf and 2.pdf differ >=20 > Since PDF files contains both texts and binaries, just saying that > they are binary files and refusing to show the difference is not very > helpful. Is there a way for me to easily see the differences between > PDF files. Thanks. >=20 > --=20 > Regards, > Peng --=20 Eric Deplagne --w03smVzBQO+3NbLO Content-Type: application/pgp-signature; name="signature.asc" Content-Description: Digital signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.9 (GNU/Linux) iD8DBQFeaAcmej7DisjVpq4RAmEaAJ0U+GzqQmkhRMC+MIQd7fOAsguI3wCdF4zF WYA7i6b29T0LfMnRbh7sT4E= =kV3e -----END PGP SIGNATURE----- --w03smVzBQO+3NbLO--