From debbugs-submit-bounces@debbugs.gnu.org Fri Mar 01 10:34:22 2024 Received: (at submit) by debbugs.gnu.org; 1 Mar 2024 15:34:22 +0000 Received: from localhost ([127.0.0.1]:37296 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1rg4ty-0007gJ-Ht for submit@debbugs.gnu.org; Fri, 01 Mar 2024 10:34:22 -0500 Received: from lists.gnu.org ([209.51.188.17]:47652) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1rg4ts-0007g5-2F for submit@debbugs.gnu.org; Fri, 01 Mar 2024 10:34:21 -0500 Received: from eggs.gnu.org ([2001:470:142:3::10]) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1rg4tP-0001Qj-8s for bug-coreutils@gnu.org; Fri, 01 Mar 2024 10:33:47 -0500 Received: from mail-ed1-x532.google.com ([2a00:1450:4864:20::532]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1rg4tN-0002gN-Pe for bug-coreutils@gnu.org; Fri, 01 Mar 2024 10:33:47 -0500 Received: by mail-ed1-x532.google.com with SMTP id 4fb4d7f45d1cf-5654f700705so3459842a12.1 for ; Fri, 01 Mar 2024 07:33:44 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1709307221; x=1709912021; darn=gnu.org; h=to:subject:message-id:date:from:mime-version:from:to:cc:subject :date:message-id:reply-to; bh=2lb42PEpfIwXBLoXciQ7k/nPT6DitTKFnP0AXBHcSHs=; b=L+dQn/FClYO2AI1+ZWpYL9Gx3N9PxPgJI1QaN8pS6fpKXktJ/XZP4GLeGytHsKaC3f /XBnTE3IHCHmCt6sliIl9i04swe7/pn0mFMcTFYX3HlI4yHZlOY1zNMkcFuyUaLY4DRO i5ScAGkD3jfq6/f3sTrqw19435+UewzE184w5YpXKLCWc4LKhiszgIKb9HhfQI0fZzwt q7b7rhrrI+IA1ZqOlBXbExiEKS7feGv+R0FLCt7Q1Ggb/kChzWSTwHEVwJ7g6GDtcyHm pmL71tTnvSQSUr5DO5hPBz7DVywLmp7RMGYrQI0e8y+NAaC12OTPKq8epNO1JS5IXTq1 21Cw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1709307221; x=1709912021; h=to:subject:message-id:date:from:mime-version:x-gm-message-state :from:to:cc:subject:date:message-id:reply-to; bh=2lb42PEpfIwXBLoXciQ7k/nPT6DitTKFnP0AXBHcSHs=; b=OyADEt9IbiZAiGLYZDbkFDQpG7ZWfC8aob/40qygv16pvrw61jtrpcaZ20znZihHnT V1zDThwEmKQbgKjEErzq5h5nDzA+yLDGHxO6U59K5HkxLGpZTtkKy1PA4qen/3MeN8bx d7SfdTH9CxydN/gR+OiOPhIEMkCVprNpDy8tbJ4BPeX2AlVUFKjfoa9E/hnAF3VvGmyU Y12ovihCpSI3hAww0XcCsn0qlFqBYLfFaDvl2e7b1Ilt69pKr2Cr/d6sJ5/lnKmxwLQ4 Do0orsBHQFcjkOYlgshdMY1FIHdJKeB+80FVku6YuzyYrrzL0MtzsRdAadvhfGQk3ph4 8GwQ== X-Gm-Message-State: AOJu0Yzes6oMOXcn1ey/96qKQIy7c+7Hs42K6q0w4MwlNzBE3XV3hjio eJPK4Qy16EdecZ8R+xGltd6EqrJCxTV00rMFZWjrzAwoioXxklYhe67yksIoiP+bq9wBRH0QA4T 7G1BRCxxp8Uj4hzFkFqZKzCuvrGZ0aEfUKnznHg== X-Google-Smtp-Source: AGHT+IEFIbVpWE0rxtRtxmZuo0BEw0BTCv2LtjC+NwxQvvLutohGKC7vMKcui/9ybFqK6Ck4KB8pnLMBbWW8gBHrdjQ= X-Received: by 2002:a17:906:7809:b0:a44:19df:63dd with SMTP id u9-20020a170906780900b00a4419df63ddmr1551865ejm.8.1709307221237; Fri, 01 Mar 2024 07:33:41 -0800 (PST) MIME-Version: 1.0 From: lacsaP Patatetom Date: Fri, 1 Mar 2024 16:33:29 +0100 Message-ID: Subject: tr (question) To: bug-coreutils@gnu.org Content-Type: multipart/alternative; boundary="000000000000bbf0ed06129b1960" Received-SPF: pass client-ip=2a00:1450:4864:20::532; envelope-from=patatetom@gmail.com; helo=mail-ed1-x532.google.com X-Spam_score_int: 20 X-Spam_score: 2.0 X-Spam_bar: ++ X-Spam_report: (2.0 / 5.0 requ) BAYES_20=-0.001, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, MIXED_ES=2.199, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01 autolearn=no autolearn_force=no X-Spam_action: no action X-Spam-Score: 0.9 (/) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.1 (/) --000000000000bbf0ed06129b1960 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable hi, I did a few tests with tr and I'm surprised by the results... $ echo =C3=A9=C3=A8=C3=A7=C3=A0 =C3=A9=C3=A8=C3=A7=C3=A0 these characters are encoded in utf-8 on 2 bytes : $ echo =C3=A9=C3=A8=C3=A7=C3=A0 | xxd 00000000: c3a9 c3a8 c3a7 c3a0 0a ......... now I use tr to remove non-printable characters : $ echo =C3=A9=C3=A8=C3=A7=C3=A0 | tr -cd '[:print:]' $ echo =C3=A9=C3=A8=C3=A7=C3=A0 | tr -cd '[:print:]' | wc 0 0 0 all characters are deleted by tr now I want to keep the "=C3=A9" character : $ echo =C3=A9=C3=A8=C3=A7=C3=A0 | tr -cd '[:print:]=C3=A9' =C3=A9=EF=BF=BD=EF=BF=BD=EF=BF=BD why do the "=EF=BF=BD" characters appear ? regards, lacsaP. --000000000000bbf0ed06129b1960 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
hi,

I did a few tests with t= r and I'm surprised by the results...

$ echo =C3=A9=C3=A8= =C3=A7=C3=A0
=C3=A9=C3=A8=C3=A7=C3=A0

these cha= racters are encoded in utf-8 on 2 bytes :

$ echo = =C3=A9=C3=A8=C3=A7=C3=A0 | xxd
00000000: c3a9 c3a8 c3a7 c3a0 0a =C2=A0 = =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 .........

now I use tr to remove non-printable characters :

$ echo =C3=A9=C3=A8=C3=A7=C3=A0 | tr -cd '[:print:]'= ;
$ echo =C3=A9=C3=A8=C3=A7=C3=A0 | tr -cd '[:print:]' | = wc
=C2=A0 =C2=A0 =C2=A0 0 =C2=A0 =C2=A0 =C2=A0 0 =C2=A0 =C2=A0 =C2=A0 0<= br>

all characters are deleted by tr
now= I want to keep the "=C3=A9" character :

$ echo =C3=A9=C3=A8=C3=A7=C3=A0 | tr -cd '[:print:]=C3=A9'
=C3= =A9=EF=BF=BD=EF=BF=BD=EF=BF=BD

why do the "= =EF=BF=BD" characters appear ?

regards, lacsa= P.
--000000000000bbf0ed06129b1960-- From debbugs-submit-bounces@debbugs.gnu.org Fri Mar 01 14:32:12 2024 Received: (at 69488) by debbugs.gnu.org; 1 Mar 2024 19:32:12 +0000 Received: from localhost ([127.0.0.1]:37496 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1rg8c8-0005gP-D7 for submit@debbugs.gnu.org; Fri, 01 Mar 2024 14:32:12 -0500 Received: from mail-wm1-f53.google.com ([209.85.128.53]:44254) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1rg8c6-0005g9-61 for 69488@debbugs.gnu.org; Fri, 01 Mar 2024 14:32:11 -0500 Received: by mail-wm1-f53.google.com with SMTP id 5b1f17b1804b1-412c3f4c6b9so10518765e9.0 for <69488@debbugs.gnu.org>; Fri, 01 Mar 2024 11:31:42 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1709321436; x=1709926236; darn=debbugs.gnu.org; h=content-transfer-encoding:in-reply-to:from:references:to :content-language:subject:user-agent:mime-version:date:message-id :sender:from:to:cc:subject:date:message-id:reply-to; bh=BKSEMMOrYBg4xAqLMRNHctuqzr5UPSHVL6DNN6RHQkk=; b=OogssWMX6mytppDe2rqRy+tdgbNBIbL02x6ks/dKf8rCTpFXy+TGlXg6GULRFzFecx KMdf7r/wQDxGdMh5K0Xg3nZ4LvYDgqxXjeIIxRn9nLE1VJEWWcxVSSISOENq6C3QA3Au nhM20tgLn9D2ZxLnK5gtNFbVHRWEdTDIBIs7qxjMfvoYe2XVI77tyEK6Hg9LdtymvtZZ E7K+w/v/mQyH2A85lnPCF1n0lnk8traDrjGTMSEwshhusLqZkDswBpVrQwNdStnLsD4y zB5BCVVkYoefITHdhsc8T0A5lx9HmOh024CRhZh5fLZg/pJrXlHQn0KwzY87JrFXj6kl I0Ow== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1709321436; x=1709926236; h=content-transfer-encoding:in-reply-to:from:references:to :content-language:subject:user-agent:mime-version:date:message-id :sender:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=BKSEMMOrYBg4xAqLMRNHctuqzr5UPSHVL6DNN6RHQkk=; b=NQtlbyHrnpq0UMNfLjMp6wnNjQNJE45kQK9TYIRSSYV/bpfUjoe8HUYfpdNOI8agLN f+tUx7NxyUnvOVUC1i8cwxONlTVz+8sH1ZDX738Uh/Jx4ymBhCAmfdqCX0h3uZ4KgmO9 YLjwnIlPylSMwE+fo9RVvh3ENDxcPAvLQkHf8UYayY5W0zqdUHdaZbv0QufIM729oU8m yhuA5pIgvh1xD3xqY18qSnHRiJZCZbe9/+vFYvp6c0Nn8zEYJlxbNAFXbBHWt6uNLAcE EE1oJifS6iBi/o7nBDeUUkPyiWtoLYra/v3igDFZn1cqR/8FgVwW9A6mdrpDABx1d3kC 7Z0g== X-Forwarded-Encrypted: i=1; AJvYcCXFbVWUPjYSYDUVsolDuBWCCMTkciqc12iEWkw8gJEYzRWAcOvrS2o2yH2e6tY3CFxVbGmw6YSIsdk4PwSJU3/EHOiiOVk= X-Gm-Message-State: AOJu0YwT0B8gfanRseYe6UdWFurUgyr3LIrEoXe0YcH52KsNt1X5DuTI amArBcHKppk1dUMD1h9/xrZA3J04ZW6rRXHZOTqM/djuQEHKFAEc X-Google-Smtp-Source: AGHT+IFRHT65QBwrU/1uoVAj6R4MXtSJyzjHD+XaVd4eHHcQpLb1oYhUtXp2t9bhhK1N2OIJkJqroA== X-Received: by 2002:a05:600c:b94:b0:412:d0ab:a33f with SMTP id fl20-20020a05600c0b9400b00412d0aba33fmr151965wmb.22.1709321436084; Fri, 01 Mar 2024 11:30:36 -0800 (PST) Received: from [192.168.1.56] (86-40-129-3-dynamic.agg2.lod.rsl-rtd.eircom.net. [86.40.129.3]) by smtp.googlemail.com with ESMTPSA id u22-20020a05600c139600b0040fdf5e6d40sm6608822wmf.20.2024.03.01.11.30.35 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Fri, 01 Mar 2024 11:30:35 -0800 (PST) Message-ID: <4c63daa4-10f9-1a35-2da4-628c7cc88ac7@draigBrady.com> Date: Fri, 1 Mar 2024 19:30:33 +0000 MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: bug#69488: tr (question) Content-Language: en-US To: lacsaP Patatetom , 69488@debbugs.gnu.org References: From: =?UTF-8?Q?P=C3=A1draig_Brady?= In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-Spam-Score: 0.2 (/) X-Debbugs-Envelope-To: 69488 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.8 (/) On 01/03/2024 15:33, lacsaP Patatetom wrote: > hi, > > I did a few tests with tr and I'm surprised by the results... > > $ echo éèçà > éèçà > > these characters are encoded in utf-8 on 2 bytes : > > $ echo éèçà | xxd > 00000000: c3a9 c3a8 c3a7 c3a0 0a ......... > > now I use tr to remove non-printable characters : > > $ echo éèçà | tr -cd '[:print:]' > $ echo éèçà | tr -cd '[:print:]' | wc > 0 0 0 > > all characters are deleted by tr > now I want to keep the "é" character : > > $ echo éèçà | tr -cd '[:print:]é' > é��� > > why do the "�" characters appear ? > > regards, lacsaP. It's a known issue that tr is currently non multi-byte aware. thanks, Pádraig From debbugs-submit-bounces@debbugs.gnu.org Mon Mar 04 03:27:13 2024 Received: (at 69488) by debbugs.gnu.org; 4 Mar 2024 08:27:13 +0000 Received: from localhost ([127.0.0.1]:41849 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1rh3fF-0005kl-AG for submit@debbugs.gnu.org; Mon, 04 Mar 2024 03:27:13 -0500 Received: from mail-ej1-f48.google.com ([209.85.218.48]:56482) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1rh3fD-0005kV-WA for 69488@debbugs.gnu.org; Mon, 04 Mar 2024 03:27:12 -0500 Received: by mail-ej1-f48.google.com with SMTP id a640c23a62f3a-a45670f9508so38862466b.0 for <69488@debbugs.gnu.org>; Mon, 04 Mar 2024 00:26:42 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1709540737; x=1710145537; darn=debbugs.gnu.org; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=QJncICGX5a8JoWpweeSPpqasTJ2rV+Ayw22GQ3+vQ60=; b=j4t0JwxC7+v25/0IIqTwRUkcgjZJ4EEdLotbTf90w1WQicqvkuSsYoLuTOo77IwWPJ Sc7wAAIpBs7CvCAib6J/Iqvm+PObjS5a4mewiIVkGFZX6cX4xw8ls8+a3PDSroZhUR99 icK+4x/Z1fOZaz2FvnwGsodQlOGtOvcAvPGxBZEJk5iyqyZ6F/XGf3K1kiEfSJe6Wnix yd4uDKkpHJGyqKHyAeQhp3pAEAUZwJmE8fTWg4HZGfGenVW60VCz4Kir5JREKimCUVKQ JcSaVNGrCNazGD6YjQZKhiVSFmLLCiBFWr+AIEpzCoOTviFYMvgN3ZDRpiryUEg/Zo52 Kc/Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1709540737; x=1710145537; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=QJncICGX5a8JoWpweeSPpqasTJ2rV+Ayw22GQ3+vQ60=; b=iKWRyl+ReUs4WLEzv7WDPUo1J1nf+D5H9PfOiSdpbB1A87PicbNjisAW/9ILs/THMi qDKBNswCPL8KT9BQjov7ygGD3uy8vFrEGq53GWiMyV10ESYHyMNAY1Oor8dz/bpvDGDJ 00BYmkd1S227KppvA3EpIN2LEi5+WPCNBi0ikLbACfLH+e6sIVkmrZWjUAmVELwRv5mD wdKrySnSyKnL3E4nOMr0WsSbBCEvWFYw/UUElkrSsNnJWI0KTHhEjq7X8fic7p97eTeH aARjiWhCrhwUePB9Y4Ug+bgsA7sm/g2SPUu6HIjbXxLmr6ZI1VaKUUbDHTRIWaPeN6pF kVnw== X-Gm-Message-State: AOJu0Yzy/xWukT9w/9vNGpwvfkWP4uHKsX9YZuy8zx+4XLvPOMjCXofS KIW/yHRoGm0l2GAKUwwpRa1TF54MMaCWAva32I7+FxtCz6BRealI2BX3ouE+NleLtIx0b+ZjaQg xYaIz8fT4lMhFL1X54Uhd1xRR0No= X-Google-Smtp-Source: AGHT+IG/CWaJ0eh0BQwl5BhBtzkQhw/xq5t106vztd21xXb+F3wkfnq7FRQ+bJVbXpLL2qTlwcR8eAXWRl4+Nt6fUY8= X-Received: by 2002:a17:906:ff53:b0:a3e:c818:b7f with SMTP id zo19-20020a170906ff5300b00a3ec8180b7fmr6001413ejb.29.1709540736477; Mon, 04 Mar 2024 00:25:36 -0800 (PST) MIME-Version: 1.0 References: <4c63daa4-10f9-1a35-2da4-628c7cc88ac7@draigBrady.com> In-Reply-To: <4c63daa4-10f9-1a35-2da4-628c7cc88ac7@draigBrady.com> From: lacsaP Patatetom Date: Mon, 4 Mar 2024 09:25:24 +0100 Message-ID: Subject: Re: bug#69488: tr (question) To: =?UTF-8?Q?P=C3=A1draig_Brady?= Content-Type: multipart/alternative; boundary="00000000000053b74c0612d178cf" X-Spam-Score: -0.0 (/) X-Debbugs-Envelope-To: 69488 Cc: 69488@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) --00000000000053b74c0612d178cf Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Le ven. 1 mars 2024 =C3=A0 20:30, P=C3=A1draig Brady a = =C3=A9crit : > On 01/03/2024 15:33, lacsaP Patatetom wrote: > > hi, > > > > I did a few tests with tr and I'm surprised by the results... > > > > $ echo =C3=A9=C3=A8=C3=A7=C3=A0 > > =C3=A9=C3=A8=C3=A7=C3=A0 > > > > these characters are encoded in utf-8 on 2 bytes : > > > > $ echo =C3=A9=C3=A8=C3=A7=C3=A0 | xxd > > 00000000: c3a9 c3a8 c3a7 c3a0 0a ......... > > > > now I use tr to remove non-printable characters : > > > > $ echo =C3=A9=C3=A8=C3=A7=C3=A0 | tr -cd '[:print:]' > > $ echo =C3=A9=C3=A8=C3=A7=C3=A0 | tr -cd '[:print:]' | wc > > 0 0 0 > > > > all characters are deleted by tr > > now I want to keep the "=C3=A9" character : > > > > $ echo =C3=A9=C3=A8=C3=A7=C3=A0 | tr -cd '[:print:]=C3=A9' > > =C3=A9=EF=BF=BD=EF=BF=BD=EF=BF=BD > > > > why do the "=EF=BF=BD" characters appear ? > > > > regards, lacsaP. > > > It's a known issue that tr is currently non multi-byte aware. > > thanks, > P=C3=A1draig > hi, thank you for this clarification. what alternative to `tr` would you recommend for this type of treatment ? regards, lacsaP. --00000000000053b74c0612d178cf Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
Le=C2=A0ven. 1 mars 2024 =C3=A0=C2=A020:30, P=C3=A1draig Brady <<= a href=3D"mailto:P@draigbrady.com">P@draigbrady.com> a =C3=A9crit=C2= =A0:
On 01/03/20= 24 15:33, lacsaP Patatetom wrote:
> hi,
>
> I did a few tests with tr and I'm surprised by the results...
>
> $ echo =C3=A9=C3=A8=C3=A7=C3=A0
> =C3=A9=C3=A8=C3=A7=C3=A0
>
> these characters are encoded in utf-8 on 2 bytes :
>
> $ echo =C3=A9=C3=A8=C3=A7=C3=A0 | xxd
> 00000000: c3a9 c3a8 c3a7 c3a0 0a=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2= =A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0.........
>
> now I use tr to remove non-printable characters :
>
> $ echo =C3=A9=C3=A8=C3=A7=C3=A0 | tr -cd '[:print:]'
> $ echo =C3=A9=C3=A8=C3=A7=C3=A0 | tr -cd '[:print:]' | wc
>=C2=A0 =C2=A0 =C2=A0 =C2=A0 0=C2=A0 =C2=A0 =C2=A0 =C2=A00=C2=A0 =C2=A0 = =C2=A0 =C2=A00
>
> all characters are deleted by tr
> now I want to keep the "=C3=A9" character :
>
> $ echo =C3=A9=C3=A8=C3=A7=C3=A0 | tr -cd '[:print:]=C3=A9'
> =C3=A9=EF=BF=BD=EF=BF=BD=EF=BF=BD
>
> why do the "=EF=BF=BD" characters appear ?
>
> regards, lacsaP.


It's a known issue that tr is currently non multi-byte aware.

thanks,
P=C3=A1draig
hi,

thank you fo= r this clarification.

what alternative to `tr` wou= ld you recommend for this type of treatment ?

rega= rds, lacsaP.
--00000000000053b74c0612d178cf--