From unknown Thu Aug 14 17:24:36 2025 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-Mailer: MIME-tools 5.509 (Entity 5.509) Content-Type: text/plain; charset=utf-8 From: bug#11948 <11948@debbugs.gnu.org> To: bug#11948 <11948@debbugs.gnu.org> Subject: Status: 24.1.50; word-wrap should allow wrapping at all breaking space characters, not just space and tab Reply-To: bug#11948 <11948@debbugs.gnu.org> Date: Fri, 15 Aug 2025 00:24:36 +0000 retitle 11948 24.1.50; word-wrap should allow wrapping at all breaking spac= e characters, not just space and tab reassign 11948 emacs submitter 11948 Reuben Thomas severity 11948 wishlist thanks From debbugs-submit-bounces@debbugs.gnu.org Mon Jul 16 08:48:40 2012 Received: (at submit) by debbugs.gnu.org; 16 Jul 2012 12:48:40 +0000 Received: from localhost ([127.0.0.1]:43882 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1Sqkj9-0002qL-He for submit@debbugs.gnu.org; Mon, 16 Jul 2012 08:48:40 -0400 Received: from eggs.gnu.org ([208.118.235.92]:58604) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1Sqkj5-0002qC-Ct for submit@debbugs.gnu.org; Mon, 16 Jul 2012 08:48:37 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1SqkdG-0006lq-8R for submit@debbugs.gnu.org; Mon, 16 Jul 2012 08:42:40 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-6.9 required=5.0 tests=BAYES_00,RCVD_IN_DNSWL_HI autolearn=unavailable version=3.3.2 Received: from lists.gnu.org ([208.118.235.17]:46630) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1SqkdG-0006lj-4s for submit@debbugs.gnu.org; Mon, 16 Jul 2012 08:42:34 -0400 Received: from eggs.gnu.org ([208.118.235.92]:56916) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1SqkdE-0004Ko-6E for bug-gnu-emacs@gnu.org; Mon, 16 Jul 2012 08:42:34 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Sqkd5-0006jS-UG for bug-gnu-emacs@gnu.org; Mon, 16 Jul 2012 08:42:32 -0400 Received: from exprod7og110.obsmtp.com ([64.18.2.173]:39994) by eggs.gnu.org with smtp (Exim 4.71) (envelope-from ) id 1Sqkd5-0006iv-Hi for bug-gnu-emacs@gnu.org; Mon, 16 Jul 2012 08:42:23 -0400 Received: from mail-wg0-f45.google.com ([74.125.82.45]) (using TLSv1) by exprod7ob110.postini.com ([64.18.6.12]) with SMTP ID DSNKUAQMLHSpchy0Lfpqp/Iy+fEslqRZ1INR@postini.com; Mon, 16 Jul 2012 05:42:23 PDT Received: by mail-wg0-f45.google.com with SMTP id dt14so4776737wgb.14 for ; Mon, 16 Jul 2012 05:42:20 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=from:to:subject:date:message-id:mime-version:content-type :x-gm-message-state; bh=90YuabekVuDItGW9h0RTJBRGYGqO17oWm3XmRjZI3vA=; b=i5Gagg6vf+Z5+s8v2kHu09XDxjDlqXobRBl8hGUs8X7cISm5Hnj8A4zDhEf1WSfj3o fI+VJnLmHvJJ9cJuZgdtiBFfyqJkUvWkfM/TSLyYJTpIPgmMJx/NElK4khUMDhdGTjXm dndkpKmBd4WrwZ5M0fN0zh70oW83ZitWIjPkr/3vyvq78q/TlTMh8MBZW6QwaHH2pQcC 1trU8+P37/QQWNh/E1H27Z8TrptPg6nA8oLdZ5OPGOpOLvt7MdSbodnDLOh2Wzo6NdKP CPawA5990IPlQQxjmQkZo2DyVasXbSbEkbm4e6loWtLwJRQz45rOrQFogKuGVS9xcN9n 010Q== Received: by 10.180.97.135 with SMTP id ea7mr17958671wib.11.1342442540436; Mon, 16 Jul 2012 05:42:20 -0700 (PDT) Received: from skwd (87-194-87-241.bethere.co.uk. [87.194.87.241]) by mx.google.com with ESMTPS id t7sm31555688wix.6.2012.07.16.05.42.19 (version=TLSv1/SSLv3 cipher=OTHER); Mon, 16 Jul 2012 05:42:19 -0700 (PDT) From: Reuben Thomas To: bug-gnu-emacs@gnu.org Subject: 24.1.50; word-wrap should allow wrapping at all breaking space characters, not just space and tab Date: Mon, 16 Jul 2012 13:42:18 +0100 Message-ID: <871ukbubbp.fsf@sc3d.org> MIME-Version: 1.0 Content-Type: text/plain X-Gm-Message-State: ALoCoQlrv1xji3gN/cjJzZtyBCL7skVeJGfRHo3rZj7kMk7Aja90kQfRsaJ68t7ZFlQWd+cO3dHU X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6, seldom 2.4 (older, 4) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 3) X-Received-From: 208.118.235.17 X-Spam-Score: -6.2 (------) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -6.2 (------) I noticed this when in visual-line-mode, and it failed to wrap at an em space (U+2003), but of course there are lotsof other breaking space characters. In GNU Emacs 24.1.50.2 (x86_64-unknown-linux-gnu, GTK+ Version 2.24.10) of 2012-07-14 on skwd Bzr revision: 109087 cyd@gnu.org-20120714053223-jxkxt958pqg8tisb Windowing system distributor `The X.Org Foundation', version 11.0.11103000 Important settings: value of $LC_MONETARY: en_GB.UTF-8 value of $LC_NUMERIC: en_GB.UTF-8 value of $LC_TIME: en_GB.UTF-8 value of $LANG: en_US.UTF-8 locale-coding-system: utf-8-unix default enable-multibyte-characters: t Major mode: Help Minor modes in effect: shell-dirtrack-mode: t diff-auto-refine-mode: t recentf-mode: t show-paren-mode: t server-mode: t savehist-mode: t minibuffer-electric-default-mode: t iswitchb-mode: t icomplete-mode: t global-auto-revert-mode: t desktop-save-mode: t tooltip-mode: t mouse-wheel-mode: t file-name-shadow-mode: t global-font-lock-mode: t font-lock-mode: t blink-cursor-mode: t auto-composition-mode: t auto-encryption-mode: t auto-compression-mode: t column-number-mode: t line-number-mode: t transient-mark-mode: t Recent input: C-y C-d C-d C-d C-d C-d C-d C-d C-d C-d C-d C-d C-d C-g C-g C-h k C-c l C-n C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-x 1 C-n C-n C-n C-n C-n C-n C-n C-n C-n C-n C-n C-n C-n C-n C-n C-n C-n C-n C-n C-n C-n C-n C-n C-n M-x c u s o t m i z e - t o m i z e - g g r r o u p v i s u C-h f v i s u a l - l i n e - m o d e C-n C-n C-n C-n C-n C-n C-n C-n C-n C-n C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f C-f M-x r e p o r t - b e m a c s - b u g Recent messages: Type "q" to restore previous buffer. uncompressing simple.el.gz...done Note: file is write protected Creating customization items... Creating group... Creating group entries...done Creating customization items ...done Resetting customization items...done Creating customization setup...done Type "q" to restore previous buffer. [2 times] Load-path shadows: /home/rrt/.emacs.d/elpa/dictionary-1.8.7/dictionary-init hides /usr/local/share/emacs/24.1.50/site-lisp/dictionary-el/dictionary-init /home/rrt/.emacs.d/elpa/dictionary-1.8.7/dictionary hides /usr/local/share/emacs/24.1.50/site-lisp/dictionary-el/dictionary /home/rrt/.emacs.d/elpa/dictionary-1.8.7/link hides /usr/local/share/emacs/24.1.50/site-lisp/dictionary-el/link /home/rrt/.emacs.d/elpa/dictionary-1.8.7/connection hides /usr/local/share/emacs/24.1.50/site-lisp/dictionary-el/connection /home/rrt/local/share/emacs/site-lisp/dict hides /usr/local/share/emacs/24.1.50/site-lisp/emacs-goodies-el/dict /usr/local/share/emacs/24.1.50/site-lisp/auctex/tex-style hides /usr/share/emacs/site-lisp/auctex/tex-style /usr/local/share/emacs/24.1.50/site-lisp/auctex/tex-mik hides /usr/share/emacs/site-lisp/auctex/tex-mik /usr/local/share/emacs/24.1.50/site-lisp/auctex/multi-prompt hides /usr/share/emacs/site-lisp/auctex/multi-prompt /usr/local/share/emacs/24.1.50/site-lisp/auctex/tex-jp hides /usr/share/emacs/site-lisp/auctex/tex-jp /usr/local/share/emacs/24.1.50/site-lisp/auctex/tex-info hides /usr/share/emacs/site-lisp/auctex/tex-info /usr/local/share/emacs/24.1.50/site-lisp/auctex/latex hides /usr/share/emacs/site-lisp/auctex/latex /usr/local/share/emacs/24.1.50/site-lisp/auctex/tex hides /usr/share/emacs/site-lisp/auctex/tex /usr/local/share/emacs/24.1.50/site-lisp/auctex/texmathp hides /usr/share/emacs/site-lisp/auctex/texmathp /usr/local/share/emacs/24.1.50/site-lisp/auctex/context-nl hides /usr/share/emacs/site-lisp/auctex/context-nl /usr/local/share/emacs/24.1.50/site-lisp/auctex/tex-font hides /usr/share/emacs/site-lisp/auctex/tex-font /usr/local/share/emacs/24.1.50/site-lisp/auctex/toolbar-x hides /usr/share/emacs/site-lisp/auctex/toolbar-x /usr/local/share/emacs/24.1.50/site-lisp/auctex/tex-buf hides /usr/share/emacs/site-lisp/auctex/tex-buf /usr/local/share/emacs/24.1.50/site-lisp/auctex/tex-fptex hides /usr/share/emacs/site-lisp/auctex/tex-fptex /usr/local/share/emacs/24.1.50/site-lisp/auctex/bib-cite hides /usr/share/emacs/site-lisp/auctex/bib-cite /usr/local/share/emacs/24.1.50/site-lisp/auctex/context-en hides /usr/share/emacs/site-lisp/auctex/context-en /usr/local/share/emacs/24.1.50/site-lisp/auctex/tex-fold hides /usr/share/emacs/site-lisp/auctex/tex-fold /usr/local/share/emacs/24.1.50/site-lisp/auctex/tex-bar hides /usr/share/emacs/site-lisp/auctex/tex-bar /usr/local/share/emacs/24.1.50/site-lisp/auctex/context hides /usr/share/emacs/site-lisp/auctex/context /usr/local/share/emacs/24.1.50/site-lisp/auctex/font-latex hides /usr/share/emacs/site-lisp/auctex/font-latex Features: (shadow sort gnus-util mail-extr emacsbug message format-spec rfc822 mml mml-sec mm-decode mm-bodies mm-encode mail-parse rfc2231 mailabbrev gmm-utils mailheader sendmail rfc2047 rfc2045 ietf-drums mm-util mail-prsvr mail-utils cus-edit find-func vc ediff-merg ediff-diff ediff-wind ediff-help ediff-util ediff-mult ediff-init ediff vc-dispatcher todoo shell pcomplete grep multi-isearch help-mode jka-compr info etags nxml-uchnm rng-xsd xsd-regexp rng-cmpct image-mode rng-nxml rng-valid rng-loc rng-uri rng-parse nxml-parse rng-match rng-dt rng-util rng-pttrn nxml-ns nxml-mode nxml-outln nxml-rap nxml-util nxml-glyph nxml-enc xmltok sgml-mode js byte-opt bytecomp byte-compile cconv json imenu thingatpt inform-mode cc-langs cc-mode cc-fonts cc-guess cc-menus cc-cmds cc-styles cc-align cc-engine cc-vars cc-defs autoconf autoconf-mode make-mode noutline outline lua-mode diff-git diff-mode cperl-mode flymake compile comint ansi-color ring vc-git face-remap flyspell smart-quotes auto-dictionary-autoloads c-eldoc-autoloads dictionary-autoloads diff-git-autoloads dired-isearch-autoloads full-ack-autoloads guess-style-autoloads kill-ring-search-autoloads magit-autoloads mv-shell-autoloads tumble-autoloads http-post-simple-autoloads package completing-help recentf tree-widget wid-edit uniquify paren server savehist minibuf-eldef iswitchb icomplete autorevert desktop cus-start cus-load ropemacs pymacs go-mode-load ispell advice advice-preload yasnippet help-fns derived edmacro kmacro cl-macs gv easymenu assoc cl cl-lib macroexp muse-autoloads emacs-goodies-el emacs-goodies-custom emacs-goodies-loaddefs easy-mmode preview-latex tex-site auto-loads user-site-loaddefs time-date tooltip ediff-hook vc-hooks lisp-float-type mwheel x-win x-dnd tool-bar dnd fontset image regexp-opt fringe tabulated-list newcomment lisp-mode register page menu-bar rfn-eshadow timer select scroll-bar mouse jit-lock font-lock syntax facemenu font-core frame cham georgian utf-8-lang misc-lang vietnamese tibetan thai tai-viet lao korean japanese hebrew greek romanian slovak czech european ethiopic indian cyrillic chinese case-table epa-hook jka-cmpr-hook help simple abbrev minibuffer loaddefs button faces cus-face files text-properties overlay sha1 md5 base64 format env code-pages mule custom widget hashtable-print-readable backquote make-network-process dbusbind dynamic-setting system-font-setting font-render-setting move-toolbar gtk x-toolkit x multi-tty emacs) -- http://rrt.sc3d.org/ From debbugs-submit-bounces@debbugs.gnu.org Mon Jul 16 12:11:44 2012 Received: (at 11948) by debbugs.gnu.org; 16 Jul 2012 16:11:44 +0000 Received: from localhost ([127.0.0.1]:44378 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1Sqntg-0007bi-0w for submit@debbugs.gnu.org; Mon, 16 Jul 2012 12:11:44 -0400 Received: from mtaout22.012.net.il ([80.179.55.172]:54005) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1Sqntb-0007bX-Hz for 11948@debbugs.gnu.org; Mon, 16 Jul 2012 12:11:43 -0400 Received: from conversion-daemon.a-mtaout22.012.net.il by a-mtaout22.012.net.il (HyperSendmail v2007.08) id <0M7900900GN0N200@a-mtaout22.012.net.il> for 11948@debbugs.gnu.org; Mon, 16 Jul 2012 19:05:42 +0300 (IDT) Received: from HOME-C4E4A596F7 ([87.69.210.75]) by a-mtaout22.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0M79009FIGPIMO10@a-mtaout22.012.net.il>; Mon, 16 Jul 2012 19:05:42 +0300 (IDT) Date: Mon, 16 Jul 2012 19:05:49 +0300 From: Eli Zaretskii Subject: Re: bug#11948: 24.1.50; word-wrap should allow wrapping at all breaking space characters, not just space and tab In-reply-to: <871ukbubbp.fsf@sc3d.org> X-012-Sender: halo1@inter.net.il To: Reuben Thomas Message-id: <837gu3bsiq.fsf@gnu.org> References: <871ukbubbp.fsf@sc3d.org> X-Spam-Score: -1.2 (-) X-Debbugs-Envelope-To: 11948 Cc: 11948@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list Reply-To: Eli Zaretskii List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -1.2 (-) > From: Reuben Thomas > Date: Mon, 16 Jul 2012 13:42:18 +0100 > > I noticed this when in visual-line-mode, and it failed to wrap at an em > space (U+2003), but of course there are lotsof other breaking space > characters. A prerequisite for doing something about this is to decide which characters should allow breaking the line. Is there some guidance in the Unicode standard or elsewhere about this? If not, we will have to decide on our own. From debbugs-submit-bounces@debbugs.gnu.org Mon Jul 16 14:27:01 2012 Received: (at 11948) by debbugs.gnu.org; 16 Jul 2012 18:27:01 +0000 Received: from localhost ([127.0.0.1]:44545 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1Sqq0a-0002Gs-Sj for submit@debbugs.gnu.org; Mon, 16 Jul 2012 14:27:01 -0400 Received: from exprod7og120.obsmtp.com ([64.18.2.18]:54279) by debbugs.gnu.org with smtp (Exim 4.72) (envelope-from ) id 1Sqq0X-0002Ge-EZ for 11948@debbugs.gnu.org; Mon, 16 Jul 2012 14:26:59 -0400 Received: from mail-vc0-f169.google.com ([209.85.220.169]) (using TLSv1) by exprod7ob120.postini.com ([64.18.6.12]) with SMTP ID DSNKUARbjKe1S5u92xJNvSQLRo5p7dsanhl4@postini.com; Mon, 16 Jul 2012 11:21:01 PDT Received: by mail-vc0-f169.google.com with SMTP id fl10so3823596vcb.28 for <11948@debbugs.gnu.org>; Mon, 16 Jul 2012 11:21:00 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type:x-gm-message-state; bh=wIiAwyDMxubsgopnw9eH+p+7IVMh/RUqnw4lo7HC3pc=; b=m4dKysLPAJ4GLqa6gK+Pjyl2WJpAEy9RlTR4mDfQZchVE/aFy4WqEC1JzBhn7uXczi /sTF5bfHXSjIWyLNizrd7EFq1kYMmPggiAx7YL2BsMt/kHW2h/rjjjNmHODf5mfTkDeS Ca6/55XTp3UYw3/bxGQ7XEmZlXIxvc5X4vw3byBwJWMJHMv2vkw0EVXIqGIhNylr9+P6 52rpzzapTRwGxKGaKYeW0cBuzlhOeZNyDvrwcQy7zNvPha38ynUIC8JgMFBfBf0pKgAj sk8Y6oaGuCl5lEA86v7NvmE7CKj4XK4vIp0z2DLFmr10oX3mN0BE0LEy7vVhFPkMR2qi 2z/A== MIME-Version: 1.0 Received: by 10.220.220.78 with SMTP id hx14mr5807833vcb.23.1342462860258; Mon, 16 Jul 2012 11:21:00 -0700 (PDT) Received: by 10.220.150.196 with HTTP; Mon, 16 Jul 2012 11:21:00 -0700 (PDT) In-Reply-To: <837gu3bsiq.fsf@gnu.org> References: <871ukbubbp.fsf@sc3d.org> <837gu3bsiq.fsf@gnu.org> Date: Mon, 16 Jul 2012 19:21:00 +0100 Message-ID: Subject: Re: bug#11948: 24.1.50; word-wrap should allow wrapping at all breaking space characters, not just space and tab From: Reuben Thomas To: Eli Zaretskii Content-Type: text/plain; charset=ISO-8859-1 X-Gm-Message-State: ALoCoQmR+5eIwMs1Xkiw7sM2p+SXsHQxX3/X3PAxzjzPHY4btfIBa7wYv05N2N2w6jSyhK0qveR8 X-Spam-Score: -4.2 (----) X-Debbugs-Envelope-To: 11948 Cc: 11948@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -4.2 (----) On 16 July 2012 17:05, Eli Zaretskii wrote: > > A prerequisite for doing something about this is to decide which > characters should allow breaking the line. Is there some guidance in > the Unicode standard or elsewhere about this? If not, we will have to > decide on our own. The Unicode line breaking algorithm is probably the place to go: http://unicode.org/reports/tr14/ -- http://rrt.sc3d.org From debbugs-submit-bounces@debbugs.gnu.org Mon Jul 16 15:53:03 2012 Received: (at 11948) by debbugs.gnu.org; 16 Jul 2012 19:53:03 +0000 Received: from localhost ([127.0.0.1]:44672 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1SqrLr-0004G4-3f for submit@debbugs.gnu.org; Mon, 16 Jul 2012 15:53:03 -0400 Received: from mtaout22.012.net.il ([80.179.55.172]:59803) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1SqrLo-0004Fe-HW for 11948@debbugs.gnu.org; Mon, 16 Jul 2012 15:53:01 -0400 Received: from conversion-daemon.a-mtaout22.012.net.il by a-mtaout22.012.net.il (HyperSendmail v2007.08) id <0M7900B00PUWJP00@a-mtaout22.012.net.il> for 11948@debbugs.gnu.org; Mon, 16 Jul 2012 22:47:02 +0300 (IDT) Received: from HOME-C4E4A596F7 ([87.69.210.75]) by a-mtaout22.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0M7900B90QYEQY20@a-mtaout22.012.net.il>; Mon, 16 Jul 2012 22:47:02 +0300 (IDT) Date: Mon, 16 Jul 2012 22:47:09 +0300 From: Eli Zaretskii Subject: Re: bug#11948: 24.1.50; word-wrap should allow wrapping at all breaking space characters, not just space and tab In-reply-to: X-012-Sender: halo1@inter.net.il To: Reuben Thomas Message-id: <83wr23a3pe.fsf@gnu.org> References: <871ukbubbp.fsf@sc3d.org> <837gu3bsiq.fsf@gnu.org> X-Spam-Score: -1.2 (-) X-Debbugs-Envelope-To: 11948 Cc: 11948@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list Reply-To: Eli Zaretskii List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -1.2 (-) > Date: Mon, 16 Jul 2012 19:21:00 +0100 > From: Reuben Thomas > Cc: 11948@debbugs.gnu.org > > On 16 July 2012 17:05, Eli Zaretskii wrote: > > > > A prerequisite for doing something about this is to decide which > > characters should allow breaking the line. Is there some guidance in > > the Unicode standard or elsewhere about this? If not, we will have to > > decide on our own. > > The Unicode line breaking algorithm is probably the place to go: > > http://unicode.org/reports/tr14/ Thanks, but that's not what I meant. Implementing UAX#14 in full is an effort similar (although smaller) to what was required for implementing UAX#9, the Unicode Bidirectional Algorithm. The main problem is that, like with UAX#9, the algorithms in UAX#14 are specified assuming that text is processed for display in batches. By contrast, the Emacs display engine, which implements word-wrap, examines and processes characters one by one. So one needs to "serialize", so to speak, the UAX#14 algorithms so that its decisions could be made on a character-by-character basis. I think just supporting more characters from LineBreak.txt on which to wrap should be a good start, and much easier than implementing UAX#14. Even for that, we will need an efficient char-table for the related properties, probably via the uniprop_table machinery, like what bidi.c uses. Otherwise, referencing the ordinary char-tables of character properties for each character we display could slow down redisplay too much. From debbugs-submit-bounces@debbugs.gnu.org Mon Jul 16 15:54:52 2012 Received: (at 11948) by debbugs.gnu.org; 16 Jul 2012 19:54:52 +0000 Received: from localhost ([127.0.0.1]:44676 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1SqrNb-0004IP-SG for submit@debbugs.gnu.org; Mon, 16 Jul 2012 15:54:52 -0400 Received: from exprod7og101.obsmtp.com ([64.18.2.155]:43755) by debbugs.gnu.org with smtp (Exim 4.72) (envelope-from ) id 1SqrNZ-0004IH-8N for 11948@debbugs.gnu.org; Mon, 16 Jul 2012 15:54:49 -0400 Received: from mail-vc0-f169.google.com ([209.85.220.169]) (using TLSv1) by exprod7ob101.postini.com ([64.18.6.12]) with SMTP ID DSNKUARwJGQtK4KxPRjUhdZYy+2bcne7DL6c@postini.com; Mon, 16 Jul 2012 12:48:53 PDT Received: by mail-vc0-f169.google.com with SMTP id fl10so3883666vcb.28 for <11948@debbugs.gnu.org>; Mon, 16 Jul 2012 12:48:52 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type:x-gm-message-state; bh=H2s1oUfzlxH6EQZtrGKJZmzg9yi0c17obg2MztN6yYs=; b=QiOLU85lKV4NIvLcWmj9sLwzNgdHZUmjrb0tolM0z30VimA6SyCMABLiH1dPImSeSh RhCs3kROM9pZGF6Lg9mW00aCn5v3m26NL9wWAxVb3Ctz8Y44PmxZJKDGTTuZ7zIjGilH qGEBa+neNS0dRXW4PrlhCjpeDRKs2QOOv2ixLJYI7MV2VGEpg06spmoW6XPbLZ3D3UqP EUeSEymGW2+OD8NXxsoq+jwRP3Axli22BBZCdiRB+aHSNlESMrhJas+rXKlXOehDygey x84+8q8/GaM1eule+y+T1lPF1LW33hWsiRSAqkAlsut+fw1AiYRO5vfJ3H5jQAdIhVr4 nzSA== MIME-Version: 1.0 Received: by 10.221.12.14 with SMTP id pg14mr5945347vcb.21.1342468132213; Mon, 16 Jul 2012 12:48:52 -0700 (PDT) Received: by 10.220.150.196 with HTTP; Mon, 16 Jul 2012 12:48:52 -0700 (PDT) In-Reply-To: <83wr23a3pe.fsf@gnu.org> References: <871ukbubbp.fsf@sc3d.org> <837gu3bsiq.fsf@gnu.org> <83wr23a3pe.fsf@gnu.org> Date: Mon, 16 Jul 2012 20:48:52 +0100 Message-ID: Subject: Re: bug#11948: 24.1.50; word-wrap should allow wrapping at all breaking space characters, not just space and tab From: Reuben Thomas To: Eli Zaretskii Content-Type: text/plain; charset=ISO-8859-1 X-Gm-Message-State: ALoCoQnZa+FNh6yIchQs16Jrj3rb2ny4jZpc8wX1B+F2oErnIMjzLxVMxsOc6qrvdIKicCdjh6E8 X-Spam-Score: -4.2 (----) X-Debbugs-Envelope-To: 11948 Cc: 11948@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -4.2 (----) On 16 July 2012 20:47, Eli Zaretskii wrote: > > Thanks, but that's not what I meant. Implementing UAX#14 in full is > an effort similar (although smaller) to what was required for > implementing UAX#9, the Unicode Bidirectional Algorithm. The main > problem is that, like with UAX#9, the algorithms in UAX#14 are > specified assuming that text is processed for display in batches. I wasn't suggesting you should implement the algorithm; I just assumed it would contain a list of breaking space characters. You seem to have found such a thing anyway! -- http://rrt.sc3d.org From debbugs-submit-bounces@debbugs.gnu.org Mon Jul 16 16:43:33 2012 Received: (at 11948) by debbugs.gnu.org; 16 Jul 2012 20:43:33 +0000 Received: from localhost ([127.0.0.1]:44704 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1Sqs8e-0005Pu-Ct for submit@debbugs.gnu.org; Mon, 16 Jul 2012 16:43:32 -0400 Received: from pruche.dit.umontreal.ca ([132.204.246.22]:44652) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1Sqs8c-0005Pm-5F for 11948@debbugs.gnu.org; Mon, 16 Jul 2012 16:43:27 -0400 Received: from fmsmemgm.homelinux.net (lechon.iro.umontreal.ca [132.204.27.242]) by pruche.dit.umontreal.ca (8.14.1/8.14.1) with ESMTP id q6GKbQI2010799; Mon, 16 Jul 2012 16:37:27 -0400 Received: by fmsmemgm.homelinux.net (Postfix, from userid 20848) id E90EAAE2FE; Mon, 16 Jul 2012 16:37:25 -0400 (EDT) From: Stefan Monnier To: Eli Zaretskii Subject: Re: bug#11948: 24.1.50; word-wrap should allow wrapping at all breaking space characters, not just space and tab Message-ID: References: <871ukbubbp.fsf@sc3d.org> <837gu3bsiq.fsf@gnu.org> Date: Mon, 16 Jul 2012 16:37:25 -0400 In-Reply-To: <837gu3bsiq.fsf@gnu.org> (Eli Zaretskii's message of "Mon, 16 Jul 2012 19:05:49 +0300") User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.1.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-NAI-Spam-Flag: NO X-NAI-Spam-Threshold: 5 X-NAI-Spam-Score: 0 X-NAI-Spam-Rules: 1 Rules triggered RV4281=0 X-NAI-Spam-Version: 2.2.0.9309 : core <4281> : streams <785765> : uri <1166950> X-Spam-Score: -3.5 (---) X-Debbugs-Envelope-To: 11948 Cc: 11948@debbugs.gnu.org, Reuben Thomas X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -3.5 (---) >> I noticed this when in visual-line-mode, and it failed to wrap at an em >> space (U+2003), but of course there are lotsof other breaking space >> characters. > A prerequisite for doing something about this is to decide which > characters should allow breaking the line. Is there some guidance in > the Unicode standard or elsewhere about this? If not, we will have to > decide on our own. I think the issue here is whether we want to "render" the text, or whether we want to show the file's content to the user. For text-rendering, any space-like thingy that Unicode says isn't unbreakable would probably be fine, but for the other case, it can be important for the user to see the difference between a normal space and some other space and wrapping the line can hide the difference. Maybe we can rely on a variable such as nobreak-char-display (where there's a similar issue). Stefan From debbugs-submit-bounces@debbugs.gnu.org Mon Jul 16 16:46:45 2012 Received: (at 11948) by debbugs.gnu.org; 16 Jul 2012 20:46:45 +0000 Received: from localhost ([127.0.0.1]:44709 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1SqsBp-0005V2-8K for submit@debbugs.gnu.org; Mon, 16 Jul 2012 16:46:45 -0400 Received: from exprod7og106.obsmtp.com ([64.18.2.165]:55469) by debbugs.gnu.org with smtp (Exim 4.72) (envelope-from ) id 1SqsBm-0005Uq-NC for 11948@debbugs.gnu.org; Mon, 16 Jul 2012 16:46:43 -0400 Received: from mail-vb0-f43.google.com ([209.85.212.43]) (using TLSv1) by exprod7ob106.postini.com ([64.18.6.12]) with SMTP ID DSNKUAR8TbXJgFc/0wt2FS58FBx7tuiHn1oM@postini.com; Mon, 16 Jul 2012 13:40:46 PDT Received: by mail-vb0-f43.google.com with SMTP id fq11so4360313vbb.2 for <11948@debbugs.gnu.org>; Mon, 16 Jul 2012 13:40:45 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type:x-gm-message-state; bh=eecOmQNQRtbfLaNZwO1SdbRcnTAmX4cfwxl62Esr+wo=; b=GZhUUh3GOTQKkAtkFVAqQBKHE2OeEGRa9WQFY3h68ryeNTjQ8+oWOs7WscJ+LyBTxJ eQYYzwWUsLo9BH+c9gCcwihpQ0zAx7BKiNoNkVOCggOChIy9rFDsqKQEReyhDByQ6BNB kymDElA7DPZ+JVUjGOZyGutUaAjwXw2CLkL/tdS6PI+xPQvla3EFHnCmfIc74ukHhkHN q3oytNL2Y+pwbQHxl3AnLC5mBU/yyRNUspzaWAD3Tiy5aXVmDKUhyBlWFL2S4pLoG+EU vZ2G6QSpHpbF+pVsqpZN3d6e6sV9i1C+6exO1Ka5Vq4ZzMCCfALnMSP6xaS0P732YkMf qXoA== MIME-Version: 1.0 Received: by 10.52.90.233 with SMTP id bz9mr5151444vdb.93.1342471245283; Mon, 16 Jul 2012 13:40:45 -0700 (PDT) Received: by 10.220.150.196 with HTTP; Mon, 16 Jul 2012 13:40:45 -0700 (PDT) In-Reply-To: References: <871ukbubbp.fsf@sc3d.org> <837gu3bsiq.fsf@gnu.org> Date: Mon, 16 Jul 2012 21:40:45 +0100 Message-ID: Subject: Re: bug#11948: 24.1.50; word-wrap should allow wrapping at all breaking space characters, not just space and tab From: Reuben Thomas To: Stefan Monnier Content-Type: text/plain; charset=ISO-8859-1 X-Gm-Message-State: ALoCoQnohqnRVnosdHoimtbfnU97laa0vga6FWaQqB4s4FpRfnDPU8hmpykQy+luo+bO7wqgKS25 X-Spam-Score: -4.2 (----) X-Debbugs-Envelope-To: 11948 Cc: Eli Zaretskii , 11948@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -4.2 (----) On 16 July 2012 21:37, Stefan Monnier wrote: > > I think the issue here is whether we want to "render" the text, or > whether we want to show the file's content to the user. Isn't whitespace-mode for showing the content, as far as spaces are concerned? -- http://rrt.sc3d.org From debbugs-submit-bounces@debbugs.gnu.org Mon Jul 16 17:22:45 2012 Received: (at 11948) by debbugs.gnu.org; 16 Jul 2012 21:22:45 +0000 Received: from localhost ([127.0.0.1]:44743 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1Sqske-0006Im-T3 for submit@debbugs.gnu.org; Mon, 16 Jul 2012 17:22:45 -0400 Received: from pruche.dit.umontreal.ca ([132.204.246.22]:40508) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1Sqskc-0006If-Dl for 11948@debbugs.gnu.org; Mon, 16 Jul 2012 17:22:43 -0400 Received: from fmsmemgm.homelinux.net (lechon.iro.umontreal.ca [132.204.27.242]) by pruche.dit.umontreal.ca (8.14.1/8.14.1) with ESMTP id q6GLGhon014045; Mon, 16 Jul 2012 17:16:44 -0400 Received: by fmsmemgm.homelinux.net (Postfix, from userid 20848) id 77A86AE2FE; Mon, 16 Jul 2012 17:16:42 -0400 (EDT) From: Stefan Monnier To: Reuben Thomas Subject: Re: bug#11948: 24.1.50; word-wrap should allow wrapping at all breaking space characters, not just space and tab Message-ID: References: <871ukbubbp.fsf@sc3d.org> <837gu3bsiq.fsf@gnu.org> Date: Mon, 16 Jul 2012 17:16:42 -0400 In-Reply-To: (Reuben Thomas's message of "Mon, 16 Jul 2012 21:40:45 +0100") User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.1.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-NAI-Spam-Flag: NO X-NAI-Spam-Threshold: 5 X-NAI-Spam-Score: 0 X-NAI-Spam-Rules: 1 Rules triggered RV4281=0 X-NAI-Spam-Version: 2.2.0.9309 : core <4281> : streams <785775> : uri <1166973> X-Spam-Score: -3.5 (---) X-Debbugs-Envelope-To: 11948 Cc: Eli Zaretskii , 11948@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -3.5 (---) >> I think the issue here is whether we want to "render" the text, or >> whether we want to show the file's content to the user. > Isn't whitespace-mode for showing the content, as far as spaces > are concerned? whitespace mode is good at showing where you have things like trailing-spaces, but other than anal retentive guys like us, nobody cares. OTOH many people (myself included) have wasted hours tracking bugs where some chunk of code contained some weird char like a NBSP that displayed exactly like a normal space but isn't parsed the same way. For similar reasons, we don't treat ~ in TeX as whitespace: while its rendering will display as whitespace its meaning in the source code is non-trivial. I'm aware that neither ~ in TeX nor NBSP are quite the same as the problem at hand, but there is still the same general issue of distinguishing the specification from its rendering and when editing a file in Emacs you often what to see what the file specifies more than what it will render to (which you'll want to see maybe elsewhere such as in a browser). Stefan From debbugs-submit-bounces@debbugs.gnu.org Mon Jul 16 17:24:57 2012 Received: (at 11948) by debbugs.gnu.org; 16 Jul 2012 21:24:58 +0000 Received: from localhost ([127.0.0.1]:44748 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1Sqsmn-0006Lz-BE for submit@debbugs.gnu.org; Mon, 16 Jul 2012 17:24:57 -0400 Received: from exprod7og120.obsmtp.com ([64.18.2.18]:42624) by debbugs.gnu.org with smtp (Exim 4.72) (envelope-from ) id 1Sqsmk-0006Lr-Q4 for 11948@debbugs.gnu.org; Mon, 16 Jul 2012 17:24:55 -0400 Received: from mail-vb0-f50.google.com ([209.85.212.50]) (using TLSv1) by exprod7ob120.postini.com ([64.18.6.12]) with SMTP ID DSNKUASFQe0bA6WQYFoV35k/tzaT9oibJPxN@postini.com; Mon, 16 Jul 2012 14:18:58 PDT Received: by mail-vb0-f50.google.com with SMTP id l1so4997046vba.37 for <11948@debbugs.gnu.org>; Mon, 16 Jul 2012 14:18:57 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type:x-gm-message-state; bh=N7EKYwglrLDbEKyY9znX9cv2kVyrFI23EPRaTUiJRJQ=; b=V4AKw7XO5qLlpb02beRRFm86CiEIiaUMkQQPSx3+KnzzbBl2Q1tsEysUGw6vYk/kVQ CB/pXHZuwtozi5qEAoZHIE326aFCf4Ust6/SwJywhmeNUvHaSZd4HKi50ue8Vfm6BBZx RTcKP1Wxk/8bgyzMOD2yymO6qUGWEYoOi/pHpZS2GvYMe51yONrlzLXZUG2yXL5AYTLi IBkbpNzpvEB5097Ecn7z2AN9yyMTvbN84hAEm/2y9j+99zSWF9NEbLP+6Zg7b+sh6rIx +W7RbZjQBF0veKBSHn0+e1p30nI+7hZyPWOyykAqt//IsGx3utOeLek7MumWIrzgGlYf i/Og== MIME-Version: 1.0 Received: by 10.52.35.15 with SMTP id d15mr5232841vdj.128.1342473537286; Mon, 16 Jul 2012 14:18:57 -0700 (PDT) Received: by 10.220.150.196 with HTTP; Mon, 16 Jul 2012 14:18:57 -0700 (PDT) In-Reply-To: References: <871ukbubbp.fsf@sc3d.org> <837gu3bsiq.fsf@gnu.org> Date: Mon, 16 Jul 2012 22:18:57 +0100 Message-ID: Subject: Re: bug#11948: 24.1.50; word-wrap should allow wrapping at all breaking space characters, not just space and tab From: Reuben Thomas To: Stefan Monnier Content-Type: text/plain; charset=ISO-8859-1 X-Gm-Message-State: ALoCoQnadYEsnNhOP35ra//1UOdy+rQ5PKsX2kePEGIuJPektFP2IHV0ftPhBZM06uomWOIZzumj X-Spam-Score: -4.2 (----) X-Debbugs-Envelope-To: 11948 Cc: Eli Zaretskii , 11948@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -4.2 (----) On 16 July 2012 22:16, Stefan Monnier wrote: >>> I think the issue here is whether we want to "render" the text, or >>> whether we want to show the file's content to the user. >> Isn't whitespace-mode for showing the content, as far as spaces >> are concerned? > > whitespace mode is good at showing where you have things like > trailing-spaces, but other than anal retentive guys like us, > nobody cares. whitespace-mode is quite a bit more general than that: it allows one to visualise various space characters, every time they appear. It could presumably be extended to visualise more unicode space characters. -- http://rrt.sc3d.org From debbugs-submit-bounces@debbugs.gnu.org Mon Jul 16 22:54:31 2012 Received: (at 11948) by debbugs.gnu.org; 17 Jul 2012 02:54:31 +0000 Received: from localhost ([127.0.0.1]:44936 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1Sqxvj-0005FS-86 for submit@debbugs.gnu.org; Mon, 16 Jul 2012 22:54:31 -0400 Received: from mtaout22.012.net.il ([80.179.55.172]:43268) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1Sqxvf-0005FI-Po for 11948@debbugs.gnu.org; Mon, 16 Jul 2012 22:54:29 -0400 Received: from conversion-daemon.a-mtaout22.012.net.il by a-mtaout22.012.net.il (HyperSendmail v2007.08) id <0M7A00F00AFGOH00@a-mtaout22.012.net.il> for 11948@debbugs.gnu.org; Tue, 17 Jul 2012 05:48:28 +0300 (IDT) Received: from HOME-C4E4A596F7 ([87.69.210.75]) by a-mtaout22.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0M7A00FCWAGS6C90@a-mtaout22.012.net.il>; Tue, 17 Jul 2012 05:48:28 +0300 (IDT) Date: Tue, 17 Jul 2012 05:48:36 +0300 From: Eli Zaretskii Subject: Re: bug#11948: 24.1.50; word-wrap should allow wrapping at all breaking space characters, not just space and tab In-reply-to: X-012-Sender: halo1@inter.net.il To: Reuben Thomas Message-id: <83vchn9k6z.fsf@gnu.org> References: <871ukbubbp.fsf@sc3d.org> <837gu3bsiq.fsf@gnu.org> <83wr23a3pe.fsf@gnu.org> X-Spam-Score: -1.2 (-) X-Debbugs-Envelope-To: 11948 Cc: 11948@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list Reply-To: Eli Zaretskii List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -1.2 (-) > Date: Mon, 16 Jul 2012 20:48:52 +0100 > From: Reuben Thomas > Cc: 11948@debbugs.gnu.org > > On 16 July 2012 20:47, Eli Zaretskii wrote: > > > > Thanks, but that's not what I meant. Implementing UAX#14 in full is > > an effort similar (although smaller) to what was required for > > implementing UAX#9, the Unicode Bidirectional Algorithm. The main > > problem is that, like with UAX#9, the algorithms in UAX#14 are > > specified assuming that text is processed for display in batches. > > I wasn't suggesting you should implement the algorithm; I just assumed > it would contain a list of breaking space characters. You seem to have > found such a thing anyway! Yes, with your help: the file was mentioned in UAX#14. Thanks. From debbugs-submit-bounces@debbugs.gnu.org Tue Jul 17 06:10:12 2012 Received: (at 11948) by debbugs.gnu.org; 17 Jul 2012 10:10:12 +0000 Received: from localhost ([127.0.0.1]:45381 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1Sr4jL-0000p4-Uj for submit@debbugs.gnu.org; Tue, 17 Jul 2012 06:10:12 -0400 Received: from mailout-de.gmx.net ([213.165.64.22]:55377) by debbugs.gnu.org with smtp (Exim 4.72) (envelope-from ) id 1Sr4jJ-0000ov-7h for 11948@debbugs.gnu.org; Tue, 17 Jul 2012 06:10:10 -0400 Received: (qmail invoked by alias); 17 Jul 2012 09:48:58 -0000 Received: from 62-47-34-58.adsl.highway.telekom.at (EHLO [62.47.34.58]) [62.47.34.58] by mail.gmx.net (mp034) with SMTP; 17 Jul 2012 11:48:58 +0200 X-Authenticated: #14592706 X-Provags-ID: V01U2FsdGVkX1+H68AB8f+H85uucFJexq8C95qTA6iX+YAnbrPpBL FiC/y2JWySVwEl Message-ID: <50053544.8020608@gmx.at> Date: Tue, 17 Jul 2012 11:49:56 +0200 From: martin rudalics MIME-Version: 1.0 To: Eli Zaretskii Subject: Re: bug#11948: 24.1.50; word-wrap should allow wrapping at all breaking space characters, not just space and tab References: <871ukbubbp.fsf@sc3d.org> <837gu3bsiq.fsf@gnu.org> <83wr23a3pe.fsf@gnu.org> In-Reply-To: <83wr23a3pe.fsf@gnu.org> Content-Type: text/plain; charset=ISO-8859-15; format=flowed Content-Transfer-Encoding: 7bit X-Y-GMX-Trusted: 0 X-Spam-Score: -1.9 (-) X-Debbugs-Envelope-To: 11948 Cc: 11948@debbugs.gnu.org, Reuben Thomas X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -1.9 (-) >> The Unicode line breaking algorithm is probably the place to go: >> >> http://unicode.org/reports/tr14/ > > Thanks, but that's not what I meant. Implementing UAX#14 in full is > an effort similar (although smaller) to what was required for > implementing UAX#9, the Unicode Bidirectional Algorithm. The main > problem is that, like with UAX#9, the algorithms in UAX#14 are > specified assuming that text is processed for display in batches. By > contrast, the Emacs display engine, which implements word-wrap, > examines and processes characters one by one. So one needs to > "serialize", so to speak, the UAX#14 algorithms so that its decisions > could be made on a character-by-character basis. > > I think just supporting more characters from LineBreak.txt on which to > wrap should be a good start, and much easier than implementing UAX#14. > Even for that, we will need an efficient char-table for the related > properties, probably via the uniprop_table machinery, like what bidi.c > uses. Otherwise, referencing the ordinary char-tables of character > properties for each character we display could slow down redisplay too > much. While you're all there: If anybody has any idea how to support a practical and simplified version of collation, see http://www.unicode.org/reports/tr10/ in emacs, I'd be all ears. martin From debbugs-submit-bounces@debbugs.gnu.org Tue Jul 17 08:06:53 2012 Received: (at 11948) by debbugs.gnu.org; 17 Jul 2012 12:06:53 +0000 Received: from localhost ([127.0.0.1]:45544 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1Sr6YG-0005AH-Ez for submit@debbugs.gnu.org; Tue, 17 Jul 2012 08:06:53 -0400 Received: from mail-bk0-f44.google.com ([209.85.214.44]:34087) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1Sr6YC-0005A8-GZ for 11948@debbugs.gnu.org; Tue, 17 Jul 2012 08:06:50 -0400 Received: by bkty7 with SMTP id y7so211027bkt.3 for <11948@debbugs.gnu.org>; Tue, 17 Jul 2012 05:00:47 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc:content-type; bh=7CV5VNQtVQLlDcyeuVReuJqJESEIwVSwWxBUJFzxc1I=; b=FJpY3f3//Fz17zhi3TNsWgTlWImelPsL4DoPHeuLzaD3G8GtLLZRaZyonakJ/732Ml L9Pf5f+DTnCv77SQrc6WpW92BlSW0CaIJHOtueZxOQ7wh5LpILzv2e1UcsI5oL+zbDtJ 2DAfcAasBQkKSoAGAw3rAKQNVBwFKoR1DFgbccKVIReC6xxnLLzYnBESr3+zcwKHYzsS KXqlXFSfPVNV6JC0qItGfaqjhKUCNaJV48+YatAeKkWs137MezP2Tkw49370vRsFgasH 8CRugxmcAwShRL1d5hBJPauQB8fCxa/HZsPmQ6P1mrCzlqFs1dR+5NKJlzqTq0EiFi8e IWLQ== Received: by 10.152.122.9 with SMTP id lo9mr2307481lab.41.1342526447693; Tue, 17 Jul 2012 05:00:47 -0700 (PDT) MIME-Version: 1.0 Received: by 10.112.41.99 with HTTP; Tue, 17 Jul 2012 05:00:17 -0700 (PDT) In-Reply-To: <50053544.8020608@gmx.at> References: <871ukbubbp.fsf@sc3d.org> <837gu3bsiq.fsf@gnu.org> <83wr23a3pe.fsf@gnu.org> <50053544.8020608@gmx.at> From: Lennart Borgman Date: Tue, 17 Jul 2012 14:00:17 +0200 Message-ID: Subject: Re: bug#11948: 24.1.50; word-wrap should allow wrapping at all breaking space characters, not just space and tab To: martin rudalics Content-Type: text/plain; charset=UTF-8 X-Spam-Score: -2.6 (--) X-Debbugs-Envelope-To: 11948 Cc: Eli Zaretskii , 11948@debbugs.gnu.org, Reuben Thomas X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -2.6 (--) On Tue, Jul 17, 2012 at 11:49 AM, martin rudalics wrote: > > While you're all there: If anybody has any idea how to support a > practical and simplified version of collation, see > > http://www.unicode.org/reports/tr10/ > > in emacs, I'd be all ears. How does operating systems support this? From debbugs-submit-bounces@debbugs.gnu.org Tue Jul 17 09:18:12 2012 Received: (at 11948) by debbugs.gnu.org; 17 Jul 2012 13:18:12 +0000 Received: from localhost ([127.0.0.1]:45660 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1Sr7fH-0007ef-VI for submit@debbugs.gnu.org; Tue, 17 Jul 2012 09:18:12 -0400 Received: from mailout-de.gmx.net ([213.165.64.22]:42628) by debbugs.gnu.org with smtp (Exim 4.72) (envelope-from ) id 1Sr7fF-0007eX-Kq for 11948@debbugs.gnu.org; Tue, 17 Jul 2012 09:18:10 -0400 Received: (qmail invoked by alias); 17 Jul 2012 13:12:08 -0000 Received: from 62-47-34-58.adsl.highway.telekom.at (EHLO [62.47.34.58]) [62.47.34.58] by mail.gmx.net (mp034) with SMTP; 17 Jul 2012 15:12:08 +0200 X-Authenticated: #14592706 X-Provags-ID: V01U2FsdGVkX1/HvnsOp7LHVgSM8fIkwCVyZlrhn42kQ6feRWvCIu ihCwgSQxGwMcmI Message-ID: <500564F0.9080507@gmx.at> Date: Tue, 17 Jul 2012 15:13:20 +0200 From: martin rudalics MIME-Version: 1.0 To: Lennart Borgman Subject: Re: bug#11948: 24.1.50; word-wrap should allow wrapping at all breaking space characters, not just space and tab References: <871ukbubbp.fsf@sc3d.org> <837gu3bsiq.fsf@gnu.org> <83wr23a3pe.fsf@gnu.org> <50053544.8020608@gmx.at> In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Y-GMX-Trusted: 0 X-Spam-Score: -1.9 (-) X-Debbugs-Envelope-To: 11948 Cc: Eli Zaretskii , 11948@debbugs.gnu.org, Reuben Thomas X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -1.9 (-) > How does operating systems support this? I suppose most of them support it in some locale dependent manner. Sadly, `sort-lines' and `dired' don't support it all. I can't use `dired' because it doesn't. martin From debbugs-submit-bounces@debbugs.gnu.org Tue Jul 17 12:05:22 2012 Received: (at 11948) by debbugs.gnu.org; 17 Jul 2012 16:05:22 +0000 Received: from localhost ([127.0.0.1]:46366 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1SrAH3-0005cv-Es for submit@debbugs.gnu.org; Tue, 17 Jul 2012 12:05:21 -0400 Received: from mtaout23.012.net.il ([80.179.55.175]:60357) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1SrAH1-0005ci-4Q for 11948@debbugs.gnu.org; Tue, 17 Jul 2012 12:05:19 -0400 Received: from conversion-daemon.a-mtaout23.012.net.il by a-mtaout23.012.net.il (HyperSendmail v2007.08) id <0M7B00I00B02AT00@a-mtaout23.012.net.il> for 11948@debbugs.gnu.org; Tue, 17 Jul 2012 18:58:45 +0300 (IDT) Received: from HOME-C4E4A596F7 ([87.69.210.75]) by a-mtaout23.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0M7B00IBIB1X4N80@a-mtaout23.012.net.il>; Tue, 17 Jul 2012 18:58:45 +0300 (IDT) Date: Tue, 17 Jul 2012 18:58:54 +0300 From: Eli Zaretskii Subject: Re: bug#11948: 24.1.50; word-wrap should allow wrapping at all breaking space characters, not just space and tab In-reply-to: <500564F0.9080507@gmx.at> X-012-Sender: halo1@inter.net.il To: martin rudalics Message-id: <83mx2y9y69.fsf@gnu.org> References: <871ukbubbp.fsf@sc3d.org> <837gu3bsiq.fsf@gnu.org> <83wr23a3pe.fsf@gnu.org> <50053544.8020608@gmx.at> <500564F0.9080507@gmx.at> X-Spam-Score: -1.2 (-) X-Debbugs-Envelope-To: 11948 Cc: 11948@debbugs.gnu.org, lennart.borgman@gmail.com, rrt@sc3d.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list Reply-To: Eli Zaretskii List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -1.2 (-) > Date: Tue, 17 Jul 2012 15:13:20 +0200 > From: martin rudalics > CC: Eli Zaretskii , 11948@debbugs.gnu.org, > Reuben Thomas > > > How does operating systems support this? > > I suppose most of them support it in some locale dependent manner. > Sadly, `sort-lines' and `dired' don't support it all. I can't use > `dired' because it doesn't. I suggest to file a separate bug report, and please explain there why you cannot run Dired because of this, because I don't see the relation. From debbugs-submit-bounces@debbugs.gnu.org Wed Jul 18 12:21:41 2012 Received: (at 11948) by debbugs.gnu.org; 18 Jul 2012 16:21:41 +0000 Received: from localhost ([127.0.0.1]:48750 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1SrX0P-0000Qg-AG for submit@debbugs.gnu.org; Wed, 18 Jul 2012 12:21:41 -0400 Received: from mailout-de.gmx.net ([213.165.64.22]:37704) by debbugs.gnu.org with smtp (Exim 4.72) (envelope-from ) id 1SrX0N-0000QY-Pj for 11948@debbugs.gnu.org; Wed, 18 Jul 2012 12:21:40 -0400 Received: (qmail invoked by alias); 18 Jul 2012 16:15:32 -0000 Received: from 62-47-60-60.adsl.highway.telekom.at (EHLO [62.47.60.60]) [62.47.60.60] by mail.gmx.net (mp004) with SMTP; 18 Jul 2012 18:15:32 +0200 X-Authenticated: #14592706 X-Provags-ID: V01U2FsdGVkX1/Lm/sVU8bLN9OyfUpzInpvMLefNKoj/k6eRMBrFl aKhpynfOeimtYX Message-ID: <5006E166.8090601@gmx.at> Date: Wed, 18 Jul 2012 18:16:38 +0200 From: martin rudalics MIME-Version: 1.0 To: Eli Zaretskii Subject: Re: bug#11948: 24.1.50; word-wrap should allow wrapping at all breaking space characters, not just space and tab References: <871ukbubbp.fsf@sc3d.org> <837gu3bsiq.fsf@gnu.org> <83wr23a3pe.fsf@gnu.org> <50053544.8020608@gmx.at> <500564F0.9080507@gmx.at> <83mx2y9y69.fsf@gnu.org> In-Reply-To: <83mx2y9y69.fsf@gnu.org> Content-Type: text/plain; charset=ISO-8859-15; format=flowed Content-Transfer-Encoding: 7bit X-Y-GMX-Trusted: 0 X-Spam-Score: -1.9 (-) X-Debbugs-Envelope-To: 11948 Cc: 11948@debbugs.gnu.org, lennart.borgman@gmail.com, rrt@sc3d.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -1.9 (-) >> Sadly, `sort-lines' and `dired' don't support it all. I can't use >> `dired' because it doesn't. > > I suggest to file a separate bug report, and please explain there why > you cannot run Dired because of this, because I don't see the > relation. I didn't say that I cannot "run" it. I said that I cannot "use" it. And I can't use it because ls/dired and I have different understandings of what "alphabetically" means. martin From debbugs-submit-bounces@debbugs.gnu.org Thu Jul 19 15:53:28 2012 Received: (at 11948) by debbugs.gnu.org; 19 Jul 2012 19:53:28 +0000 Received: from localhost ([127.0.0.1]:51243 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1Srwmu-0006Xj-9a for submit@debbugs.gnu.org; Thu, 19 Jul 2012 15:53:28 -0400 Received: from mtaout21.012.net.il ([80.179.55.169]:43382) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1Srwmr-0006Xa-Dh for 11948@debbugs.gnu.org; Thu, 19 Jul 2012 15:53:27 -0400 Received: from conversion-daemon.a-mtaout21.012.net.il by a-mtaout21.012.net.il (HyperSendmail v2007.08) id <0M7F00500APGHZ00@a-mtaout21.012.net.il> for 11948@debbugs.gnu.org; Thu, 19 Jul 2012 22:47:10 +0300 (IDT) Received: from HOME-C4E4A596F7 ([87.69.210.75]) by a-mtaout21.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0M7F005F0AYMC970@a-mtaout21.012.net.il>; Thu, 19 Jul 2012 22:47:10 +0300 (IDT) Date: Thu, 19 Jul 2012 22:47:24 +0300 From: Eli Zaretskii Subject: Re: bug#11948: 24.1.50; word-wrap should allow wrapping at all breaking space characters, not just space and tab In-reply-to: <50053544.8020608@gmx.at> X-012-Sender: halo1@inter.net.il To: martin rudalics Message-id: <83eho78reb.fsf@gnu.org> References: <871ukbubbp.fsf@sc3d.org> <837gu3bsiq.fsf@gnu.org> <83wr23a3pe.fsf@gnu.org> <50053544.8020608@gmx.at> X-Spam-Score: -1.2 (-) X-Debbugs-Envelope-To: 11948 Cc: 11948@debbugs.gnu.org, rrt@sc3d.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list Reply-To: Eli Zaretskii List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -1.2 (-) > Date: Tue, 17 Jul 2012 11:49:56 +0200 > From: martin rudalics > CC: Reuben Thomas , 11948@debbugs.gnu.org > > While you're all there: If anybody has any idea how to support a > practical and simplified version of collation, see > > http://www.unicode.org/reports/tr10/ > > in emacs, I'd be all ears. We could provide a function suitable to be a PREDICATE argument for 'sort', which would call 'strcoll' in the underlying C library, can't we? From debbugs-submit-bounces@debbugs.gnu.org Sat Jul 21 07:08:17 2012 Received: (at 11948) by debbugs.gnu.org; 21 Jul 2012 11:08:17 +0000 Received: from localhost ([127.0.0.1]:54089 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1SsXXk-0002Mu-Ba for submit@debbugs.gnu.org; Sat, 21 Jul 2012 07:08:16 -0400 Received: from mailout-de.gmx.net ([213.165.64.23]:47035) by debbugs.gnu.org with smtp (Exim 4.72) (envelope-from ) id 1SsXXc-0002Mg-4N for 11948@debbugs.gnu.org; Sat, 21 Jul 2012 07:08:14 -0400 Received: (qmail invoked by alias); 21 Jul 2012 11:01:44 -0000 Received: from 62-47-44-153.adsl.highway.telekom.at (EHLO [62.47.44.153]) [62.47.44.153] by mail.gmx.net (mp027) with SMTP; 21 Jul 2012 13:01:44 +0200 X-Authenticated: #14592706 X-Provags-ID: V01U2FsdGVkX18Sld5b19QSZW7IezI3HzNCFN+rX8vaDCEdPWF6js 3K38Z5mOUpHN3o Message-ID: <500A8C32.1010703@gmx.at> Date: Sat, 21 Jul 2012 13:02:10 +0200 From: martin rudalics MIME-Version: 1.0 To: Eli Zaretskii Subject: Re: bug#11948: 24.1.50; word-wrap should allow wrapping at all breaking space characters, not just space and tab References: <871ukbubbp.fsf@sc3d.org> <837gu3bsiq.fsf@gnu.org> <83wr23a3pe.fsf@gnu.org> <50053544.8020608@gmx.at> <83eho78reb.fsf@gnu.org> In-Reply-To: <83eho78reb.fsf@gnu.org> Content-Type: text/plain; charset=ISO-8859-15; format=flowed Content-Transfer-Encoding: 7bit X-Y-GMX-Trusted: 0 X-Spam-Score: -1.9 (-) X-Debbugs-Envelope-To: 11948 Cc: 11948@debbugs.gnu.org, rrt@sc3d.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -1.9 (-) > We could provide a function suitable to be a PREDICATE argument for > 'sort', which would call 'strcoll' in the underlying C library, can't > we? That would be awesome. Can you try doing that? martin From debbugs-submit-bounces@debbugs.gnu.org Sat Jul 21 08:49:11 2012 Received: (at 11948) by debbugs.gnu.org; 21 Jul 2012 12:49:11 +0000 Received: from localhost ([127.0.0.1]:54323 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1SsZ7P-0005Ts-3S for submit@debbugs.gnu.org; Sat, 21 Jul 2012 08:49:11 -0400 Received: from mtaout22.012.net.il ([80.179.55.172]:55505) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1SsZ7K-0005Th-5H for 11948@debbugs.gnu.org; Sat, 21 Jul 2012 08:49:07 -0400 Received: from conversion-daemon.a-mtaout22.012.net.il by a-mtaout22.012.net.il (HyperSendmail v2007.08) id <0M7I00C00GMUI900@a-mtaout22.012.net.il> for 11948@debbugs.gnu.org; Sat, 21 Jul 2012 15:42:32 +0300 (IDT) Received: from HOME-C4E4A596F7 ([87.69.210.75]) by a-mtaout22.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0M7I00BIUGMVYUC0@a-mtaout22.012.net.il>; Sat, 21 Jul 2012 15:42:32 +0300 (IDT) Date: Sat, 21 Jul 2012 15:42:29 +0300 From: Eli Zaretskii Subject: Re: bug#11948: 24.1.50; word-wrap should allow wrapping at all breaking space characters, not just space and tab In-reply-to: <500A8C32.1010703@gmx.at> X-012-Sender: halo1@inter.net.il To: martin rudalics Message-id: <83wr1x1e16.fsf@gnu.org> References: <871ukbubbp.fsf@sc3d.org> <837gu3bsiq.fsf@gnu.org> <83wr23a3pe.fsf@gnu.org> <50053544.8020608@gmx.at> <83eho78reb.fsf@gnu.org> <500A8C32.1010703@gmx.at> X-Spam-Score: -1.2 (-) X-Debbugs-Envelope-To: 11948 Cc: 11948@debbugs.gnu.org, rrt@sc3d.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list Reply-To: Eli Zaretskii List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -1.2 (-) > Date: Sat, 21 Jul 2012 13:02:10 +0200 > From: martin rudalics > CC: rrt@sc3d.org, 11948@debbugs.gnu.org > > > We could provide a function suitable to be a PREDICATE argument for > > 'sort', which would call 'strcoll' in the underlying C library, can't > > we? > > That would be awesome. Can you try doing that? I can try, but would you please file a separate bug report for this? From debbugs-submit-bounces@debbugs.gnu.org Sun Jul 22 05:47:48 2012 Received: (at 11948) by debbugs.gnu.org; 22 Jul 2012 09:47:48 +0000 Received: from localhost ([127.0.0.1]:56898 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1SsslP-0002XK-S1 for submit@debbugs.gnu.org; Sun, 22 Jul 2012 05:47:48 -0400 Received: from pruche.dit.umontreal.ca ([132.204.246.22]:42759) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1SsslN-0002XC-La for 11948@debbugs.gnu.org; Sun, 22 Jul 2012 05:47:46 -0400 Received: from fmsmemgm.homelinux.net (lechon.iro.umontreal.ca [132.204.27.242]) by pruche.dit.umontreal.ca (8.14.1/8.14.1) with ESMTP id q6M9fE9Q027270; Sun, 22 Jul 2012 05:41:15 -0400 Received: by fmsmemgm.homelinux.net (Postfix, from userid 20848) id 95E2BAE17C; Sun, 22 Jul 2012 05:41:14 -0400 (EDT) From: Stefan Monnier To: Eli Zaretskii Subject: Re: bug#11948: 24.1.50; word-wrap should allow wrapping at all breaking space characters, not just space and tab Message-ID: References: <871ukbubbp.fsf@sc3d.org> <837gu3bsiq.fsf@gnu.org> <83wr23a3pe.fsf@gnu.org> <50053544.8020608@gmx.at> <83eho78reb.fsf@gnu.org> Date: Sun, 22 Jul 2012 05:41:14 -0400 In-Reply-To: <83eho78reb.fsf@gnu.org> (Eli Zaretskii's message of "Thu, 19 Jul 2012 22:47:24 +0300") User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.1.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-NAI-Spam-Flag: NO X-NAI-Spam-Threshold: 5 X-NAI-Spam-Score: 0 X-NAI-Spam-Rules: 1 Rules triggered RV4286=0 X-NAI-Spam-Version: 2.2.0.9309 : core <4286> : streams <788656> : uri <1171811> X-Spam-Score: -3.5 (---) X-Debbugs-Envelope-To: 11948 Cc: martin rudalics , 11948@debbugs.gnu.org, rrt@sc3d.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -3.5 (---) >> While you're all there: If anybody has any idea how to support a >> practical and simplified version of collation, see >> http://www.unicode.org/reports/tr10/ >> in emacs, I'd be all ears. > We could provide a function suitable to be a PREDICATE argument for > 'sort', which would call 'strcoll' in the underlying C library, can't > we? We'd need to select a utf-8 locale before doing that, right? Stefan From debbugs-submit-bounces@debbugs.gnu.org Fri Mar 24 23:30:58 2017 Received: (at control) by debbugs.gnu.org; 25 Mar 2017 03:30:58 +0000 Received: from localhost ([127.0.0.1]:43047 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1crcPp-0003Aq-Pj for submit@debbugs.gnu.org; Fri, 24 Mar 2017 23:30:58 -0400 Received: from mail-it0-f54.google.com ([209.85.214.54]:34845) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1crcPn-00034h-O8 for control@debbugs.gnu.org; Fri, 24 Mar 2017 23:30:55 -0400 Received: by mail-it0-f54.google.com with SMTP id y18so27359714itc.0 for ; Fri, 24 Mar 2017 20:30:55 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=sender:from:to:subject:date:message-id:mime-version; bh=cdZttWpbv/lN8UzoGIx+X0Dy6P+cka1YFtPwNNdVyAY=; b=lnmfCqq3CNtlD0q2int+o2ox/n4+/xxl3zdxewelFzYblP8ZUqrTYGOAvN8Dhvcs9k +88AwEb9xvuN9jROU0nEywtrCLJa3IBDovUMovMUAzxuFdL3UQ2j5r0KhwIeHWmhGXId VcrYfhE39+YmstYSCfqWuKlYOBCptBBgKXT2MN0w1Wjd6scg5JvnjLiaimrIIhMQPaZk tAhRcrTfWIoMD1+MK5U/zR0YnMEccMx9B3NdfDddoVN+O0qhEEys61jNVzo9te/K6BOa HwAzQ5CspvjeHkL2SJ5Qp0IRjSGCeIm4MJCEsbzzB27/X7cOLDEgB3Uee5lRduzAimVW cl0w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:sender:from:to:subject:date:message-id :mime-version; bh=cdZttWpbv/lN8UzoGIx+X0Dy6P+cka1YFtPwNNdVyAY=; b=VStxVx5NOEANnmE+Y+fC8qyA9ue0RKXlm13DTDGSqrSEavw6wl/IgZb3/4P4jTraa0 2TIh59ZEHsLYVY63xVEMBx5w5IcZ19d2u8JQOzYbOUCgK80Id1aGrVxBA9Up83zVyn1x ey2uz2grXQN1vjy0bw7FpitsDtbQ+Ju0V3XXafd5hoROHo9Nt+NK6ok4gR5tXd4ZLMv3 jt2L/w/z7AWu4+Fs1NZqkRrl8ptRfa2NWekb0XNwwj4/sCe1T83YsB1g2hYM1WHiq+hy kJpXLcByZWX9B4J7G98XYi+Zpos8N7pOr7RIlToqKnYiAazJR9isv2Ci94jy6YJcoKlq Sy6g== X-Gm-Message-State: AFeK/H35yICVbncH5cyeIww7VwsD0wb2hP0ML1vPSNZ0w7s5iYM0/F6Sm5V8PZbZ98d6IA== X-Received: by 10.107.162.76 with SMTP id l73mr11167388ioe.1.1490412650276; Fri, 24 Mar 2017 20:30:50 -0700 (PDT) Received: from zony ([45.2.7.65]) by smtp.googlemail.com with ESMTPSA id x69sm1946814ite.25.2017.03.24.20.30.49 for (version=TLS1_2 cipher=ECDHE-RSA-CHACHA20-POLY1305 bits=256/256); Fri, 24 Mar 2017 20:30:49 -0700 (PDT) From: npostavs@users.sourceforge.net To: control@debbugs.gnu.org Subject: control message for bug #11948 Date: Fri, 24 Mar 2017 23:32:13 -0400 Message-ID: <87wpbe116q.fsf@users.sourceforge.net> MIME-Version: 1.0 Content-Type: text/plain X-Spam-Score: -0.0 (/) X-Debbugs-Envelope-To: control X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.0 (/) severity 11948 wishlist quit From debbugs-submit-bounces@debbugs.gnu.org Fri Apr 22 08:47:36 2022 Received: (at 11948) by debbugs.gnu.org; 22 Apr 2022 12:47:36 +0000 Received: from localhost ([127.0.0.1]:51926 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1nhshE-0007Ez-D9 for submit@debbugs.gnu.org; Fri, 22 Apr 2022 08:47:36 -0400 Received: from quimby.gnus.org ([95.216.78.240]:38292) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1nhshC-000798-Nj for 11948@debbugs.gnu.org; Fri, 22 Apr 2022 08:47:35 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnus.org; s=20200322; h=Content-Type:MIME-Version:Message-ID:In-Reply-To:Date: References:Subject:Cc:To:From:Sender:Reply-To:Content-Transfer-Encoding: Content-ID:Content-Description:Resent-Date:Resent-From:Resent-Sender: Resent-To:Resent-Cc:Resent-Message-ID:List-Id:List-Help:List-Unsubscribe: List-Subscribe:List-Post:List-Owner:List-Archive; bh=eZ62C0H/iUuQLy5OzU01Ps3xfYroofBjzZCF4LMOyas=; b=NE9gKzFplq/EztX5UzCSMcwsA8 zc4v4lb0Yp45xwhL4kLgnikT6tE1Tp81TYvTo0jNrSB+fwBx0rIRlSKevL68KI5EDckwvlzd9Kk8z wZZcBsIUl7s0wQ2MuOUdV50oAsYu23+h3wdaJTf1UDm0BpSj073ApSIIO4w04x2SIRQY=; Received: from [84.212.220.105] (helo=xo) by quimby.gnus.org with esmtpsa (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1nhsh2-0006WM-MU; Fri, 22 Apr 2022 14:47:27 +0200 From: Lars Ingebrigtsen To: Reuben Thomas Subject: Re: bug#11948: 24.1.50; word-wrap should allow wrapping at all breaking space characters, not just space and tab References: <871ukbubbp.fsf@sc3d.org> X-Now-Playing: ELpH's _Protection_: "pHILM #1 (Vox)" Date: Fri, 22 Apr 2022 14:47:23 +0200 In-Reply-To: <871ukbubbp.fsf@sc3d.org> (Reuben Thomas's message of "Mon, 16 Jul 2012 13:42:18 +0100") Message-ID: <87czh91e04.fsf@gnus.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/29.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: Reuben Thomas writes: > I noticed this when in visual-line-mode, and it failed to wrap at an em > space (U+2003), but of course there are lotsof other breaking space > characters. Eli, now that we have word-wrap-by-category, wouldn't this be easy to implement? I.e., do Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 11948 Cc: Eli Zaretskii , 11948@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) Reuben Thomas writes: > I noticed this when in visual-line-mode, and it failed to wrap at an em > space (U+2003), but of course there are lotsof other breaking space > characters. Eli, now that we have word-wrap-by-category, wouldn't this be easy to implement? I.e., do (modify-category-entry #x2003 ?|) for all characters of general-category Zs in character.el? Or would that have other negative consequences? -- (domestic pets only, the antidote for overdose, milk.) bloggy blog: http://lars.ingebrigtsen.no From debbugs-submit-bounces@debbugs.gnu.org Fri Apr 22 08:56:30 2022 Received: (at 11948) by debbugs.gnu.org; 22 Apr 2022 12:56:30 +0000 Received: from localhost ([127.0.0.1]:51939 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1nhspq-0007WK-4u for submit@debbugs.gnu.org; Fri, 22 Apr 2022 08:56:30 -0400 Received: from eggs.gnu.org ([209.51.188.92]:54074) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1nhspp-0007W6-03 for 11948@debbugs.gnu.org; Fri, 22 Apr 2022 08:56:29 -0400 Received: from fencepost.gnu.org ([2001:470:142:3::e]:55584) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1nhspi-0001v4-7b; Fri, 22 Apr 2022 08:56:22 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnu.org; s=fencepost-gnu-org; h=References:Subject:In-Reply-To:To:From:Date: mime-version; bh=CkU+7g/EEoJCK9KNyFCZRkRBIr97GdIldHgR7wiHaAo=; b=JZlqI7wuPrCr rOG5b5XHgyEQkAcxK/kj36zcZvgov3LMn7h+nokGb5kusuTPTfrR1z+Njf233fuj6ElZAA/ruXHUm CSaA2SNZCkRint8JXlIERphdt428j0Et8KZSzgqxuC/t5AlFRUIHNuC5ZmZUuwwJ9KfNNmnocburH WkyfOwpfqPgS3TcWbhYNvqXz5wybys14U+bf/AtNU9Zt5RWJYYwR74oghnCzPiUSgCQW+5DS03f0i WMN+oJr/ZB+AKB3Fc61PJzwFPWHqbFr62OoSGA0sMYfm1y8yqUSf2VM42nja8JhaCMb2JU+ziMEaO MwblAwkTlwAFNRlMYO0y2w==; Received: from [87.69.77.57] (port=3320 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1nhspg-00077L-2Q; Fri, 22 Apr 2022 08:56:21 -0400 Date: Fri, 22 Apr 2022 15:56:17 +0300 Message-Id: <83mtgdp98u.fsf@gnu.org> From: Eli Zaretskii To: Lars Ingebrigtsen In-Reply-To: <87czh91e04.fsf@gnus.org> (message from Lars Ingebrigtsen on Fri, 22 Apr 2022 14:47:23 +0200) Subject: Re: bug#11948: 24.1.50; word-wrap should allow wrapping at all breaking space characters, not just space and tab References: <871ukbubbp.fsf@sc3d.org> <87czh91e04.fsf@gnus.org> X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 11948 Cc: 11948@debbugs.gnu.org, rrt@sc3d.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) > From: Lars Ingebrigtsen > Cc: 11948@debbugs.gnu.org, Eli Zaretskii > Date: Fri, 22 Apr 2022 14:47:23 +0200 > > Reuben Thomas writes: > > > I noticed this when in visual-line-mode, and it failed to wrap at an em > > space (U+2003), but of course there are lotsof other breaking space > > characters. > > Eli, now that we have word-wrap-by-category, wouldn't this be easy to > implement? I.e., do > > (modify-category-entry #x2003 ?|) > > for all characters of general-category Zs in character.el? Or would > that have other negative consequences? Yes, now people who want what the OP wanted should be able to have that easily. But I would hesitate making that the default, instead leaving it to user customizations. We could have a minor mode to do that, though, so that users who want this won't need to customize individually each character's category set. Of course, the harder part here is to decide which of the Zs characters will allow word-wrap on them. I don't think all of them should. From debbugs-submit-bounces@debbugs.gnu.org Sat Apr 23 07:33:05 2022 Received: (at 11948) by debbugs.gnu.org; 23 Apr 2022 11:33:05 +0000 Received: from localhost ([127.0.0.1]:55282 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1niE0e-0003J5-S9 for submit@debbugs.gnu.org; Sat, 23 Apr 2022 07:33:05 -0400 Received: from quimby.gnus.org ([95.216.78.240]:46994) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1niE0b-0003IY-8O for 11948@debbugs.gnu.org; Sat, 23 Apr 2022 07:33:03 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnus.org; s=20200322; h=Content-Type:MIME-Version:Message-ID:In-Reply-To:Date: References:Subject:Cc:To:From:Sender:Reply-To:Content-Transfer-Encoding: Content-ID:Content-Description:Resent-Date:Resent-From:Resent-Sender: Resent-To:Resent-Cc:Resent-Message-ID:List-Id:List-Help:List-Unsubscribe: List-Subscribe:List-Post:List-Owner:List-Archive; bh=MbQHC2IgGEbzRTLYjvADtT+6CSfExuavf/aI77bz1zc=; b=c4Cg8pFrBHAI+5Xa+qppEqkJV0 cJgCk+Lu5AQgfitsAqG10CA5hZ3H4zSJb20h/1zgmzC+7cEVNa66o8DcTOVNkph6BXhOD35mUi9CH IA4iYDm/3ZajEjE+VbbmYeZyb83FgLeM21Eo9v7A0O26BaRycAJ8zYaI2jB1ksB/0SH0=; Received: from [84.212.220.105] (helo=xo) by quimby.gnus.org with esmtpsa (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1niE0Q-0001iW-Rt; Sat, 23 Apr 2022 13:32:53 +0200 From: Lars Ingebrigtsen To: Eli Zaretskii Subject: Re: bug#11948: 24.1.50; word-wrap should allow wrapping at all breaking space characters, not just space and tab References: <871ukbubbp.fsf@sc3d.org> <87czh91e04.fsf@gnus.org> <83mtgdp98u.fsf@gnu.org> Face: iVBORw0KGgoAAAANSUhEUgAAADAAAAAwBAMAAAClLOS0AAAABGdBTUEAALGPC/xhBQAAACBj SFJNAAB6JgAAgIQAAPoAAACA6AAAdTAAAOpgAAA6mAAAF3CculE8AAAAFVBMVEX8/PylqahUT0Yx OjccNTmzXi3///+ih99BAAAAAWJLR0QGYWa4fQAAAAlwSFlzAAALEgAACxIB0t1+/AAAAAd0SU1F B+YEFwseJNCQrzYAAAF4SURBVDjLfZRtcgIhDIZXewEdPYANy//KxwFcxguAuf9VGiCBhZ02f9zJ 45uEkLAsYlcguy6zncBZsgCXyb8GW8xtl9Fvmw1k5yfS/bewB+67BbKTSbDbDJ5/CETydQSPY0m7 wk7hCNylRwrKqDHWvXwZTO9kGngRqP9DjO/km8ZIijdmhf9AT3LOwOmqWH1i8ODcSMAn89HYspd+ bFhMk5DS5PKeFXhsxuBVi9IdUBLIsIMIBUQB5FYqe2hGdAXKVQBQQMygZCdHBVVBjsi/DGqoCNZQ caAVh8rJFUXQmBSl19SvoSqKorNCowA+YG5WIhpRDthbon05p7SkN7EWhdLEs+Wjx035tXaxtL1e lC8XldvOgKbBytVudIM+olxtG4YCNKY2DDw+PoMVILTx4YFzQEAZ1wdukZmh8uVz60NtgK2wn/0a 5DEUBW/OnZOQDYsjEgoDg2DeNSeCf9Z5ObnuD+PT4HivpieDCFdqJn95lqiw/iz9AsXRwxoTRAM0 AAAAGXRFWHRjb21tZW50AENyZWF0ZWQgd2l0aCBHSU1Q569AywAAACV0RVh0ZGF0ZTpjcmVhdGUA MjAyMi0wNC0yM1QxMTozMDozNiswMDowMAJ06o4AAAAldEVYdGRhdGU6bW9kaWZ5ADIwMjItMDQt MjNUMTE6MzA6MzYrMDA6MDBzKVIyAAAAAElFTkSuQmCC X-Now-Playing: Herbert's _Part Four_: "Pump" Date: Sat, 23 Apr 2022 13:32:48 +0200 In-Reply-To: <83mtgdp98u.fsf@gnu.org> (Eli Zaretskii's message of "Fri, 22 Apr 2022 15:56:17 +0300") Message-ID: <8735i4vxun.fsf@gnus.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/29.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: Eli Zaretskii writes: > Yes, now people who want what the OP wanted should be able to have > that easily. But I would hesitate making that the default, instead > leaving it to user customizations. We could have a minor mod [...] Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 11948 Cc: 11948@debbugs.gnu.org, rrt@sc3d.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) Eli Zaretskii writes: > Yes, now people who want what the OP wanted should be able to have > that easily. But I would hesitate making that the default, instead > leaving it to user customizations. We could have a minor mode to do > that, though, so that users who want this won't need to customize > individually each character's category set. Sure, a minor mode would work well here. > Of course, the harder part here is to decide which of the Zs > characters will allow word-wrap on them. I don't think all of them > should. Looking over these: 17 matches for "Zs" in buffer: UnicodeData.txt 33:0020;SPACE;Zs;0;WS;;;;;N;;;;; 161:00A0;NO-BREAK SPACE;Zs;0;CS; 0020;;;;N;NON-BREAKING SPACE;;;; 5187:1680;OGHAM SPACE MARK;Zs;0;WS;;;;;N;;;;; 7354:2000;EN QUAD;Zs;0;WS;2002;;;;N;;;;; 7355:2001;EM QUAD;Zs;0;WS;2003;;;;N;;;;; 7356:2002;EN SPACE;Zs;0;WS; 0020;;;;N;;;;; 7357:2003;EM SPACE;Zs;0;WS; 0020;;;;N;;;;; 7358:2004;THREE-PER-EM SPACE;Zs;0;WS; 0020;;;;N;;;;; 7359:2005;FOUR-PER-EM SPACE;Zs;0;WS; 0020;;;;N;;;;; 7360:2006;SIX-PER-EM SPACE;Zs;0;WS; 0020;;;;N;;;;; 7361:2007;FIGURE SPACE;Zs;0;WS; 0020;;;;N;;;;; 7362:2008;PUNCTUATION SPACE;Zs;0;WS; 0020;;;;N;;;;; 7363:2009;THIN SPACE;Zs;0;WS; 0020;;;;N;;;;; 7364:200A;HAIR SPACE;Zs;0;WS; 0020;;;;N;;;;; 7401:202F;NARROW NO-BREAK SPACE;Zs;0;CS; 0020;;;;N;;;;; 7449:205F;MEDIUM MATHEMATICAL SPACE;Zs;0;WS; 0020;;;;N;;;;; 11232:3000;IDEOGRAPHIC SPACE;Zs;0;WS; 0020;;;;N;;;;; I think only the no-break ones shouldn't trigger wrapping? -- (domestic pets only, the antidote for overdose, milk.) bloggy blog: http://lars.ingebrigtsen.no From debbugs-submit-bounces@debbugs.gnu.org Sat Apr 23 07:49:48 2022 Received: (at 11948) by debbugs.gnu.org; 23 Apr 2022 11:49:48 +0000 Received: from localhost ([127.0.0.1]:55288 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1niEGq-0003jB-C4 for submit@debbugs.gnu.org; Sat, 23 Apr 2022 07:49:48 -0400 Received: from eggs.gnu.org ([209.51.188.92]:41344) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1niEGo-0003iz-Ee for 11948@debbugs.gnu.org; Sat, 23 Apr 2022 07:49:46 -0400 Received: from fencepost.gnu.org ([2001:470:142:3::e]:58690) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1niEGi-0002u8-Iz; Sat, 23 Apr 2022 07:49:40 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnu.org; s=fencepost-gnu-org; h=References:Subject:In-Reply-To:To:From:Date: mime-version; bh=jr5AeRYY1241fzEw8hO7WXQMdh0GJS+1HZZSvVSVt24=; b=QH0qXNSf3S1N e7EQfBw3RdjQuPFPF9R2yjGaWFUfUMLReKYkU4soFCzXYEcpXP3wI44TqUXmjWFQ449bbCaVlzYgL 1wG2AnpzD7br0glos4WWw1MZkHWAzccpfwF63+tyWtiWWOCIe+aufNhM1RJyP9clHl4PhH5zd7IjB vJ2UlnRjKNQ/9eJalwwVXPFoJlirG4m9Iz2mqppOEVoIiUY5KSCTpyGQlJnBx7W3lii2GwMMdMzgM XC/OpQGPa0AHTVlN9faK1pMgO/tJ3gg3SCm3smg3OPYNNZXX4rYlLSCR6cfyi1njgu4qhDnZVm/aF n6q7Ope/lveItuQER+Y5ew==; Received: from [87.69.77.57] (port=2022 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1niEGi-0000SW-0a; Sat, 23 Apr 2022 07:49:40 -0400 Date: Sat, 23 Apr 2022 14:49:40 +0300 Message-Id: <83y1zwnhnv.fsf@gnu.org> From: Eli Zaretskii To: Lars Ingebrigtsen In-Reply-To: <8735i4vxun.fsf@gnus.org> (message from Lars Ingebrigtsen on Sat, 23 Apr 2022 13:32:48 +0200) Subject: Re: bug#11948: 24.1.50; word-wrap should allow wrapping at all breaking space characters, not just space and tab References: <871ukbubbp.fsf@sc3d.org> <87czh91e04.fsf@gnus.org> <83mtgdp98u.fsf@gnu.org> <8735i4vxun.fsf@gnus.org> X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 11948 Cc: 11948@debbugs.gnu.org, rrt@sc3d.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) > From: Lars Ingebrigtsen > Cc: rrt@sc3d.org, 11948@debbugs.gnu.org > Date: Sat, 23 Apr 2022 13:32:48 +0200 > > > Of course, the harder part here is to decide which of the Zs > > characters will allow word-wrap on them. I don't think all of them > > should. > > Looking over these: > > 17 matches for "Zs" in buffer: UnicodeData.txt > 33:0020;SPACE;Zs;0;WS;;;;;N;;;;; > 161:00A0;NO-BREAK SPACE;Zs;0;CS; 0020;;;;N;NON-BREAKING SPACE;;;; > 5187:1680;OGHAM SPACE MARK;Zs;0;WS;;;;;N;;;;; > 7354:2000;EN QUAD;Zs;0;WS;2002;;;;N;;;;; > 7355:2001;EM QUAD;Zs;0;WS;2003;;;;N;;;;; > 7356:2002;EN SPACE;Zs;0;WS; 0020;;;;N;;;;; > 7357:2003;EM SPACE;Zs;0;WS; 0020;;;;N;;;;; > 7358:2004;THREE-PER-EM SPACE;Zs;0;WS; 0020;;;;N;;;;; > 7359:2005;FOUR-PER-EM SPACE;Zs;0;WS; 0020;;;;N;;;;; > 7360:2006;SIX-PER-EM SPACE;Zs;0;WS; 0020;;;;N;;;;; > 7361:2007;FIGURE SPACE;Zs;0;WS; 0020;;;;N;;;;; > 7362:2008;PUNCTUATION SPACE;Zs;0;WS; 0020;;;;N;;;;; > 7363:2009;THIN SPACE;Zs;0;WS; 0020;;;;N;;;;; > 7364:200A;HAIR SPACE;Zs;0;WS; 0020;;;;N;;;;; > 7401:202F;NARROW NO-BREAK SPACE;Zs;0;CS; 0020;;;;N;;;;; > 7449:205F;MEDIUM MATHEMATICAL SPACE;Zs;0;WS; 0020;;;;N;;;;; > 11232:3000;IDEOGRAPHIC SPACE;Zs;0;WS; 0020;;;;N;;;;; > > I think only the no-break ones shouldn't trigger wrapping? Those marked with "", you mean? Yes. But I think we should add U+200B ZERO WIDTH SPACE to the list, although it's not Zs. From debbugs-submit-bounces@debbugs.gnu.org Sat Apr 23 08:14:09 2022 Received: (at 11948) by debbugs.gnu.org; 23 Apr 2022 12:14:10 +0000 Received: from localhost ([127.0.0.1]:55311 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1niEeP-0004KQ-OC for submit@debbugs.gnu.org; Sat, 23 Apr 2022 08:14:09 -0400 Received: from quimby.gnus.org ([95.216.78.240]:47258) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1niEeO-0004K7-M5 for 11948@debbugs.gnu.org; Sat, 23 Apr 2022 08:14:09 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnus.org; s=20200322; h=Content-Type:MIME-Version:Message-ID:In-Reply-To:Date: References:Subject:Cc:To:From:Sender:Reply-To:Content-Transfer-Encoding: Content-ID:Content-Description:Resent-Date:Resent-From:Resent-Sender: Resent-To:Resent-Cc:Resent-Message-ID:List-Id:List-Help:List-Unsubscribe: List-Subscribe:List-Post:List-Owner:List-Archive; bh=LabaHAn7B6fm517Hf+gxsYklRjxbLXKIN9NqGubz0E0=; b=O2nmFC7zeSPGI8yEcHKgd+tDl3 iootzMSAqtkkgzMOgcqGlY5k+IixclCx/JFeCjS2iebGZHG9ijetug+f2XfqZzU9DUTfh2EnD4V73 NOHGadejO9C1/MhqYp77bjHmR2EDj9oLblVbFoNAhBCB20Ly2ZQDuqal+fB1TsU53uY4=; Received: from [84.212.220.105] (helo=xo) by quimby.gnus.org with esmtpsa (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1niEeE-00026C-Ez; Sat, 23 Apr 2022 14:14:01 +0200 From: Lars Ingebrigtsen To: Eli Zaretskii Subject: Re: bug#11948: 24.1.50; word-wrap should allow wrapping at all breaking space characters, not just space and tab References: <871ukbubbp.fsf@sc3d.org> <87czh91e04.fsf@gnus.org> <83mtgdp98u.fsf@gnu.org> <8735i4vxun.fsf@gnus.org> <83y1zwnhnv.fsf@gnu.org> Face: iVBORw0KGgoAAAANSUhEUgAAADAAAAAwBAMAAAClLOS0AAAABGdBTUEAALGPC/xhBQAAACBj SFJNAAB6JgAAgIQAAPoAAACA6AAAdTAAAOpgAAA6mAAAF3CculE8AAAAGFBMVEX6+/3a0dKlnZ2B eHdMUE4xOjkYJzr///+RN+RxAAAAAWJLR0QHFmGI6wAAAAlwSFlzAAALEgAACxIB0t1+/AAAAAd0 SU1FB+YEFwwNBmFQucUAAAFxSURBVDjLbVRLcsIwDAXSA0Cn7Om06ZohpGuYyAeAWDpAZ+z7H6G2 Po4T8hb56EVPlvyczcbw3gF0n5sltldkDPt5fOeCEORPs+8dFvg6p8cK4xT/CDVB9zWhmdgRF3is J5SUN3zBjYnfV+KPlXAF+3Ul0WrlMQasunkmQtYUY8RAhfJWgiJnBOKoy0WaoARncE4i6KS1o2VE zbjpPCKD82wqrSkpspbLy8oEzAnPRG9EqlARY24DQMVJpVCIAcAikT/wQ88EgLflKAEuEakGSOMs hTqxsRSnTKUbWfG2ECTt2HJLgyEhd2/ENBKiKuOhQ6TSYLQh7qaxYx57tLGrF8rYSYa4N0OTbVQs xj5WZggYJ5M2ujW244zzzHCVS9i8P+oYj86lS7GoaHlI079eAZwpFcuhrFPtJidNXoeuA3m6L44a WEI5t01l59TreTq2bU086x9AP8XH2U9j26sazeOJ+Za6l0U84fDVdZdDef0HlpEthqAcduEAAAAZ dEVYdGNvbW1lbnQAQ3JlYXRlZCB3aXRoIEdJTVDnr0DLAAAAJXRFWHRkYXRlOmNyZWF0ZQAyMDIy LTA0LTIzVDEyOjEzOjA2KzAwOjAwgmqDYQAAACV0RVh0ZGF0ZTptb2RpZnkAMjAyMi0wNC0yM1Qx MjoxMzowNiswMDowMPM3O90AAAAASUVORK5CYII= X-Now-Playing: Herbert's _Part One_: "Oo Licky" Date: Sat, 23 Apr 2022 14:13:57 +0200 In-Reply-To: <83y1zwnhnv.fsf@gnu.org> (Eli Zaretskii's message of "Sat, 23 Apr 2022 14:49:40 +0300") Message-ID: <87y1zwuhdm.fsf@gnus.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/29.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: Eli Zaretskii writes: > Those marked with "", you mean? Yes. But I think we should > add U+200B ZERO WIDTH SPACE to the list, although it's not Zs. Now added as word-wrap-whitespace-mode in Emacs 29, but if you have a better name, feel free to change (and tweak the list of space characters further). Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 11948 Cc: 11948@debbugs.gnu.org, rrt@sc3d.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) Eli Zaretskii writes: > Those marked with "", you mean? Yes. But I think we should > add U+200B ZERO WIDTH SPACE to the list, although it's not Zs. Now added as word-wrap-whitespace-mode in Emacs 29, but if you have a better name, feel free to change (and tweak the list of space characters further). -- (domestic pets only, the antidote for overdose, milk.) bloggy blog: http://lars.ingebrigtsen.no From debbugs-submit-bounces@debbugs.gnu.org Sat Apr 23 08:14:18 2022 Received: (at control) by debbugs.gnu.org; 23 Apr 2022 12:14:18 +0000 Received: from localhost ([127.0.0.1]:55314 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1niEeX-0004Kn-VO for submit@debbugs.gnu.org; Sat, 23 Apr 2022 08:14:18 -0400 Received: from quimby.gnus.org ([95.216.78.240]:47274) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1niEeW-0004KZ-Fv for control@debbugs.gnu.org; Sat, 23 Apr 2022 08:14:17 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnus.org; s=20200322; h=Subject:From:To:Message-Id:Date:Sender:Reply-To:Cc: MIME-Version:Content-Type:Content-Transfer-Encoding:Content-ID: Content-Description:Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc :Resent-Message-ID:In-Reply-To:References:List-Id:List-Help:List-Unsubscribe: List-Subscribe:List-Post:List-Owner:List-Archive; bh=n9n2Rg2at0MflGz4CXJiu1S+johnk/qQiV3t93q62V4=; b=Ic1ZIGJ3jWBfyhxGoMnN6eyNrJ o6s1Z4M+nhPARfw1ytOJhb2sk2nu1jJdE+3ieFvC0g/FM9pVdq2SQJu6HtsIIdQjOZXz2c4g4Mwgu 8EwFJY9yUqWcqaLk6+cZmivyCuV3+OtJeA/1pBwqGQclm4Z6vsO3gmiubSu3w7WLHSos=; Received: from [84.212.220.105] (helo=xo) by quimby.gnus.org with esmtpsa (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1niEeO-00026R-PW for control@debbugs.gnu.org; Sat, 23 Apr 2022 14:14:10 +0200 Date: Sat, 23 Apr 2022 14:14:08 +0200 Message-Id: <87wnfguhdb.fsf@gnus.org> To: control@debbugs.gnu.org From: Lars Ingebrigtsen Subject: control message for bug #11948 X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: close 11948 29.1 quit Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: control X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) close 11948 29.1 quit From unknown Thu Aug 14 17:24:36 2025 Received: (at fakecontrol) by fakecontrolmessage; To: internal_control@debbugs.gnu.org From: Debbugs Internal Request Subject: Internal Control Message-Id: bug archived. Date: Sun, 22 May 2022 11:24:06 +0000 User-Agent: Fakemail v42.6.9 # This is a fake control message. # # The action: # bug archived. thanks # This fakemail brought to you by your local debbugs # administrator