GNU bug report logs -
#74155
upcasing strings doesn’t respect standard-case-table
Previous Next
Reported by: "Thomas Voss" <mail <at> thomasvoss.com>
Date: Fri, 1 Nov 2024 12:34:01 UTC
Severity: normal
Tags: notabug
Done: Eli Zaretskii <eliz <at> gnu.org>
Bug is archived. No further changes may be made.
To add a comment to this bug, you must first unarchive it, by sending
a message to control AT debbugs.gnu.org, with unarchive 74155 in the body.
You can then email your comments to 74155 AT debbugs.gnu.org in the normal way.
Toggle the display of automated, internal messages from the tracker.
Report forwarded
to
bug-gnu-emacs <at> gnu.org
:
bug#74155
; Package
emacs
.
(Fri, 01 Nov 2024 12:34:02 GMT)
Full text and
rfc822 format available.
Acknowledgement sent
to
"Thomas Voss" <mail <at> thomasvoss.com>
:
New bug report received and forwarded. Copy sent to
bug-gnu-emacs <at> gnu.org
.
(Fri, 01 Nov 2024 12:34:02 GMT)
Full text and
rfc822 format available.
Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):
Hi all, quick bug report.
As of 2017 (I believe), the capital eszett (ẞ) was adopted into the
German alphabet as the uppercase variable of ß which was previously (and
which still can be) uppercased to ‘SS’. Since I prefer to use the newer
ẞ to the older SS, I have the following line in my configuration:
(set-case-syntax-pair ?ẞ ?ß (standard-case-table))
When working with characters, this behaves as intended:
(upcase ?ß)
⇒ ?ẞ
However when working with strings, it doesn’t:
(upcase "ß")
⇒ "SS"
The same goes for the ‘upcase-word’ and ‘upcase-dwim’ functions which
still upcase ß to SS. It seems that whatever code that is handling
case-conversions for multi-character inputs is not respecting the current
case table.
— Thomas
Information forwarded
to
bug-gnu-emacs <at> gnu.org
:
bug#74155
; Package
emacs
.
(Fri, 01 Nov 2024 14:11:02 GMT)
Full text and
rfc822 format available.
Message #8 received at 74155 <at> debbugs.gnu.org (full text, mbox):
> Date: Fri, 01 Nov 2024 13:33:13 +0100
> From: "Thomas Voss" via "Bug reports for GNU Emacs,
> the Swiss army knife of text editors" <bug-gnu-emacs <at> gnu.org>
>
> As of 2017 (I believe), the capital eszett (ẞ) was adopted into the
> German alphabet as the uppercase variable of ß which was previously (and
> which still can be) uppercased to ‘SS’. Since I prefer to use the newer
> ẞ to the older SS, I have the following line in my configuration:
>
> (set-case-syntax-pair ?ẞ ?ß (standard-case-table))
>
> When working with characters, this behaves as intended:
>
> (upcase ?ß)
> ⇒ ?ẞ
>
> However when working with strings, it doesn’t:
>
> (upcase "ß")
> ⇒ "SS"
>
> The same goes for the ‘upcase-word’ and ‘upcase-dwim’ functions which
> still upcase ß to SS. It seems that whatever code that is handling
> case-conversions for multi-character inputs is not respecting the current
> case table.
This is a feature: characters which have the 'special-uppercase'
property defined for them by the Unicode Standard use their special
upper-case rules that override the case-table. If you don't want
that, force the special-uppercase property of ß to be nil:
(upcase "ß")
=> "SS"
(put-char-code-property ?ß 'special-uppercase nil)
(upcase "ß")
=> "ẞ"
Added tag(s) notabug.
Request was from
Stefan Kangas <stefankangas <at> gmail.com>
to
control <at> debbugs.gnu.org
.
(Sun, 03 Nov 2024 05:48:03 GMT)
Full text and
rfc822 format available.
Reply sent
to
Eli Zaretskii <eliz <at> gnu.org>
:
You have taken responsibility.
(Sat, 16 Nov 2024 13:46:02 GMT)
Full text and
rfc822 format available.
Notification sent
to
"Thomas Voss" <mail <at> thomasvoss.com>
:
bug acknowledged by developer.
(Sat, 16 Nov 2024 13:46:02 GMT)
Full text and
rfc822 format available.
Message #15 received at 74155-done <at> debbugs.gnu.org (full text, mbox):
> Cc: 74155 <at> debbugs.gnu.org
> Date: Fri, 01 Nov 2024 16:07:38 +0200
> From: Eli Zaretskii <eliz <at> gnu.org>
>
> > Date: Fri, 01 Nov 2024 13:33:13 +0100
> > From: "Thomas Voss" via "Bug reports for GNU Emacs,
> > the Swiss army knife of text editors" <bug-gnu-emacs <at> gnu.org>
> >
> > As of 2017 (I believe), the capital eszett (ẞ) was adopted into the
> > German alphabet as the uppercase variable of ß which was previously (and
> > which still can be) uppercased to ‘SS’. Since I prefer to use the newer
> > ẞ to the older SS, I have the following line in my configuration:
> >
> > (set-case-syntax-pair ?ẞ ?ß (standard-case-table))
> >
> > When working with characters, this behaves as intended:
> >
> > (upcase ?ß)
> > ⇒ ?ẞ
> >
> > However when working with strings, it doesn’t:
> >
> > (upcase "ß")
> > ⇒ "SS"
> >
> > The same goes for the ‘upcase-word’ and ‘upcase-dwim’ functions which
> > still upcase ß to SS. It seems that whatever code that is handling
> > case-conversions for multi-character inputs is not respecting the current
> > case table.
>
> This is a feature: characters which have the 'special-uppercase'
> property defined for them by the Unicode Standard use their special
> upper-case rules that override the case-table. If you don't want
> that, force the special-uppercase property of ß to be nil:
>
> (upcase "ß")
> => "SS"
> (put-char-code-property ?ß 'special-uppercase nil)
> (upcase "ß")
> => "ẞ"
No further comments within 2 weeks, so I'm now closing this bug.
bug archived.
Request was from
Debbugs Internal Request <help-debbugs <at> gnu.org>
to
internal_control <at> debbugs.gnu.org
.
(Sun, 15 Dec 2024 12:24:08 GMT)
Full text and
rfc822 format available.
This bug report was last modified 243 days ago.
Previous Next
GNU bug tracking system
Copyright (C) 1999 Darren O. Benham,
1997,2003 nCipher Corporation Ltd,
1994-97 Ian Jackson.