GNU bug report logs -
#34215
27.0.50; Provide elisp access to Chinese pinyin-to-character mapping
Previous Next
Reported by: Eric Abrahamsen <eric <at> ericabrahamsen.net>
Date: Sun, 27 Jan 2019 05:44:01 UTC
Severity: wishlist
Found in version 27.0.50
Done: Eric Abrahamsen <eric <at> ericabrahamsen.net>
Bug is archived. No further changes may be made.
Full log
View this message in rfc822 format
[Message part 1 (text/plain, inline)]
Your bug report
#34215: 27.0.50; Provide elisp access to Chinese pinyin-to-character mapping
which was filed against the emacs package, has been closed.
The explanation is attached below, along with your original report.
If you require more details, please reply to 34215 <at> debbugs.gnu.org.
--
34215: http://debbugs.gnu.org/cgi/bugreport.cgi?bug=34215
GNU Bug Tracking System
Contact help-debbugs <at> gnu.org with problems
[Message part 2 (message/rfc822, inline)]
[Message part 3 (message/rfc822, inline)]
[Message part 4 (text/plain, inline)]
This bug report is apropos to this[1] emacs.devel thread.
The basic idea is that in the Emacs sources there's a file containing a
mapping between pinyin -- the most common Chinese romanization system --
and Chinese characters themselves. The mapping lives in
leim/MISC-DIC/pinyin.map, and is converted into a quail input method by
the `py-converter' function in titdic-cnv.el, which is part of the
"make" process.
I want this mapping to be available to elisp code in general, because
it's useful for all kinds of other language utilities (searching Chinese
characters using ascii letters, etc).
pinyin.map is a plain text file, each line consisting of a romanized
syllable, a TAB, and a string of the possible corresponding Chinese
characters. `titdic-convert' parses this and feeds it to
`quail-define-rules'.
My first thought was to add an intermediate step, where `titdic-convert'
first composes an alist, then feeds that alist to `quail-define-rules',
which would also allow us access to the alist.
The more I looked at it, the more hacky and awkward that approached
seemed, and it's not like it would save any memory: you still end up
with the data both in a quail method, and in a separate alist.
So this proposed patch simply parses the same file in the same way, but
in a different location. I've put it in china-util.el, but chinese.el
would also be a reasonable spot. Both those files are concerned with
encoding, but at least "china-util" gives the impression that it could
be a grab-bag.
I'm not sure this use of `source-directory' is particularly robust, but
I don't know how else to handle it.
Hope this will be considered!
Eric
[1]: https://lists.gnu.org/archive/html/emacs-devel/2019-01/msg00306.html
[0001-New-constant-chinese-pinyin-character-map.patch (text/x-patch, attachment)]
This bug report was last modified 6 years and 145 days ago.
Previous Next
GNU bug tracking system
Copyright (C) 1999 Darren O. Benham,
1997,2003 nCipher Corporation Ltd,
1994-97 Ian Jackson.