GNU bug report logs - #7096
23.1; nxml-mode attribute validation with rnc

Previous Next

Package: emacs;

Reported by: Stefan Majewski <stefan.majewski <at> univie.ac.at>

Date: Fri, 24 Sep 2010 15:11:01 UTC

Severity: normal

To reply to this bug, email your comments to 7096 AT debbugs.gnu.org.

Toggle the display of automated, internal messages from the tracker.

View this report as an mbox folder, status mbox, maintainer mbox


Report forwarded to owner <at> debbugs.gnu.org, bug-gnu-emacs <at> gnu.org:
bug#7096; Package emacs. (Fri, 24 Sep 2010 15:11:02 GMT) Full text and rfc822 format available.

Acknowledgement sent to Stefan Majewski <stefan.majewski <at> univie.ac.at>:
New bug report received and forwarded. Copy sent to bug-gnu-emacs <at> gnu.org. (Fri, 24 Sep 2010 15:11:02 GMT) Full text and rfc822 format available.

Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):

From: Stefan Majewski <stefan.majewski <at> univie.ac.at>
To: bug-gnu-emacs <at> gnu.org
Subject: 23.1; nxml-mode attribute validation with rnc
Date: Fri, 24 Sep 2010 17:10:27 +0200
Dear Emacs devs, @CC James Clark as initial developer of nxml-mode,

first of all thanks for emacs and especially thanks for nxml-mode, as I
couldn't do or wouldn't like to do without it.

I frequently encounter the following bug with nxml-mode:

Whenever I try to use specific attribute definitions that are well
covered within the specs of RelaxNG Compact syntax, Emacs does not
validate some RNC schemas correctly. Jing (after a convertion with
trang) and other relaxng tools, nevertheless do. I tried to debug it
myself, but can't see where it goes wrong, as everything indeed seems to
be correctly in place.

The bug certainly relates to the way attribute restrictions can be based
on Unicode Character Classes like \p{P} for punctuation or \p{L} for
letter. For some strange reason the definitions for Letter seem to work
while Punctuation is not handled correctly.

Consider this minimal example (RNC and corresponding XML):

## BEGIN RNC
test = element test {
     empty,
     attribute att {
               xsd:token { pattern = "(\p{L}|\p{P})+" }
     }
}

start = test
##END RNC


<!-- BEGIN XML -->
<test att="att_val"/>
<!-- END XML -->


Strange enough, it validates when the underscore ("_" (unicode 95,
included in \p{Pc} and thus \p{P}) is removed from the XML.

But it is not just the underscore, other punctuation characters fail as
well. I could not find an obvious error in nxml's definition of the
character classes in xsd-regexp.el

As this behaviour is against the specs, I would consider it a bug in
nxml-mode.

If there is anything more I could help to track this bug down, please
let me know.

cheers,
Stefan

-- 
| Stefan Majewski  | Department of English, University of Vienna     |
| VOICE Corpus     | Spitalgasse 2-4, Universitätscampus AAKH, Hof 8 |
|                  | A-1090 Vienna                                   |
| Research Ass.(IT)| Phone: +43 1 4277 424 46                        |





Forcibly Merged 6499 7096 7162 7212. Request was from Glenn Morris <rgm <at> gnu.org> to control <at> debbugs.gnu.org. (Thu, 14 Oct 2010 15:50:03 GMT) Full text and rfc822 format available.

Disconnected #7096 from all other report(s). Request was from Glenn Morris <rgm <at> gnu.org> to control <at> debbugs.gnu.org. (Thu, 14 Oct 2010 15:53:02 GMT) Full text and rfc822 format available.

Did not alter fixed versions and reopened. Request was from Debbugs Internal Request <help-debbugs <at> gnu.org> to internal_control <at> debbugs.gnu.org. (Thu, 14 Oct 2010 15:53:02 GMT) Full text and rfc822 format available.

Changed bug title to 'minor display issue when getting new mail in rmail?' from '23.1; nxml-mode attribute validation with rnc' Request was from Glenn Morris <rgm <at> gnu.org> to control <at> debbugs.gnu.org. (Wed, 26 Jan 2011 04:22:02 GMT) Full text and rfc822 format available.

Severity set to 'minor' from 'normal' Request was from Glenn Morris <rgm <at> gnu.org> to control <at> debbugs.gnu.org. (Wed, 26 Jan 2011 04:22:02 GMT) Full text and rfc822 format available.

Changed bug title to '23.1; nxml-mode attribute validation with rnc' from 'minor display issue when getting new mail in rmail?' Request was from Glenn Morris <rgm <at> gnu.org> to control <at> debbugs.gnu.org. (Wed, 26 Jan 2011 04:25:01 GMT) Full text and rfc822 format available.

Severity set to 'normal' from 'minor' Request was from Glenn Morris <rgm <at> gnu.org> to control <at> debbugs.gnu.org. (Wed, 26 Jan 2011 04:25:01 GMT) Full text and rfc822 format available.

This bug report was last modified 14 years and 144 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.