Add UTR#36 and discuss Cf.

This commit is contained in:
Martin v. Löwis 2007-05-17 18:30:09 +00:00
parent 65d78ad8f2
commit 3319c1a181
1 changed files with 11 additions and 1 deletions

View File

@ -145,12 +145,22 @@ scripts mentioned in section 5? For Confusable Detection, it seems one
needs two identifiers to compare them for confusion - is it possible
to somehow apply it to a single identifier only, and warn?
In follow-up discussion, it turns out that John Nagle actually
meant to suggest UTR#36, level "Highly Restrictive", [3]_.
Several people suggested to allow and ignore formatting control
characters (general category Cf), as is done in Java, JavaScript, and
C#. It's not clear whether this would improve things (it might
for RTL languages); if there is a need, these can be added
later.
References
==========
.. [1] http://www.unicode.org/reports/tr31/
.. [2] http://www.unicode.org/reports/tr39/
.. [3] http://www.unicode.org/reports/tr36/
Copyright
=========