From 900a1cc94353b9202dcaee66b95d67e31331940e Mon Sep 17 00:00:00 2001
From: Arseny Kapoulkine <arseny.kapoulkine@gmail.com>
Date: Tue, 29 Aug 2017 20:46:30 -0700
Subject: docs: Clarify Unicode validation behavior

It has always been the case that pugixml does not perform Unicode
validation or name/tag Unicode character class validation, but it wasn't
very obvious from documentation.

Fixes #162
---
 docs/manual.adoc | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

(limited to 'docs/manual.adoc')

diff --git a/docs/manual.adoc b/docs/manual.adoc
index 7f4fc8b..b901a54 100644
--- a/docs/manual.adoc
+++ b/docs/manual.adoc
@@ -811,12 +811,13 @@ There is only one non-conformant behavior when dealing with valid XML documents:
 As for rejecting invalid XML documents, there are a number of incompatibilities with W3C specification, including:
 
 * Multiple attributes of the same node can have equal names.
-* All non-ASCII characters are treated in the same way as symbols of English alphabet, so some invalid tag names are not rejected.
+* Tag and attribute names are not fully validated for consisting of allowed characters, so some invalid tags are not rejected
 * Attribute values which contain `<` are not rejected.
 * Invalid entity/character references are not rejected and are instead left as is.
 * Comment values can contain `--`.
 * XML data is not required to begin with document declaration; additionally, document declaration can appear after comments and other nodes.
 * Invalid document type declarations are silently ignored in some cases.
+* Unicode validation is not performed so invalid UTF sequences are not rejected.
 
 [[access]]
 == Accessing document data
-- 
cgit v1.2.3