pugixml.git - Mirror for https://github.com/zeux/pugixml

Age	Commit message (Collapse)	Author
2015-05-02	Fix MSVC build	Arseny Kapoulkine

2015-05-02	Reorder conditions in compact_string implementation	Arseny Kapoulkine
	Now compact_string matches compact_pointer_parent. Turns out PUGI__UNLIKELY is good at reordering conditions but usually does not really affect performance. Since MSVC should treat "if" branches as taken and does not support branch probabilities, don't use them if we don't need to.
2015-05-02	Minor refactoring	Arseny Kapoulkine

2015-05-02	Revise marker deletion strategy	Arseny Kapoulkine
	Instead of checking if the object being removed allocated a marker, mark the marker block as deleted immediately upon allocation. This simplifies the logic and prevents extra markers from being inserted if we allocate/deallocate the same node indefinitely. Also change marker pointer type to uint32_t*.
2015-05-02	Optimize compact_string	Arseny Kapoulkine
	First assignment uses a fast path; second assignment uses a specialized path as well.
2015-05-02	Fix node deallocation	Arseny Kapoulkine
	When we deallocate nodes/attributes that allocated the marker we have to adjust the size accordingly, and dismiss the marker in case it gets overwritten with something else...
2015-05-02	Implement efficient compact_header storage	Arseny Kapoulkine
	Header is now just 2 bytes, with optional additonal 4 bytes that are only allocated for every 85 nodes / 128 attributes.
2015-05-01	Implement compact_string with shared storage	Arseny Kapoulkine

2015-05-01	Rename compact_string to compact_string_fat	Arseny Kapoulkine

2015-05-01	Revert to name/value storage inside node	Arseny Kapoulkine
	This temporarily increases the node size to 16 bytes - we'll bring it back. It allows us to remove the horrible node_pi hack and to reduce the amount of changes against master. This comes at the price of not decreasing basline xml_node_struct size. The compact xml_node_struct is also increased by this change but a followup change will reduce both xml_attribute_struct and xml_node_struct (to 8/12 bytes).
2015-04-29	Refactor offset_debug	Arseny Kapoulkine
	Split a long line into multiple statements.
2015-04-22	Change xml_node_struct field order to match compact	Arseny Kapoulkine
	Also remove useless comments.
2015-04-22	Fix node_pi memory leak	Arseny Kapoulkine

2015-04-22	Make xml_node::value() structure consistent with set_*	Arseny Kapoulkine

2015-04-22	Remove compact_header::operator uintptr_t	Arseny Kapoulkine
	We used this in two cases - to get the page pointer and to test flags. We now use PUGI__GETPAGE for getting the page pointer and operator& to test flags - this makes getting node type significantly faster since it does not require page pointer reconstruction.
2015-04-22	Remove redundant has_value check	Arseny Kapoulkine

2015-04-22	Use has_name/has_value in set_name/set_value	Arseny Kapoulkine

2015-04-22	Optimize and refactor compact_pointer implementations	Arseny Kapoulkine
	Clarify the offset applied when encoding the pointer difference. Make decoding diff slightly more clear - no effect on performance. Adjust branch weighting in compact_string encoding - 0.5% faster. Use uint16_t in compact_pointer_parent - 2% faster.
2015-04-21	Optimize xml_allocator::reserve()	Arseny Kapoulkine
	Make sure compact_hash_table::rehash() is not inlined - that way reserve() is inlined so the fast path has no extra function calls. Also use subtraction instead of multiplication when checking capacity.
2015-04-21	Merge branch 'master' into compact	Arseny Kapoulkine

2015-04-21	XPath: Implement move semantics support	Arseny Kapoulkine
	xpath_query, xpath_node_set and xpath_variable_set are now moveable. This is a nice performance optimization for variable/node sets, and enables storing xpath_query in containers without using pointers (it's only possible now since the query is not copyable).
2015-04-21	Fix compilation warning in some configurations	Arseny Kapoulkine

2015-04-15	Implement copy ctor/assignment for xpath_variable_set	Arseny Kapoulkine
	xpath_variable_set is essentially an associative container; it's about time it became copyable. Implementation is slightly tricky due to out of memory handling. Both copy ctor and assignment operator have strong exception guarantee (even if exceptions are disabled! which translates to "roll back on allocation errors").
2015-04-15	Minor xpath_variable refactoring	Arseny Kapoulkine
	The type of the variable is now initialized correctly in the ctor, so that there is no interim invalid state.
2015-04-14	Fix xpath_node_set assignment to provide strong exception guarantee	Arseny Kapoulkine
	Since the type of the set was updated before assignment, assigning in out-of-memory condition could change the type to not match the content.
2015-04-14	Explicitly call xml_buffered_writer::flush()	Arseny Kapoulkine
	If xml_writer::write throws an exception while being called from flush(), the exception is thrown from destructor. Clang in C++11 mode calls std::terminate in this case.
2015-04-13	Refactor format_indent_attributes implementation	Arseny Kapoulkine
	Fix code style and revert redundant parameters/whitespace changes. Also remove format_each_attribute_on_new_line - we're only introducing one extra formatting flag. The flag implies format_indent but does not include its bitmask. Also add a few more tests. Fixes #14.
2015-04-14	add align each attribute on new line support with format_indent_attribute	halex2005

2015-04-12	Merge branch 'master' into compact	Arseny Kapoulkine

2015-04-12	Fix unused variable warning	Arseny Kapoulkine
	Also fix test in wchar_t mode.
2015-04-12	Permit custom allocation function to throw	Arseny Kapoulkine
	Ensure that all the necessary cleanup is performed in case the allocation fails with an exception - files are closed, buffers are reclaimed, etc. Any test that triggers a simulated out-of-memory condition is ran once again with a throwing allocation function. Unobserved std::bad_alloc count as test failures and require CHECK_ALLOC_FAIL macro. Fixes #17.
2015-04-12	Fix compilation and tests after merge.	Arseny Kapoulkine

2015-04-12	Merge branch 'master' into compact	Arseny Kapoulkine

2015-04-12	Implment copyless copy for attributes	Arseny Kapoulkine
	Previously attributes that were copied with their node used string sharing, but standalone attributes that were copied using xml_node::*_copy(xml_attribute) were not.
2015-04-12	Optimize xml_node::path() to use 1 allocation	Arseny Kapoulkine
	Instead of reallocating the string for every tree level just do two passes over the ancestor chain.
2015-04-12	Move zero-termination out of as_utf8_end	Arseny Kapoulkine
	as_utf8_end was used with std::string, where writing an extra zero-terminating character should probably always work (at least if size is positive) but is not ideal. The only place that needed to zero-terminate was convert_path_heap.
2015-04-11	Fix exception type for out-of-memory for XPath variables	Arseny Kapoulkine
	When parsing XPath variables, we need to perform a heap allocation; if it fails, an xpath_exception instead of bad_alloc used to be thrown. Now we throw the exception of a correct type so that xpath_exception means 'parsing error'.
2015-04-10	Merge branch 'master' into compact	Arseny Kapoulkine

2015-03-20	Update year to 2015	Arseny Kapoulkine

2015-03-18	Update version to 1.6	Arseny Kapoulkine

2015-03-18	Do not emit surrounding whitespace for text nodes	Arseny Kapoulkine
	Previously we omitted extra whitespace for single PCDATA/CDATA children, but in mixed content there was extra indentation before/after text nodes. One of the problems with that is that the text that you saved is not exactly the same as the parsing result using default flags (parse_trim_pcdata helps). Another problem is that parse-format cycles do not have a fixed point for mixed content - the result expands indefinitely. Some XML libraries, like Python minidom, have the same issue, but this is definitely a problem. Pretty-printing mixed content is hard. It seems that the only other sensible choice is to switch mixed content nodes to raw formatting. In a way the code in this change is a weaker version of that - it removes indentation around text nodes but still keeps it around element siblings/children. Thus we can switch to mixed-raw formatting at some point later, which will be a superset of the current behavior. To do this we have to either switch at the first text node (.NET XmlDocument does that), or scan the children of each element for a possible text node and switch before we output the first child. The former behavior seems non-intuitive (and a bit broken); unfortunately, the latter behavior can cost up to 20% of the output time for trees without mixed content. Fixes #13.
2015-03-13	Merge branch 'master' into compact	Arseny Kapoulkine

2015-03-12	Fix buffer overrun when parsing comments inside DOCTYPE	Arseny Kapoulkine

2015-03-10	Fix optimized string header encoding for compact mode	Arseny Kapoulkine
	Since in compact mode we only ever have a guaranteed alignment on 4, the pages are limited to 256k even if pointers are 64 bit.
2015-03-10	Merge branch 'master' into compact	Arseny Kapoulkine

2015-03-10	Escape ?> sequence in PI value during printing	Arseny Kapoulkine
	This prevents malformed PI value from breaking the document structure.
2015-03-05	Use more efficient encoding for string headers	Arseny Kapoulkine
	Since all string allocations are pointer-aligned to avoid aligning more frequent node allocations, we can rely on that in string encoding. Encoding page offset and block size in sizeof(void*) units increases the maximum memory page size from 64k to 256k on 32-bit and 512k on 64-bit platforms. Fixes #35.
2015-03-05	Refactor contents=0 behavior	Arseny Kapoulkine
	Also change the error code to status_io_error
2015-03-05	Merge branch 'master' of https://github.com/mloy/pugixml into mloy-master	Arseny Kapoulkine

2015-03-04	Fix string length for translate and normalize-space	Arseny Kapoulkine
	The implementations generated a string with an internal null terminator; this went unnoticed since unit test string verification did not perform string equality check properly (it compared XPath string result as a C-string, thus stopping at the first null terminator). Fixes #36.