Evaluating Mark Logic XQuery Performance

I’ve recently been doing some work building RESTFul API’s backed by a Mark Logic XML Content Store utilising XQuery for document retrieval. This post details the steps involved in tuning what was deemed to be the most simplest of queries for optimum performance using some useful Mark Logic extensions for profiling.

Original Query

XQuery for looking up documents based on the value of a given attribute in the xml using XPath. (What could be simpler!)

Evaluating Performance

By default Mark Logic indexes the document structure and attributes are indexed by default as part of the universal index.

1. Adding xdmp:query-meters

By adding xdmp:query-meters() to the query we get some immediate feedback about how the query performs including elapsed time and the number of fragments and documents that were selected. Altering the above query as below gives us some interesting metrics and the query is taking nearly 2 seconds.

<qm:elapsed-time>PT1.802604S</qm:elapsed-time> <qm:requests>0</qm:requests> <qm:list-cache-hits>3</qm:list-cache-hits> <qm:list-cache-misses>0</qm:list-cache-misses> <qm:in-memory-list-hits>1</qm:in-memory-list-hits> <qm:expanded-tree-cache-hits>43762</qm:expanded-tree-cache-hits> <qm:documents>

Immediately something looks a bit suspicious as all the documents in the database are being returned which would indicate that the query is not making effective use of Mark Logic’s Indexes.

2. Verifying with xdmp:estimate

This can be verified with xdmp:estimate(), purely focusing on the XPath part of the query e.g.

The evaluator sees the XPath expression above and uses index lookup’s to match some sequence of fragments in the database. xdmp:estimate() gives an estimate of the number of documents in a sequence and is directed at the index-lookup phase, i.e “search”.

Next, the evaluator will fetch those matching fragment(s), if any, from the database. Now we are back in the evaluation phase. It will check to make sure the nodes really match: this is known as “filtering”. Then it will evaluate the entire XPath.

So what we are saying for the xquery above is that the number of matching fragments is all the documents in the database which will then get filtered so we are not making use of any of available Mark Logic indexing which means the query is very inefficient.

3. Looking at the query plan with xdmp:plan

This further verifies that all the documents in the database are being selected and we are not fully leveraging indexes

<qry:query-plan xmlns:qry="http://marklogic.com/cts/query"> <qry:info-trace>xdmp:eval("xquery version &quot;1.0-ml&quot;;&#13;&#10;import module namesp...", (), <options xmlns="xdmp:eval"><database>14032772107247300631</database><modules>77217792867070...</options>)</qry:info-trace> <qry:info-trace>Analyzing path: fn:collection()/*[/*/itemMeta/url[@href = "/news/technology"]]</qry:info-trace> <qry:info-trace>Step 1 is searchable: fn:collection()</qry:info-trace> <qry:info-trace>Step 2 is searchable: *[/*/itemMeta/url[@href = "/news/technology"]]</qry:info-trace> <qry:info-trace>Path is fully searchable.</qry:info-trace> <qry:info-trace>Gathering constraints.</qry:info-trace> <qry:info-trace>Executing search.</qry:info-trace> <qry:final-plan> <qry:and-query> <qry:term-query weight="0"> <qry:key>1458993848217274698</qry:key> </qry:term-query> </qry:and-query> </qry:final-plan> <qry:info-trace>Selected 43762 fragments</qry:info-trace> <qry:result estimate="43762"/> </qry:query-plan>

Looking at the XPath in more detail

/* accesses the entire database and returns every root element in the database, but we do it a second time in the predicate which is very expensive.

Changing the XPath to below and re-running the above steps results in a much more positive result, and look how quick the query is!

<qm:elapsed-time>PT0.000773S</qm:elapsed-time> <v:results v:warning="non-element item">1</v:results> <qry:info-trace>Step 1 predicate 1 contributed 3 constraints: */itemMeta/url/@href eq "/news/technology"</qry:info-trace>

Plugging in xinc:node-expand

So far we have done our query evaluation ignoring the final piece which is to plugin the marklogic xinc:node-expand function which will resolves any x:includes in the results

return xinc:node-expand($asset) 

Running the original query using xinc:node-expand

Not cached – 6 seconds!!

<qm:elapsed-time>PT6.05373S</qm:elapsed-time>

Cached – 2 seconds

<qm:elapsed-time>PT2.154288S</qm:elapsed-time>

With our new optimised query we can see the time is much reduced below. This is a Mark Logic extension so we can’t really do much about the performance of this. However it is interesting to see how much additional time this adds to the processing even for a fully optimised query.

<qm:elapsed-time>PT0.351562S</qm:elapsed-time>

From the above it is easy to see the majority of the query is spent in the xinc:node-expand function but we have increased the overall performance dramatically.

Conclusion

Even what is deemed to be the simplest of xquery/xpath expressions might be inefficient. Mark Logic won’t tell you how to fix your xquery/xpath but it will provide insight into whether your query is utilizing indexes and how it is actually running.

Jon's Blog

Coder, Ex BBC. Cycling obsessive and writer for cyclosport.org