Search code examples
xmldocxglossary

Is there a glossary of Word .docx XML tags?


I'm trying to create a parser to find the tracked changes and author of a Word .docx file...

I found the document.xml but there are so many tags! Is there a glossary somewhere to what all those tags stand for?

I'd like to avoid brute forcing my way through this if possible.


Solution

  • "w:ins" denotes what was inserted when trackedchanges are enabled.
    "w:del" denotes what was deleted when  trackedchanges are enabled.
    "w:commentRangeStart" denotes the start of a comment
    "w:commentRangeEnd" denotes the end of the comment.
    
    All text are found inside 
    "w:t" tags.