I'm trying to create a parser to find the tracked changes and author of a Word .docx
file...
I found the document.xml
but there are so many tags! Is there a glossary somewhere to what all those tags stand for?
I'd like to avoid brute forcing my way through this if possible.
"w:ins" denotes what was inserted when trackedchanges are enabled.
"w:del" denotes what was deleted when trackedchanges are enabled.
"w:commentRangeStart" denotes the start of a comment
"w:commentRangeEnd" denotes the end of the comment.
All text are found inside
"w:t" tags.