DOCX2STR
A simple library that converts .docx files to plain text in a string.
At the moment it only works in the browser
To contribute
Since this is a very simple library, I don't have much to say, but I'll list some points to consider when building this project locally:
- Some commands in the scripts section of the package.json file might only work in a UNIX context.
- I use typescript to easily generate two builds -> one for ESM, and other for CommonJS
- El demo para testear y ver la funcionalidad lo hice usando Vite
Notes on Word's XMl format
<w:p> Are tags for paragraphs
<w:r> Are tags to divide fragments in one paragraph. They are called runs
<w:t> Are tags that have the actual text
Info about the xml document