Search code examples
pythonlinuxubuntums-wordautomation

Python & MS Word: Convert .doc to .docx?


I found several questions that were similar to mine, but none of the answers came close to what I need.

Specifications: I'm working with Python 3 and do not have MS Word. My programming machine is running OS X and cloud machine is linux/ubuntu too.

I'm using python-docx to extract values from a .doc file that is sent to me nightly. However, python-docx only works with .docx files, so I need to convert the file to that extension first.

So, I've got a .doc file that I need to convert to .docx. This script might have to run in the cloud so I can't install any kind of Office or Office-like software. Can this be done?


Solution

  • You are working with Linux/ubuntu, you can use LibreOffice’s inbuilt converter.

    SYNTAX

    lowriter --convert-to docx *.doc
    

    #Example #

    lowriter --convert-to docx testdoc.doc
    

    This will convert all doc files to docx and save in the same folder itself. This will convert all .doc files to .docx and save them in the same folder. Currently functioning on Ubuntu.