docx2txt is a free and open source, perl based utility built to convert Microsoft Office Docx documents to equivalent text documents.
Here are some key features of "docx2txt":
· Horizontal ruler, line breaks, paragraphs separation, tabs, capitalization of text blocks.
· Character conversion. Euro character is converted to E, however you can change this behavior by comment/uncomment in Perl script.
· Naive nested list formatting - assumed 8 level nesting, however you can handle even deeper nesting by commenting/uncommenting appropriate lines in Perl script.
· Center and right justification of text fitting in a line of (adjustable) 80 columns.
· Indicating hyperlinked text along with the hyperlink.
Requirements:
· Perl
What`s New in This Release: [ read full changelog ]
New features:
· Input argument can also be a directory holding the unzipped content of .docx file.
· Windows wrapper script, and support for using CakeCmd command line unzipper.
· Configuration file support for easy control over settings.
· Windows installation script.
Updates:
· Hyperlink is not displayed if hyperlink and hyperlinked text are same, even though user has enabled hyperlink display.
· Improved handling of short line justification, capturing many cases that were missed in earlier approach.
· Path names containing spaces are now handled.