This project is not covered by Drupal’s security advisory policy.
This module allows extracting content from Word and RTF documents for use with Document Loader, using the phpoffice/phpword PHP library.
Supported Input Formats:
- Word 2007+ (
.docx) - Word 2003 (
.doc) - OpenDocument Text (
.odt) - Rich Text Format (
.rtf)
Supported Output Formats:
texthtmlmarkdown
Note on RTF: RTF support is best-effort as PHPWord's RTF reader has limitations. It does not preserve headings or lists, and may drop special characters like smart quotes, accented letters, and dashes.
Requirements
This module requires the following modules:
Installation
composer require drupal/document_loader_phpwordConfiguration
- Enable the module at Administration > Extend
- See PHPWord as an available plugin in the Document Loader configuration at admin/config/media/document-loader
Similar Projects
- AI File To Text: Leverages the AI module to improve the output of loaded documents
Supporting organizations:
Development
Project information
- Project categories: Content editing experience
- Ecosystem: Document Loader
- Created by robloach on , updated
This project is not covered by the security advisory policy.
Use at your own risk! It may have publicly disclosed vulnerabilities.
Releases
1.0.0-alpha1
released 29 May 2026
Works with Drupal: ^10.4 || ^11
Install:
Development version: 1.0.x-dev updated 29 May 2026 at 17:55 UTC



