Convert PDF to DOCX in Python

High-speed Python library for converting PDF to DOCX

Use our document conversion API to develop high-level, platform independent software in Python. This is a professional software solution to import and export PDF, DOCX, and many other document formats using Python.

View code snippet

Convert PDF to DOCX using Python

Need to convert a document from PDF to DOCX format programmatically? With Aspose.Words for Python via .NET any developer can convert documents from PDF to DOCX format with just a few lines of Python code.

Modern document-processing Python API creates a DOCX document from PDF with professional quality. Test the highest quality PDF to DOCX conversion right in your browser. Powerful Python library allows converting PDF files to almost all DOCX document formats.

Save PDF as a DOCX document in Python

The following example demonstrates how to convert PDF to DOCX document format in Python.

Follow the easy steps to turn a PDF file into DOCX document format. Read your PDF file from the local drive, then simply save it in DOCX document format, specifying the required file format by required DOCX extension. For both PDF reading and DOCX document writing you can use fully qualified filenames. The output DOCX content and formatting will be identical to the original PDF document.

Code example in Python to convert PDF to DOCX format
Upload a file you want to convert
Run code
Select the target format from the list
pip install aspose-words
Copy
import aspose.words as aw

doc = aw.Document("Input.pdf")
doc.save("Output.docx")
import aspose.words as aw doc = aw.Document("Input.pdf") doc.save("Output.docx") import aspose.words as aw doc = aw.Document("Input.pdf") save_options = aw.saving.ImageSaveOptions(aw.SaveFormat.docx) for page in range(doc.page_count): save_options.page_set = aw.saving.PageSet(page) doc.save(f"Output_{page + 1}.docx", save_options) import aspose.words as aw doc = aw.Document() builder = aw.DocumentBuilder(doc) builder.insert_image("Input.pdf") doc.save("Output.docx") import aspose.words as aw doc = aw.Document() builder = aw.DocumentBuilder(doc) shape = builder.insert_image("Input.pdf") shape.get_shape_renderer().save("Output.docx", aw.saving.ImageSaveOptions(aw.SaveFormat.docx))
Run code
Share the code on social media:

How to convert PDF to DOCX in Python

  1. Install Aspose.Words for Python via .NET.
  2. Add a library reference (import the library) to your Python project.
  3. Open the source PDF file in Python.
  4. Call the save() method, passing an output filename with DOCX extension.
  5. Get the result of PDF conversion as DOCX.

Python library to convert PDF to DOCX

We host our Python packages in PyPi repositories. Please follow the step-by-step instructions on how to install "Aspose.Words for Python via .NET" to your developer environment.

System Requirements

This package is compatible with Python ≥3.5 and <3.12. If you develop software for Linux, please have a look at additional requirements for gcc and libpython in Product Documentation.

5%

Subscribe to Aspose Product Updates

Get monthly newsletters and offers directly delivered to your mailbox.

© Aspose Pty Ltd 2001-2024. All Rights Reserved.