Split DOCX into parts in Java

Fast Java library to split one DOCX file into a group of smaller files according to the given criteria.

Use Java library to split DOCX files into parts. You can integrate the extracted DOCX pages with other data and, as a result, get documents of the form and content that you require. Splitting DOCX into parts makes it easier to collaborate on DOCX files.

View code snippet

Split DOCX in Java

This software library provides Java developers with a set of functions to split DOCX files into parts. Splitting a DOCX document into separate files can be used to make it easier to work with sections of a document in parallel. For example, if several people are working on one DOCX document at the same time, splitting it will allow them to speed up the work. The DOCX document splitting may be part of a technology for extracting text from DOCX files and integrating data into automated information systems or databases.

Our library provides Java developers with all the necessary functions to split DOCX files into parts and extract pages according to the specified mode. This is a stand-alone Java solution that does not need Microsoft Word, Acrobat Reader or other applications installed.

Split DOCX document into parts using Java

Split DOCX content using different criteria in Java code. You can use the following page extraction modes for DOCX documents: 'split by headings', 'split by sections', 'split page by page', 'split by page ranges'.

After splitting your DOCX file into parts, you can export the result to the required file format using the 'Document.Save' method. You can also control how the DOCX document parts are exported to HTML or EPUB using the 'DocumentPartSavingCallback' property, which will allow you to redirect output streams.

Split DOCX documents easily with our solution for Java. The following example shows how to split a DOCX document using Java:

Java code example to split a DOCX file
Upload a file you want to split
Run code
Select the target format from the list
// Repository path: https://releases.aspose.com/java/repo/
// Maven, where 'ver' - Aspose.Words version number, for example, 24.4.
<dependency>
  <groupId>com.aspose</groupId>
  <artifactId>aspose-words</artifactId>
  <version>ver</version>
  <classifier>jdk17</classifier>
</dependency>
Copy
// Repository path: https://releases.aspose.com/java/repo/
// Gradle, where 'ver' - Aspose.Words version number, for example, 24.4.
compile(group: 'com.aspose', name: 'aspose-words', version: 'ver', classifier: 'jdk17')
Copy
// Repository path: https://releases.aspose.com/java/repo/
// Ivy, where 'ver' - Aspose.Words version number, for example, 24.4.
<dependency org="com.aspose" name="aspose-words" rev="ver">
 <artifact name="aspose-words" m:classifier="jdk17" ext="jar"/>
</dependency>
Copy
// Repository path: https://releases.aspose.com/java/repo/
// Sbt, where 'ver' - Aspose.Words version number, for example, 24.4.
libraryDependencies += "com.aspose" % "aspose-words" % "ver"
Copy
import com.aspose.words.*;

Document doc = new Document("Input.docx");

for (int page = 0; page < doc.getPageCount(); page++)
{
    Document extractedPage = doc.extractPages(page, 1);
    extractedPage.save(String.format("Output_%d.docx", page + 1));
}
Run code

How to split DOCX Java

  1. Install Java library to split DOCX files programmatically.
  2. Add a library reference (import the library) to your Java project.
  3. Open the DOCX in Java.
  4. Call the extractPages() method to extract specific pages from DOCX.
  5. Get the result of DOCX splitting as separate files.

Java library to split DOCX documents

We host our Java packages in Maven repositories. 'Aspose.Words for Java' is a common JAR file containing byte-code. Please follow the step-by-step instructions on how to install it to your Java developer environment.

System Requirements

Java SE 7 and more recent Java versions are supported. We also provide a separate package for Java SE 6 in case you are obliged to use this outdated JRE.

Our Java package is cross-platform and runs on all operating systems with JVM implementation, including Microsoft Windows, Linux, macOS, Android and iOS.

For information on optional package dependencies, such as JogAmp JOGL, Harfbuzz font engine, Java Advanced Imaging JAI, please refer to Product Documentation.

Other supported DOCX split operations

You can also split DOCX to other file formats:

5%

Subscribe to Aspose Product Updates

Get monthly newsletters and offers directly delivered to your mailbox.

© Aspose Pty Ltd 2001-2024. All Rights Reserved.