Split documents into parts in C++

Fast C++ library to split a document into a group of smaller files according to the given criteria

Use C++ library to split documents into parts. You can integrate the extracted pages with other data and, as a result, get documents of the form and content that you require. Splitting documents into parts makes it easier to collaborate on them.

Split Word, PDF in C++

This software library provides C++ developers with a set of functions to split Word, PDF, EPUB, HTML, DOCX documents into parts. Splitting text into separate files can be used to make it easier to work with sections of documents. The procedure of document splitting can be part of the technology for integrating data into automated information systems or databases.

Our library provides C++ developers with all the necessary functions to extract document pages according to the specified mode. This is a stand-alone C++ solution that does not need Microsoft Word, Acrobat Reader or other applications installed.

Split a document into parts using C++

Split document content using different criteria in C++ code. You can use the following page extraction modes: 'split by headings', 'split by sections', 'split page by page', 'split by page ranges'.

After splitting the document, you can export the result to the required file format using the 'Document.Save' method. You can also control how document parts are exported to HTML or EPUB formats using the 'DocumentPartSavingCallback' property, which will also allow you to redirect output streams.

Split documents easily with our solution for C++. The following example shows how to split a document using C++:

Code example in C++ to split a document into parts
Input file
Upload a file you want to split
Run code
Output format
Select the target format from the list
using namespace Aspose::Words;

auto doc = MakeObject<Document>(u"Input.docx");

for (int page = 0; page < doc->get_PageCount(); page++)
{
    auto extractedPage = doc->ExtractPages(page, 1);
    extractedPage->Save(String::Format(u"Output_{0}.docx", page + 1));
}
Run code
  
Copy C++ code to the clipboard

How to split Word, PDF, HTML and other file formats in C++

  1. Install C++ library to split documents programmatically.
  2. Add a library reference (import the library) to your C++ project.
  3. Open the source document in C++.
  4. Call the 'ExtractPages()' method to extract specific pages from your document.
  5. Get the result of document splitting as separate files.

C++ library to split files

There are three options to install Aspose.Words for C++ to your developer environment. Please choose one that resembles your needs and follow the step-by-step instructions:

System Requirements

You can use this C++ library to develop software on Microsoft Windows, Linux and macOS operating systems:

  • GCC >= 6.3.0 and Clang >= 3.9.1 are required for Linux
  • Xcode >= 12.5.1, Clang and libc++ are required for macOS

If you develop software for Linux or macOS, please check information on additional library dependencies (fontconfig and mesa-glu open-source packages) in Product Documentation.

Most popular file formats for splitting

5%

Subscribe to Aspose Product Updates

Get monthly newsletters and offers directly delivered to your mailbox.

© Aspose Pty Ltd 2001-2022. All Rights Reserved.