Extract images from PDF in C#

Parse Images from PDF document. Use Aspose.PDF for .NET to modify PDF files programmatically

How to extract images from PDF using .NET Library

Do you need to extract images from PDF? Programmatic modification of PDF documents is an essential part of modern digital workflows. With .NET libraries like Aspose.PDF, developers can extract images from PDF. These libraries are stand-alone solutions that don’t rely on other software and are ready for commercial use. They cover all possible needs of professional C# developers.

  • Extract text from PDF
  • Extract Images from PDF
  • Extract Fonts from PDF
  • Extract Data from the Form
  • Extract Text From Stamps
  • Extract Data from Table

In order to extract images from PDF file, we’ll use Aspose.PDF for .NET API which is a feature-rich, powerful and easy to use document manipulation API for net platform. Open NuGet package manager, search for Aspose.PDF and install. You may also use the following command from the Package Manager Console.

Package Manager Console

PM > Install-Package Aspose.PDF

Extract images from PDF in C#


To try the code in your environment, you need Aspose.PDF for .NET.

  1. Load the PDF with an instance of Document.
  2. Create an XImage object to extract images.
  3. Save output image to jpeg file.
  4. Save updated PDF file.

Extract images from PDF - C#

This sample code shows how to extract images from PDF documents

Input file:

File not added

Output format:

PDF

Output file:

    // For complete examples and data files, please go to https://github.com/aspose-pdf/Aspose.PDF-for-.NET
    // The path to the documents directory.
    string dataDir = RunExamples.GetDataDir_AsposePdf_Images();

    // Open document
    Document pdfDocument = new Document(dataDir+ "ExtractImages.pdf");

    // Extract a particular image
    XImage xImage = pdfDocument.Pages[1].Resources.Images[1];

    FileStream outputImage = new FileStream(dataDir + "output.jpg", FileMode.Create);

    // Save output image
    xImage.Save(outputImage, ImageFormat.Jpeg);
    outputImage.Close();

    dataDir = dataDir + "ExtractImages_out.pdf";

    // Save updated PDF file
    pdfDocument.Save(dataDir);

About Aspose.PDF for .NET API

Aspose.PDF for C# API provides a wide range of features for working with PDF files. Some of the features include:

  • Create PDF documents from scratch or from HTML, XML, or images.
  • Edit existing PDF documents by adding or removing pages, text, images, and other content.
  • Convert PDF documents to other formats such as HTML, XML, and images.
  • Render PDF documents to images or XPS format.
  • Print PDF documents directly from your application.
  • Digitally sign PDF documents.

You can find more information on Aspose.PDF for C# API in this Aspose documentation