.NET API for working with real-world HTML
Class library to create, edit, extract data, merge and convert HTML pages to PDF, DOCX, XPS, Images and other formats.
Download Free TrialAspose.HTML for .NET is an advanced HTML processing API to perform a wide range of management and manipulation tasks within cross-platform applications. API supports to generate, modify, extract data, convert and render HTML documents without any external software. Also, it supports popular file formats such as EPUB, MHTML, XML, SVG, and Markdown and rendering to PDF, DOCX, XPS and Image file formats. Aspose.HTML for .NET is written completely in C# and can be used to build any type of 32-bit or 64-bit .NET application including ASP.NET, WCF, WinForms & .NET Core.
Moreover, the HTML Document Object Model is integrated with embedded formats and specifications such as CSS, HTML Canvas, SVG, XPath and JavaScript out-of-the-box that extend the manipulation functional and rendering quality. You can see the full list of Aspose.HTML features in our documentation.
Using Aspose.HTML C# library in your project allows you to perform the following tasks:
- Create or load HTML-based documents from a file, URL, string, or stream.
- Convert documents between popular formats.
- Create custom message handlers to do a specific task.
- Navigate HTML documents using XPath Query or CSS Selector.
- Edit HTML files by inserting new nodes, removing, or editing the content of existing nodes.
- Render documents with high quality.
- Extract data from the web, download a video or audio by its URL,
- and more.
Advanced C# HTML API Features
Create HTML pages from Scratch
Load existing HTML from file, stream or URL
Implement W3C specifications
Implement templates using template merger
Fill the template with various data sources
Render HTML Canvas 2D to PDF
Add, replace or remove nodes
Extract data from HTML documents
Load EPUB and MHTML file formats
Render HTML to raster image formats
Render multiple documents at once
Implement Markdown to HTML converter
Apply header and footer during HTML to PDF
Navigate HTML using XPath Query or CSS Selector
Wide range of conversions between formats
Convert HTML to PDF, Image and Other Formats in C#
API allows with just a few lines of code to implement HTML to PDF, HTML to Image or any other conversion for your .NET applications. The conversion process is simple and reliable, thus making Aspose.HTML for .NET API a perfect choice.
Convert HTML to PDF - C#
// Load an HTML file to be converted
using var document = new Aspose.Html.HTMLDocument("document.html");
// Convert HTML to PDF
Aspose.Html.Converters.Converter.ConvertHTML(document, new PdfSaveOptions(), "output.pdf");
You can also convert HTML, XHTML, MHTML, Markdown, EPUB, or SVG into many other file formats including few listed below:
You can try online HTML Converter here.
Editing HTML Documents
Aspose.HTML for .NET allows you to create and edit HTML documents using a Document Object Model (DOM). The DOM is a programming interface for HTML documents that represents the document (as nodes and objects) as a node tree, where each node represents part of the document. Aspose.HTML for .NET API lets you connect to the page and can change the document structure, style, and content. You can modify the document by inserting new nodes and removing or editing existing nodes' content.
The .NET HTML API assists developers to read, modify, navigate and edit (X)HTML documents. Some file editing functions that the Aspose.HTML for .NET API can perform are the following:- navigate HTML documents by using various methods, such as, element traversal, document traversal, XPath queries, and CSS selector queries,- remove and replace HTML nodes,- extract and edit CSS from HTML,- configure a document sandbox and more.
Markdown Support
Markdown is a markup language with a plain-text-formatting syntax. Markdown is often used as a format for documentation and readme files since it allows writing in an easy-to-read and easy-to-write style. Aspose.HTML provides a powerful and flexible Markdown Converter that can convert in both directions from Markdown to HTML and from HTML to Markdown. Moreover, the converter API has a set of predefined rules, so you can convert HTML to Markdown using the authentic Markdown syntax, GitLab Flavored Markdown modification or even configure the rules for your needs.
Convert HTML to Markdown - C#
// Load an HTML file
using var document = new Aspose.Html.HTMLDocument("document.html");
// Convert HTML to Markdown using a set of features supported by GitLab Flavored Markdown
document.Save("output.md", Aspose.Html.Saving.MarkdownSaveOptions.Git);
The reverse conversion is that simple!
Convert Markdown to HTML - C#
// Convert Markdown to HTML
Aspose.Html.Converters.Converter.ConvertMarkdown("document.md", "output.html");
You can try online Markdown Converter here.
Electronic Books and Web Archives
The Electronic Books (EPUB) formats and Web Archive (MHTML) formats supported out-of-the-box. API offers high fidelity rendering EPUB and MHTML files to the supported output formats such as PDF, XPS and Image file formats.
Convert EPUB to PDF - C#
// Convert EPUB to PDF
Aspose.Html.Converters.Converter.ConvertEPUB("document.epub", new Aspose.Html.Saving.PdfSaveOptions(), "output.pdf");
You can try online EPUB Converter here.
Convert MHTML to PDF - C#
// Convert MHTML to PDF
Aspose.Html.Converters.Converter.ConvertMHTML("document.mht", new Aspose.Html.Saving.PdfSaveOptions(), "output.pdf");
You can try online MHTML Converter here.
Web Scraping
Web scraping, also well known as web harvesting, web data extraction or web crawling, is a technique to extract data from a website. Aspose.HTML doesn't support a Web Scraping module out-of-the-box. However, using Aspose.HTML API that is entirely based on W3C specification and supports XPath and CSS Selector queries you can easily inspect the content of any HTML document and create your own Web Scraping solution.
Simple Web Data Extraction - C#
// Create an instance of the HTML document with a website as a parameter
using var document = new Aspose.Html.HTMLDocument("https://en.wikipedia.org/wiki/Aspose_API");
// Get all anchor-elements
var elements = document.QuerySelectorAll("a");
// Dump the anchor-element data to the console
elements.Cast<HTMLAnchorElement>().ToList().ForEach(x =>
{
System.Console.WriteLine("[Href]: " + x.Href);
System.Console.WriteLine("[Content]: " + x.TextContent);
});
You can try online Data Scrapers Apps here.
Aspose.HTML offers free online Data Scrapers Apps that are a way to get data from websites. Our Apps are safe, work on any platform and do not require any software installation. Data Scrapers can be used for image extracting, getting keywords from a webpage, video downloading, etc. They are easy and clear to use, yet forceful and reliable.
Support and Learning Resources
- Learning Resources
- Documentation
- Source Code
- API References
- Tutorial Videos
- Product Support
- Free Support
- Paid Support
- Blog
- Release Notes
- Why Aspose.HTML for .NET?
- Customers List
- Success Stories