PNG JPG BMP TIFF XHTML
  Product Family

Search XHTML Formats in Java

Native and high performance XHTML file search using Java APIs, without the use of any software like Microsoft or Adobe PDF.

How to Search XHTML File Using Java

In order to search XHTML file, we’ll use

Aspose.Words for Java

API which is a feature-rich, powerful and easy to use Search API for Java platform. You can download its latest version directly from

Maven

and install it within your Maven-based project by adding the following configurations to the pom.xml.

Repository


<repository>
<id>AsposeJavaAPI</id>
<name>Aspose Java API</name>
<url>https://repository.aspose.com/repo/</url>
</repository>

Dependency

<dependency>
<groupId>com.aspose</groupId>
<artifactId>aspose-words</artifactId>
<version>version of aspose-words API</version>
<classifier>jdk17</classifier>
</dependency>

Steps to Search XHTML Files in Java

Developers can easily integrate code with just few lines as listed.

  • Load XHTML file by instantiating Document Class object.
  • Instantiate FindReplaceOptions.
  • Use Pattern.compile() method to define a regex pattern
  • Use getRange().replace method to find and replace
  • Save XHTML file.

System Requirements

Before integrating the code, make sure that you have the following prerequisites.

  • Microsoft Windows or a compatible OS with Java Runtime Environment for JSP/JSF Application and Desktop Applications.
  • Get latest version of Aspose.Words for Java directly from Maven .
 

Search XHTML Files - Java

// Load XHTML file
Document xhtml = new Document("sourceFile.xhtml");
// Find and replace similar pattern words in the file
FindReplaceOptions options = new FindReplaceOptions();
xhtml.getRange().replace(Pattern.compile("[B|S|M]ad"), "[replaced]", options);
// Save the XHTML file
xhtml.save("output.xhtml");  
 
  • Java Words API can be used to load, view and convert Microsoft Word and OpenDocument Formats like DOC, DOCX, ODT to PDF, XPS, HTML and various other formats. You can also create new documents from scratch and save them in the supported formats. It is a standalone API that is suitable for server side and backend systems where high performance is required. It does not depend on any software like Microsoft or OpenOffice. ‎

    Online XHTML Search Live Demos

    XHTML  What is XHTML  File Format

    The XHTML is a text based file format with markup in the XML, using a reformulation of HTML 4.0. These files are well suited to be open or viewed in a web browser. XHTML was designed to be more structured, less scripting, generic; using all the existing facilities of XML and more device independent. XHTML provides a generally worthwhile set of elements and attributes, with extension options in combination with style sheets. The attributes are used from the metadata attributes collection. XHTML provides flexibility and accessibility by subordinating all HTML presentation elements to style sheets. Style sheets are more versatile than these presentational elements. Specifications for HTML 4.01, HTML5 and XHTML are being dynamically developed by the World Wide Web Consortium (W3C).

    Read More

    Other Supported Search Documents

    Using Java, one can also search other files including.

    DOC (Microsoft Word Binary Format)
    DOCX (Office 2007+ Words Document)
    HTML (Hyper Text Markup Language)
    MHTML (Web Page Archive Format)
    ODT (OpenDocument Text File Format)
    OTT (OpenDocument Standard Format)
    RTF (Rich Text Format)
    TXT (Text Document)