Redact DOTX Formats in Java
Native and high performance Microsoft Word DOTX document sensitive redaction information using Java APIs, without the use of any software like Microsoft or Adobe PDF.
How to Redact DOTX File Using Java
In order to redact Microsoft Word DOTX file, we’ll use
API which is a feature-rich, powerful and easy to use redaction API for Java platform. You can download its latest version directly from
and install it within your Maven-based project by adding the following configurations to the pom.xml.
Repository
<repository>
<id>AsposeJavaAPI</id>
<name>Aspose Java API</name>
<url>https://repository.aspose.com/repo/</url>
</repository>
Dependency
<dependency>
<groupId>com.aspose</groupId>
<artifactId>aspose-words</artifactId>
<version>version of aspose-words API</version>
<classifier>jdk17</classifier>
</dependency>
Steps to Redact DOTX Files in Java
A basic document search and replace text in contents, comments or metadata can be done with just few lines of code. Redact sensitive information through search and replace text in contents, comments or metadata in Word documents.
- Instantiate Document class..
- Create FindReplaceOptions object.
- Set Pattern
- Use Replace method with relevant options.
- Save document.
System Requirements
Before integrating the code, make sure that you have the following prerequisites.
- Microsoft Windows or a compatible OS with Java Runtime Environment for JSP/JSF Application and Desktop Applications.
- Get latest version of Aspose.Words for Java directly from Maven .
Redact DOTX Files - Java
Document dotx = new Document();
DocumentBuilder builder = new DocumentBuilder(dotx);
builder.writeln("sad mad bad");
if(dotx.getText().trim() == "sad mad bad"){
System.out.println("Strings are equal!");
}
// Replaces all occurrences of the words "sad" or "mad" to "bad".
FindReplaceOptions options = new FindReplaceOptions();
dotx.getRange().replace(Pattern.compile("[s|m]ad"), "bad", options);
// Save the DOTX document.
dotx.save(dataDir + "output.dotx");
Online DOTX Redaction Live Demos
DOTX What is DOTX File Format
Files with DOTX extension are template files created by Microsoft Word to have pre-formatted settings for generation of further DOCX files. A template file is created in order to have specific user settings that should be applied to subsequent flies created from these. These settings include page margins, borders, headers, footers, and other page settings. Such templates are used in official documents such as company letterheads and standardized forms. The DOTX file format was introduced with the release of Microsoft Office 2007 to replace the binary DOT file format, but is supported by higher versions as well. Microsoft Word by default opens every new document based on normal.dot file. If modified, all the new files created will result in same settings as from the template file. In Microsoft Word 2007, the DOT file format has been replaced with Office OpenXML based DOTX file format.
Read MoreOther Supported Redaction Documents
Using Java, one can easily redact different formats including.