Transform TEX to MHTML via Java

On Premise Java API to Render TEX to MHTML without using any third party application

 

You can convert TEX to MHTML by using two simple steps. First you need to render TEX file to DOC using Aspose.PDF for Java . After that, by using powerful Document Processing API Aspose.Words for Java , you can convert DOC to MHTML. Both APIs come under Aspose.Total for Java package.

Java API to Convert TEX to MHTML

  1. Open TEX file using Document class
  2. Convert TEX to DOC by using save method
  3. Load DOC file by using Document class of Aspose.Words
  4. Save the document to MHTML format using save method and set MHTML as SaveFormat

Get Started with Java File Manipulation APIs

You can easily use Aspose.Total for Java directly from a Maven based project and include Aspose.PDF for Java and Aspose.Words for Java in your pom.xml.

Alternatively, you can get a ZIP file from downloads .

// load TEX file with an instance of Document class
Document document = new Document("template.tex");
// save TEX as a DOC 
document.save("DocOutput.doc", SaveFormat.DOC); 
// load DOC with an instance of Document
Document outputDocument = new com.aspose.words.Document("DocOutput.doc");
// call save method while passing SaveFormat.MHTML
outputDocument.save("output.mhtml", SaveFormat.MHTML);   

Open Password Protected TEX Document via Java

While converting TEX to MHTML, even if your document is password protected, you can still open it using PDF Manipulation API Aspose.PDF for Java . In order to open the encrypted file, you need to create a Document object and open the TEX using the owner’s password.

Document document = new Document("input.tex", "password");
// save TEX as a DOC 
document.save("DocOutput.doc", SaveFormat.DOC);

Save MHTML Document to a Database via Java

While saving your input document to MHTML file format, you can also save your document to database instead of a file system. You may need to implement storing and retrieving Document objects to and from a database. This would be necessary if you were implementing any type of content management system. In order to save your MHTML to database it is often necessary to serialize the document to obtain a byte array. This can be done using Aspose.Words for Java API. After getting your byte array, you can store it in the database using SQL statement.

public static void StoreToDatabase(Document doc, Connection mConnection) throws Exception {
    // create an output stream which uses byte array to save data
    ByteArrayOutputStream aout = new ByteArrayOutputStream();
    // save the document to byte array
    doc.save(aout, SaveFormat.MHTML);
    // get the byte array from output steam
    // the byte array now contains the document
    byte[] buffer = aout.toByteArray();
    // get the filename from the document.
    String fileName = doc.getOriginalFileName();
    String filePath = fileName.replace("\\", "\\\\");
    // create the SQL command.
    String commandString = "INSERT INTO Documents (FileName, FileContent) VALUES('" + filePath + "', '" + buffer + "')";
    Statement statement = mConnection.createStatement();
    statement.executeUpdate(commandString);
}  

Other Conversion Options

TEX TO WORDML (Microsoft Word 2003 WordprocessingML)
TEX TO DOTM (Microsoft Word 2007+ Template File)
TEX TO DOT (Microsoft Word Template Files)
TEX TO ODT (OpenDocument Text File Format)
TEX TO OTT (OpenDocument Standard Format)
TEX TO DOTX (Microsoft Word Template File )
TEX TO PCL (Printer Command Language Document)
TEX TO PS (PostScript File)
TEX TO RTF (Rich Text Format)

TEX What is TEX File Format?

TeX is a language that comprises of programming as well as mark-up features, used to typeset documents. Donald Knuth from Stanford University, is the creator of this resourceful typesetting system. Across the world, TeX is the ultimate choice of authors and publishers to produce high quality technical documents. TeX performs an outstanding job of formatting complex mathematical expressions. In conjunction with a high-quality phototypesetter, TeX competes the results generated by the best traditional typesetting systems. Therefore regarded as the classiest digital typographical systems.

Read More

MHTML What is MHTML File Format?

Files with MHTML extension represent a web page archive format that can be created by a number of different applications. The format is known as archive format because it saves the web HTML code and associated resources in a single file. These resources include anything linked to the webpage such as images, applets, animations, audio files and so on. MHTML files can be opened in a variety of applications such as Internet Explorer and Microsoft Word. Microsoft Windows uses MHTML file format for recording scenarios of problems observed during the usage of any application on Windows that raises issues. The MHTML file format encodes the page contents similar to specifications defined in message/rfc822 which is plain text email related specifications. The actual specifications of the format are as detailed by RFC 2557.

Read More