File Format Converter

This task will convert files of one format to another format during migration. Document processing often requires transforming documents from one format to another. For example, for long term preservation/archival, PDF/A is a desired format. All documents may not be in that format – and doing such conversion at scale is often hard, and the quality may at times be compromised. With Objective’s Universal Document Converter, any document can be converted into desired format without loss of quality. It will also support complex processing when a document cannot be directly converted into another format by converting it into a transient format before converting it into the desired format.

Features

  • Ability to convert any document format to desired format

  • Support intermediary conversion formats if direct conversion is not possible

  • Ensure great quality when converting document from one format to another

  • Ensure no loss during text extraction from converted format

  • Bulk conversion of document formats at scale

Supported Converters

Converter

Source ext.

Target ext.

Cad

DWG, DGN, DXF, DWF, DWT, SVG

PDF, PNG, BMP, TIFF, JPEG, JPG, GIF, DWG, DWF, DXF, SVG

Cells

CSV, XLS, XLSX, ODS, XLSB, JSON, XLT, XLTX, XLSM, XLTM

CSV, XLS, XLSX, XLSM, XLTX, XLTM, XLAM, TSV, TAB_DELIMITED, HTML, M_HTML, ODS, XLSB, PDF, XPS, TIFF, SVG, DIF, XLT, XML, MD, MARKDOWN, PPTX, DOCX, EMF, JPG, PNG, BMP, GIF, JSON, JPEG

Diagram

VSD, VSDX, VSS, VSSX, VTX, VDX, VDW, VST, VSTX, VSX

BMP, JPEG, JPG, TIFF, EMF, SVG, PNG, PDF, HTML, XPS, XAML

Email

MSG, EML, EMLX, VCF, ICS

HTML, MHT

Imaging

PNG, JPEG, BMP, TIFF, BIGTIFF, GIF, DICOM, SVG, ICO

PDF, JPEG, BMP, PNG, TIFF, GIF, SVG, ICO

Pdf

PDF, PDFXML

PDF, DOC, DOCX, XPS, XML, TEX, SVG, XLSX, MOBI, EPUB, PPTX, APS, PS, EPS, PDFXML, HTML, PDF_A_1A, PDF_A_1B, PDF_A_2A, PDF_A_2B, PDF_A_3A

Slides

PPT, PPS, PPTX, PPSX, ODP, POTX, PPTM, PPSM, POTM, POT

PPT, PDF, XPS, PPTX, PPSX, ODP, PPTM, PPSM, POTX, POTM, HTML, PPS, POT, GIF, HTML5, MD

Words

DOC, DOCX, DOCM, DOT, DOTX, DOTM, RTF, HTML, MHTML, ODT, XML, TXT

DOC, DOT, DOCX, DOCM, DOTX, DOTM, RTF, PDF, XPS, SVG, HTML, MHTML, ODT,  TXT, TEXT, MD, MARKDOWN, XLSX, TIFF, PNG, BMP, EMF, GIF, JPEG, JPG

Configuration

To use the File Format Converter task, go to the Task tab in your job. Select the task from the drop down and click the plus circle to configure the task. Click done after making any changes to save.

Condition check

It will execute the task when the condition's result is 'true', 't', 'on', '1', or 'yes' (case-insensitive), or run on all conditions if left empty. This condition is evaluated for each document, determining whether the task should be executed based on the specified values.

Example: If I only want to run this task for PDF documents I would use the expression: equals('#{rd.mimetype}',"application/pdf")

Conversion Timeout

Enter the number of minutes you want 3Sixty to wait before taking action on a file that cannot be converted.

Conversion Mapping

Select the file formats you want to convert to another format. You can either search for the file types or scroll down to select them from the list.

Note:  Note: Certain files can only be converted to certain file formats. For example EML and EMLX files can only be converted to HTML or MHT. And password protected files cannot be converted.

  • Source Extension: Select the file type you would like to convert from the drop down.

  • Target Extension: Select the file type you would like to convert the files to. Only the file types the source target can be converted to will be listed.

  • Add Another: Click this button to add another file type that you would like converted.

Action

Conversions may fail if the document is password protected or has restricted access. The action drop down is where you will chose what 3Sixty should do if a document fails the conversion.

  • Skip: Skip the file and continue migrating the job

  • Fail: Fail the job and stop migration

Examples

Example 1

You can add several document types and have them all be converted to a file type that the destination repository will allow during migration.

For example, we can convert all Microsoft office documents to PDF.

Example 2

Users can also use this task to convert images into a searchable pdf document. First convert the image to PDF then from PDF to PDF. And check the box to make PDF text searchable.

*For Linux and Mac: To convert PDF to searchable text users will need to have tesseract installed.

  • Mac Install: https://formulae.brew.sh/formula/tesseract (command line: brew install tesseract)

  • Ubuntu Install: https://ubuntuhandbook.org/index.php/2021/12/install-tesseract-ocr-5-ubuntu


Supported Converters