File Format Converter
This task will convert files of one format to another format during migration. Document processing often requires transforming documents from one format to another. For example, for long term preservation/archival, PDF/A is a desired format. All documents may not be in that format – and doing such conversion at scale is often hard, and the quality may at times be compromised. With Objective’s Universal Document Converter, any document can be converted into desired format without loss of quality. It will also support complex processing when a document cannot be directly converted into another format by converting it into a transient format before converting it into the desired format.
Features
-
Ability to convert any document format to desired format
-
Support intermediary conversion formats if direct conversion is not possible
-
Ensure great quality when converting document from one format to another
-
Ensure no loss during text extraction from converted format
-
Bulk conversion of document formats at scale
Supported Converters
Converter |
Source ext. |
Target ext. |
---|---|---|
Cad |
DWG, DGN, DXF, DWF, DWT, SVG |
PDF, PNG, BMP, TIFF, JPEG, JPG, GIF, DWG, DWF, DXF, SVG |
Cells |
CSV, XLS, XLSX, ODS, XLSB, JSON, XLT, XLTX, XLSM, XLTM |
CSV, XLS, XLSX, XLSM, XLTX, XLTM, XLAM, TSV, TAB_DELIMITED, HTML, M_HTML, ODS, XLSB, PDF, XPS, TIFF, SVG, DIF, XLT, XML, MD, MARKDOWN, PPTX, DOCX, EMF, JPG, PNG, BMP, GIF, JSON, JPEG |
Diagram |
VSD, VSDX, VSS, VSSX, VTX, VDX, VDW, VST, VSTX, VSX |
BMP, JPEG, JPG, TIFF, EMF, SVG, PNG, PDF, HTML, XPS, XAML |
|
MSG, EML, EMLX, VCF, ICS |
HTML, MHT |
Imaging |
PNG, JPEG, BMP, TIFF, BIGTIFF, GIF, DICOM, SVG, ICO |
PDF, JPEG, BMP, PNG, TIFF, GIF, SVG, ICO |
|
PDF, PDFXML |
PDF, DOC, DOCX, XPS, XML, TEX, SVG, XLSX, MOBI, EPUB, PPTX, APS, PS, EPS, PDFXML, HTML, PDF_A_1A, PDF_A_1B, PDF_A_2A, PDF_A_2B, PDF_A_3A |
Slides |
PPT, PPS, PPTX, PPSX, ODP, POTX, PPTM, PPSM, POTM, POT |
PPT, PDF, XPS, PPTX, PPSX, ODP, PPTM, PPSM, POTX, POTM, HTML, PPS, POT, GIF, HTML5, MD |
Words |
DOC, DOCX, DOCM, DOT, DOTX, DOTM, RTF, HTML, MHTML, ODT, XML, TXT |
DOC, DOT, DOCX, DOCM, DOTX, DOTM, RTF, PDF, XPS, SVG, HTML, MHTML, ODT, TXT, TEXT, MD, MARKDOWN, XLSX, TIFF, PNG, BMP, EMF, GIF, JPEG, JPG |
Configuration
To use the File Format Converter task, go to the Task tab in your job. Select the task from the drop down and click the plus circle to configure the task. Click done after making any changes to save.
Condition check
It will execute the task when the condition's result is 'true', 't', 'on', '1', or 'yes' (case-insensitive), or run on all conditions if left empty. This condition is evaluated for each document, determining whether the task should be executed based on the specified values.
Example: If I only want to run this task for PDF documents I would use the expression: equals('#{rd.mimetype}',"application/pdf")
Conversion Timeout
Enter the number of minutes you want 3Sixty to wait before taking action on a file that cannot be converted.
Conversion Mapping
Select the file formats you want to convert to another format. You can either search for the file types or scroll down to select them from the list.
Note: Note: Certain files can only be converted to certain file formats. For example EML and EMLX files can only be converted to HTML or MHT. And password protected files cannot be converted.
-
Source Extension: Select the file type you would like to convert from the drop down.
-
Target Extension: Select the file type you would like to convert the files to. Only the file types the source target can be converted to will be listed.
-
Add Another: Click this button to add another file type that you would like converted.
Action
Conversions may fail if the document is password protected or has restricted access. The action drop down is where you will chose what 3Sixty should do if a document fails the conversion.
-
Skip: Skip the file and continue migrating the job
-
Fail: Fail the job and stop migration
Examples
Example 1
You can add several document types and have them all be converted to a file type that the destination repository will allow during migration.
For example, we can convert all Microsoft office documents to PDF.
Example 2
Users can also use this task to convert images into a searchable pdf document. First convert the image to PDF then from PDF to PDF. And check the box to make PDF text searchable.
*For Linux and Mac: To convert PDF to searchable text users will need to have tesseract installed.
-
Mac Install: https://formulae.brew.sh/formula/tesseract (command line: brew install tesseract)
-
Ubuntu Install: https://ubuntuhandbook.org/index.php/2021/12/install-tesseract-ocr-5-ubuntu