Metadata Extraction - Path

This task will extract the metadata from the file and add it as fields to the repository document.


Configuration

To use this task go to the task tab in your job. Select the task from the drop down and click the plus circle to configure the task. Click done after making any changes to save.

Condition check

It will execute the task when the condition's result is 'true', 't', 'on', '1', or 'yes' (case-insensitive), or run on all conditions if left empty. This condition is evaluated for each document, determining whether the task should be executed based on the specified values.

Example: If I only want to run this task for PDF documents I would use the expression: equals('#{rd.mimetype}',"application/pdf")

Path Field

Which field to use to extract metadata. Options are:

  • Repository ID (for repos that use the entire path as an ID, such as Filesystem, or Amazon S3)

  • Parent Path, for repos that don't use paths as ID.

Path Rules

Comma delimited list of rules to extract. The format is [New Field Name]=#{Location}

  • Location takes the form of an integer, starting at 0 for the root folder.

  • Additionally, 'filename','parent',and 'grandparent' with **quotes included can be used.


Examples

Note:  For the file
/accounting/healthcare/userName.pdf
and the rules
department=#{0},type=#{'parent'}
department=accounting
type=healthcare


API Keys

Processor: extractMetadataFromPathTask

Key

Display Name

Type

use_condition Check a condition before executing this task. Boolean

task_condition

Condition

String

task_stop_proc

Stop Processing

Boolean

path_field

Path Field

String

path_rules

Path Rules

String