Metadata Extraction - Path
This task will extract the metadata from the file and add it as fields to the repository document.
Configuration
To use this task go to the task tab in your job. Select the task from the drop down and click the plus circle to configure the task. Click done after making any changes to save.
Condition check
It will execute the task when the condition's result is 'true', 't', 'on', '1', or 'yes' (case-insensitive), or run on all conditions if left empty. This condition is evaluated for each document, determining whether the task should be executed based on the specified values.
Example: If I only want to run this task for PDF documents I would use the expression: equals('#{rd.mimetype}',"application/pdf")
Path Field
Which field to use to extract metadata. Options are:
-
Repository ID (for repos that use the entire path as an ID, such as Filesystem, or Amazon S3)
-
Parent Path, for repos that don't use paths as ID.
Path Rules
Comma delimited list of rules to extract. The format is [New Field Name]=#{Location}
-
Location takes the form of an integer, starting at 0 for the root folder.
-
Additionally, 'filename','parent',and 'grandparent' with **quotes included can be used.
Examples
Note: For the file
/accounting/healthcare/userName.pdf
and the rules
department=#{0},type=#{'parent'}
department=accounting
type=healthcare
API Keys
Processor: extractMetadataFromPathTask
Key |
Display Name |
Type |
---|---|---|
use_condition | Check a condition before executing this task. | Boolean |
task_condition |
Condition |
String |
task_stop_proc |
Stop Processing |
Boolean |
path_field |
Path Field |
String |
path_rules |
Path Rules |
String |