The following table lists the automated processors currently available in Taskmonk.

The first column lists the processor type, organizing the available processors into pre-processing and post-processing tools.
The second column lists the processor subtype. This indicates when they can be used.
The third column lists the processor’s area of function.
The fourth column lists the name of the processor. Click the name of the processor to view details on using it.

Processor Type	Processor Subtype	Processor Function	Processor Name
Task Pre-processing: These are processes that must be executed on the input data before they are sent to labelers.
	On Task Creation: These are processes that are executed as soon as the tasks are first created in Taskmonk.
		Named Entity Recognition: Seeks to locate and classify named entities mentioned in unstructured text into pre-defined categories such as person names, organizations, locations, medical codes, time expressions, quantities, monetary values, percentages, etc. Source: Wikipedia
			PDF Data Extraction
			Named Entity Recognition
		Text	Grammar Check
			Web Scraping
			Text Translation
			Text Transliteration
			NLP
		Transcribe Speech	Extract Audio
			Audio Transcription
			Video Type Convertor
		Image	Load Annotations
			Image Details Extraction
			Object Detection
			Color Extraction
			TifftoPNG Convertor
		Others	Search LinkedIn
			Merge Fields
			Code Runner
		Optical Character Recognition	Optical Character Recognition
	On Level Change: These are processes that are executed when a task moves from one level to another. For example, after the labeler labels or skips a task.
		Named Entity Recognition	Named Entity Recognition
		Text	Text Translation
		Image	Load Annotations
			Color Extraction
		Others	Code Runner
	UI	No processor available here.
	On Task Allocation Format: These are processes that are executed when a task is allocated to a labeler.
		Image	Pose Detector
Task Post-processing
	On Task Completion: These are processes that are executed after a labeler completes a task.
		Others	Code Runner
			Output Formatter
	Output Format: These are processes that are executed after the task is complete, and before the task is uploaded to the destination.
		Others	Code Runner

PDF Data Extraction

To configure data extraction from input PDF files:

Select field to extract data from PDF: Click to select a field that exists in the PDF file and contains the data that you want to use while labeling.
Select field to store the output: Click to select the field into which the PDF Data Extraction tool must write the data extracted from the data field in the PDF file.
Click Add to include this capability into your project.

Named Entity Recognition

To configure the Named Entity Resolution tool:

Click to Select the input field with text to extract entities. This is the field that contains the entities that you want to include into your labeling.
Click to Select the provider of named entity recognition.
Click to Select the output field to store the extracted entities.
Click Add to include this capability into your project.

Grammar Check

The Grammar Check tool checks the spelling of words in fields to which it is directed.

To configure the Grammar Check tool:

Click to Select the field on which to run spell-check.
Click to Select field to store spell check output.
Click Add to include this capability into your project.

Web Scraping

Use the Web Scraping tool to extract data from online sources.

To configure the Web Scraping tool:

Click to Select the field with the scrape URL.
Click to Select the field to store the output.
Click Add to include this capability into your project.

Text Translation

Use the Text Translation tool to automatically translate input text into the target language.

Click to Select the Translation Provider that you want to use.
Click to Select the Field to Store the Source Language.
Click to Select the input language code. This is the language in which the source data is expected to be coded.
Click to Select the language code in which the input needs to be converted.
Click to Select the field to store the translated text.
Click Add to include this capability into your project.

Text Transliteration

Use the Text Transliteration tool to configure how you want to transliterate text from one language to another.

Click to Select the Transliteration Provider.
Click to Select the field that contains the source text. This is the text that must be transliterated.
Click to Select the source language. This is the language of the source text.
Click to Select the source script to input text. This is the script in which the input text is written. Thus, for example, you could enter text in Hindi written in English script.
Click to Select the target script into which the input text will be converted.
Click to Select the field that must store the transliteration output.
Click Add to include this capability into your project.

NLP

Use the NLP tool to tag and extract entities from text.

Click to Select the input field. This is the field that contains the untagged text.
Click to Select the annotations field. This is the field that will contain the automatically tagged text.
Click to specify whether you want to Filter common nouns.
Enter the Comma-separated list of entities that you want to tag in the input text.
Click to Select the provider of the NLP service.
Click Add to include this capability into your project.

Field Dictionary Matcher

Use the Field Dictionary Matcher to connect to your dictionary in Taskmonk and provide settings associated with its input, output, and settings.

Click to Select the input field that contains the text that you want to run through the dictionary.
Click to Select the output field that must contain the dictionary’s output.
Enter the Key for the Dictionary. This key enables you to access your dictionary. Once your dictionary is created, you should receive this key from Taskmonk.
Click Case Sensitive Match and specify whether you want the matching to be case sensitive.
Click Add to include this capability into your project.

Extract Audio

Use the Extract Audio tool to extract audio content from video files.

Click to Select the field that contains the view URL.
Click to Select the field that must store the extracted audio.
Click Add to include this capability into your project.

Audio Transcription

Use the Audio Transcription tool to automatically create transcription from audio files.

Click to Select the field that stores the URL to the input audio file.
Click to Select the provider of the audio transcription.
Click to Select the field that must store the extracted text results.
Click Add to include this capability into your project.

Video Type Convertor

Use the Video type Convertor tool to convert video files of different formats into a format that Taskmonk uses.

Click to Select the field that contains the video URL.
Click to Select the field that must store the processed video file.
Click Add to include this capability into your project.

Load Annotations

Use the Load Annotations tool to load annotation data (that you already have) from a separate file on to another image. This will simplify labelers' tasks, requiring them to only move the annotation edges around until they accurately surround the object concerned.

Click to Select the data format of the annotation source file.
Click to Select the data format to which the source annotation must be converted.
Click to Select the field that contains the source annotations.
Click to Select the field that must store the converted annotations.
Click Add to include this capability into your project.

Image Details Extraction

Use the Image Details Extraction field to store image-related details from input images.

Click to Select the field that contains the source image path.
Click to Select the field that must store the image’s height.
Click to Select the field that must store the image’s width.
Click to Select the field that must store the RGB value of the background color. This is an optional field.
Click to Select the field that must store the object-to-background pixel ratio.
Click Add to include this capability into your project.

Object Detection

Use the Object Detection tool to automatically detect objects in images when they load on to the labeling UI.

Click to Select the field that contains URLs to the images that must be labeled.
Click to Select the object detection service provider.
Click to select the field that must store the generated annotations.
Use the Set External URL field to provide the URL to the custom object detection service provider. If you selected Google as the service provider, you do not need to provide any detail here.
Click Add to include this capability into your project.

Color Extraction

Use the Color Extraction tool to extract primary and secondary color information from images..

Click to Select the input field that contains the image URL.
Click to Select the provider of the color extraction software.
Click to Select the output field that must store the extracted color information.
Click Add to include this capability into your project.

TifftoPNG Convertor

Use the TIFF-to-PNG convertor to convert images in TIFF format to other image formats.

Click to Select the output format to which the image file should be converted.
Click to Select input field into save the original URL.
Click Add to include this capability into your project.

Pose Detector

Use the Pose Detector to detect human beings in images and identify their poses.

Click to Select input field that contains the image that must be processed.
Click to Select the field containing the annotations.
Click Add to include this capability into your project.

Search LinkedIn

Use the Search LinkedIn tool to automatically search LinkedIn for details associated with specific people and organizations.

Click to Select the field containing the input URL. This is the field that contains a document listing out the people and organizations that you want to search on LinkedIn.
Click to Select the field to store the search result.
Click Add to include this capability into your project.

Merge Fields

Use the Merge Fields tool to merge the contents of two or more fields into a single string.

Click to Select the fields that you want to merge.
Click to Select the field into which the merged output must be stored.
Click Add to include this capability into your project.

Code Runner

Use the Code Runner tool to run a set code when a specific event occurs. This could be on task creation, on level change, and so on.

Click to Select the code language that you want to use and enter the Code to be run in the field provided.
Click Add to include this capability into your project.

Optical Character Recognition

Use the Optical Character Recognition to automatically read text in input images.

Click to Select the input field containing the path to the image.
Click to Select the OCR service provider from the drop-down menu.
Click Add to include this capability into your project.

Output Formatter

Use the Output Formatter tool to convert annotations created in Taskmonk to another format. Click to Specify the output format and click Add to include this capability into your project.

WIP List of Automated Processors