~4 min11 / 14

PDF & Image / OCR Activities

Extract text from PDFs (digital or scanned), manipulate PDF files (merge, extract pages, password-protect), find images on screen using template matching, and run Tesseract-based OCR on screenshots or image files.

Image (4)

Click ImageClick Image

Finds a template image on screen and clicks at its center position.

ParameterTypeRequiredDefaultDescription
ImagePathStringYesPath to the template image to find and click
RegionStringNoLimit search to region: x,y,width,height
ClickTypeStringNoSingleType of click to perform Allowed: Single, Double, Right
ConfidenceThresholdInt32No80Minimum match confidence (0-100)
OffsetXInt32No0Horizontal pixel offset from image center
OffsetYInt32No0Vertical pixel offset from image center

Find Image on ScreenFind Image On Screen

Searches for a template image on screen and returns its coordinates and match confidence.

ParameterTypeRequiredDefaultDescription
ImagePathStringYesPath to the template image to find on screen
RegionStringNoLimit search to region: x,y,width,height (e.g. '100,200,400,300')
ConfidenceThresholdInt32No80Minimum match confidence (0-100)
MultiScaleBooleanNoTrueTry matching at different scales for DPI-aware detection
OutputXStringNoVariable to store the X coordinate of the match center
OutputYStringNoVariable to store the Y coordinate of the match center
OutputConfidenceStringNoVariable to store the match confidence (0-100)
OutputFoundStringNoVariable to store whether the image was found (Boolean)

Read Text with OCRRead Text With OCR

Extracts text from a screen region or image file using Tesseract OCR engine.

ParameterTypeRequiredDefaultDescription
ImagePathStringNoPath to image file. Leave empty to capture from screen.
RegionStringNoScreen region to capture: x,y,width,height (e.g. '100,200,400,300'). Leave empty for full screen.
LanguageStringNoengTesseract language code Allowed: eng, spa, fra, deu, ita, por, jpn, kor, chi_sim, chi_tra, ara, hin
TessDataPathStringNoPath to tessdata folder. Auto-detected if empty.
ConfidenceThresholdInt32No60Minimum confidence percentage (0-100)
OutputTextStringNoVariable to store the extracted text
OutputConfidenceStringNoVariable to store the confidence score (0-100)

Wait for ImageWait For Image

Waits until a template image appears on screen, polling at configurable intervals.

ParameterTypeRequiredDefaultDescription
ImagePathStringYesPath to the template image to wait for
RegionStringNoLimit search to region: x,y,width,height
ConfidenceThresholdInt32No80Minimum match confidence (0-100)
PollIntervalInt32No500How often to check for the image (milliseconds, min 100)
FailOnTimeoutBooleanNoTrueWhen true, timeout returns failure. When false, continues with Found=false.
OutputFoundStringNoVariable to store whether the image was found (Boolean)
OutputXStringNoVariable to store the X coordinate of the match center
OutputYStringNoVariable to store the Y coordinate of the match center
OutputElapsedStringNoVariable to store elapsed time in milliseconds

PDF (9)

Export PDFExport PDF

Exports PDF pages to Image (PNG/JPEG) files or converts text content to a Word (DOCX) document.

ParameterTypeRequiredDefaultDescription
FilePathStringYesPath to the source PDF file
FormatStringYesImageOutput format Allowed: Image, Word
OutputPathStringYesOutput folder (Image) or .docx file path (Word)
PageNumberInt32No0Page to export (0 = all pages)
ImageFormatStringNojpgImage format (when Format=Image) Allowed: jpg, png
DpiInt32No150Render DPI for image export (72-600)
PasswordStringNoPassword for encrypted PDF

Extract PDF ImagesExtract PDF Images

Extracts embedded images from a PDF and saves them as individual image files.

ParameterTypeRequiredDefaultDescription
FilePathStringYesPath to the PDF file
PageNumberInt32No0Page to extract from (0 = all pages)
OutputFolderStringYesFolder to save extracted images
PasswordStringNoPassword for encrypted PDF
ResultInt32NoVariable to store the number of images extracted

Extract PDF Page RangeExtract PDF Page Range

Extracts a range of pages from a PDF document and saves them as a new PDF file.

ParameterTypeRequiredDefaultDescription
FilePathStringYesPath to the source PDF file
StartPageInt32Yes1Starting page number (1-based)
EndPageInt32Yes1Ending page number (1-based, inclusive)
OutputFilePathStringYesPath for the extracted pages PDF
PasswordStringNoPassword for encrypted source PDF

Get PDF Page CountGet PDF Page Count

Returns the total number of pages in a PDF document.

ParameterTypeRequiredDefaultDescription
FilePathStringYesPath to the PDF file
PasswordStringNoPassword for encrypted PDF
ResultInt32YesVariable to store the page count

Manage PDF PasswordManage PDF Password

Add, remove, or change password protection on a PDF document.

ParameterTypeRequiredDefaultDescription
FilePathStringYesPath to the source PDF file
OperationStringYesAddPassword operation to perform Allowed: Add, Remove, Change
PasswordStringNoCurrent password (for Remove/Change)
NewPasswordStringNoNew password to set (for Add/Change)
OutputFilePathStringYesPath for the output PDF

Merge PDFsMerge PDFs

Combines multiple PDF files into a single merged PDF document.

ParameterTypeRequiredDefaultDescription
InputFilesStringYesSemicolon-separated paths to PDF files (e.g. file1.pdf;file2.pdf)
OutputFilePathStringYesPath for the merged output PDF

Read PDF TableRead PDF Table

Extracts tabular data from a PDF page into a DataTable by analyzing borders and word positions.

ParameterTypeRequiredDefaultDescription
FilePathStringYesPath to the PDF file
PageNumberInt32No1Page number to extract table from (1-based)
HasHeaderBooleanNoTrueFirst row contains column headers
PasswordStringNoPassword for encrypted PDF
ResultDataTableYesVariable to store extracted DataTable

Read PDF TextRead PDF Text

Extracts text from a PDF file. Works with digitally-created PDFs that contain selectable text.

ParameterTypeRequiredDefaultDescription
FilePathStringYesPath to the PDF file
PageNumberInt32No0Page number to read (0 = all pages)
PreserveFormatBooleanNoTrueReconstruct text layout with proper line breaks, indentation, and spacing as it appears in the PDF
PasswordStringNoPassword for encrypted PDF
ResultStringYesVariable to store extracted text

Read PDF With OCRRead PDF With OCR

Extracts text from scanned/image-based PDFs by rendering pages and applying Tesseract OCR.

ParameterTypeRequiredDefaultDescription
FilePathStringYesPath to the PDF file
PageNumberInt32No0Page number (0 = all pages)
LanguageStringNoengTesseract language code Allowed: eng, spa, fra, deu, ita, por, jpn, kor, chi_sim, chi_tra, ara, hin
DpiInt32No200DPI for rendering pages (72-600, default 200)
TessDataPathStringNoPath to tessdata folder. Auto-detected if empty.
PasswordStringNoPassword for encrypted PDF
ResultStringYesVariable to store OCR-extracted text
Was this helpful?