PDF & Image / OCR Activities
Extract text from PDFs (digital or scanned), manipulate PDF files (merge, extract pages, password-protect), find images on screen using template matching, and run Tesseract-based OCR on screenshots or image files.
Image (4)
Click ImageClick Image
Finds a template image on screen and clicks at its center position.
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
ImagePath | String | Yes | — | Path to the template image to find and click |
Region | String | No | — | Limit search to region: x,y,width,height |
ClickType | String | No | Single | Type of click to perform Allowed: Single, Double, Right |
ConfidenceThreshold | Int32 | No | 80 | Minimum match confidence (0-100) |
OffsetX | Int32 | No | 0 | Horizontal pixel offset from image center |
OffsetY | Int32 | No | 0 | Vertical pixel offset from image center |
Find Image on ScreenFind Image On Screen
Searches for a template image on screen and returns its coordinates and match confidence.
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
ImagePath | String | Yes | — | Path to the template image to find on screen |
Region | String | No | — | Limit search to region: x,y,width,height (e.g. '100,200,400,300') |
ConfidenceThreshold | Int32 | No | 80 | Minimum match confidence (0-100) |
MultiScale | Boolean | No | True | Try matching at different scales for DPI-aware detection |
OutputX | String | No | — | Variable to store the X coordinate of the match center |
OutputY | String | No | — | Variable to store the Y coordinate of the match center |
OutputConfidence | String | No | — | Variable to store the match confidence (0-100) |
OutputFound | String | No | — | Variable to store whether the image was found (Boolean) |
Read Text with OCRRead Text With OCR
Extracts text from a screen region or image file using Tesseract OCR engine.
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
ImagePath | String | No | — | Path to image file. Leave empty to capture from screen. |
Region | String | No | — | Screen region to capture: x,y,width,height (e.g. '100,200,400,300'). Leave empty for full screen. |
Language | String | No | eng | Tesseract language code Allowed: eng, spa, fra, deu, ita, por, jpn, kor, chi_sim, chi_tra, ara, hin |
TessDataPath | String | No | — | Path to tessdata folder. Auto-detected if empty. |
ConfidenceThreshold | Int32 | No | 60 | Minimum confidence percentage (0-100) |
OutputText | String | No | — | Variable to store the extracted text |
OutputConfidence | String | No | — | Variable to store the confidence score (0-100) |
Wait for ImageWait For Image
Waits until a template image appears on screen, polling at configurable intervals.
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
ImagePath | String | Yes | — | Path to the template image to wait for |
Region | String | No | — | Limit search to region: x,y,width,height |
ConfidenceThreshold | Int32 | No | 80 | Minimum match confidence (0-100) |
PollInterval | Int32 | No | 500 | How often to check for the image (milliseconds, min 100) |
FailOnTimeout | Boolean | No | True | When true, timeout returns failure. When false, continues with Found=false. |
OutputFound | String | No | — | Variable to store whether the image was found (Boolean) |
OutputX | String | No | — | Variable to store the X coordinate of the match center |
OutputY | String | No | — | Variable to store the Y coordinate of the match center |
OutputElapsed | String | No | — | Variable to store elapsed time in milliseconds |
PDF (9)
Export PDFExport PDF
Exports PDF pages to Image (PNG/JPEG) files or converts text content to a Word (DOCX) document.
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
FilePath | String | Yes | — | Path to the source PDF file |
Format | String | Yes | Image | Output format Allowed: Image, Word |
OutputPath | String | Yes | — | Output folder (Image) or .docx file path (Word) |
PageNumber | Int32 | No | 0 | Page to export (0 = all pages) |
ImageFormat | String | No | jpg | Image format (when Format=Image) Allowed: jpg, png |
Dpi | Int32 | No | 150 | Render DPI for image export (72-600) |
Password | String | No | — | Password for encrypted PDF |
Extract PDF ImagesExtract PDF Images
Extracts embedded images from a PDF and saves them as individual image files.
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
FilePath | String | Yes | — | Path to the PDF file |
PageNumber | Int32 | No | 0 | Page to extract from (0 = all pages) |
OutputFolder | String | Yes | — | Folder to save extracted images |
Password | String | No | — | Password for encrypted PDF |
Result | Int32 | No | — | Variable to store the number of images extracted |
Extract PDF Page RangeExtract PDF Page Range
Extracts a range of pages from a PDF document and saves them as a new PDF file.
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
FilePath | String | Yes | — | Path to the source PDF file |
StartPage | Int32 | Yes | 1 | Starting page number (1-based) |
EndPage | Int32 | Yes | 1 | Ending page number (1-based, inclusive) |
OutputFilePath | String | Yes | — | Path for the extracted pages PDF |
Password | String | No | — | Password for encrypted source PDF |
Get PDF Page CountGet PDF Page Count
Returns the total number of pages in a PDF document.
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
FilePath | String | Yes | — | Path to the PDF file |
Password | String | No | — | Password for encrypted PDF |
Result | Int32 | Yes | — | Variable to store the page count |
Manage PDF PasswordManage PDF Password
Add, remove, or change password protection on a PDF document.
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
FilePath | String | Yes | — | Path to the source PDF file |
Operation | String | Yes | Add | Password operation to perform Allowed: Add, Remove, Change |
Password | String | No | — | Current password (for Remove/Change) |
NewPassword | String | No | — | New password to set (for Add/Change) |
OutputFilePath | String | Yes | — | Path for the output PDF |
Merge PDFsMerge PDFs
Combines multiple PDF files into a single merged PDF document.
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
InputFiles | String | Yes | — | Semicolon-separated paths to PDF files (e.g. file1.pdf;file2.pdf) |
OutputFilePath | String | Yes | — | Path for the merged output PDF |
Read PDF TableRead PDF Table
Extracts tabular data from a PDF page into a DataTable by analyzing borders and word positions.
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
FilePath | String | Yes | — | Path to the PDF file |
PageNumber | Int32 | No | 1 | Page number to extract table from (1-based) |
HasHeader | Boolean | No | True | First row contains column headers |
Password | String | No | — | Password for encrypted PDF |
Result | DataTable | Yes | — | Variable to store extracted DataTable |
Read PDF TextRead PDF Text
Extracts text from a PDF file. Works with digitally-created PDFs that contain selectable text.
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
FilePath | String | Yes | — | Path to the PDF file |
PageNumber | Int32 | No | 0 | Page number to read (0 = all pages) |
PreserveFormat | Boolean | No | True | Reconstruct text layout with proper line breaks, indentation, and spacing as it appears in the PDF |
Password | String | No | — | Password for encrypted PDF |
Result | String | Yes | — | Variable to store extracted text |
Read PDF With OCRRead PDF With OCR
Extracts text from scanned/image-based PDFs by rendering pages and applying Tesseract OCR.
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
FilePath | String | Yes | — | Path to the PDF file |
PageNumber | Int32 | No | 0 | Page number (0 = all pages) |
Language | String | No | eng | Tesseract language code Allowed: eng, spa, fra, deu, ita, por, jpn, kor, chi_sim, chi_tra, ara, hin |
Dpi | Int32 | No | 200 | DPI for rendering pages (72-600, default 200) |
TessDataPath | String | No | — | Path to tessdata folder. Auto-detected if empty. |
Password | String | No | — | Password for encrypted PDF |
Result | String | Yes | — | Variable to store OCR-extracted text |