PDF & Image / OCR Activities
Extract text from PDFs (digital or scanned), manipulate PDF files, capture screenshots, find images on screen using template matching, and run OCR. 9 PDF activities and 4 Image/OCR activities.
Read PDF Text
Extracts digital text from a PDF file. Fast and accurate for PDFs created by Word, Excel, or other digital tools.
| Parameter | Type | Required | Description |
|---|---|---|---|
| FilePath | String | Yes | Absolute path to the PDF file |
| Pages | String | No | Page range to extract, e.g. 1-3 or 2. Leave empty for all pages. |
| OutputVariable | String | Yes | String variable to store the extracted text |
Read PDF Table
Extracts a table from a PDF page into a DataTable variable.
| Parameter | Type | Required | Description |
|---|---|---|---|
| FilePath | String | Yes | PDF file path |
| Page | Int32 | Yes | Page number (1-based) containing the table |
| TableIndex | Int32 | No (default: 0) | Zero-based index of the table on that page |
| OutputVariable | String | Yes | DataTable variable to store the extracted table |
Merge PDFs
Combines multiple PDF files into a single PDF in the specified order.
| Parameter | Type | Required | Description |
|---|---|---|---|
| InputPaths | StringArray / comma-separated | Yes | Ordered list of source PDF file paths |
| OutputPath | String | Yes | Path for the merged PDF to create |
| Overwrite | Boolean | No (default: false) | Replace output file if it exists |
Extract PDF Page Range
Extracts a range of pages from a PDF into a new PDF file.
| Parameter | Type | Required | Description |
|---|---|---|---|
| InputPath | String | Yes | Source PDF file |
| Pages | String | Yes | Page range, e.g. 1-5 or 3,7,10 |
| OutputPath | String | Yes | Output PDF path |
Get PDF Page Count
Returns the total number of pages in a PDF file.
| Parameter | Type | Required | Description |
|---|---|---|---|
| FilePath | String | Yes | PDF file path |
| OutputVariable | String | Yes | Int32 variable to store the page count |
Read PDF With OCR
Extracts text from a scanned PDF using Optical Character Recognition. Slower than Read PDF Text but handles image-based PDFs.
| Parameter | Type | Required | Description |
|---|---|---|---|
| FilePath | String | Yes | Scanned PDF file path |
| Language | String | No (default: en) | OCR language code |
| OutputVariable | String | Yes | String variable to store the extracted text |
Extract PDF Images
Extracts all embedded images from a PDF and saves them to a folder.
| Parameter | Type | Required | Description |
|---|---|---|---|
| FilePath | String | Yes | PDF file path |
| OutputFolder | String | Yes | Folder to save extracted images into |
| OutputVariable | String | No | StringArray variable holding the saved image file paths |
Export PDF
Exports a PDF to another format (e.g., converts to images).
| Parameter | Type | Required | Description |
|---|---|---|---|
| FilePath | String | Yes | PDF file path |
| OutputFolder | String | Yes | Folder for the exported files |
| Format | PNG | JPEG | TIFF | No (default: PNG) | Output image format |
| Dpi | Int32 | No (default: 150) | Resolution in DPI |
Manage PDF Password
Encrypts a PDF with a password, or decrypts a password-protected PDF.
| Parameter | Type | Required | Description |
|---|---|---|---|
| InputPath | String | Yes | Source PDF |
| OutputPath | String | Yes | Output PDF path |
| Action | Encrypt | Decrypt | Yes | Whether to add or remove password protection |
| Password | String | Yes | Password to apply or verify |
Find Image On Screen
Searches the screen (or a captured screenshot) for a template image using pixel-level template matching. Returns the screen coordinates if found.
| Parameter | Type | Required | Description |
|---|---|---|---|
| TemplatePath | String | Yes | File path of the template image to search for |
| Confidence | Double (0–1) | No (default: 0.9) | Minimum match score (lower = more lenient) |
| FoundVariable | String | Yes | Boolean variable — true if the template was found |
| LocationVariable | String | No | Variable holding screen coordinates {'{'}X, Y{'}'} of the centre of the match |
| 1 | Find Image On Screen: |
| 2 | TemplatePath: "templates\submit-button.png" |
| 3 | Confidence: 0.85 |
| 4 | FoundVariable: "btnFound" |
| 5 | LocationVariable: "btnPoint" |
| 6 | |
| 7 | If: Condition="${btnFound}" |
| 8 | Then: |
| 9 | Click: Selector=... // use the coordinates to click |
Click Image
Finds a template image on screen and clicks its centre. Combines Find Image On Screen and Click into one activity.
| Parameter | Type | Required | Description |
|---|---|---|---|
| TemplatePath | String | Yes | Template image file path |
| Confidence | Double | No (default: 0.9) | Match threshold |
| ClickType | Single | Double | Right | No (default: Single) | Click type |
| TimeoutMs | Int32 | No | Max wait for the image to appear |
Wait For Image
Waits until a template image appears on screen, then continues execution.
| Parameter | Type | Required | Description |
|---|---|---|---|
| TemplatePath | String | Yes | Template image to wait for |
| Confidence | Double | No (default: 0.9) | Match threshold |
| TimeoutMs | Int32 | No (default: 30000) | Max wait before throwing |
Read Text With OCR
Runs Optical Character Recognition on an image file or captured screenshot and returns the extracted text.
| Parameter | Type | Required | Description |
|---|---|---|---|
| ImagePath | String | Yes | Path to the image file (.png, .jpg, .tiff, .bmp) |
| Language | String | No (default: en) | Tesseract OCR language code |
| OutputVariable | String | Yes | String variable to store the extracted text |