PDF & Image / OCR Activities

Extract text from PDFs (digital or scanned), manipulate PDF files (merge, extract pages, password-protect), find images on screen using template matching, and run Tesseract-based OCR on screenshots or image files.

Image (4)

Click Image`Click Image`

Finds a template image on screen and clicks at its center position.

Parameter	Type	Required	Default	Description
`ImagePath`	String	Yes	—	Path to the template image to find and click
`Region`	String	No	—	Limit search to region: x,y,width,height
`ClickType`	String	No	`Single`	Type of click to perform Allowed: Single, Double, Right
`ConfidenceThreshold`	Int32	No	`80`	Minimum match confidence (0-100)
`OffsetX`	Int32	No	`0`	Horizontal pixel offset from image center
`OffsetY`	Int32	No	`0`	Vertical pixel offset from image center

Find Image on Screen`Find Image On Screen`

Searches for a template image on screen and returns its coordinates and match confidence.

Parameter	Type	Required	Default	Description
`ImagePath`	String	Yes	—	Path to the template image to find on screen
`Region`	String	No	—	Limit search to region: x,y,width,height (e.g. '100,200,400,300')
`ConfidenceThreshold`	Int32	No	`80`	Minimum match confidence (0-100)
`MultiScale`	Boolean	No	`True`	Try matching at different scales for DPI-aware detection
`OutputX`	String	No	—	Variable to store the X coordinate of the match center
`OutputY`	String	No	—	Variable to store the Y coordinate of the match center
`OutputConfidence`	String	No	—	Variable to store the match confidence (0-100)
`OutputFound`	String	No	—	Variable to store whether the image was found (Boolean)

Read Text with OCR`Read Text With OCR`

Extracts text from a screen region or image file using Tesseract OCR engine.

Parameter	Type	Required	Default	Description
`ImagePath`	String	No	—	Path to image file. Leave empty to capture from screen.
`Region`	String	No	—	Screen region to capture: x,y,width,height (e.g. '100,200,400,300'). Leave empty for full screen.
`Language`	String	No	`eng`	Tesseract language code Allowed: eng, spa, fra, deu, ita, por, jpn, kor, chi_sim, chi_tra, ara, hin
`TessDataPath`	String	No	—	Path to tessdata folder. Auto-detected if empty.
`ConfidenceThreshold`	Int32	No	`60`	Minimum confidence percentage (0-100)
`OutputText`	String	No	—	Variable to store the extracted text
`OutputConfidence`	String	No	—	Variable to store the confidence score (0-100)

Wait for Image`Wait For Image`

Waits until a template image appears on screen, polling at configurable intervals.

Parameter	Type	Required	Default	Description
`ImagePath`	String	Yes	—	Path to the template image to wait for
`Region`	String	No	—	Limit search to region: x,y,width,height
`ConfidenceThreshold`	Int32	No	`80`	Minimum match confidence (0-100)
`PollInterval`	Int32	No	`500`	How often to check for the image (milliseconds, min 100)
`FailOnTimeout`	Boolean	No	`True`	When true, timeout returns failure. When false, continues with Found=false.
`OutputFound`	String	No	—	Variable to store whether the image was found (Boolean)
`OutputX`	String	No	—	Variable to store the X coordinate of the match center
`OutputY`	String	No	—	Variable to store the Y coordinate of the match center
`OutputElapsed`	String	No	—	Variable to store elapsed time in milliseconds

PDF (9)

Export PDF`Export PDF`

Exports PDF pages to Image (PNG/JPEG) files or converts text content to a Word (DOCX) document.

Parameter	Type	Required	Default	Description
`FilePath`	String	Yes	—	Path to the source PDF file
`Format`	String	Yes	`Image`	Output format Allowed: Image, Word
`OutputPath`	String	Yes	—	Output folder (Image) or .docx file path (Word)
`PageNumber`	Int32	No	`0`	Page to export (0 = all pages)
`ImageFormat`	String	No	`jpg`	Image format (when Format=Image) Allowed: jpg, png
`Dpi`	Int32	No	`150`	Render DPI for image export (72-600)
`Password`	String	No	—	Password for encrypted PDF

Extract PDF Images`Extract PDF Images`

Extracts embedded images from a PDF and saves them as individual image files.

Parameter	Type	Required	Default	Description
`FilePath`	String	Yes	—	Path to the PDF file
`PageNumber`	Int32	No	`0`	Page to extract from (0 = all pages)
`OutputFolder`	String	Yes	—	Folder to save extracted images
`Password`	String	No	—	Password for encrypted PDF
`Result`	Int32	No	—	Variable to store the number of images extracted

Extract PDF Page Range`Extract PDF Page Range`

Extracts a range of pages from a PDF document and saves them as a new PDF file.

Parameter	Type	Required	Default	Description
`FilePath`	String	Yes	—	Path to the source PDF file
`StartPage`	Int32	Yes	`1`	Starting page number (1-based)
`EndPage`	Int32	Yes	`1`	Ending page number (1-based, inclusive)
`OutputFilePath`	String	Yes	—	Path for the extracted pages PDF
`Password`	String	No	—	Password for encrypted source PDF

Get PDF Page Count`Get PDF Page Count`

Returns the total number of pages in a PDF document.

Parameter	Type	Required	Default	Description
`FilePath`	String	Yes	—	Path to the PDF file
`Password`	String	No	—	Password for encrypted PDF
`Result`	Int32	Yes	—	Variable to store the page count

Manage PDF Password`Manage PDF Password`

Add, remove, or change password protection on a PDF document.

Parameter	Type	Required	Default	Description
`FilePath`	String	Yes	—	Path to the source PDF file
`Operation`	String	Yes	`Add`	Password operation to perform Allowed: Add, Remove, Change
`Password`	String	No	—	Current password (for Remove/Change)
`NewPassword`	String	No	—	New password to set (for Add/Change)
`OutputFilePath`	String	Yes	—	Path for the output PDF

Merge PDFs`Merge PDFs`

Combines multiple PDF files into a single merged PDF document.

Parameter	Type	Required	Default	Description
`InputFiles`	String	Yes	—	Semicolon-separated paths to PDF files (e.g. file1.pdf;file2.pdf)
`OutputFilePath`	String	Yes	—	Path for the merged output PDF

Read PDF Table`Read PDF Table`

Extracts tabular data from a PDF page into a DataTable by analyzing borders and word positions.

Parameter	Type	Required	Default	Description
`FilePath`	String	Yes	—	Path to the PDF file
`PageNumber`	Int32	No	`1`	Page number to extract table from (1-based)
`HasHeader`	Boolean	No	`True`	First row contains column headers
`Password`	String	No	—	Password for encrypted PDF
`Result`	DataTable	Yes	—	Variable to store extracted DataTable

Read PDF Text`Read PDF Text`

Extracts text from a PDF file. Works with digitally-created PDFs that contain selectable text.

Parameter	Type	Required	Default	Description
`FilePath`	String	Yes	—	Path to the PDF file
`PageNumber`	Int32	No	`0`	Page number to read (0 = all pages)
`PreserveFormat`	Boolean	No	`True`	Reconstruct text layout with proper line breaks, indentation, and spacing as it appears in the PDF
`Password`	String	No	—	Password for encrypted PDF
`Result`	String	Yes	—	Variable to store extracted text

Read PDF With OCR`Read PDF With OCR`

Extracts text from scanned/image-based PDFs by rendering pages and applying Tesseract OCR.

Parameter	Type	Required	Default	Description
`FilePath`	String	Yes	—	Path to the PDF file
`PageNumber`	Int32	No	`0`	Page number (0 = all pages)
`Language`	String	No	`eng`	Tesseract language code Allowed: eng, spa, fra, deu, ita, por, jpn, kor, chi_sim, chi_tra, ara, hin
`Dpi`	Int32	No	`200`	DPI for rendering pages (72-600, default 200)
`TessDataPath`	String	No	—	Path to tessdata folder. Auto-detected if empty.
`Password`	String	No	—	Password for encrypted PDF
`Result`	String	Yes	—	Variable to store OCR-extracted text

PDF & Image / OCR Activities

Image (4)

Click ImageClick Image

Find Image on ScreenFind Image On Screen

Read Text with OCRRead Text With OCR

Wait for ImageWait For Image

PDF (9)

Export PDFExport PDF

Extract PDF ImagesExtract PDF Images

Extract PDF Page RangeExtract PDF Page Range

Get PDF Page CountGet PDF Page Count

Manage PDF PasswordManage PDF Password

Merge PDFsMerge PDFs

Read PDF TableRead PDF Table

Read PDF TextRead PDF Text

Read PDF With OCRRead PDF With OCR