cancel
Showing results for 
Search instead for 
Did you mean: 
Reply
cjibb02
New Member

Extract Text with OCR - Use another language to specify "trained" data

Hi,

 

I'm using the "Extract text with OCR" action to read text from a grid in an application running under Citrix. I'm getting OK-ish results after following the suggestions in this post - https://powerusers.microsoft.com/t5/Power-Automate-Desktop/Move-mouse-to-text-on-Screen-OCR-worked-y...but still not good enough.

 

The text that i need to read (and then compare to an input variable so it needs to be exact) aren't real words in any language.  It also has lots of slashes e.g. C560XL/XLS/IR. I was thinking that if i could train the engine with this text then i would get better results. Power Automate Desktop has the "Use Other Language" option and allows you to set the language data path. I can't find any instructions on how this works and i keep getting "Failed to create the OCR engine" error.

 

Can anyone give me more information on this feature? Will it do what i want it to do? How do i specify the data file?

 

I've tried downloading the language data from here - https://github.com/tesseract-ocr/langdata but i'm clearly doing something wrong ...

 

cjibb02_0-1635373445893.png

 

Any help will be much appreciated!

2 REPLIES 2
fraenK
Super User
Super User

For the additional language did you try this? https://docs.microsoft.com/en-us/power-automate/desktop-flows/how-to/ocr-multilingual-documents

BUT the built-in OCR functionality based on Tesseract is not that great.

Did you try the Microsoft cognitive action for OCR?

https://docs.microsoft.com/en-us/power-automate/desktop-flows/actions-reference/microsoftcognitive#o...

 

Or would there be a chance to export the grid content as text from the application itself or take a screenshot and run it through a more advanced 3rd party OCR tool install PAD within Citrix?

 

Unfortunately other RPA product are more advanced with Citrix based automation.

Thanks, yes I've followed the steps in that link but i always get "Failed to create OCR engine" even when i use the language files downloaded from github - https://github.com/tesseract-ocr/langdata. I have a support ticket open with Microsoft for this and will update with what they come back with. 

 

I haven't tried the Microsoft OCR Cognitive action yet but will take some screen shots and run these through that service. There will be challenges identifying and cropping the part of the screen i want to run the OCR on, saving as an image and then parsing the results. So I'm not sure if it will be practical but will be interesting how it compares to Tesseract over the whole screen image.

Helpful resources

Announcements
UG GA Amplification 768x460.png

Launching new user group features

Learn how to create your own user groups today!

Community Connections 768x460.jpg

Community & How To Videos

Check out the new Power Platform Community Connections gallery!

M365 768x460.jpg

Microsoft 365 Collaboration Conference | December 7–9, 2021

Join us, in-person, December 7–9 in Las Vegas, for the largest gathering of the Microsoft community in the world.

Users online (1,918)