cancel
Showing results for 
Search instead for 
Did you mean: 
Reply
Anonymous
Not applicable

Extract Text with OCR

Hi all,

 

I would like to extract text from a website login page after entering the password.

This text is dynamic when the website is loaded each time.

 

Is there a way to identify the specific subregion of text that the OCR needs to read?

 

Regards

Hidayat

 

 

1 ACCEPTED SOLUTION

Accepted Solutions
geavgous
Microsoft
Microsoft

Hi Hidayat, 

 

Thanks for exploring the potential of Power Automate Desktop! Instead of OCR, you could use the Web Automation action , called "Extract Data from Web". This action allows you to select specific web element and get anything you want from it. 

You can learn more by checking the documentation page in Web automation - Power Automate | Microsoft Docs

 

In addtion, you could use the Image based recording for using OCR. You can find more details in Recording in a desktop flow - Power Automate | Microsoft Docs

 

Let me know how it goes!

 

Thanks, 

George

View solution in original post

9 REPLIES 9
DanielOlsson
Helper II
Helper II

On action Extract text with OCR you can choose "Search mode" and add X and Y cords of the area.Screenshot_9.png

Anonymous
Not applicable

Hi @DanielOlsson ,

 

Thanks for the feedback.

I tried this but received an error, Failed to extract text with OCR.

 

For the image to select, I tried both with the text and without text.

 

Image without text to capture

Without Text.PNG

 

Image with text to capture

With Text.PNG

 

Regards

Hidayat

 

geavgous
Microsoft
Microsoft

Hi Hidayat, 

 

Thanks for exploring the potential of Power Automate Desktop! Instead of OCR, you could use the Web Automation action , called "Extract Data from Web". This action allows you to select specific web element and get anything you want from it. 

You can learn more by checking the documentation page in Web automation - Power Automate | Microsoft Docs

 

In addtion, you could use the Image based recording for using OCR. You can find more details in Recording in a desktop flow - Power Automate | Microsoft Docs

 

Let me know how it goes!

 

Thanks, 

George

View solution in original post

MichaelAnnis
Memorable Member
Memorable Member

I like using the "Create Tesseract OCR engine" and using the multipliers before using "Extract text with OCR"

Mess with the multipliers until you get the result you need.  The higher the multiplier, the slower the bot; however, it greatly increases the accuracy of the OCR.

 

Best of luck.

Anonymous
Not applicable

Hi @DanielOlsson@geavgous , @MichaelAnnis ,

Thank you for pointing me into the intended direction.

I manage to get the solution by combining all of you guys feedback 😀

And reading the documentation should instead be the first step. 😅

Awesome work all!

Overview of Workflow

Capture.PNG

Extract text with OCR step

1) Search mode: Selected subregion relative to image

2) Tolerance: Increased to 10

 

Capture2.PNG

 

 

Regards

Hidayat

Hi Hidayat,

1. How you actually identify the X1, X2, Y1, Y2 coordinates? Can we capture the coordinates from the position of mouse?
2. I use Capture Image in the OCR action but it's display is blurred. Or you upload clear image instead?

JAWL_0-1636078327530.png

 

Hello, you can use the move mouse action to determine the X and Y cords of your picture. Not as an active action but while you develop your flow.  Your picture might be blurred as its zooms in when it take the snapshot but the quality is as well determined by the resolution of your screen as well as how you connected to the device you run your flow on, is it local or do you make some kind of remote connection? 

Hello Jawl, you can use move mouse to image activity>Advanced>Search Mode>Search on Specified subregion of screen or foreground Window.Then you will be getting option to select the region.By double tapping and dragging the area X1,X2,Y1,Y2 Positions will be reflecting.These positions you can note down and use in other OCR actions.Please Check the below image.

 

Thanks Regards

Salman

Screenshot (15).png

 

Is it possible to use OCR action move mouse to text found based on the whole multipage pdf file (and not the active screen)?

Helpful resources

Announcements
UG GA Amplification 768x460.png

Launching new user group features

Learn how to create your own user groups today!

Community Connections 768x460.jpg

Community & How To Videos

Check out the new Power Platform Community Connections gallery!

M365 768x460.jpg

Microsoft 365 Collaboration Conference | December 7–9, 2021

Join us, in-person, December 7–9 in Las Vegas, for the largest gathering of the Microsoft community in the world.

Users online (1,392)