Dear community ,
How to extract the data from PDF's and store in excel using PAD?
My flow is failing at step 'Extract text with OCR' with error message - Failed to extract text with OCR.
1-Create Tesseract OCR engine
2-Extract text with OCR
3-Write text to file ( just for testing) , eventually it will be excel sheet.
Please let me know if I have to do any specific configurations?
Solved! Go to Solution.
Right now there is not an ability to extract text or images from a PDF file.
The appropriate group of actions will be available in Power Automate Desktop in the near future.
This sounds very interesting and will sure be useful!
When you say it will be available really soon, could it be before the ending of 2020 or at the beginning of 2021?
Keep up the good work!
I can see the functions like extract from pdf which is great!
Could you please guide, if I extract a table on the first page along with headers containing useful information, how to pull that into excel as separate information?
What it is doing is taking all the content from PDF page and just dumping that into a cell. Can i further decompose that information into useful information and how?
Hi, @JamesP_MSFT ! Good evening! 🙂
Any news on this CV working with PDF-files?... I'm wondering if you know the site I can track for future PAD updates. 🙂 I am able to work with the alternative "Extract text from PDF" and just use RegEx with some extra steps... But would love to implement this alternative as soon as it has been released!