02-16-2023 07:29 AM
This flow takes a PDF that has multiple documents in it – for example, multiple invoices in a single PDF – and uses a delimiter word you provide to know where the PDF should be split or processed. It uses AI Builder text recognition (OCR) to read all the text from the PDF to then obtain the page ranges for the different documents in the PDF.
You can customize this flow to use a connector that will do the actual splitting of the PDF like Adobe PDF Services, Encodian, Plumsail among others. Or you can directly specify the page range in supported AI Builder actions like Invoice Processing and Form Processing. To use this flow:
Don't hesitate to ask questions in the comments section below! 💬
HI @JoeF-MSFT
Hi @lucascroxatto - thanks for the heads up.
Hello,
I am fairly new to Power Automate and i am having trouble uploading the flow.
Hi @JayJayRiv - thanks for the question. You will need to click on + Create new and provide your credentials. Hope this helps! 🙂
How can we rename the file as per the user name and save the file to other location instead of dataverse? Above flow works fine but i want to send the splitted files to onedrive instead of Dataverse. And also want to save the file name as ..
User ID abc.com
...........
User ID xuz.com...
So it will extract the User ID and save the pdf named as abc.com or xuz.com
Hi @rishabhgupta - this cookbook from @plarrue can help for your scenario: Renaming files in OneDrive after extracting a fiel... - Power Platform Community (microsoft.com)
Thanks for sharing the workflow.
It works for me.
Encodian has a simple action that does all of this with no set up...its called: Split PDF by Text
You can choose whether to use a set string as a split character or use a regular expression for more flexibility.
PDFs have to be searchable, otherwise you can OCR a PDF Document first.
Hi @RookAils, thanks for the question. You can use the Create file action in your flow, from the Google Drive connector.