04-15-2022 09:10 AM
This flow takes a PDF that has multiple documents in it – for example, multiple invoices in a single PDF – and uses a delimiter word you provide to know where the PDF should be split or processed. It uses AI Builder text recognition (OCR) to read all the text from the PDF to then obtain the page ranges for the different documents in the PDF.
You can customize this flow to use a connector that will do the actual splitting of the PDF like Adobe PDF Services, Encodian, Plumsail among others. Or you can directly specify the page range in supported AI Builder actions like Invoice Processing and Form Processing. To use this flow:
Don't hesitate to ask questions in the comments section below! 💬
Hello,
Thank you for this flow.
I'm trying to import it but I'm getting the following error:
Hi @Iro_ ! Thanks for the question.
How are you importing the flow? You should import it from the My flows section in Power Automate.
Hello @JoeF-MSFT ,
I'm dealing with a scenario where my PDF contains different forms, each form has different layout, also, I've trained different forms in different collection (1 form, 1 collection) Will it work in this case as well ?
Also, each form starts with unique form number. We have 90 form numbers that are already known to us, can we use these form numbers as delimiters?
Hi @JoeF-MSFT , the attached zip file gives an error when importing. Powerautomate requieres an xml and in this zip there json files.
I'd appreciate your help by re uploading the file or describing more the flow.
Thanks in advance and best regards.
Hi @JoeF-MSFT, didn't know that option, gonna try it, thanks so much for your help and fast reply.
Best regards!
Is there solution for if the delimiter is used in the body of the document as well? i.e. INVOICE being both in the header and body?
I am trying to split pdf if invoice have more than one page, however this solution is working fine to split all pages of large pdf into multiple pdf file.
Invoice PDF have delimiter as Page 1/1 for single page and for multi page invoice Page 1/1, Page 1/2,... like that.
How to do that? even in power automate desktop I tried but unable to solve this.
Hi @JoeF-MSFT , I have a 10 collection model. Some collections may have PDF with multiple invoices, and each collection with a different text that delimits the beginning of a new document.
What would you suggest me doing?