We have a situation where vendors are sending us multiple vendor invoices within one PDF file. Unfortunately the AI to extract info from invoices only gets the first invoice and ignores the rest of the data within the same file. When you look at the Extracted data directly we noticed that things like "vendorName:" or "invoiceID" only show up once, yet there are several. How can we get it to check for things like "Invoice Number" or Invoice ID more than once?
We were thinking about maybe seperating out the PDF pages and processing them page by page, but then we have instances where invoices are spread out across more than one page and the vendor name is on one page and the invoice total is on another. We still may be a way to figure this out. But it would be great if this connector could processes multiple invoices from the same file.
Hi @technusgirl - thank you for the feedback!
Indeed, today the connector can process one invoice at a time. If multiple invoices are in the same PDF, you will need to split them. Here's a possible approach: Know where to split a PDF with multiple documents ... - Power Platform Community (microsoft.com)
We are working on improving the connector so it can process multiple invoices in a same PDF in a few months from now.
I hope this clarifies.