AI Builder Forms processing currently does not support multiple pages of PDF. It only picks up fields and tables in the first page.
Please improve it so that its able to pick up at minimum, tables over multiple PDF pages.
Good news! The ability to specify that a table spans across multiple pages is planned to be released this summer.
Thanks for posting this idea. Today when you are training a Form Processing model you can tag fields and tables in more than one page. In the case of tables however today you can't specify that a table can continue beyond one page. If you train with documents that have tables across multiple pages, what you can do is tag the table in each page as a different table - you will need to define a table per page. Hope this helps. Feel free to provide additional info where this is not currently working for you.
@Rayson79 and @JoeF-MSFT I can confirm that using a table per page is indeed working but can become tedious depending on how many pages a tables spans over, hence why it would still be great if the form processing model could recognise a table across multiple pages.
Totally agree with @HarmVan. This tool needs to do better at picking up multiple pages intuitively. It's a basic feature of most OCR tools out in the market. @JoeF-MSFT Can we please add this as a feature?
Thanks @Rayson79, @HarmVan for the comments. The ability to specify that a table spans across multiple pages is indeed in our roadmap. 🙂
Great to hear! Thanks @JoeF-MSFT. Please keep us updated.
That's great to hear @JoeF-MSFT do you have any time line for it so we can keep that in mind?
I wanted to let you know that this week we are making available, in private preview, the capability to extract tables than span multiple pages. If you, or anybody reading this idea, would like to try out this preview feature and share about your experience directly to the product group, you can fill out this form.Also, now the Invoice Processing prebuilt model supports extracting line items than span multiple pages: https://docs.microsoft.com/en-us/ai-builder/prebuilt-invoice-processing#model-output
Let me know if there is any question. Thanks!
That's great news @JoeF-MSFT I have completed the form and I'm looking forward testing the functionality!
Hello, just wanted to check how is testing of the capability to extract tables over multiple pages is going? Any Production release date that you have in mind?
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.