cancel
Showing results for 
Search instead for 
Did you mean: 
Reply
TFranchina
Frequent Visitor

Troubles with multipage table data recognition

Hi,

 

I have created a model in order to extract 2 fields and one table of 2 columns. (537d99bf-3e12-428e-87d7-29061712abd4)

 

This table can be composed of one single line up to spreading on 4-5 pages. In such case layout of the first page is different form the successive ones. I am facing issues to extract information beyond page 2. 

 

TFranchina_0-1664288467273.png

 


As part of the training of the moment i have created collections of document with table on 1, 2, 3, 4+ pages.

 

When tagging the table on pages 3 and 4, i get a notification in the model that tagging the multipage table on more than 2 pages can affect its capacity. Can that be related to the difficulty to extract information ?

 

In addition, this model is supposed to be linked with a Flow to manipulate the extracted table and send it via email. At the moment, we are trying to encapsulate the model in a Do-until loop to force it to process each pages and aggregate an array variable but this is creating some issues as well in the Flow.

 

Would you have any recommendation in the model creation, set-up, training in order to ensure it process all pages ?

 

Thank you in advance for your support

Thomas Franchina

5 REPLIES 5
JoeF-MSFT
Power Apps
Power Apps

Hi @TFranchina - thank you for sharing this.

 

Today multipage table extraction is an experimental feature and we're aware of some cases where the extraction might not work well as it seems to be your case. Processing page by page as you have done using this template is our recommendation https://learn.microsoft.com/en-us/ai-builder/form-processing-multipage#use-a-cloud-flow-to-process-a... 

 

The good news is that in the second half of October we'll be releasing an improved version of multipage tables that should give you better results. Extract content from multipage tables in document processing | Microsoft Learn Stay tuned. 🙂 

TFranchina
Frequent Visitor

Hi @JoeF-MSFT ,

 

Wishing you the best for this starting new year, i'd like to reactivate this discussion as i'm still facing issues with the multipage table processing with AI Builder.

 

Improvements released in Oct-Nov 2022 really improved the stability of the system (👍), i'm still having difficulties to reach reliable extract quality i need for my use case.

 

I'm now at the v5 of my model (ID: 2c786418-ba41-4c16-afbf-c419eadb57ce) trained with 30+ examples of structured document where i want to extract:

  • one date field (99% accuracy)
  • one text field (99% accuracy) 
  • 2-column table with variable lenght -> that's where the troubles starts

i can't predict in advance the length of the table in the file. it can be 2 lines (1 header, 1 content row) or up to X pages full of rows with repeated headers. usually X = 2-3-4 but can go up to 9...

 

From the tests i've made, the table is recognized correctly down to 3-4 pages without issues but after that i loose significant accuracy and extract quality. 

 

Today i've added to the collection  the latest example of 9 pages-table i have received, tagging all lines manually. I launched the model training over lunch break and when back, i checked the performance doing a quick test with the exact same file. It resulted with only the first 3 pages of table recognized, nothing more as if it stopped... While it has many examples of 3+ tables in its collection...

 

Any ideas for improvement i could try ? 

 

As i can't transfer a collection from one model to the other i'd like to avoid creating a v6 and re-starting of the collection tagging again.

 

Thanks in advance for the support

T. Franchina

 
JoeF-MSFT
Power Apps
Power Apps

Hi @TFranchina - thank you for the nice wishes. Cheers to a fresh start and endless possibilities in 2023!

 

Let's try the following, edit your existing model and set the document type on the first step as Unstructured documents. This uses a newer AI technology that might give you better results. It works for both structured and unstructured document types. 

 

JoeFMSFT_0-1672842332345.png


Let us know how this goes!

TFranchina
Frequent Visitor

Hi @JoeF-MSFT ,

 

Thank you for the prompt feedback. I modified the model accordingly and trained it again.

 

After several "quick tests" with the examples giving me the worst results so far, it seems indeed this is solving the main issue resulting in undetected cell content. 👍

 

I will continue to monitor the tests but so far, no more empty cells. Actually it would be the opposite, it now results the repeated table headers while not needed... 

 

Keeping you posted of the progress.

All the best

T.Franchina

TFranchina
Frequent Visitor

Hi @JoeF-MSFT , 

I hope 2024 year has started well for you !

I'd like to restart our conversation here as i now again face issues with AI module to process all the pages i want (model id: 293d577d-6499-4346-a4c9-d005fb046796)


As a reminder my model is trained to recognized text fields with fixed locations (doing it with 100% accuracy 👍) but also a 2-column table that is spanned accross multiple pages and i can't predict in advance how long the table will be.

I had modified the model in 2023 to Unstructured version and got better results but recently i've noticed that again the model stops after 3-4 pages while the table is longer.

In the training collection, i've manually mapped 60+ examples with tables from 1 row to 8 pages of lines.

 

As a side note,  some of the recognized text fields are located before the table but also after it and those are recognized perfectly each time.


Any tips on what i could do to force the model to process all pages ? 

Thanks in advance

Helpful resources

Announcements

Update! June 13th, Community Ambassador Call for User Group Leaders and Super Users

Calling all Super Users & User Group Leaders   UPDATE:  We just wrapped up June's Community Ambassador monthly calls for Super Users and User Group Leaders. We had a fantastic call with lots of engagement. We are excited to share some highlights with you!    Big THANK YOU to our special guest Thomas Verhasselt, from the Copilot Studio Product Team for sharing how to use Power Platform Templates to achieve next generation growth.     A few key takeaways: Copilot Studio Cookbook Challenge:  Week 1 results are posted, Keep up the great work!Summer of Solutions:  Starting on Monday, June 17th. Just by providing solutions in the community, you can be entered to win tickets to Power Platform Community Conference.Super User Season 2: Coming SoonAll communities moving to the new platform end of July We also honored two different community members during the call, Mohamed Amine Mahmoudi and Markus Franz! We are thankful for both leaders' contributions and engagement with their respective communities. 🎉   Be sure to mark your calendars and register for the meeting on July 11th and stay up to date on all of the changes that are coming. Check out the Super User Forum boards for details.   We're excited to connect with you and continue building a stronger community together.   See you at the call!

Copilot Cookbook Challenge | Week 2 Results | Win Tickets to the Power Platform Conference

We are excited to announce the "The Copilot Cookbook Community Challenge is a great way to showcase your creativity and connect with others. Plus, you could win tickets to the Power Platform Community Conference in Las Vegas in September 2024 as an amazing bonus.   Two ways to enter: 1. Copilot Studio Cookbook Gallery:  https://aka.ms/CS_Copilot_Cookbook_Challenge 2. Power Apps Copilot Cookbook Gallery: https://aka.ms/PA_Copilot_Cookbook_Challenge   There will be 5 chances to qualify for the final drawing: Early Bird Entries: March 1 - June 2Week 1: June 3 - June 9Week 2: June 10 - June 16Week 3: June 17 - June 23Week 4: June 24 - June 30     At the end of each week, we will draw 5 random names from every user who has posted a qualifying Copilot Studio template, sample or demo in the Copilot Studio Cookbook or a qualifying Power Apps Copilot sample or demo in the Power Apps Copilot Cookbook. Users who are not drawn in a given week will be added to the pool for the next week. Users can qualify more than once, but no more than once per week. Four winners will be drawn at random from the total qualifying entrants. If a winner declines, we will draw again at random for the next winner.  A user will only be able to win once. If they are drawn multiple times, another user will be drawn at random. Prizes:  One Pass to the Power Platform Conference in Las Vegas, Sep. 18-20, 2024 ($1800 value, does not include travel, lodging, or any other expenses) Winners are also eligible to do a 10-minute presentation of their demo or solution in a community solutions showcase at the event. To qualify for the drawing, templates, samples or demos must be related to Copilot Studio or a Copilot feature of Power Apps, Power Automate, or Power Pages, and must demonstrate or solve a complete unique and useful business or technical problem. Power Automate and Power Pagers posts should be added to the Power Apps Cookbook. Final determination of qualifying entries is at the sole discretion of Microsoft. Weekly updates and the Final random winners will be posted in the News & Announcements section in the communities on July 29th, 2024. Did you submit entries early?  Early Bird Entries March 1 - June 2:  If you posted something in the "early bird" time frame complete this form: https://aka.ms/Copilot_Challenge_EarlyBirds if you would like to be entered in the challenge.   Week 1 Results:  Congratulations to the Week 1 qualifiers, you are being entered in the random drawing that will take place at the end of the challenge. Copilot Cookbook Gallery:Power Apps Cookbook Gallery:1.  @Mathieu_Paris 1.   @SpongYe 2.  @Dhanush 2.   @Deenuji 3.  n/a3.   @Nived_Nambiar  4.  n/a4.   @ManishSolanki 5.  n/a5.    n/a   Week 2 Results:  Congratulations to the Week 2 qualifiers, you are being entered in the random drawing that will take place at the end of the challenge. Copilot Cookbook Gallery:Power Apps Cookbook Gallery:1. Kasun_Pathirana1. ManishSolanki2. cloudatica2. madlad3. n/a3. SpongYe4. n/a4. n/a5. n/a5. n/a

Win free tickets to the Power Platform Conference | Summer of Solutions

We are excited to announce the Summer of Solutions Challenge!    This challenge is kicking off on Monday, June 17th and will run for (4) weeks.  The challenge is open to all Power Platform (Power Apps, Power Automate, Copilot Studio & Power Pages) community members. We invite you to participate in a quest to provide solutions to as many questions as you can. Answers can be provided in all the communities.    Entry Period: This Challenge will consist of four weekly Entry Periods as follows (each an “Entry Period”)   - 12:00 a.m. PT on June 17, 2024 – 11:59 p.m. PT on June 23, 2024 - 12:00 a.m. PT on June 24, 2024 – 11:59 p.m. PT on June 30, 2024 - 12:00 a.m. PT on July 1, 2024 – 11:59 p.m. PT on July 7, 2024 - 12:00 a.m. PT on July 8, 2024 – 11:59 p.m. PT on July 14, 2024   Entries will be eligible for the Entry Period in which they are received and will not carryover to subsequent weekly entry periods.  You must enter into each weekly Entry Period separately.   How to Enter: We invite you to participate in a quest to provide "Accepted Solutions" to as many questions as you can. Answers can be provided in all the communities. Users must provide a solution which can be an “Accepted Solution” in the Forums in all of the communities and there are no limits to the number of “Accepted Solutions” that a member can provide for entries in this challenge, but each entry must be substantially unique and different.    Winner Selection and Prizes: At the end of each week, we will list the top ten (10) Community users which will consist of: 5 Community Members & 5 Super Users and they will advance to the final drawing. We will post each week in the News & Announcements the top 10 Solution providers.  At the end of the challenge, we will add all of the top 10 weekly names and enter them into a random drawing.  Then we will randomly select ten (10) winners (5 Community Members & 5 Super Users) from among all eligible entrants received across all weekly Entry Periods to receive the prize listed below. If a winner declines, we will draw again at random for the next winner.  A user will only be able to win once overall. If they are drawn multiple times, another user will be drawn at random.  Individuals will be contacted before the announcement with the opportunity to claim or deny the prize.  Once all of the winners have been notified, we will post in the News & Announcements of each community with the list of winners.   Each winner will receive one (1) Pass to the Power Platform Conference in Las Vegas, Sep. 18-20, 2024 ($1800 value). NOTE: Prize is for conference attendance only and any other costs such as airfare, lodging, transportation, and food are the sole responsibility of the winner. Tickets are not transferable to any other party or to next year’s event.   ** PLEASE SEE THE ATTACHED RULES for this CHALLENGE**

Celebrating the June Super User of the Month: Markus Franz

Markus Franz is a phenomenal contributor to the Power Apps Community. Super Users like Markus inspire others through their example, encouragement, and active participation.    The Why: "I do this to help others achieve what they are trying to do. As a total beginner back then without IT background I know how overwhelming things can be, so I decided to jump in and help others. I also do this to keep progressing and learning myself." Thank you, Markus Franz, for your outstanding work! Keep inspiring others and making a difference in the community! 🎉  Keep up the fantastic work! 👏👏 Markus Franz | LinkedIn  Power Apps: mmbr1606  

Your Moment to Shine: 2024 PPCC’s Got Power Awards Show

For the third year, we invite you, our talented community members, to participate in the grand 2024 Power Platform Community Conference's Got Power Awards. This event is your opportunity to showcase solutions that make a significant business impact, highlight extensive use of Power Platform products, demonstrate good governance, or tell an inspirational story. Share your success stories, inspire your peers, and show off some hidden talents.  This is your time to shine and bring your creations into the spotlight!  Make your mark, inspire others and leave a lasting impression. Sign up today for a chance to showcase your solution and win the coveted 2024 PPCC’s Got Power Award. This year we have three categories for you to participate in: Technical Solution Demo, Storytelling, and Hidden Talent.      The Technical solution demo category showcases your applications, automated workflows, copilot agentic experiences, web pages, AI capabilities, dashboards, and/or more. We want to see your most impactful Power Platform solutions!  The Storytelling category is where you can share your inspiring story, and the Hidden Talent category is where your talents (such as singing, dancing, jump roping, etc.) can shine! Submission Details:  Fill out the submission form https://aka.ms/PPCCGotPowerSignup by July 12th with details and a 2–5-minute video showcasing your Solution impact. (Please let us know you're coming to PPCC, too!)After review by a panel of Microsoft judges, the top storytellers will be invited to present a virtual demo presentation to the judges during early August. You’ll be notified soon after if you have been selected as a finalist to share your story live at PPCC’s Got Power!  The live show will feature the solution demos and storytelling talents of the top contestants, winner announcements, and the opportunity to network with your community.  It's not just a showcase for technical talent and storytelling showmanship, show it's a golden opportunity to make connections and celebrate our Community together! Let's make this a memorable event! See you there!   Mark your calendars! Date and Time: Thursday, Sept 19th Location: PPCC24 at the MGM Grand, Las Vegas, NV 

Tuesday Tip | Accepting Solutions

It's time for another TUESDAY TIPS, your weekly connection with the most insightful tips and tricks that empower both newcomers and veterans in the Power Platform Community! Every Tuesday, we bring you a curated selection of the finest advice, distilled from the resources and tools in the Community. Whether you’re a seasoned member or just getting started, Tuesday Tips are the perfect compass guiding you across the dynamic landscape of the Power Platform Community.   To enhance our collaborative environment, it's important to acknowledge when your question has been answered satisfactorily. Here's a quick guide on how to accept a solution to your questions: Find the Helpful Reply: Navigate to the reply that has effectively answered your question.Accept as Solution: Look for the "Accept as Solution" button or link, usually located at the bottom of the reply.Confirm Your Selection: Clicking this button may prompt you for confirmation. Go ahead and confirm that this is indeed the solution.Acknowledgment: Once accepted, the reply will be highlighted, and the original post will be marked as "Solved". This helps other community members find the same solution quickly. By marking a reply as an accepted solution, you not only thank the person who helped you but also make it easier for others with similar questions to find answers. Let's continue to support each other by recognizing helpful contributions. 

Top Solution Authors
Users online (2,884)