Solved: Extracting PDF data with Form Recognizer and savin...

Anonymous · ‎12-18-2019

Hi all,

I am currently trying to extract data from a PDF file with Microsoft Form Recognizer whenever

a file is uploaded on my Sharepoint site. The extracted data shall be added to the original file as Metadata.

Everything worked quite fine besides the usage of the template language expression to use the JSON output from Form

Recognizer to find the adequate key - value pairs and add their data to the PDF.

The JSON output from Form Recognizer looks like this:

My Flow looks like this:

In update file properties I try to get the data from JSON via the expression:

It seems to be working because I do not get an error anymore but the metadata field is empty.

body('Parse_JSON')?['pages'][0]?['Datum:']?['value']

if I try any other Index than [0] I get this error and I do not understand why:

To be honest I do not really understand the template language expression and how to iterate through an array.

Additionally it does not make too much sense for my purpose because I wanted to be able to

analyze documents with another format as well. Whenever the key - value pair I want to extract is in another line it would not

work if I work with Indexes.

I would appreciate any help.

Best regards,

AdriP

manuelstgomes · ‎01-02-2020

Hi @Anonymous

You need to include the "body('Parse_JSON')?['pages'][0]?['Datum:']?['value']" into a "Apply For Each" and then perform your actions.

Something like this:

The error message is indicating that you're trying to use an array that is empty, so it's trying to access position 1 but that position is not available.

If I have answered your question, please mark your post as Solved.
If you like my response, please give it a Thumbs Up.

Cheers
Manuel

View solution in original post

manuelstgomes · ‎01-02-2020

Hi @Anonymous

You need to include the "body('Parse_JSON')?['pages'][0]?['Datum:']?['value']" into a "Apply For Each" and then perform your actions.

Something like this:

The error message is indicating that you're trying to use an array that is empty, so it's trying to access position 1 but that position is not available.

If I have answered your question, please mark your post as Solved.
If you like my response, please give it a Thumbs Up.

Cheers
Manuel