cancel
Showing results for 
Search instead for 
Did you mean: 
Reply
NandorR
Frequent Visitor

Failed to extract data (web page error while extracting data).

Hi!

 

I am currently trying to do a webscraping flow to copy data in a table to an excel file:

  1. I used Recorder to identify which data to copy and I selected to extract HTML Table
  2. However when the flow completes, there is no data in the excel file.
  3. When i look at the Flow Variable itself, there is no values at all refer to pict below.
    image.png

    Has anybody ever encountered such an issue before, and how do I resolve it?

21 REPLIES 21
Ankesh_49
Super User
Super User

Could you please share the PAD script you are using?

Hi Ankesh_49,

 

Thank you for the response! I am pretty new to PAD, so may I know what is the script you are referring to? Is it the screenshot below?image.png

Works fine.

Ankesh_49_0-1655889411013.png

Could you please share the Web url, you are using for data extraction.

Hi Ankesh,

 

I am unable to share the URLas it is a company intranet link. Is it possible that the intranet website is the reason why PAD is unable to pull anything?

Not at all.

 

Could you please try adding a delay after step 11 in your script.

Also, could you please confirm if you are getting below option while doing a record

Ankesh_49_0-1655891783597.png

 

Cheers,

Ankesh

--------------------------------

If this post helps answer your question, please click on “Accept as Solution” to help other members find it more quickly. If you thought this post was helpful, please give it a Thumbs Up.

 

VJR
Super User
Super User

@NandorR 

 

Instead of the recorder use the "Extract data from web page" action.

Also see on which Html tag are you selecting the "Extract entire html table"?

 

For example in this case I am doing it on the <th> tag and then able to see the columns in the preview section on the right.

 

Likewise play around and see which one works for you. 

Sometimes it is also the <table> tag.

VJR_0-1655892032608.png

Hi,

 

My answer to your comments in blue:

 

  • Could you please try adding a delay after step 11 in your script. Performed but still got the same results)

image.png

  • Also, could you please confirm if you are getting below option while doing a record (i did get that same option when i used the Recorder function. However when i clicked it, and ran the flow, the variables were still empty)

image.png

Ankesh_49
Super User
Super User

@NandorR Could you please check if you are using correct browser instance? %browser2% or %browser1%.

 

Cheers,

Ankesh

--------------------------------

If this post helps answer your question, please click on “Accept as Solution” to help other members find it more quickly. If you thought this post was helpful, please give it a Thumbs Up.

NandorR
Frequent Visitor

Hi VJR,

 

Thank you for your response. My answer to your comments in blue:

 

  • Instead of the recorder use the "Extract data from web page" action. (For some reason this doesn't seem to work for me, when i select "Extract data from web page" it cannot seem to recognize any of the elements in the web page, though could it be due to lag as I am using it on MS Edge?)
  • Also see on which Html tag are you selecting the "Extract entire html table"? (For this case i selected <td> it still returned a blank table when i ran the code, when i originally selected <table> it still yielded the same issue. though i will try clicking on other types of tags to see if it works. Also it should be noted, I originally selected specific elements in the web page, the flow variables still showed blanks.)

image.png

Ankesh_49
Super User
Super User

@NandorR Is this webtable inside some webframe?

NandorR
Frequent Visitor

Hi VJR,

 

  • I managed to try with "Extract data from web page" however the results was the same. In the "Extractor Preview" I was able to see the values i wanted (censored for confidentiality reasons)

image.png

  • But when i ran the flow, the flow variable i got was blank. While the columns were the correct quantity (8 columns), the number of rows generated was incorrect (total data rows was definitely more than 3 rows)

image.png

  • Additionally when i look at the web page, the only tags i see are <td> (the data i want is in in this tag), <a>, <b>.<table> does not appear unless i select "extract entire HTML Table". Could this be the issue?
NandorR
Frequent Visitor

Hi Ankesh,

 

To my limited knowledge in html, I don't think the table is in a webframe? As when i view page source, I am unable to find any mention of "frame" when i Ctr+f "frame" in the page source. Additionally, the only tags i see by default when i use Data Extractor are <td> (the data i want is in in this tag), <a>, and <b>.<table> does not appear unless i select "extract entire HTML Table" after clicking an element.

Ankesh_49
Super User
Super User

@NandorR  Could you please check it on other websites, if table data is getting extracted.

Hi Ankesh,

 

It works on other websites. I was able to pull data into a table.

image.png

However under "Advanced Settings" i noticed there was a difference in the CSS Selector description.

 

  • Website where data unable to be extracted: html > body > table
  • Website where data able to be extracted: html > body > form > main > div:eq(1) > div > div > div:eq(1) > div > div:eq(0) > div > div > div > div:eq(0) > table

 

Would this be what is causing no data to be extracted?

Ankesh_49
Super User
Super User

@NandorR  Could you please try this:

1. Add the HTML table in ui element

Open selector builder and see how is it getting identified by PAD

Ankesh_49_0-1656104487972.png

and try using that attribute while creating custom selector

Ankesh_49_1-1656104645412.png

 

Hope it helps!!

 

Cheers,

Ankesh

--------------------------------

If this post helps answer your question, please click on “Accept as Solution” to help other members find it more quickly. If you thought this post was helpful, please give it a Thumbs Up.

NandorR
Frequent Visitor

Hi Ankesh,

 

I did like you suggested.

 

  • I identified the UI of some sample values in the webpage (circled in green)

image.png

  • I then entered it into the respective "CSS Selector" and "Attribute" (e.g. i changed the initial CSS selector from "html > body > form > table:eq(1) > tbody > tr:eq(0) > td > input:eq(1)" to "html > body > form > table:eq(1) > tbody > tr:eq(0) > td > input[Id="styleSmall"]" and Attribute "Own Text" to "id".

image.png

  • I ran the flow, which ran without any errors, but when I opened the variables, it was still blank

image.png

  •  When i look back at the CSS selector and attribute, I can see that it was successfully saved. Howver I am still not getting any results. Do you have any other suggestions as to what may be wrong?

image.png

Ankesh_49
Super User
Super User

@NandorR  Only thing which I can think of now, if you could share a similar website so that people here can look into it.

Thank you

momlo
Super User
Super User

Hi @NandorR 

Apologies if that was tested already, but did you test your extract data from web page in isolation, not depended on the send keys actions that happen earlier in your code?

I'm asking as perhaps your web page navigation does not reach to the point where table is displayed, hence extraction fails.

 

What I would do is to deactivate all actions except extract data (or create fresh flow with just this action), navigate to the page manually and test the action. If this works, your flow has issue with prior actions.

Helpful resources

Announcements

Announcing | Super Users - 2023 Season 1

Super Users – 2023 Season 1    We are excited to kick off the Power Users Super User Program for 2023 - Season 1.  The Power Platform Super Users have done an amazing job in keeping the Power Platform communities helpful, accurate and responsive. We would like to send these amazing folks a big THANK YOU for their efforts.      Super User Season 1 | Contributions July 1, 2022 – December 31, 2022  Super User Season 2 | Contributions January 1, 2023 – June 30, 2023    Curious what a Super User is? Super Users are especially active community members who are eager to help others with their community questions. There are 2 Super User seasons in a year, and we monitor the community for new potential Super Users at the end of each season. Super Users are recognized in the community with both a rank name and icon next to their username, and a seasonal badge on their profile.    Power Apps  Power Automate  Power Virtual Agents  Power Pages  Pstork1*  Pstork1*  Pstork1*  OliverRodrigues  BCBuizer  Expiscornovus*  Expiscornovus*  ragavanrajan  AhmedSalih  grantjenkins  renatoromao    Mira_Ghaly*  Mira_Ghaly*      Sundeep_Malik*  Sundeep_Malik*      SudeepGhatakNZ*  SudeepGhatakNZ*      StretchFredrik*  StretchFredrik*      365-Assist*  365-Assist*      cha_cha  ekarim2020      timl  Hardesh15      iAm_ManCat  annajhaveri      SebS  Rhiassuring      LaurensM  abm      TheRobRush  Ankesh_49      WiZey  lbendlin      Nogueira1306  Kaif_Siddique      victorcp  RobElliott      dpoggemann  srduval      SBax  CFernandes      Roverandom  schwibach      Akser  CraigStewart      PowerRanger  MichaelAnnis      subsguts  David_MA      EricRegnier  edgonzales      zmansuri  GeorgiosG      ChrisPiasecki  ryule      AmDev  fchopo      phipps0218  tom_riha      theapurva  takolota     Akash17  momlo     BCLS776  Shuvam-rpa     rampprakash   ScottShearer     Rusk   ChristianAbata     cchannon   Koen5     a33ik        AaronKnox        Matren        Alex_10        Jeff_Thorpe        poweractivate        Ramole        DianaBirkelbach        DavidZoon        AJ_Z        PriyankaGeethik        BrianS        StalinPonnusamy        HamidBee        CNT        Anonymous_Hippo        Anchov        KeithAtherton        alaabitar        Tolu_Victor        KRider        sperry1625        IPC_ahaas      zuurg     rubin_boer     cwebb365       If an * is at the end of a user's name this means they are a Multi Super User, in more than one community. Please note this is not the final list, as we are pending a few acceptances.  Once they are received the list will be updated. 

Microsoft Power Platform | March 2023 Newsletter

Welcome to our March 2023 Newsletter, where we'll be highlighting the great work of our members within our Biz Apps communities, alongside the latest news, video releases, and upcoming events. If you're new to the community, be sure to subscribe to the News & Announcements and stay up to date with the latest news from our ever-growing membership network who find real "Power in the Community".        LATEST NEWS Power Platform Connections Check out Episode Five of Power Platform Connections, as David Warner II and Hugo Bernier chat with #PowerAutomate Vice President, Stephen Siciliano, alongside reviewing out the great work of Vesa Juvonen, Waldek Mastykarz, Maximilian Müller, Kristine Kolodziejski, Danish Naglekar, Cat Schneider, Victor Dantas, and many more.       Use the hashtag #PowerPlatformConnects on social media for a chance to have your work featured on the show!   Did you miss an episode?  Catch up now in the Community Connections Galleries Power Apps, Power Automate, Power Virtual Agents, Power Pages     Power Platform leading a new era of AI-generated low-code development.   **HOT OFF THE PRESS** Fantastic piece here by Charles Lamanna on how we're reinventing software development with Copilot in Power Platform to help you can build apps, flows, and bots with just a simple description! Click here to see the Product Blog         Copilot for Power Apps - Power CAT Live To follow on from Charles' blog, check out #PowerCATLive as Phil Topness gives Clay Wesener Wesner a tour of the capabilities of Copilot in Power Apps.           UPCOMING EVENTS   Modern Workplace Conference Check out the Power Platform and Microsoft 365 Modern Workplace Conference that returns face-to-face at the Espace St Martin in Paris on 27-28th March. The #MWCP23 will feature a wide range of expert speakers, including Nadia Yahiaoui, Amanda Sterner, Pierre-Henri, Chirag Patel, Chris Hoard, Edyta Gorzoń, Erika Beaumier, Estelle Auberix, Femke Cornelissen, Frank POIREAU, Gaëlle Moreau, Gilles Pommier, Ilya Fainberg, Julie Ecolivet, Mai-Lynn Lien, Marijn Somers, Merethe Stave, Nikki Chapple, Patrick Guimonet, Penda Sow, Pieter Op De Beéck, Rémi Riche, Robin Doudoux, Stéphanie Delcroix, Yves Habersaat and many more.  Click here to find out more and register today!     Business Applications Launch 2023 Join us on Tuesday 4th April 2023 for an in-depth look into the latest updates across Microsoft Power Platform and Microsoft Dynamics 365 that are helping businesses overcome their biggest challenges today. Find out about new features, capabilities, and best practices for connecting data to deliver exceptional customer experiences, collaborating and creating using AI-powered capabilities, driving productivity with automation, and building future growth with today’s leading technology. Click Here to Register Today!       Power Platform Conference 2023 We are so excited to see you for the Microsoft Power Platform Conference in Las Vegas October 3-5th, 2023! But first, let's take a look below at some fun moments from MPPC 2022 in Orlando Florida. 2023 sees guest speakers such as Charles Lamanna, Heather Cook, Julie Strauss, Nirav Shah, Ryan Cunningham, Sangya Singh, and many more taking part, so why not click the link below to register for the #PowerPlatformConf today! Vegas, baby! Click Here to Register Today!      COMMUNITY HIGHLIGHTS Check out our top Super and Community Users reaching new levels!  These hardworking members are posting, answering questions, kudos, and providing top solutions in their communities.   Power Apps:  Super Users:  @WarrenBelz  |  @iAm_ManCat  Community Users: @LaurensM | @Rusk | @RJM07    Power Automate:   Super Users: @abm  | @Expiscornovus | @RobElliott  Community Users:  @grantjenkins | @Chriddle    Power Virtual Agents:   Super Users: @Expiscornovus | @Pstork1  Community Users: @MisterBates | @Jupyter123 | Kunal K   Power Pages: Super Users:  @OliverRodriguesOliverRodrigues | @Mira_Ghaly  Community Users: @FubarFubar | @ianwukianwuk  LATEST PRODUCT BLOG ARTICLES  Power Apps Community Blog  Power Automate Community Blog  Power Virtual Agents Community Blog  Power Pages Community Blog  Check out 'Using the Community' for more helpful tips and information:  Power Apps, Power Automate, Power Virtual Agents, Power Pages 

Register now for the Business Applications Launch Event | Tuesday, April 4, 2023

Join us for an in-depth look into the latest updates across Microsoft Dynamics 365 and Microsoft Power Platform that are helping businesses overcome their biggest challenges today.   Find out about new features, capabilities, and best practices for connecting data to deliver exceptional customer experiences, collaborating, and creating using AI-powered capabilities, driving productivity with automation—and building towards future growth with today’s leading technology.   Microsoft leaders and experts will guide you through the full 2023 release wave 1 and how these advancements will help you: Expand visibility, reduce time, and enhance creativity in your departments and teams with unified, AI-powered capabilities.Empower your employees to focus on revenue-generating tasks while automating repetitive tasks.Connect people, data, and processes across your organization with modern collaboration tools.Innovate without limits using the latest in low-code development, including new GPT-powered capabilities.    Click Here to Register Today!    

Power Platform Connections - Episode 5 | March 16, 2023

Episode Five of Power Platform Connections sees David Warner and Hugo Bernier talk to Vice President of Power Automate, Stephen Siciliano, alongside the latest news, product reviews, and community blogs.     Use the hashtag #PowerPlatformConnects on social media for a chance to have your work featured on the show!      Show schedule in this episode:  0:00 Cold Open  0:34 Show Intro  01:09 Stephen Siciliano Interview  30:42 Blogs & Articles  31:06 PnP Weekly Ep 200  32:51 SharePoint Custom Form Backup  33:38 Power Apps Extreme Makeover  34:56 ChatGPT Control  35:35 Color Data  37:17 Top 7 Features on Dynamics 365 2023 Release Wave 1  38:30 Outro & Bloopers  Check out the blogs and articles featured in this week’s episode:    https://pnp.github.io/blog/microsoft-365-pnp-weekly/episode-200/​ (no tags)   https://grazfuchs.net/post/sharepoint-customform-backup/ @Maximilian Müllerhttps://www.fromzerotoheroes.com/ @Kristine Kolodziejski​ https://github.com/Power-Maverick/PCF-Controls/tree/master/ChatGPTControl @DanzMaverick https://yerawizardcat.com/color/ ​ @CatSchneider https://events.powercommunity.com/dynamics-power-israel/ @VictorDantas  Action requested: Feel free to provide feedback on how we can make our community more inclusive and diverse.  This episode premiered live on our YouTube at 12pm PST on Thursday, 16th March 2023.  Video series available at Power Platform Community YouTube channel.    Upcoming events:  Business Applications Launch – April 4th – Free and Virtual! M365 Conference - May 1-5th - Las Vegas Power Apps Developers Summit – May 19-20th - London European Power Platform conference – Jun. 20-22nd - Dublin Microsoft Power Platform Conference – Oct. 3-5th - Las Vegas  Join our Communities:  Power Apps Community Power Automate Community Power Virtual Agents Community Power Pages Community  If you’d like to hear from a specific community member in an upcoming recording and/or have specific questions for the Power Platform Connections team, please let us know. We will do our best to address all your requests or questions.     

Check out the new Power Platform Communities Front Door Experience!

We are excited to share the ‘Power Platform Communities Front Door’ experience with you!   Front Door brings together content from all the Power Platform communities into a single place for our community members, customers and low-code, no-code enthusiasts to learn, share and engage with peers, advocates, community program managers and our product team members. There are a host of features and new capabilities now available on Power Platform Communities Front Door to make content more discoverable for all power product community users which includes ForumsUser GroupsEventsCommunity highlightsCommunity by numbersLinks to all communities Users can see top discussions from across all the Power Platform communities and easily navigate to the latest or trending posts for further interaction. Additionally, they can filter to individual products as well.   Users can filter and browse the user group events from all power platform products with feature parity to existing community user group experience and added filtering capabilities.     Users can now explore user groups on the Power Platform Front Door landing page with capability to view all products in Power Platform.      Explore Power Platform Communities Front Door today. Visit Power Platform Community Front door to easily navigate to the different product communities, view a roll up of user groups, events and forums.

Microsoft Power Platform Conference | Registration Open | Oct. 3-5 2023

We are so excited to see you for the Microsoft Power Platform Conference in Las Vegas October 3-5 2023! But first, let's take a look back at some fun moments and the best community in tech from MPPC 2022 in Orlando, Florida.   Featuring guest speakers such as Charles Lamanna, Heather Cook, Julie Strauss, Nirav Shah, Ryan Cunningham, Sangya Singh, Stephen Siciliano, Hugo Bernier and many more.   Register today: https://www.powerplatformconf.com/   

Users online (2,279)