Hi!
I am currently trying to do a webscraping flow to copy data in a table to an excel file:
Has anybody ever encountered such an issue before, and how do I resolve it?
Could you please share the PAD script you are using?
Hi Ankesh_49,
Thank you for the response! I am pretty new to PAD, so may I know what is the script you are referring to? Is it the screenshot below?
Works fine.
Could you please share the Web url, you are using for data extraction.
Hi Ankesh,
I am unable to share the URLas it is a company intranet link. Is it possible that the intranet website is the reason why PAD is unable to pull anything?
Not at all.
Could you please try adding a delay after step 11 in your script.
Also, could you please confirm if you are getting below option while doing a record
Cheers,
Ankesh
--------------------------------
If this post helps answer your question, please click on “Accept as Solution” to help other members find it more quickly. If you thought this post was helpful, please give it a Thumbs Up.
Instead of the recorder use the "Extract data from web page" action.
Also see on which Html tag are you selecting the "Extract entire html table"?
For example in this case I am doing it on the <th> tag and then able to see the columns in the preview section on the right.
Likewise play around and see which one works for you.
Sometimes it is also the <table> tag.
Hi,
My answer to your comments in blue:
@NandorR Could you please check if you are using correct browser instance? %browser2% or %browser1%.
Cheers,
Ankesh
--------------------------------
If this post helps answer your question, please click on “Accept as Solution” to help other members find it more quickly. If you thought this post was helpful, please give it a Thumbs Up.
Hi VJR,
Thank you for your response. My answer to your comments in blue:
Hi VJR,
Hi Ankesh,
To my limited knowledge in html, I don't think the table is in a webframe? As when i view page source, I am unable to find any mention of "frame" when i Ctr+f "frame" in the page source. Additionally, the only tags i see by default when i use Data Extractor are <td> (the data i want is in in this tag), <a>, and <b>.<table> does not appear unless i select "extract entire HTML Table" after clicking an element.
@NandorR Could you please check it on other websites, if table data is getting extracted.
Hi Ankesh,
It works on other websites. I was able to pull data into a table.
However under "Advanced Settings" i noticed there was a difference in the CSS Selector description.
Would this be what is causing no data to be extracted?
@NandorR Could you please try this:
1. Add the HTML table in ui element
Open selector builder and see how is it getting identified by PAD
and try using that attribute while creating custom selector
Hope it helps!!
Cheers,
Ankesh
--------------------------------
If this post helps answer your question, please click on “Accept as Solution” to help other members find it more quickly. If you thought this post was helpful, please give it a Thumbs Up.
Hi Ankesh,
I did like you suggested.
@NandorR Only thing which I can think of now, if you could share a similar website so that people here can look into it.
Thank you
Hi @NandorR
Apologies if that was tested already, but did you test your extract data from web page in isolation, not depended on the send keys actions that happen earlier in your code?
I'm asking as perhaps your web page navigation does not reach to the point where table is displayed, hence extraction fails.
What I would do is to deactivate all actions except extract data (or create fresh flow with just this action), navigate to the page manually and test the action. If this works, your flow has issue with prior actions.