I have a Web Extraction that pulls data from a job posting site. It works for most of the text I need extracted, but on 10%-15% of the rows, it extracts some of the text and the rest it extracts in its source/HTML format, rendering it useless. I need it to only extract the text, not half text then the rest as HTML.
Here is an example of what the problem looks like:
this is the web extraction selector:
my CSS selectors are
html > body > div:eq(0) > div > div > div:eq(1) > div > div:eq(2) > div
div:eq(0) > div > label
div:eq(0) > div > div
I think it must be something with the selector, because the problem shows up in the selector tool:
I need help extracting only the text, not half the text, then some HTML code.
Solved! Go to Solution.
Hello @Duck_Taper !
This is strange behavior, probably it is related to the HTML code of the page.
Could you please try to extract the specific area with a separate action? Either with 'Extract data from web page' or the 'Get details of element on web page'.
I've been using 'Extract data from web page' and have tried capturing it different ways, like extracting it as a table or list, Extracting a specific section and broadening out to extract a larger chunk of the web content. However, I still get the same html problem, no matter how I slice it.
I tried 'Get details of element on web page' and got the same results: text with the html showing in the middle
Agreed that it's strange behavior. Any help is appreciated.
Learn to digitize and optimize business processes and connect all your applications to share data in real time.
Read the latest about new experiences and capabilities in the Power Automate product blog.
If you are a small business ISV/Reseller, share your thoughts with our research team.
The AI Builder team invite you to ask questions and provide helpful answers at our next AMA.