cancel
Showing results for 
Search instead for 
Did you mean: 
Reply
PAuserFromFranc
Helper III
Helper III

New webflow from specific website not able to extract data

Hello Guys, 

 

Is someone able to build up a flow that can extract all the contacts informations listed in all the companies of this website?

i canno't find the right selector to grab it

https://www.energaia.fr/visiter/liste-des-exposants/

thanks for helping, 

 

regards, 

 

Fred

2 ACCEPTED SOLUTIONS

Accepted Solutions
Henrik_M
Super User
Super User

This should work for you. Paste the whole text into an empty PAD flow.

 

Look it through and see if it makes sense to you.

 

You must have this page open when you run it: https://exposants.energaia.fr/form/liste_exposant&lang=fr&session=EN22&langue_id=1

 

View solution in original post

https://regexone.com/ takes you through many of the basics, but (a lot of) practice is what makes... proficient, at some point 😅

 

But actually you shouldn't even need the regular expressions that much moving forward, since the Crop text action can do the whole "get text between two other texts" thing that I did with parse text.

 

By the way, the Replace text that I add, is just because I find new lines to be annoying when it comes to parsing, so I tend to reduce them to regular spaces.

Henrik_M_0-1664919298934.png

 

View solution in original post

18 REPLIES 18
PAuserFromFranc
Helper III
Helper III

@VJR @Henri @Henrik_M @Ankesh_49 

Hi Team, requesting your help to manage selector and build this flow i'm asking for...

May you help please?

thx fred

Ankesh_49
Super User
Super User

@PAuserFromFranc   Could you please share the flow you have developed? which selector are you using?

PAuserFromFranc
Helper III
Helper III

@Ankesh_49 at the moment nothing but i want to grab information from 

https://www.energaia.fr/visiter/liste-des-exposants/

for each company name found in the table, then open the little down arrow and get name, phone, email, adress and field of activity for thoses companies and also for each page (we need also pagination)

Usually i know how to do it but i can't here normally selector would be : 

body > main > section > div > div > form > div:eq(3) > table > tr.odd:nth-child(1) > td:nth-child(1) > span:nth-child(2)

something like this with attribute like tr[Class="odd"]

 

Pavel_NaNoi
Impactful Individual
Impactful Individual

From my limited research on this, most I can tell you is that you'll need to run a javascript in order for this to work on the webpage. Why? well because the company information is not actually on the webpage, but rather an imbued document, an iframe. Thus to access it you need to run javascript that can somehow switch the CSS selector from the main page to this iframe of the imbued document. I have honestly no idea how to do that and I hope someone more well versed in this will come along to help out with this.

Pavel_NaNoi
Impactful Individual
Impactful Individual

Figured it out, do the following:

-Launch the webpage

-Use Extract data from web page

-In the advanced options write the following css selector - > "iframe:eq(0)" with the following attribute "src"

this will get you the html link of the iframe, now simply launch a new chrome instance with that link and from there you can extract everything as normal.

 

Enjoy.

PAuserFromFranc
Helper III
Helper III

@Ankesh_49 @Pavel_NaNoi 

thank you i could step a bit but i'm still stuck with a javascript to execute to open the little arrow and get the datas

PAuserFromFranc_0-1664476023464.png

 

Shouldn't need javascript for that, does extract data from webpage not work?

Alright, I can see now why you were struggling on that extraction part, got some good news and some bad news,

Good news, I made an automation that does what you want, opens the arrow, extracts text and moves to the next.

Bad news? its 1 minute 30 seconds per page (in 1ms delay debug mode)

 

I don't see a way of improving that time other than maybe just using an API call method or something (I have no idea how to do that don't even ask)

 

However if you want this automation, private message me (just click my profile, should see the button on the right), I'll send it over to you, it will need some editing from your end though.

In summary it does the following:

- Creates a new Datatable 

- Gets the number of pages to go through

- Creates a loop based on the number of pages

- Extracts arrows count on page

- Goes through each arrow, extracting specific text (can be edited to extract w/e)

- Puts that information into the Datatable (will need editing if the above is changed)

- Once done, moves to next page and repeat till finished.

Henrik_M
Super User
Super User

Step one should be to enter the iframe directly:  https://exposants.energaia.fr/form/liste_exposant&lang=fr&session=EN22&langue_id=1 

Henrik_M_0-1664552145732.png

 

I thought about the program in my head, and it should be possible. I'll see if I have time to make it during the weekend, then I can share.

PAuserFromFranc
Helper III
Helper III

Enter the iframe, i did but the rest...i'm stuck, thanks again @Henrik_M 

Henrik_M
Super User
Super User

This should work for you. Paste the whole text into an empty PAD flow.

 

Look it through and see if it makes sense to you.

 

You must have this page open when you run it: https://exposants.energaia.fr/form/liste_exposant&lang=fr&session=EN22&langue_id=1

 

Wait, Henrik, how'd you put a zip file attachment in your message? I can't seem to do it, just tells me its no supported.

I might have more privileges because of the Super User status. I only got the "not supported" message when I tried uploading the .txt file 🤔

Henrik_M_0-1664648781703.png

 

Ah, fair enough.

PAuserFromFranc
Helper III
Helper III

Thank you so much @Henrik_M 

I'm now trying to understand the flow you made but too difficult. I don't get this for instance :

table[Id="exposant"] > tbody > tr > td > span[Class*="fa-chevron"]:eq(%LoopIndex_Chevron%)

loopIndex_Chevron is variable and you use it as attribute right to keep forward?

And what does mean the little * after Class?

The rest i get it i think but very complex for me to think the algorithmes this way...

thank for all

Fred

Correct. Since we know that there are 25 entries on each page, we count from index 0 to 24.

 

*= is the way to write the contains operator between an attribute (the class) and the value (fa-chevron)

 

So in this case, we are able to advance down through the list and open each description box, regardless of the chevron type.

Henrik_M_0-1664730000646.png

 

PAuserFromFranc
Helper III
Helper III

Hi @Henrik_M where can i learn Regex like you did (?<=Contact : ).+?(?=string) and so on? i don't get it and i'm not into code or regex so i can't understand it well in order to use it for similar flows which attend to be some others texts to parse

thanks

https://regexone.com/ takes you through many of the basics, but (a lot of) practice is what makes... proficient, at some point 😅

 

But actually you shouldn't even need the regular expressions that much moving forward, since the Crop text action can do the whole "get text between two other texts" thing that I did with parse text.

 

By the way, the Replace text that I add, is just because I find new lines to be annoying when it comes to parsing, so I tend to reduce them to regular spaces.

Henrik_M_0-1664919298934.png

 

Helpful resources

Announcements

Announcing Power Apps Copilot Cookbook Gallery

We are excited to share that the all-new Copilot Cookbook Gallery for Power Apps is now available in the Power Apps Community, full of tips and tricks on how to best use Microsoft Copilot as you develop and create in Power Apps. The new Copilot Cookbook is your go-to resource when you need inspiration--or when you're stuck--and aren't sure how to best partner with Copilot while creating apps.   Whether you're looking for the best prompts or just want to know about responsible AI use, visit Copilot Cookbook for regular updates you can rely on--while also serving up some of your greatest tips and tricks for the Community. Our team will be reviewing posts using the new "Copilot" label to ensure we highlight and amplify the most relevant and recent content, so you're assured of high-quality content every time you visit. If you share a post that gets featured in the curated gallery, you'll get a PM in the Community to let you know!The curated gallery is ready for you to experience now, so visit the new Copilot Cookbook for Power Apps today: Copilot Cookbook - Power Platform Community. We can't wait to see what you "cook" up!    

Tuesday Tips: Getting Started in the Community

TUESDAY TIPS is back!   This weekly series of posts is our way of sharing helpful things we've learned or shared that have helped members of the Community. Whether you're just getting started or you're a seasoned pro, Tuesday Tips will help you know where to go, what to look for, and navigate your way through the ever-growing--and ever-changing--world of the Power Platform Community! The original run of Tuesday Tips was a highlight of last year, and these all-new Tips will hopefully prove to be just as informative as helpful. We will cover some basics about the Community, a few "insider tips" to make your experience even better, and sharing best practices gleaned from our most active community members and Super Users. Make sure to watch the News & Announcements each week for the latest and greatest Tuesday Tips!   THIS WEEK: I'm Brand New! What Do I Do? The number of new community members we have each week is pretty amazing, and we are so glad to welcome all of you to the Community! You may be wondering. "What do I do? Where do I get started? Will anyone be willing to help me? What I have a question? Help!"   Let's start with this: Welcome to the low-code revolution, and more importantly, welcome to the Power Platform Community! This is a great place to start. Whether you're busy with Power Apps, getting familiar with Power Automate, engaging Copilot Studio, or building in Power Pages, there are a few key places you should check out as you begin your journey: FORUMS: The forums are THE place to ask questions, look at questions asked by other Community members—and see answers and solutions from our Super Users and other helpful people in the Community. Power Apps ForumsPower Automate ForumsCopilot Studio ForumsPower Pages Forums   NEWS & ANNOUNCEMENTS: Our News & Announcements section highlights the newest and greatest updates in the Community, news from the product team, and so much more. It’s updated a few times each week and will also help you find ways to connect with what’s going on in the ever-growing world of Power Platform. Power Apps News & AnnouncementsPower Automate News & AnnouncementsCopilot Studio News & AnnouncementsPower Pages News & Announcements   GALLERIES: The Galleries section of the Community features tons of tips and tricks, features and benefits, and more—through videos created by our Super Users, product teams, and other helpful members of the Community. Power Apps GalleriesPower Automate Galleries Copilot Studio GalleriesPower Pages Galleries BLOGS: The community blogs section is full of handy step-by-step tips from members of the Community—and some of them include detailed answers to some of the questions most frequently asked questions, as well as how they solved a problem they faced. Power Apps Community BlogPower Automate Community BlogCopilot Studio Community BlogPower Pages Community Blog POWER UP PROGRAM: If you’d like to really take a huge step forward in your journey, we recommend checking out the Power Up Program, a Microsoft-sponsored initiative that trains new Power Platform users and has been a huge success since it launched a little over a year ago. There’s a waiting list, so definitely apply soon if you’re interested! Find out more here: Microsoft Power Up Program for career switchers.   There's so much more you'll discover in your Power Platform experience, and this Community is here for YOU! We are glad you've discovered us and can't wait to see where you grow! If you're new to the Community and just getting started, make sure to give this post a kudo and introduce yourself so we can welcome you!

Super User of the Month | Drew Poggemann

As part of a new monthly feature in the Community, we are excited to share that Drew Poggemann is our featured Super User for the month of February 2024. If you've been in the Community for a while, we're sure Drew's name is familiar to you, as he is one of our most active contributors--he's been a Super User for five consecutive seasons!   Since authoring his first reply 5 years ago to his 514th solution authored, Drew has helped countless Community members with his insights and expertise. In addition to being a Super User, Drew is also a User Group leader and a Microsoft MVP. His contributions to our Super User sessions and to the new SUIT program are always welcome--as well as his sense of humor and fun-loving way of sharing what he knows with others.   When Drew is not solving problems and authoring solutions, he's busy overseeing the Solution Architecture team at HBS, specializing in application architecture and business solution strategy--something he's been doing for over 30 years. We are grateful for Drew and the amazing way he has used his talent and skills to help so many others in the Community. If you are part of the SUIT program, you got to hear some great tips from Drew at the first SUIT session--and we know he still has much more to share!You can find him in the Community and on LinkedIn. Thank you for all you do, Drew!

Super Users 2024 Season One is Here!

   We are excited to announce the first season of our 2024 Super Users is here! Our kickoff to the new year welcomes many returning Super Users and several new faces, and it's always exciting to see the impact these incredible individuals will have on the Community in 2024! We are so grateful for the daily difference they make in the Community already and know they will keep staying engaged and excited for all that will happen this year.   How to Spot a Super User in the Community:Have you ever written a post or asked for help in the Community and had it answered by a user with the Super User icon next to their name? It means you have found the actual, real-life superheroes of the Power Platform Community! Super Users are our heroes because of the way they consistently make a difference in the Community. Our amazing Super Users help keep the Community a safe place by flagging spam and letting the Community Managers know about issues. They also make the Community a great place to find answers, because they are often the first to offer solutions and get clarity on questions. Finally, Super Users share valuable insights on ways to keep the Community growing, engaging, and looking ahead!We are honored to reveal the new badges for this season of Super Users! Congratulations to all the new and returning Super Users!     To better answer the question "What is a Super User?" please check out this article: Power Apps: What is A Super User? - Power Platform CommunityPower Automate: What is A Super User? - Power Platform Community Copilot Studio: What is A Super User? - Power Platform Community Power Pages: What is A Super User? - Power Platform Community

Microsoft Power Platform | 2024 Release Wave 1 Plan

Check out the latest Microsoft Power Platform release plans for 2024!   We have a whole host of exciting new features to help you be more productive, enhance delegation, run automated testing, build responsive pages, and so much more.    Click the links below to see not only our forthcoming releases, but to also try out some of the new features that have recently been released to market across:     Power Apps  Power Automate  Copilot Studio   We can’t wait to share with you all the upcoming releases that will help take your Power Platform experience to the next level!    Check out the entire Release Wave: Power Platform Complete Release Planner 

It's Time to S.U.I.T. Up! Season One Begins This Week

        After its initial announcement met with such resounding success at the Microsoft Power Platform Conference last fall, the Super User In Training Program's first season is about to kick off! We are so excited to welcome hundreds of potential new Super Users who have signed up for our inaugural program.Not sure what a Super User is yet? Keep reading:Have you ever been exploring the community and come across a user with this unique icon next to their name? It means you have found the actual, real-life superheroes of the Power Platform Community! Super Users are our heroes because of the way they are consistently helpful with everything from solutions to flagging spam, offering insight on the community, and so much more!If you've ever wondered exactly what a Super User does and how they earn that special badge, please check out this article: Power Apps: What is A Super User? - Power Platform CommunityPower Virtual Agents: What is A Super User? - Power Platform Community Power Automate: What is A Super User? - Power Platform CommunityPower Pages: What is A Super User? - Power Platform CommunityIf you missed this season of S.U.I.T., don't worry! We have a whole new season coming soon, so watch our News & Announcements for updates on when you can sign up and get your Super User SUIT on!

Top Solution Authors
Top Kudoed Authors
Users online (3,213)