Hello
I am trying to extract data from a webpage (see below sample html) to an excel.
Some "Details" have multiple lines so I cannot handpick values.
What is the best way to achieve that?
I am quite confused with the CSS tags etc (suggested reading welcome)
<!DOCTYPE html>
<html>
<head>
<title>Title</title>
</head>
<body>
<div class="logo">
<img src="/images/Banner.png"
alt="alt name" title="ALT NAME" />
</div>
</body>
</html>
<div id="mainContent">
<div> </div>
<h1 id="infoHeader">Information about </h1>
<div>
<a >xxxxx</a>
</div>
<div class="clearBoth"></div>
<div class="Info">
<fieldset>
<legend>Details 1</legend>
<table class="table">
<tbody>
<tr>
<th>Label 1</th> <td>Data 1</td>
<th>Label 2</th> <td>Data 2</td>
</tr>
<tr>
<th>Label 3</th> <td>Data 3</td>
<th>Label 4</th> <td>Data 4</td>
</tr>
<tr>
<th>Label 5</th> <td>Data 5</td>
<th>Label 6</th> <td>Data 6</td>
</tr>
<tr>
<th>Label 7</th> <td>Data 7</td>
<th>Label 8</th> <td>Data 8</td>
</tr>
<tr>
<th>Label 9</th> <td colspan="3">Data 9</td>
</tr>
</tbody>
</table>
</fieldset>
<fieldSet>
<legend>Details 2</legend>
<ul>
<li><span>Label 10</span><span>Label 11</span><span>Label 12</span><span>Label 13</span></li>
</ul>
<ol>
<li><span> Data 10</span><span> Data 11</span><span> Data 12</span><span> Data 13</span></li>
</ol>
</fieldSet>
<fieldSet>
<legend>Details 3</legend>
<ul>
<li><span>Label 14</span><span>Label 15</span><span>Label 16</span><span>Label 17</span><span>Label 18</span><span>Label 19</span></li>
</ul>
<ol>
<li><span> Data14-1</span><span> Data 15-1</span><span> Data 16-1</span><span> Data 17-1</span><span> Data 18-1</span><span> Data 19-1</span></li>
<li><span> Data14-2</span><span> Data 15-2</span><span> Data 16-2</span><span> Data 17-2</span><span> Data 18-2</span><span> Data 19-2</span></li>
</ol>
</fieldSet>
</div>
</body>
</html>
Hi @lazy check out this youtube video if it can help you in any way to extract data from a web page
https://www.youtube.com/watch?v=QllyIdxm4H0
Hope this helps !
If this resolves your issue please mark this post as answered and hit me a thumps up.
Thanks and Regards,
Vidit
What do you want to extract?
Basically everything that is Data1, Data2, Data3 etc...
While the above can be handpicked values to be placed in cells on an excel, I have a particular problem with the part "Details 3": this is a dynamic table which can contain n number of lines (Data14-1, 14-2 ... 14-n) and this is where I am particulary stuck.
Given that this webpage seems legacy as it does not use proper HTML table, the live helper does not recognise it as such
TIA
Thanks for the reply - I have already gone over that video and while it helped for certain parts (e.g. for handpicking selected values) I am a bit stuck with the second part - see my reply to @GeoffRen below.
What format can the Data1, Data2, etc strings take? If they're all the same format you can use a text parser (probably regex) to get all the dynamic data. Or if the structure of the html is always going to be the same then you can parse out what you want depending on the format of the tags.
Fieldset 1 contains a table
Fieldset 2 & 3 contains two lists
The easiest would be to extract label (th, ul > li > span) and data cells (td, ol > li > span) both as lists and then merge them into the desired table, depending on the output.
Thanks
I'll give this a try and revert
Format you mean data type?
They are either text or numbers and the columns are not dynamic only the rows are
Have you tried the extract data from webpage action to fetch the details. This will return you your data in many form like, list,Data table and variable.
Later you can manipulate these as per your need and then can write the data in excel. If you need more information then please let me know.
Episode Seven of Power Platform Connections sees David Warner and Hugo Bernier talk to Microsoft MVP Dian Taylor, alongside the latest news, product reviews, and community blogs. Use the hashtag #PowerPlatformConnects on social media for a chance to have your work featured on the show! Show schedule in this episode: 0:00 Cold Open 00:30 Show Intro 01:02 Dian Taylor Interview 18:03 Blogs & Articles 26:55 Outro & Bloopers Check out the blogs and articles featured in this week’s episode: https://francomusso.com/create-a-drag-and-drop-experience-to-upload-case-attachments @crmbizcoach https://www.youtube.com/watch?v=G3522H834Ro/ @pranavkhuranauk https://github.com/pnp/powerapps-designtoolkit/tree/main/materialdesign%20components @MMe2K https://2die4it.com/2023/03/27/populate-a-dynamic-microsoft-word-template-in-power-automate-flow/ @StefanS365 https://d365goddess.com/viva-sales-administrator-settings/ @D365Goddess https://marketplace.visualstudio.com/items?itemName=megel.mme2k-powerapps-helper#Visualize_Dataverse_Environments @MMe2K Action requested: Feel free to provide feedback on how we can make our community more inclusive and diverse. This episode premiered live on our YouTube at 12pm PST on Thursday 30th March 2023. Video series available at Power Platform Community YouTube channel. Upcoming events: Business Applications Launch – April 4th – Free and Virtual! M365 Conference - May 1-5th - Las Vegas Power Apps Developers Summit – May 19-20th - London European Power Platform conference – Jun. 20-22nd - Dublin Microsoft Power Platform Conference – Oct. 3-5th - Las Vegas Join our Communities: Power Apps Community Power Automate Community Power Virtual Agents Community Power Pages Community If you’d like to hear from a specific community member in an upcoming recording and/or have specific questions for the Power Platform Connections team, please let us know. We will do our best to address all your requests or questions.
Super Users – 2023 Season 1 We are excited to kick off the Power Users Super User Program for 2023 - Season 1. The Power Platform Super Users have done an amazing job in keeping the Power Platform communities helpful, accurate and responsive. We would like to send these amazing folks a big THANK YOU for their efforts. Super User Season 1 | Contributions July 1, 2022 – December 31, 2022 Super User Season 2 | Contributions January 1, 2023 – June 30, 2023 Curious what a Super User is? Super Users are especially active community members who are eager to help others with their community questions. There are 2 Super User seasons in a year, and we monitor the community for new potential Super Users at the end of each season. Super Users are recognized in the community with both a rank name and icon next to their username, and a seasonal badge on their profile. Power Apps Power Automate Power Virtual Agents Power Pages Pstork1* Pstork1* Pstork1* OliverRodrigues BCBuizer Expiscornovus* Expiscornovus* ragavanrajan AhmedSalih grantjenkins renatoromao Mira_Ghaly* Mira_Ghaly* Sundeep_Malik* Sundeep_Malik* SudeepGhatakNZ* SudeepGhatakNZ* StretchFredrik* StretchFredrik* 365-Assist* 365-Assist* cha_cha ekarim2020 timl Hardesh15 iAm_ManCat annajhaveri SebS Rhiassuring LaurensM abm TheRobRush Ankesh_49 WiZey lbendlin Nogueira1306 Kaif_Siddique victorcp RobElliott dpoggemann srduval SBax CFernandes Roverandom schwibach Akser CraigStewart PowerRanger MichaelAnnis subsguts David_MA EricRegnier edgonzales zmansuri GeorgiosG ChrisPiasecki ryule AmDev fchopo phipps0218 tom_riha theapurva takolota Akash17 momlo BCLS776 Shuvam-rpa rampprakash ScottShearer Rusk ChristianAbata cchannon Koen5 a33ik Heartholme AaronKnox okeks Matren David_MA Alex_10 Jeff_Thorpe poweractivate Ramole DianaBirkelbach DavidZoon AJ_Z PriyankaGeethik BrianS StalinPonnusamy HamidBee CNT Anonymous_Hippo Anchov KeithAtherton alaabitar Tolu_Victor KRider sperry1625 IPC_ahaas zuurg rubin_boer cwebb365 Dorrinda G1124 Gabibalaban Manan-Malhotra jcfDaniel WarrenBelz Waegemma drrickryp GuidoPreite If an * is at the end of a user's name this means they are a Multi Super User, in more than one community. Please note this is not the final list, as we are pending a few acceptances. Once they are received the list will be updated.
Join us for an in-depth look into the latest updates across Microsoft Dynamics 365 and Microsoft Power Platform that are helping businesses overcome their biggest challenges today. Find out about new features, capabilities, and best practices for connecting data to deliver exceptional customer experiences, collaborating, and creating using AI-powered capabilities, driving productivity with automation—and building towards future growth with today’s leading technology. Microsoft leaders and experts will guide you through the full 2023 release wave 1 and how these advancements will help you: Expand visibility, reduce time, and enhance creativity in your departments and teams with unified, AI-powered capabilities.Empower your employees to focus on revenue-generating tasks while automating repetitive tasks.Connect people, data, and processes across your organization with modern collaboration tools.Innovate without limits using the latest in low-code development, including new GPT-powered capabilities. Click Here to Register Today!
We are excited to share the ‘Power Platform Communities Front Door’ experience with you! Front Door brings together content from all the Power Platform communities into a single place for our community members, customers and low-code, no-code enthusiasts to learn, share and engage with peers, advocates, community program managers and our product team members. There are a host of features and new capabilities now available on Power Platform Communities Front Door to make content more discoverable for all power product community users which includes ForumsUser GroupsEventsCommunity highlightsCommunity by numbersLinks to all communities Users can see top discussions from across all the Power Platform communities and easily navigate to the latest or trending posts for further interaction. Additionally, they can filter to individual products as well. Users can filter and browse the user group events from all power platform products with feature parity to existing community user group experience and added filtering capabilities. Users can now explore user groups on the Power Platform Front Door landing page with capability to view all products in Power Platform. Explore Power Platform Communities Front Door today. Visit Power Platform Community Front door to easily navigate to the different product communities, view a roll up of user groups, events and forums.
We are so excited to see you for the Microsoft Power Platform Conference in Las Vegas October 3-5 2023! But first, let's take a look back at some fun moments and the best community in tech from MPPC 2022 in Orlando, Florida. Featuring guest speakers such as Charles Lamanna, Heather Cook, Julie Strauss, Nirav Shah, Ryan Cunningham, Sangya Singh, Stephen Siciliano, Hugo Bernier and many more. Register today: https://www.powerplatformconf.com/
User | Count |
---|---|
24 | |
10 | |
9 | |
6 | |
5 |
User | Count |
---|---|
37 | |
27 | |
17 | |
15 | |
15 |