cancel
Showing results for 
Search instead for 
Did you mean: 
Reply
OllyL
New Member

Recursive Sharepoint archiving

Hi there,

 

I'm trying to build a flow that does the following:

 

  • Find all items in Library #1 older than # Months (recursively into sub directories)
  • Move to Library #2, recreating source folder structure from Library #1

I've managed to create a basic flow using get past time followed by Get files (properties only) with nested items included and using a filter Query on the 'Modified' property.  I'm a bit stuck on the next bit though. How could I go about build the folder structure in Library #2 then moving the file into the corresponding location?

 

Thanks very much in advance

 

Olly

1 ACCEPTED SOLUTION

Accepted Solutions
OllyL
New Member

Just as an FYI I have something which seems to be working though I'm not sure if it works in every scenario (I've only tested it so much). It was easier that expected as "Create new folder" does the hard work for you and creates the whole folder tree. My process is just:

 

  • Get past time (which controls the age of documents to be archived)
  • Get items
    • For some reason using the option "Limit entries to folder" returns zero results for me so don't set this option
    • Filter Query is (Modified lt '@{body('Get_past_time')}') and (ContentType ne 'Folder')
    • Top Count - Make sure this is set. I think Apply to each (below) has a 5000 item limit so I set this that
    • Limit Columns by view - not sure if this is 100% required but I created a view with only the basic columns in there
    • As a side note I also enabled a new index on the Library in Sharepoint for Modified. Again not 100% sure this is necessary but I had to try a lot of things in order to get any output from this step - Our document library is pretty big hence the need to archive. 
  • Apply to Each - operate on 'value' from Get items step
  • [AtE] Create new folder
    • Specify archive destination and use "Folder Path" value from Get items as Folder Path
  • [AtE] Get file content
    • Specify source and use "Identifier" from Get items as File Identifier
  • [AtE] Create file
    • Specify archive destination for site
    • Folder path is 'test/@{items('Apply_to_each')?['{Path}']}' where 'test' is the name of my Archive library
    • File name is "File name with extension" from Get items
    • File Content is "File Content" from Get file content
  • (Optional) [Ate] Delete File
    • As we're moving out of the source location, I use a delete here using the source site and the Identifier again to remove it from source library

 

So far in my testing this seems to work. It mirrors the source folder structure and seems to be able to deal with some partial folder structure already existing in the destination. The major drawback is that you can't target this at a specific folder in the source sharepoint but I think that's more an issue with us exceeding the max view list item threshold and all that pain rather than a particular issue with Power Automate. 

 

It produces errors too - Create File / Get File Content have a file size limit so it won't work with large files and other bits will also error out. For our purposes though the errors don't matter, so long as we are able to archive out some files it's still a success and we would plan to keep running the job on a loop to maintain the library. 

 

Hope this helps

 

Olly

 

 

View solution in original post

7 REPLIES 7
AnthonyAmador
Solution Sage
Solution Sage

Hi @OllyL 

 

Can you explain what you mean by building the folder structure? I do not understand. 

I was thinking that you can use "Move File" or "Move Folder" actions.

 

Let me know if you need more help.

If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.

 

 

 

Anthony Amador
Power Platform Specialist.
OllyL
New Member

Hi Anthony,

 

Thanks for getting back to me. Sure:

 

So if we I have a file : /Library1/Some/Folder/Structure/file.docx

 

This is mirrored to: /Library2/Some/Folder/Structure/movedfile.docx

 

The flow would build any of the folder structure that is missing, leaving existing files/folders intact. So in some cases the folder structure would already exist, sometimes it might just have to make the top folder ("Structure"), sometimes it might have to create all the folders.

 

Does this make sense?

Thanks for the explanation. 

 

I was thinking on how to it, but the scenario seems complex, what happens is that you can use the file path of the file to be copied, save it in a variable and split it to know the names of the folders, but you have to compare them with the names in the new folders to make the decision to create a new one or not, the problem is I have no idea how to get those folder names and I do not see any action that can help us here.

 

Kind regards. 

 

 

 

 

Anthony Amador
Power Platform Specialist.
OllyL
New Member

Just as an FYI I have something which seems to be working though I'm not sure if it works in every scenario (I've only tested it so much). It was easier that expected as "Create new folder" does the hard work for you and creates the whole folder tree. My process is just:

 

  • Get past time (which controls the age of documents to be archived)
  • Get items
    • For some reason using the option "Limit entries to folder" returns zero results for me so don't set this option
    • Filter Query is (Modified lt '@{body('Get_past_time')}') and (ContentType ne 'Folder')
    • Top Count - Make sure this is set. I think Apply to each (below) has a 5000 item limit so I set this that
    • Limit Columns by view - not sure if this is 100% required but I created a view with only the basic columns in there
    • As a side note I also enabled a new index on the Library in Sharepoint for Modified. Again not 100% sure this is necessary but I had to try a lot of things in order to get any output from this step - Our document library is pretty big hence the need to archive. 
  • Apply to Each - operate on 'value' from Get items step
  • [AtE] Create new folder
    • Specify archive destination and use "Folder Path" value from Get items as Folder Path
  • [AtE] Get file content
    • Specify source and use "Identifier" from Get items as File Identifier
  • [AtE] Create file
    • Specify archive destination for site
    • Folder path is 'test/@{items('Apply_to_each')?['{Path}']}' where 'test' is the name of my Archive library
    • File name is "File name with extension" from Get items
    • File Content is "File Content" from Get file content
  • (Optional) [Ate] Delete File
    • As we're moving out of the source location, I use a delete here using the source site and the Identifier again to remove it from source library

 

So far in my testing this seems to work. It mirrors the source folder structure and seems to be able to deal with some partial folder structure already existing in the destination. The major drawback is that you can't target this at a specific folder in the source sharepoint but I think that's more an issue with us exceeding the max view list item threshold and all that pain rather than a particular issue with Power Automate. 

 

It produces errors too - Create File / Get File Content have a file size limit so it won't work with large files and other bits will also error out. For our purposes though the errors don't matter, so long as we are able to archive out some files it's still a success and we would plan to keep running the job on a loop to maintain the library. 

 

Hope this helps

 

Olly

 

 

@OllyL thanks for sharing your findings, this is valuable information, please mark your own answer as a solution to help more users find it more easily in the future. 

Cheers. 

Anthony Amador
Power Platform Specialist.

Any chance you can share screenshots of this Flow? It is exactly what I need but, I have never built a Flow before.

OllyL
New Member

Hi Boaby,

 

The Flow I had didn't actually work so well over large lists because SharePoint was failing internally and only returning some of the results, not all.

 

My solution in the end was to use Powershell and I have script now which I use to perform this kind of archiving. see script below (provided with no guarantees and full disclaimer, use at your own risk). You'll need the pnp.powershell module and it doesn't currently support 2FA on the login. I'd recommend setting up a new login for this as it will cause throttling over a large document library which stops you being able to use the account for other SharePoint admin tasks.

 

<#
    -----------------------------------------------------------
    SharePoint Archiving
    -----------------------------------------------------------

    Arguments:
    
    -Calling with zero arguments ./ArchiveDocumentLibrary.ps1 will run from root of doc libray
    -Use additional arguments to run from sub directories e.g. ./ArchiveDocumentLibrary.ps1 Directory1 ChildDirectory2 GrandChildDirectory3
    -Note that directories must be specified individually, in order, and with a blank space between each one

    Notes:

    - Moves files older than $ArchiveDate out of the source location into the Archive location
    - Deletes empty folders in the source location as it goes if $CleanupDirectories is set
    - Files are stored under a folder with $ListName in the archive location. In this way, you can archive multiple source lists to the same archive location (so long as they have different names). 
    - Liable to throttling by Microsoft so be patient, it will get there in the end!
#>

#Required Module for script
Update-Module -Name PnP.PowerShell

#-------------Setings----------------------

#Root Site Url
$SiteUrl = "https://<tenancy>.sharepoint.com"
$username = "<o365 user>"
$password = "<pwd>"

#Source Settings
$RelativeUrl = "/sites/<sitename>"
$ListName = "<documentlibrary>"

#Archive Settings
$ArchiveRelativeUrl = "/sites/<archivesite>"
$ArchiveListName = "<archivelist>"

#Job Settings
$ArchiveDate = get-date "2019-01-01" #Anything older than this will be archived
$FileLimit = -1 # Limits number of files processed at a time. use -1 to turn this off
$CleanupDirectories = $true # Whether to remove empty directories or not
$Debug = $false

#-------------------------------------------

function Recurse-Files($Folder, $Stack)
{        	
	#Write-Output $Folder
    $items = Get-PnPFolderItem -FolderSiteRelativeUrl $Folder
    if ($Debug -eq $true)
    {
        Write-Output ("Folder: " + $Folder)
        Write-Output ("Count: " + $items.count)
    }

        		
    foreach($item in $items)
    {         
        if ($Debug -eq $true)
        {
            Write-Output ("Item: " + $item.Name)
        }

        if ($item.Name -eq "Forms") { continue }                                        
        if ($item.Name -eq "_catalogs") { continue }                                                
        		
	    if ($item.TypedObject.ToString() -ne 'Microsoft.SharePoint.Client.Folder') 
	    { 			
		    $file = Get-PnPFile -AsListItem -Url $item.ServerRelativeUrl			 				
		    if ((get-date $file.FieldValues.Modified) -lt $ArchiveDate)
		    {				               
			    Archive-File $item $file $Folder $Stack
                $FileLimit = $FileLimit - 1
                if ($FileLimit -eq 0) { Exit }
		    }			
	    }		
	    else { 
		    $NewPath = ($Folder + "/" + $item.Name) 
                        
            $TempStack = $Stack.PSObject.Copy()            
            $TempStack.Push($item.Name)

		    Recurse-Files $NewPath $TempStack
	    }							    
    }
     
    if ($CleanupDirectories)
    {
        $items = Get-PnPFolderItem -FolderSiteRelativeUrl  $Folder
        if ($items.count -gt 0) {} else {
        Cleanup-Empty-Folder $Stack
        }
    }                  
}

function Archive-File($Item, $FileItem, $ParentFolder, $Stack)
{	
    Write-Output ("Archiving:  " + $Item.Name + " - " + $FileItem.FieldValues.Modified)    
    
    #Build Destination Path
    $temps = $Stack.PSObject.Copy()           
    $TargetUrl = ""
    while ($temps.count -gt 0) {  $TargetUrl = ("/" + $temps.Pop() + $TargetUrl) }    
    $TargetUrl = ($ArchiveListName + $TargetUrl)  

    #Generate Destination Folder if it doesn't exist
    if ($Debug -eq $false) 
    {
        Resolve-PnPFolder -SiteRelativePath $TargetUrl -Connection $ArchiveConnection | Out-Null
    }
    		
    #Move the File to Archive	    
    if ($Debug -eq $false) 
    {
        Move-PnPFile -SourceUrl ($RelativeUrl + "/" + $ParentFolder + "/" + $Item.Name) -TargetUrl ($ArchiveRelativeUrl + "/" + $TargetUrl) -Overwrite -AllowSchemaMismatch -IgnoreVersionHistory -Force
    }
}

function Cleanup-Empty-Folder($Stack)
{   
    $temps = $Stack.PSObject.Copy()           

    $Name = $temps.Pop()

    $TargetUrl = ""
    while ($temps.count -gt 1) {  $TargetUrl = ("/" + $temps.Pop() + $TargetUrl) }        
    $TargetUrl = ($ListName + $TargetUrl)

    Write-Output ("Clean-up:  " + $Name + " in " + $TargetUrl)    
    if ($Debug -eq $false) 
    {
        Remove-PnPFolder -Name $Name -Folder $TargetUrl -Force
    }
}

$encpassword = convertto-securestring -String $password -AsPlainText -Force
$credentials = new-object -typename System.Management.Automation.PSCredential -argumentlist $username, $encpassword

#Get Connection for connecting to the archive (need to do this before the other connection as it also connects interactive)
$ArchiveConnection = Connect-PnPOnline -Url ($SiteUrl + $ArchiveRelativeUrl) -ReturnConnection -Credentials $credentials

#Start (Main Method)
Connect-PnPOnline -Url ($SiteUrl + $RelativeUrl) -Credentials $credentials

#Prep stack
$StartStack = new-object system.collections.stack
$StartStack.Push($ListName)

#Load Path from args
$StartPath = $ListName
for ( $i = 0; $i -lt $args.count; $i++ ) {
    $StartPath = ($StartPath + "/" + $args[$i])
    $StartStack.Push($args[$i]) 
}


if ($Debug -eq $true) { Get-PnPList }

#Go!
Recurse-Files $StartPath $StartStack    

 

Helpful resources

Announcements

Exclusive LIVE Community Event: Power Apps Copilot Coffee Chat with Copilot Studio Product Team

It's time for the SECOND Power Apps Copilot Coffee Chat featuring the Copilot Studio product team, which will be held LIVE on April 3, 2024 at 9:30 AM Pacific Daylight Time (PDT).     This is an incredible opportunity to connect with members of the Copilot Studio product team and ask them anything about Copilot Studio. We'll share our special guests with you shortly--but we want to encourage to mark your calendars now because you will not want to miss the conversation.   This live event will give you the unique opportunity to learn more about Copilot Studio plans, where we’ll focus, and get insight into upcoming features. We’re looking forward to hearing from the community, so bring your questions!   TO GET ACCESS TO THIS EXCLUSIVE AMA: Kudo this post to reserve your spot! Reserve your spot now by kudoing this post.  Reservations will be prioritized on when your kudo for the post comes through, so don't wait! Click that "kudo button" today.   Invitations will be sent on April 2nd.Users posting Kudos after April 2nd at 9AM PDT may not receive an invitation but will be able to view the session online after conclusion of the event. Give your "kudo" today and mark your calendars for April 3, 2024 at 9:30 AM PDT and join us for an engaging and informative session!

Tuesday Tip: Unlocking Community Achievements and Earning Badges

TUESDAY TIPS are our way of communicating helpful things we've learned or shared that have helped members of the Community. Whether you're just getting started or you're a seasoned pro, Tuesday Tips will help you know where to go, what to look for, and navigate your way through the ever-growing--and ever-changing--world of the Power Platform Community! We cover basics about the Community, provide a few "insider tips" to make your experience even better, and share best practices gleaned from our most active community members and Super Users.   With so many new Community members joining us each week, we'll also review a few of our "best practices" so you know just "how" the Community works, so make sure to watch the News & Announcements each week for the latest and greatest Tuesday Tips!     THIS WEEK'S TIP: Unlocking Achievements and Earning BadgesAcross the Communities, you'll see badges on users profile that recognize and reward their engagement and contributions. These badges each signify a different achievement--and all of those achievements are available to any Community member! If you're a seasoned pro or just getting started, you too can earn badges for the great work you do. Check out some details on Community badges below--and find out more in the detailed link at the end of the article!       A Diverse Range of Badges to Collect The badges you can earn in the Community cover a wide array of activities, including: Kudos Received: Acknowledges the number of times a user’s post has been appreciated with a “Kudo.”Kudos Given: Highlights the user’s generosity in recognizing others’ contributions.Topics Created: Tracks the number of discussions initiated by a user.Solutions Provided: Celebrates the instances where a user’s response is marked as the correct solution.Reply: Counts the number of times a user has engaged with community discussions.Blog Contributor: Honors those who contribute valuable content and are invited to write for the community blog.       A Community Evolving Together Badges are not only a great way to recognize outstanding contributions of our amazing Community members--they are also a way to continue fostering a collaborative and supportive environment. As you continue to share your knowledge and assist each other these badges serve as a visual representation of your valuable contributions.   Find out more about badges in these Community Support pages in each Community: All About Community Badges - Power Apps CommunityAll About Community Badges - Power Automate CommunityAll About Community Badges - Copilot Studio CommunityAll About Community Badges - Power Pages Community

Tuesday Tips: Powering Up Your Community Profile

TUESDAY TIPS are our way of communicating helpful things we've learned or shared that have helped members of the Community. Whether you're just getting started or you're a seasoned pro, Tuesday Tips will help you know where to go, what to look for, and navigate your way through the ever-growing--and ever-changing--world of the Power Platform Community! We cover basics about the Community, provide a few "insider tips" to make your experience even better, and share best practices gleaned from our most active community members and Super Users.   With so many new Community members joining us each week, we'll also review a few of our "best practices" so you know just "how" the Community works, so make sure to watch the News & Announcements each week for the latest and greatest Tuesday Tips!   This Week's Tip: Power Up Your Profile!  🚀 It's where every Community member gets their start, and it's essential that you keep it updated! Your Community User Profile is how you're able to get messages, post solutions, ask questions--and as you rank up, it's where your badges will appear and how you'll be known when you start blogging in the Community Blog. Your Community User Profile is how the Community knows you--so it's essential that it works the way you need it to! From changing your username to updating contact information, this Knowledge Base Article is your best resource for powering up your profile.     Password Puzzles? No Problem! Find out how to sync your Azure AD password with your community account, ensuring a seamless sign-in. No separate passwords to remember! Job Jumps & Email Swaps Changed jobs? Got a new email? Fear not! You'll find out how to link your shiny new email to your existing community account, keeping your contributions and connections intact. Username Uncertainties Unraveled Picking the perfect username is crucial--and sometimes the original choice you signed up with doesn't fit as well as you may have thought. There's a quick way to request an update here--but remember, your username is your community identity, so choose wisely. "Need Admin Approval" Warning Window? If you see this error message while using the community, don't worry. A simple process will help you get where you need to go. If you still need assistance, find out how to contact your Community Support team. Whatever you're looking for, when it comes to your profile, the Community Account Support Knowledge Base article is your treasure trove of tips as you navigate the nuances of your Community Profile. It’s the ultimate resource for keeping your digital identity in tip-top shape while engaging with the Power Platform Community. So, dive in and power up your profile today!  💪🚀   Community Account Support | Power Apps Community Account Support | Power AutomateCommunity Account Support | Copilot Studio  Community Account Support | Power Pages

Super User of the Month | Chris Piasecki

In our 2nd installment of this new ongoing feature in the Community, we're thrilled to announce that Chris Piasecki is our Super User of the Month for March 2024. If you've been in the Community for a while, we're sure you've seen a comment or marked one of Chris' helpful tips as a solution--he's been a Super User for SEVEN consecutive seasons!   Since authoring his first reply in April 2020 to his most recent achievement organizing the Canadian Power Platform Summit this month, Chris has helped countless Community members with his insights and expertise. In addition to being a Super User, Chris is also a User Group leader, Microsoft MVP, and a featured speaker at the Microsoft Power Platform Conference. His contributions to the new SUIT program, along with his joyous personality and willingness to jump in and help so many members has made Chris a fixture in the Power Platform Community.   When Chris isn't authoring solutions or organizing events, he's actively leading Piasecki Consulting, specializing in solution architecture, integration, DevOps, and more--helping clients discover how to strategize and implement Microsoft's technology platforms. We are grateful for Chris' insightful help in the Community and look forward to even more amazing milestones as he continues to assist so many with his great tips, solutions--always with a smile and a great sense of humor.You can find Chris in the Community and on LinkedIn. Thanks for being such a SUPER user, Chris! 💪 🌠  

Tuesday Tips: Community Ranks and YOU

TUESDAY TIPS are our way of communicating helpful things we've learned or shared that have helped members of the Community. Whether you're just getting started or you're a seasoned pro, Tuesday Tips will help you know where to go, what to look for, and navigate your way through the ever-growing--and ever-changing--world of the Power Platform Community! We cover basics about the Community, provide a few "insider tips" to make your experience even better, and share best practices gleaned from our most active community members and Super Users.   With so many new Community members joining us each week, we'll also review a few of our "best practices" so you know just "how" the Community works, so make sure to watch the News & Announcements each week for the latest and greatest Tuesday Tips!This Week: Community Ranks--Moving from "Member" to "Community Champion"   Have you ever wondered how your fellow community members ascend the ranks within our community? What sets apart an Advocate from a Helper, or a Solution Sage from a Community Champion? In today’s #TuesdayTip, we’re unveiling the secrets and sharing tips to help YOU elevate your ranking—and why it matters to our vibrant communities. Community ranks serve as a window into a member’s role and activity. They celebrate your accomplishments and reveal whether someone has been actively contributing and assisting others. For instance, a Super User is someone who has been exceptionally helpful and engaged. Some ranks even come with special permissions, especially those related to community management. As you actively participate—whether by creating new topics, providing solutions, or earning kudos—your rank can climb. Each time you achieve a new rank, you’ll receive an email notification. Look out for the icon and rank name displayed next to your username—it’s a badge of honor! Fun fact: Your Community Engagement Team keeps an eye on these ranks, recognizing the most passionate and active community members. So shine brightly with valuable content, and you might just earn well-deserved recognition! Where can you see someone’s rank? When viewing a post, you’ll find a member’s rank to the left of their name.Click on a username to explore their profile, where their rank is prominently displayed. What about the ranks themselves? New members start as New Members, progressing to Regular Visitors, and then Frequent Visitors.Beyond that, we have a categorized system: Kudo Ranks: Earned through kudos (teal icons).Post Ranks: Based on your posts (purple icons).Solution Ranks: Reflecting your solutions (green icons).Combo Ranks: These orange icons combine kudos, solutions, and posts. The top ranks have unique names, making your journey even more exciting! So dive in, collect those kudos, share solutions, and let’s see how high you can rank!  🌟 🚀   Check out the Using the Community boards in each of the communities for more helpful information!  Power Apps, Power Automate, Copilot Studio & Power Pages

Find Out What Makes Super Users So Super

We know many of you visit the Power Platform Communities to ask questions and receive answers. But do you know that many of our best answers and solutions come from Community members who are super active, helping anyone who needs a little help getting unstuck with Business Applications products? We call these dedicated Community members Super Users because they are the real heroes in the Community, willing to jump in whenever they can to help! Maybe you've encountered them yourself and they've solved some of your biggest questions. Have you ever wondered, "Why?"We interviewed several of our Super Users to understand what drives them to help in the Community--and discover the difference it has made in their lives as well! Take a look in our gallery today: What Motivates a Super User? - Power Platform Community (microsoft.com)

Users online (6,486)