Products

Solutions

Resources

Partners

Community

Blog

About

QA

Ideas Test

New Community Website

Ordinarily, you'd be at the right spot, but we've recently launched a brand new community website... For the community, by the community.

Yay... Take Me to the Community!

Welcome to the DNN Community Forums, your preferred source of online community support for all things related to DNN.
In order to participate you must be a registered DNNizen

HomeHomeUsing DNN Platf...Using DNN Platf...Performance and...Performance and...How to identify document files that are not used in any page?How to identify document files that are not used in any page?
Previous
 
Next
New Post
1/28/2010 1:47 PM
 

you will need to work on a copy anyways, why not setting it up locally on windows Vista/Win7?


Cheers from Germany,
Sebastian Leupold

dnnWerk - The DotNetNuke Experts   German Spoken DotNetNuke User Group

Speed up your DNN Websites with TurboDNN
 
New Post
1/28/2010 3:27 PM
 

It could be useful to have a small utility to do this cleanup / unreferenced resources report. The main challenge is being able to access all content from the website, in order to locate the resources that are used. What would be the best approach?

- Use an external spider
   -> difficulty to access all content: restricted content, private content for one user

- Use the module export functionality, call it on all active modules
   -> not all modules support export / import

- Use the search index
   -> not all modules support search indexing

- Scan entire database data for urls
   -> problem with data that has been deleted but still in database (such as a 'deleted' flag on the row)

Seems like the best approach would be to scan the entire database data. Thoughts?

 
New Post
1/28/2010 9:41 PM
 

I was about to manually scan the database for every of each file name. However, when I looked at the table Files (there are over 3000 records), I thought of creating a job to do it. Then I was not sure if this is a good approach since as you mentioned, there is “data that has been deleted but still in database.”  I think Sebastian’s suggestion of “removing all files from portals directory, then scan all pages” will guarantee to remove all unreferenced files, but I am still weighing the effort against the result; and I am still listening to more advices.  I wish I could use the IIS7 tool, but I don’t even have any box with vista or Win7.
 

 
New Post
1/28/2010 11:31 PM
 

The IIS7 SEO toolkit acts as a spider that will browse your website anonymously and follow all links it finds. It will not browse content that is visible only to certain roles or certain users. That may be important or not to you, depending on your website's content. If you have such content, not placed in a specific location, you risk deleting referenced content.

If you scan the entire database content, you may may find links to resources that have been deleted. If you empty the recycle bin, it should help. Other than that, it depends on the modules you are using and if any of these modules do not delete data but instead keep a deleted flag. I don't think there are many modules that actually do this. But it can, and you risk keeping too many resources (but these resources are still referenced, so it may not be a bad idea to keep them).

 
New Post
1/29/2010 4:17 AM
 

just to demonstrate the caveats of the situation: there might be a number of database entries referencing the files by id, either using Linkclick.aspcx handler with appropriate parameter or resolving path from ID during display. they may save the reference just by storing file ID in a column, fileid=..., or linkclick handler reference. File references can be stored in a dedicated column (documents module), in a shared column (user defined table) or within a text block (HTML module).

Permission issue mentioned before may be solved in local installation by granting unlimited access to anonymous users on the test installation.


Cheers from Germany,
Sebastian Leupold

dnnWerk - The DotNetNuke Experts   German Spoken DotNetNuke User Group

Speed up your DNN Websites with TurboDNN
 
Previous
 
Next
HomeHomeUsing DNN Platf...Using DNN Platf...Performance and...Performance and...How to identify document files that are not used in any page?How to identify document files that are not used in any page?


These Forums are dedicated to discussion of DNN Platform and Evoq Solutions.

For the benefit of the community and to protect the integrity of the ecosystem, please observe the following posting guidelines:

  1. No Advertising. This includes promotion of commercial and non-commercial products or services which are not directly related to DNN.
  2. No vendor trolling / poaching. If someone posts about a vendor issue, allow the vendor or other customers to respond. Any post that looks like trolling / poaching will be removed.
  3. Discussion or promotion of DNN Platform product releases under a different brand name are strictly prohibited.
  4. No Flaming or Trolling.
  5. No Profanity, Racism, or Prejudice.
  6. Site Moderators have the final word on approving / removing a thread or post or comment.
  7. English language posting only, please.
What is Liquid Content?
Find Out
What is Liquid Content?
Find Out
What is Liquid Content?
Find Out