Products

Solutions

Resources

Partners

Community

Blog

About

QA

Ideas Test

New Community Website

Ordinarily, you'd be at the right spot, but we've recently launched a brand new community website... For the community, by the community.

Yay... Take Me to the Community!

Welcome to the DNN Community Forums, your preferred source of online community support for all things related to DNN.
In order to participate you must be a registered DNNizen

HomeHomeUsing DNN Platf...Using DNN Platf...Administration ...Administration ...robots.txt and search enginesrobots.txt and search engines
Previous
 
Next
New Post
2/18/2007 4:20 PM
 

Hi All, am having a bit of trouble with this and hope to get some help. I have a robots.txt in the root of my domain, but Google says it can't find it - it's tried only once, so I'll be patient and give this problem another few days.

But, I'm trying to get the syntax of it right to disallow crawling to all of the /ctl/ pages, i.e. terms, privacy and login. Have tried the syntax used in DNN's own file at http://www.dotnetnuke.com/robots.txt of Disallow: /*/ctl/ in the Google robots.txt testing/diagnostics thing but it still finds a path to the test /ctl/ url I entered.

Has anyone solved this? My domain, in case it matters, is set up as below:

default.htm - redirects to /prod
/prod - IIS virtual folder with separate DNN install and db
/test  - IIS virtual folder with separate DNN install and db

Thanks and kind regards, JP

 
New Post
2/18/2007 6:03 PM
 

If you actually have a line like "Disallow: /*/ctl/", I think that is a syntax error.  It should be:

User-agent: *
Disallow: /ctl


Also, are you saying that you have only one defined domain and that you access your Prod and Test with a subdirectory or virtual directory?  If that is the case, bear in mind that for the Bots, your two DNN installations are actually the same site.

See http://www.robotstxt.org/wc/robots.html for more info.

Carlos

 

 
New Post
2/18/2007 6:53 PM
 

Many thanks Carlos. I actually had a typo in the Google testing thing and the /prod/*/ctl/ does actually pass the test (i.e. disallows access).

Yep, my set up is one domain with two sub-dirs that are set up as virtual directories in IIS - each with a completely separate DNN installation and SQL database - seems to work well.

I have included Disallow: /test/ in the robots.txt, so hopefully those pages won't be indexed - have yet to see if this works - although it does pass the google robots.txt test.

 
Previous
 
Next
HomeHomeUsing DNN Platf...Using DNN Platf...Administration ...Administration ...robots.txt and search enginesrobots.txt and search engines


These Forums are dedicated to discussion of DNN Platform and Evoq Solutions.

For the benefit of the community and to protect the integrity of the ecosystem, please observe the following posting guidelines:

  1. No Advertising. This includes promotion of commercial and non-commercial products or services which are not directly related to DNN.
  2. No vendor trolling / poaching. If someone posts about a vendor issue, allow the vendor or other customers to respond. Any post that looks like trolling / poaching will be removed.
  3. Discussion or promotion of DNN Platform product releases under a different brand name are strictly prohibited.
  4. No Flaming or Trolling.
  5. No Profanity, Racism, or Prejudice.
  6. Site Moderators have the final word on approving / removing a thread or post or comment.
  7. English language posting only, please.
What is Liquid Content?
Find Out
What is Liquid Content?
Find Out
What is Liquid Content?
Find Out