Vanilla 1 is no longer supported or maintained. If you need a copy, you can get it here.
HackerOne users: Testing against this community violates our program's Terms of Service and will result in your bounty being denied.
Robots.txt disallow entries for Vanilla forums
Hello community,
I have searched here looking for some disallow entries to put on my robots.txt for Vanilla. I have found nothing but I'm sure you must have something as this is essential now a days to be properly indexed on search engines.
I noticed Google indexed one of my Vanilla pages that has this in the URL: action=search
I will disallow it:
Disallow: *action=search*
Does anyone know which other entries I must disallow?
I appreciate your input on this.
I have searched here looking for some disallow entries to put on my robots.txt for Vanilla. I have found nothing but I'm sure you must have something as this is essential now a days to be properly indexed on search engines.
I noticed Google indexed one of my Vanilla pages that has this in the URL: action=search
I will disallow it:
Disallow: *action=search*
Does anyone know which other entries I must disallow?
I appreciate your input on this.
0
This discussion has been closed.
Comments
Disallow: /forum/search.php Disallow: /forum/people.php Disallow: /forum/post.php DIsallow: /forum/settings.php
Of course, if you are using Friendly URLs, the format should be different:
Disallow: /forum/search/ Disallow: /forum/people/ Disallow: /forum/post/ Disallow: /forum/settings/ Disallow: /forum/discussions/
I've added /discussions/ to the Friendly URLs example since both
example.com/forum/discussons/
andexample.com/forum/
both point to the same content and may cause a duplicate content or canonical URL issue.I'm also not entirely convinced having forum search results appear in a search engine is all that bad.
thank you for your input. yes I'm using Friendly URLs so I go woth the second one, I have some doubts here:
1) This works well if I'm using the forum file as a subdomain?
Disallow: /forum/search/
Disallow: /forum/people/
Disallow: /forum/post/
Disallow: /forum/settings/
Disallow: /forum/discussions/
2) Does the code above include: login pages and the start a new discussion pages?
3) What about: View=ParticipatedThreads and other actions in PHP ?
i.e. if your vanilla is installed at forum.example.com, your robots.txt should be forum.example.com/robots.txt and the contents should be something like:
user-agent: * Disallow: /search/ Disallow: /people/ Disallow: /post/ Disallow: /settings/ Disallow: /discussions/
Those other actions shouldn't matter as robots won't see the links unless someone happened to post one in a comment. If they did, most of them only work for people signed in anyway. Search engine bots don't usually sign in with accounts.
robotstxt.org is the main reference about the robots.txt conventions. There are some other checker utilities around that you can run on it also.