Feature #33658
closedrobots.txt: disallow crawling login, register, and lost password form
0%
Description
The attached patch adds robots exclusion tags (<meta name="robots" content="noindex,follow,noarchive" />
) to login, register, and lost password form.
Currently, Redmine allows web crawlers such as Googlebot and Baiduspider to index those forms. I think it is undesirable for the following reasons:
- There is no meaningful information there
- If the Redmine instance requires login, probably the admin doesn't want to publish any information to the internet. It is so bad to be indexed by Google
That can be resolved by the attached patch by adding robots exclusion tags to the pages.
Files
Updated by Go MAEDA almost 4 years ago
- File 33658.patch 33658.patch added
- Target version set to Candidate for next major release
Added tests to the patch.
Updated by Go MAEDA almost 4 years ago
- Target version changed from Candidate for next major release to 4.2.0
Setting the target version to 4.2.0.
Updated by Go MAEDA almost 4 years ago
- Subject changed from Prevent web crawlers from indexing login, register, and lost password form to Robots meta tag to prevent web crawlers from indexing login, register, and lost password form
Updated by Go MAEDA almost 4 years ago
- Status changed from New to Closed
- Assignee set to Go MAEDA
- Resolution set to Fixed
Committed the patch.
Updated by Holger Just almost 4 years ago
- Status changed from Closed to Reopened
Redmine already generates a robots.txt
in welcome#robots
. Wouldn't this be better suited there, specifically in source:trunk/app/views/welcome/robots.text.erb so that we have a single location where we define behavior for robots?
Updated by Go MAEDA almost 4 years ago
- File 33658-use-robots_txt-instead-of-robots-meta-tag.patch 33658-use-robots_txt-instead-of-robots-meta-tag.patch added
Holger Just wrote:
Redmine already generates a
robots.txt
inwelcome#robots
. Wouldn't this be better suited there, specifically in source:trunk/app/views/welcome/robots.text.erb so that we have a single location where we define behavior for robots?
You are absolutely right. I forgot about the existence of robots.txt for some reason.
The attached new patch uses robots.txt instead of robots meta tag.
Updated by Go MAEDA almost 4 years ago
- Subject changed from Robots meta tag to prevent web crawlers from indexing login, register, and lost password form to robots.txt: disallow crawling login, register, and lost password form
- Status changed from Reopened to Closed
Committed patch #33658#note-6.