Project

General

Profile

Actions

Feature #33658

closed

robots.txt: disallow crawling login, register, and lost password form

Added by Go MAEDA over 4 years ago. Updated about 4 years ago.

Status:
Closed
Priority:
Normal
Assignee:
Category:
SEO
Target version:
Start date:
Due date:
% Done:

0%

Estimated time:
Resolution:
Fixed

Description

The attached patch adds robots exclusion tags (<meta name="robots" content="noindex,follow,noarchive" />) to login, register, and lost password form.

Currently, Redmine allows web crawlers such as Googlebot and Baiduspider to index those forms. I think it is undesirable for the following reasons:

  • There is no meaningful information there
  • If the Redmine instance requires login, probably the admin doesn't want to publish any information to the internet. It is so bad to be indexed by Google

That can be resolved by the attached patch by adding robots exclusion tags to the pages.


Files

Actions #1

Updated by Go MAEDA about 4 years ago

Added tests to the patch.

Actions #2

Updated by Go MAEDA about 4 years ago

  • Target version changed from Candidate for next major release to 4.2.0

Setting the target version to 4.2.0.

Actions #3

Updated by Go MAEDA about 4 years ago

  • Subject changed from Prevent web crawlers from indexing login, register, and lost password form to Robots meta tag to prevent web crawlers from indexing login, register, and lost password form
Actions #4

Updated by Go MAEDA about 4 years ago

  • Status changed from New to Closed
  • Assignee set to Go MAEDA
  • Resolution set to Fixed

Committed the patch.

Actions #5

Updated by Holger Just about 4 years ago

  • Status changed from Closed to Reopened

Redmine already generates a robots.txt in welcome#robots. Wouldn't this be better suited there, specifically in source:trunk/app/views/welcome/robots.text.erb so that we have a single location where we define behavior for robots?

Actions #6

Updated by Go MAEDA about 4 years ago

Holger Just wrote:

Redmine already generates a robots.txt in welcome#robots. Wouldn't this be better suited there, specifically in source:trunk/app/views/welcome/robots.text.erb so that we have a single location where we define behavior for robots?

You are absolutely right. I forgot about the existence of robots.txt for some reason.

The attached new patch uses robots.txt instead of robots meta tag.

Actions #7

Updated by Holger Just about 4 years ago

Looks good to me, thanks!

Actions #8

Updated by Go MAEDA about 4 years ago

  • Subject changed from Robots meta tag to prevent web crawlers from indexing login, register, and lost password form to robots.txt: disallow crawling login, register, and lost password form
  • Status changed from Reopened to Closed

Committed patch #33658#note-6.

Actions

Also available in: Atom PDF