Skip to content

Commit

Permalink
Simplify robots.txt
Browse files Browse the repository at this point in the history
  • Loading branch information
anjackson committed Feb 22, 2023
1 parent 3bbd506 commit 94a5359
Showing 1 changed file with 5 additions and 7 deletions.
12 changes: 5 additions & 7 deletions static/robots.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
# Generally, block infinite traps;
# Generally, block infinite traps, and avoid archives copies of websites interfering with live sites search presence:
User-agent: *
Disallow: /wayback/
Disallow: /ukwa/search
Expand All @@ -7,15 +7,13 @@ Disallow: /cy/ukwa/search
Disallow: /gd/ukwa/search
Disallow: /datasets/
Disallow: /shine/search
# Allow search engines to index specific sites:
# As requested in https://github.com/ukwa/ukwa-services/issues/96
Allow: /wayback/archive/*/http://www.europeandialogue.org/

# Allow Twitterbot so social cards work:
User-agent: Twitterbot
Allow: /wayback/

User-agent: facebookexternalhit
Allow: /wayback/

# Allow search engines to index specific sites:
User-agent: *
# As requested in https://github.com/ukwa/ukwa-services/issues/96
Allow: /wayback/archive/*/http://www.europeandialogue.org/
Allow: /wayback/

0 comments on commit 94a5359

Please sign in to comment.