Skip to content

Commit 78a83cb

Browse files
abolliniatarix83
authored andcommitted
Merged in task/dspace-cris-2023_02_x/DSC-1660 (pull request DSpace#2673)
DSC-1660 improve default robots.txt Approved-by: Giuseppe Digilio
2 parents 6eaf646 + 69a0ddf commit 78a83cb

1 file changed

Lines changed: 10 additions & 8 deletions

File tree

src/robots.txt.ejs

Lines changed: 10 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -16,25 +16,27 @@ Disallow: /submit
1616
Disallow: /workspaceitems
1717
Disallow: /profile
1818
Disallow: /workflowitems
19+
# Crawlers should be able to access the main entity page, but not the individual tabs
20+
Disallow: /entities/*/*/*
1921
# Crawlers should be able to access entity pages, but not the facet search links present on entity pages
20-
Disallow: /entities/*?f
22+
Disallow: /entities/*/*/*?*
2123

2224
# Optionally uncomment the following line ONLY if sitemaps are working
2325
# and you have verified that your site is being indexed correctly.
24-
# Disallow: /browse/*
26+
Disallow: /browse/*
2527
#
2628
# If you have configured DSpace (Solr-based) Statistics to be publicly
2729
# accessible, then you may not want this content to be indexed
28-
# Disallow: /statistics
30+
Disallow: /statistics
2931
#
3032
# You also may wish to disallow access to the following paths, in order
3133
# to stop web spiders from accessing user-based content
32-
# Disallow: /contact
33-
# Disallow: /feedback
34-
# Disallow: /forgot
35-
# Disallow: /login
36-
# Disallow: /register
34+
Disallow: /feedback
35+
Disallow: /forgot
36+
Disallow: /login
37+
Disallow: /register
3738

39+
Disallow: /server/opensearch
3840

3941
##############################
4042
# Section for misbehaving bots

0 commit comments

Comments
 (0)