-
-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Should we change robots.txt
at https://library.kiwix.org
#232
Comments
Do we want to bring search engine's attention to the library? It's basically a copy of multiple other sources (known to be disliked by search engines), it doesn't bring people to Kiwix because there's no mention of Kiwix there ; nor the readers or the format or anything. It also pollutes search engines with outdated data and finally it increases load on our machine for traffic we're not interested in. |
A this stage, IMO, the catalog part of library.kiwix.org should crawled, but not the demo part. |
What do you mean by catalog part? The homepage or If
Anyway, why would we drive people towards library.kiwix.org if they are not told where they are, what Kiwix is, etc? |
I know for a fact that the zim files get crawled already as we regularly receive spam-like emails that are pretty much always like
They are basically trying to replace a random link with theirs, I guess as a form of SEO optimization. They never seem to realize that they're pointing at a Wikipedia page and the text barely ever changes so I'm suspecting a fully automated operation. Long story short we could do without these, and I see no material advantage for us to drive traffic to content pages (as opposite to letting people access the source material or driving folks to the more generic library.kiwix.org landing page) |
This forbids everything and this maybe not the best thing to do do advert our library?!
The text was updated successfully, but these errors were encountered: