Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

download statistics inflated by travis #79

Open
ygrek opened this issue Nov 27, 2013 · 6 comments
Open

download statistics inflated by travis #79

ygrek opened this issue Nov 27, 2013 · 6 comments

Comments

@ygrek
Copy link

ygrek commented Nov 27, 2013

Is it possible to account for downloads from travis CS and substract from download stats?

@avsm
Copy link
Contributor

avsm commented Nov 27, 2013

Is it really worth the trouble? That number will also be inflated by web crawlers. If the downloads are that low, the number is well within the error margins...

On 27 Nov 2013, at 02:14, ygrek [email protected] wrote:

Is it possible to account for downloads from travis CS and substract from download stats?


Reply to this email directly or view it on GitHub.

@ygrek
Copy link
Author

ygrek commented Nov 27, 2013

The crawlers will not usually download random archives and the majority of them can be diverted with robots.txt. Considering that every build of every reverse dependency on travis will download the package several times.. I guess it can substantially inflate the number..

@AltGr
Copy link
Contributor

AltGr commented Nov 29, 2013

if we have a list of travis IPs, it should be quite straight-forward to filter them out during the log parsing.

@ygrek
Copy link
Author

ygrek commented Nov 29, 2013

maybe they use some specific user-agent?

@avsm
Copy link
Contributor

avsm commented Nov 29, 2013

This is a total waste of time. Travis makes about 6 requests per-pull request and doesn't do so regularly at all. Is there any evidence of this inflation other than that?

On 29 Nov 2013, at 15:56, ygrek [email protected] wrote:

maybe they use some specific user-agent?


Reply to this email directly or view it on GitHub.

@ygrek
Copy link
Author

ygrek commented Nov 29, 2013

I don't know really. I could try to investigate with the logs. Was just looking at extlib download numbers (for 1.6.0) and seen them grow rapidly in the first day after release, quite unexpected, trying to find the explanation.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants