Comments (3)
First of all, analyzing stats numbers alone is probably a bad idea. Traffic fluctuates and Kiwix is widely known projects so except major events (incoming traffic or technical issue), one can only make variably-informed guesses based on context.
That being said, here's we I (now) know:
- prior to k8s, we were running the import script many times a day, over the same log file. Each entry in the log would be uploaded as many times as the script is launched after visit. Matomo is not verbose about how non-JS visits are counted. Doc indicate that User-agent + IP is considered (over a 30mn time frame) but does this applies to such uploads? Upload script itself doesn't do any kind of uniqueness guessing (at least the version we're using). It does filter out many requests (Errors, bots, search engines, static files, etc). I've tried to re-run a full day log ; there's a significant increase (10%+) but it's not crazy.
- we had a long period during which the proxy was not forwarding the client's IP, resulting in all requests being assign to the same IP. Those are not all counted as a single unique visitor but grouped based on other criteria (probably time and user-agent). For Sunday April 17th for instance, we have a unique visitor (no user-agent) with 17,435 hits and another one (same IP) on android with 2 hits. April 21st is when this was fixed and the bump is clearly visible in the graph.
- We failed to upload stats for download.kiwix.org four Sunday in a row (10/04, 17/04, 24/04 and 30/04). This is linked to the db issue every Monday morning and I have re-imported them manually ; except for April 10th which is past the logrotate limit.
- Stats look in par with 2022's pre-war values now ; mostly since Apr 21st so I'd say that the IP issue and the missing Sunday's were the main cause ; especially as traffic was still high due to East-European traffic.
from container-images.
@rgaudin All of this let me tend to think that web server logs are not always uploaded properly.
from container-images.
@rgaudin Does the sunday problem is fixed... or this is a duplicate of an other ticket?
from container-images.
Related Issues (20)
- Too many download.kiwix.org page views HOT 2
- download.kiwix.org/dev should be replaced with a drive HOT 3
- Rename repository HOT 2
- Consider installing fail2ban on the dropbox HOT 1
- watcherbot is not built and does not work anymore HOT 1
- Clean-up repository HOT 1
- [REGRESSION] OPDS stream is not HTTP compressed anymore HOT 1
- torrent tracker not working via IPv6 HOT 6
- Replace MirrorBrain by MirrorCache or Mirrorbits HOT 18
- Torrent Tracker Not Working HOT 6
- Report error in scanMirrors
- Magnet links not working HOT 2
- Surfer doesn't build anymore HOT 1
- Update MirrorBrain base image HOT 4
- Mirrorbrain: stop scanning dirs too many times HOT 1
- Mirrorbrain is not reporting IP location appropriately HOT 6
- Add debian/maintenance image HOT 4
- Build images conditionally HOT 1
- Release permalink for libkiwix_xcframework.tar.gz don't work HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from container-images.