Giter Site home page Giter Site logo

Comments (3)

rgaudin avatar rgaudin commented on August 18, 2024 1

First of all, analyzing stats numbers alone is probably a bad idea. Traffic fluctuates and Kiwix is widely known projects so except major events (incoming traffic or technical issue), one can only make variably-informed guesses based on context.

That being said, here's we I (now) know:

  • prior to k8s, we were running the import script many times a day, over the same log file. Each entry in the log would be uploaded as many times as the script is launched after visit. Matomo is not verbose about how non-JS visits are counted. Doc indicate that User-agent + IP is considered (over a 30mn time frame) but does this applies to such uploads? Upload script itself doesn't do any kind of uniqueness guessing (at least the version we're using). It does filter out many requests (Errors, bots, search engines, static files, etc). I've tried to re-run a full day log ; there's a significant increase (10%+) but it's not crazy.
  • we had a long period during which the proxy was not forwarding the client's IP, resulting in all requests being assign to the same IP. Those are not all counted as a single unique visitor but grouped based on other criteria (probably time and user-agent). For Sunday April 17th for instance, we have a unique visitor (no user-agent) with 17,435 hits and another one (same IP) on android with 2 hits. April 21st is when this was fixed and the bump is clearly visible in the graph.
  • We failed to upload stats for download.kiwix.org four Sunday in a row (10/04, 17/04, 24/04 and 30/04). This is linked to the db issue every Monday morning and I have re-imported them manually ; except for April 10th which is past the logrotate limit.
  • Stats look in par with 2022's pre-war values now ; mostly since Apr 21st so I'd say that the IP issue and the missing Sunday's were the main cause ; especially as traffic was still high due to East-European traffic.

from container-images.

kelson42 avatar kelson42 commented on August 18, 2024

@rgaudin All of this let me tend to think that web server logs are not always uploaded properly.

from container-images.

kelson42 avatar kelson42 commented on August 18, 2024

@rgaudin Does the sunday problem is fixed... or this is a duplicate of an other ticket?

from container-images.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.