Giter Site home page Giter Site logo

yonathan06 / cassandra-glam-tools Goto Github PK

View Code? Open in Web Editor NEW
8.0 8.0 5.0 4.29 MB

A usage analysis tool for GLAM institutes to follow free contents contributed to Wikimedia projects

Home Page: https://glamwikidashboard.org

License: MIT License

JavaScript 44.04% CSS 9.90% Smarty 2.17% Python 6.32% PLpgSQL 0.47% Dockerfile 0.21% Handlebars 36.85% Shell 0.04%

cassandra-glam-tools's People

Contributors

alemela avatar chirale avatar davidhaskiya avatar diegmonti avatar dsalza avatar francescocretti avatar jpdevelop avatar lokal-profil avatar loredanamaran avatar maugenta avatar michal-josef-spacek avatar moriyanadav avatar talktonight avatar yonathan06 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

cassandra-glam-tools's Issues

README.md and documentation shortcomings

Does the README.md provide enough documentation to install the system from scratch? For example:

"Copy the file config/config.example.json to config/config.json and modify it as required."

This seems like there's a lot that needs to be done here in terms of database setup, and I'm not sure what the right directions would be here. Thanks.

more detailed explanation of view stats

hi, it would be really helpful with a detailed explanation of how the view stats are compiled in this tool, compared to Magnus Manske's tools BaGLAMa2 and Glamorgan.

Skärmavbild 2021-11-22 kl  12 07 56

As you can see from the graph, the stats differ a bit between the three tools (this is Nordic Museum data). I think (but am not 100 % certain) that some possible variances might be:

  • is only the top level category used or are subcategories included and how deep?
  • how often is the category checked – i.e. if an image is added or removed to the category when does this impact stats?
  • are views checked from the daily wikipedia pageview stats or are they calculated from averages?
  • are wikipedia pages with a file displayed in a template (not in the actual page content) counted?
  • if multiple images from the set are on the same wikipedia page, are the views of that page counted once or for each image?

current description from each tool:

Cassandra:

This line chart shows the sum of the views for the files thats belong to the main category with a 24h span.
Generally the difference of visits between workdays and weekends is quite visible. Sometimes some spikes could lead to a popular event. A growing trend is not always equals to a growing popularity of the category because of file additions.
This statistic is updated every day (max lag time 24h). The data collected here are taken from https://dumps.wikimedia.org/other/mediacounts/daily/. A visit is recorded every time a device download a specific file from a Wikimedia server. So it counts both actual visits, and visits on pages where the file is present (but only if loaded on the device). Actually this is the most precise statistic released by Wikimedia servers.

BaGLAMa2:

BaGLAMa shows you page view numbers for pages on Wikipedia (and other Wikimedia sites) containing Commons files in a specific category. Since February 2014, a new software is used to aggregate page views, so there may be minute differences.
The new Wikimedia pageview API (human views only, no bots) is used starting 2015-12!

(https://glamtools.toolforge.org/baglama2/)

Glamorgan:

This tool is a variant of baGLAMa2. It can show the view number of pages that include files from a specific Commons category. Human views only, article namespace only. 30K files max in category tree. This tool is run "live", so it may take a while to run.

(https://glamtools.toolforge.org/glamorgan.html)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.