Giter Site home page Giter Site logo

similar-manga / similar Goto Github PK

View Code? Open in Web Editor NEW
20.0 2.0 2.0 8.98 MB

Finding recommendations between all MangaDex manga

License: GNU General Public License v3.0

Go 99.60% Python 0.40%
manga neko tfidf recomendation-algorithm mangadex mangadex-scraper manga-recommendations

similar's Introduction

Similar Manga Recommendations

This repo has both the scraping and matching utilities to find mangas which are close in content to others. The idea is to create a recommendation system outside MangaDex since there isn't one and thus allow for users to discover other content. Right now it is pretty rudimentary, but seems to give ok results most of the time. If you want to use it, please download Neko and enable recommendations inside the app or use the Chrome Extension.

Setup / Dependencies

Client was generated using swagger. You will need to setup a golang workspace, and then run the following commands. Only manga need to be downloaded / scraped from mangadex to be able to perform similar manga identification.

go get golang.org/x/oauth2
go get github.com/antihax/optional
go get github.com/james-bowman/nlp
go get github.com/caneroj1/stemmer
go get github.com/PuerkitoBio/goquery

Runtime Instructions

  1. Run ./1_cache_mangas to get local list of all mangas on the site (28 hours for 73k manga)
  2. Run ./2_calc_similar to generate similarity index between manga (6 hours)
  3. Run ./3_calc_mappings to export external ID to mangadex UUID mapping files (2 minutes)
  4. Run ./4_calc_mu_ids convert MangaUpdate ids into uniform "new" ids for their new api (27 hours)
  5. Run python utils/create_sqlite.py to generate mapping file for Neko

Manga Links Data

Key Related site URL URL details
al anilist https://anilist.co/manga/`{id}` Stored as id
ap animeplanet https://www.anime-planet.com/manga/`{slug}` Stored as slug
bw bookwalker.jp https://bookwalker.jp/`{slug}` Stored has "series/{id}"
mu mangaupdates https://www.mangaupdates.com/series.html?id=`{id}` Stored has id
nu novelupdates https://www.novelupdates.com/series/`{slug}` Stored has slug
kt kitsu.io https://kitsu.io/api/edge/manga/`{id}` or https://kitsu.io/api/edge/manga?filter[slug]={slug} If integer, use id version of the URL, otherwise use slug one
amz amazon N/A Stored as full URL
ebj ebookjapan N/A Stored as full URL
mal myanimelist https://myanimelist.net/manga/{id} Store as id
raw N/A N/A Stored as full URL, untranslated stuff URL (original language)
engtl N/A N/A Stored as full URL, official english licenced URL

similar's People

Contributors

goldbattle avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

similar's Issues

standalone usage

nice project , is there any way to use it without neko ?
maybe with a wraper to make a cli or something like that, it would be very useful for daily usage

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.