Giter Site home page Giter Site logo

gunjannandy / comicmaker Goto Github PK

View Code? Open in Web Editor NEW
5.0 1.0 4.0 13.29 MB

comicMaker is a command line tool to download Comics and Manga from various Manga and Comic sites easily. This is for educational purpose only. Supported websites : readcomiconline, mangalike, comicextra and more. For further information visit

Home Page: https://gunjan933.github.io/comicMaker/

License: MIT License

Python 100.00%
comic-downloader python manga-downloader multiprocessing image-to-pdf pdf-merger readcomiconline mangalike proxy-configuration proxy-list

comicmaker's Introduction

comicMaker project app icon

comicMaker

Build Status Known Vulnerabilities start with why contributions welcome python >=3.5 Gitter

comicMaker is a command line tool which downloads Comics and Manga from various Manga and Comic sites and converts them into (lossless) pdf easily. This is for educational purpose only.

Don't overuse this script. It puts loads on their servers.
Support the developers of those websites by disabling your adblock on their site.
Advertisments pay for the website servers.

TL;DR :
step 1: Use "config.json" to enter desired comic book names, and from which chapter you want to download.  
step 2: Then enter following commands:  
        pip install -r requirements.txt
        python comicMaker.py

Table of Content

Getting Started

Supported Sites

URL can be any URL of the supported websites.

Supported Sites Compatiability
https://readcomiconline.to 100%
https://readcomicsonline.ru 100%
https://www.comicextra.com 100%
https://mangalike.net 90%

How to find the book name

Go to the required book website and see the URL. Find the book name, usually just after the "http://mangalike.net/manga/" or "https://readcomiconline.to/Comic/" part. If you want to download from start put 1 in the chapter part in the config.json file.

Configuring download list

Find the file config.json. Inside there you can enter names of any books you want to download. And specify from which chapter you want to download. Here is an example:

+ Below is the content of "config.json" ->

{
    "mangaLike": {
        "chromosome-47" : "1",
        "haomen-tianjia-qianqi" : "35",
        "pulse" : "63"
        },
    "readComicOnlineTo": {
        "Civil-War-2006": "1",
        "Batman-The-Dark-Knight-Returns": "2",
        "X-Men-Origins-Jean-Grey": "1"
    },
    "readComicsOnlineRu": {
    
    },
    "comicExtra": {

    },
    "temporary library": {
        "tamen-de-gushi": "187",
        "otome-no-teikoku": "108.5"
    }
}  


+ This will download 6 books:
+ chromosome-47 , where downloads starts from chapter 1.
+ haomen-tianjia-qianqi , where download starts from chapter 35.
+ pulse , where download starts from chapter 63.
+ Civil-War-2006 , where download starts from chapter 1.
+ Batman-The-Dark-Knight-Returns , where download starts from chapter 2.
+ X-Men-Origins-Jean-Grey , where download starts from chapter 1.

- temporary library contains books that won't be downloaded by the program.

- Don't forget to put ',' between two books.

Dependencies Installation

This script can run on multiple Operating Systems. You need Node.js in your system's path for this script to work (You need this on each and every Operating System, even on WINDOWS :/).
Download the Node.Js from Node.js official website. Doesn't matter which operating system you're on, this is a must. Follow the instructions mentioned below, according to your OS.

Linux/Debian :

Since most (if not all) Linux/Debian OS come with python pre-installed, you don't have to install python manually. Make sure you're using python >= 3.5 though.

We need pip to install any external dependenc(ies). So, open any terminal and type in pip list and if it shows some data, then it is fine. But, if it shows error, like pip not found or something along this line, then you need to install pip. Just type this command in terminal :

sudo apt-get install python-pip

If you're on Fedora, CentOS/RHEL, openSUSE, Arch Linux, then you simply need to follow THIS TUTORIAL to install pip.

If this still doesn't work, then you'll manually need to install pip. Doing so is an easy one time job and you can follow THIS TUTORIAL to do so.

  • Download this requirements.text file and put it in some directory/folder.
  • Open terminal again and browse to the directory where you downloaded your requiremenets.txt file and run this command :
pip install -r requirements.txt
  • It should install the required external libraries.

Windows :

If you're on windows, then follow these steps :

  • Install Python >= 3.5. Download the desired installer from download Python.
  • Add it in the system path (if not already added).
  • Download this requirements.text file and put it in some directory/folder.
  • Open Command Prompt and browse to the directory where you downloaded your requiremenets.txt file and run this command :
pip install -r requirements.txt
  • It should install the required external libraries.

Now, install Node.Js as well and make sure it's in your path.

Well, if everything came up good without any error(s), then you're good to go!

Mac OS X :

Mac OS X users will have to fetch their version of Python and Pip.

After downloading and installing these, you need to add PIP & Python in your path. Follow THIS LITTLE GUIDE to install both, Python & pip successfully.

Python Support

Supports python >= 3.5

Usage

Follow the instructions according to your OS :

Windows

After you've saved this script in a directory/folder, you need to open command prompt and browse to that directory and then execute the script. Let's do it step by step :

  • Open the folder where you've downloaded the files of this repository.
  • Hold down the SHIFT key and while holding down the SHIFT key, RIGHT CLICK and select Open Command Prompt Here from the options that show up.
  • Now, in the command prompt, type this :
python comicMaker.py

Linux/Debian

After you've saved this script in a directory/folder, you need to open command prompt and browse to that directory and then execute the script. Let's do it step by step :

  • Open a terminal, Ctrl + Alt + T is the shortcut to do so (if you didn't know).
  • Now, change the current working directory of the terminal to the one where you've downloaded this repository.
  • Now, in the Terminal, type this :
python comicMaker.py

Features

  • This will download all the images from a chapter in a folder.
  • Progressbar added for getting realtime downloading status.
  • This will also create pdf for each chapters.
  • This will merge all chapter pdfs into a single one as a full book after completing downloading each book.
  • Added proxy support from Free Proxy List for anonymous downloading and to avoid getting banned.
  • Used multithreading for parallel downloading.

Save Location

Comics will be saved on the same directory you clone this repository. Here is how:

-     --SomeDirectory (Where you cloned the repository)
        |--comicMaker
-       |  |--comicMaker
        |  |  |--__init__.py
        |  |  |--(and some necessary modules the program needs)
        |  |--requirements.txt
        |  |--.gitignore
        |  |--_config.yml
        |  |--comicMaker.py
        |  |--config.json
        |  |--readme.md
-       |--comicDownloads
        |  |--chromosome-47
        |  |--haomen-tianjia-qianqi
        |  |--pulse 

How the program works

  • The comic pages are not hosted in "https://www.mangalike.net/". So the program goes in each chapter, looks into all of the image sources, gets all the links, saves them in location.
  • To convert to pdfs, any image should not contain any alpha channel. So every image is converted in RGB after downloading.
  • In system the sorted page numbers looks like (if page range is-(0-111)) then [0,1,10,11,100,101,110,111,2,20,21,3..,99]. We can't create pdf with this order, so we have to sort them by standard [0,1,2,3,..,110,111], and then make pdf.
  • After completing every book, all the pdfs under each chapters, are merged into a single one, so that merged pdf will contain the whole book.

Why the program uses this much resources

  • This Download iterates to over 2000 pages average for each book. (Approximately each chapter contains 15 pages, and there are over 150 chapters each book). So it will take ages to download every file at each iteration.
  • Here comes multithreading. With this, every files are downloaded simultaneously -
    • Minimum : It you have a 4 core CPU, you can start downloading 8 files at once. ( In cases where the number of files is less than 2 x cpu-core-count )
    • Maximum : It can download maximum of 90 files at once.

Future plans

Short term

  • Download only selected chapters from each book (config.json file should take that data)
  • Right now it can only download books from Manga Like which are divided only in chapters, not volumes. It should also be able to download if the book contains "Volumes".

Long term

  • Add more websites to download from.
  • Add command line arguments for ease of use.
  • Add documentation of how each code snippet works.

Bugs

  • (Non-Problamatic) The free proxies from FreeProxyList are anonymous and elite but limited and slow.

Changelog

  • [Added] New website Comic Extra
  • [Added] New website Read Comics Online
  • [Added] Proxy support from Free Proxy List for mass downloading without getting banned.
  • [Added] New website Read Comic Online
  • [Added] Optimize downloading process for taking less resources.
  • [Fixed] (Non-Problamatic) The pdf merging continues, even if the book is completely downloaded.
  • [Fixed] (Problamatic) Image to pdf fails if images are not downloaded completely.
  • [Fixed] (Problamatic) Can't convert RGBA images in pdf.
  • [Fixed] (Problamatic) If connection fails, the total downloading fails.
  • [Fixed] (Problamatic) Pdfs are merged out of order.
  • [Added] New website Manga Like

Opening An Issue/Requesting A Site

If your're planning to open an issue for the script or ask for a new feature or anything that requires opening an Issue, then please do keep these things in mind.

Reporting Issues

If you're going to report an issue, please follow this syntax :
Command You Gave : What was the command that you used to invoke the issue?
Expected Behaviour : After giving the above command, what did you expect shoud've happened?
Actual Behaviour : What actually happened?
Error Log : Error Log is mandatory.

Suggesting A Feature

If you're here to make suggestions, please follow the basic syntax to post a request :
Subject : Something that briefly tells us about the feature.
Long Explanation : Describe in details what you want and how you want.

License

MIT

comicmaker's People

Contributors

gunjannandy avatar imgbotapp avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

comicmaker's Issues

Proxy - Never found / Timeout

Command I Gave : Python3 comicMaker.py

Expected Behaviour : Searching for comics then download them while printing the progression bar.

Actual Behaviour :

The script kept searching for a proxy, writing :

- 1 Working proxies found.
Finding Proxy Lists...Please wait...
XXX---Bad proxy---XXX

in loop, finding proxies such as "47.74.152.29:8888 ", but never started downloading anything
Ending in a timeout from remote.

Error Log : No error besides a timeout after searching infinitely

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.