Giter Site home page Giter Site logo

Comments (8)

rautamiekka avatar rautamiekka commented on May 14, 2024 1

Unless someone has a better idea, that'd mandate using a file containing the links to the previously downloaded files. For example: https://github.com/rg3/youtube-dl/ can use a file which lists the vid IDs of already fetched vids -after- you DL, which is way easier to implement and use compared to a general pic ripper.

The JSON format I can imagine right now that'll enable matching multiple items from a single address:

{
    "ORIGINAL LINK SUCH AS FOLDER OR ALBUM": {
        "ITEM IN FOLDER OR ALBUM": [
            "DOWNLOAD LINK USED",
            ...
        ], ...
    }, ...
}

For example:

{
    "http://rautamiekka.deviantart.com/gallery/?catpath=/": {
        "http://rautamiekka.deviantart.com/art/Princess-Luna-BIG-COLLAB-colored-0-672156533": [
            "http://img09.deviantart.net/ed89/i/2016/335/4/2/_sleeping_on_the_job___finnish_by_rautamiekka-daq4v9l.png"
        ]
    }, ...
}

That allows for very fine (by folder/album address, item address in the folder/album, the DL link used, the filename used) matching of already-downloaded content, assuming the same format works for everything.

Otherwise we'll need ripper-specific formats built into the specific ripper and an unified API to use the format.

from ripme.

 avatar commented on May 14, 2024 1

Having this addition to the app would drastically reduce download times.

from ripme.

cyian-1756 avatar cyian-1756 commented on May 14, 2024 1

This was added, to use it click the "Remember URL history" button in the config menu

from ripme.

rautamiekka avatar rautamiekka commented on May 14, 2024

^ Re-download times, mind you.

from ripme.

Davrial avatar Davrial commented on May 14, 2024

^ *Re-RIP times, mind you. It would be explicitly preventing re-downloads.

Also as for your overspecialization of folder/file details, it wouldnt need to be nearly that complicated. You would just have the ripper document the last time it ran a rip and then have that compare to the metadata on a site which shows its age/timestamp. It would be extra easy with sites like Tumblr where all images are stored with a string name, because then the ripper could just be told to only rip images with a newer string than the most recent image currently saved/ripped for each specific url/folder.

from ripme.

metaprime avatar metaprime commented on May 14, 2024

@rautamiekka That's exactly the solution I imagined. This request has been opened and discussed multiple times in the past, and at least one of those issues is open (probably aggregating links to the duplicates as well).

from ripme.

 avatar commented on May 14, 2024

I support this enhancement as well. I opened an issue for deduplication a week or two ago, but it got closed since "ripme doesn't have deduplication by design".

from ripme.

metaprime avatar metaprime commented on May 14, 2024

@ANonCoder123 This is a distinct issue from deduplication. This is an issue of not downloading URLs already downloaded. It mostly would solve the problem of deduplication (except for duplicates downloaded from different URLs)

from ripme.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.