Comments (8)
Unless someone has a better idea, that'd mandate using a file containing the links to the previously downloaded files. For example: https://github.com/rg3/youtube-dl/ can use a file which lists the vid IDs of already fetched vids -after- you DL, which is way easier to implement and use compared to a general pic ripper.
The JSON format I can imagine right now that'll enable matching multiple items from a single address:
{
"ORIGINAL LINK SUCH AS FOLDER OR ALBUM": {
"ITEM IN FOLDER OR ALBUM": [
"DOWNLOAD LINK USED",
...
], ...
}, ...
}
For example:
{
"http://rautamiekka.deviantart.com/gallery/?catpath=/": {
"http://rautamiekka.deviantart.com/art/Princess-Luna-BIG-COLLAB-colored-0-672156533": [
"http://img09.deviantart.net/ed89/i/2016/335/4/2/_sleeping_on_the_job___finnish_by_rautamiekka-daq4v9l.png"
]
}, ...
}
That allows for very fine (by folder/album address, item address in the folder/album, the DL link used, the filename used) matching of already-downloaded content, assuming the same format works for everything.
Otherwise we'll need ripper-specific formats built into the specific ripper and an unified API to use the format.
from ripme.
Having this addition to the app would drastically reduce download times.
from ripme.
This was added, to use it click the "Remember URL history" button in the config menu
from ripme.
^ Re-download times, mind you.
from ripme.
^ *Re-RIP times, mind you. It would be explicitly preventing re-downloads.
Also as for your overspecialization of folder/file details, it wouldnt need to be nearly that complicated. You would just have the ripper document the last time it ran a rip and then have that compare to the metadata on a site which shows its age/timestamp. It would be extra easy with sites like Tumblr where all images are stored with a string name, because then the ripper could just be told to only rip images with a newer string than the most recent image currently saved/ripped for each specific url/folder.
from ripme.
@rautamiekka That's exactly the solution I imagined. This request has been opened and discussed multiple times in the past, and at least one of those issues is open (probably aggregating links to the duplicates as well).
from ripme.
I support this enhancement as well. I opened an issue for deduplication a week or two ago, but it got closed since "ripme doesn't have deduplication by design".
from ripme.
@ANonCoder123 This is a distinct issue from deduplication. This is an issue of not downloading URLs already downloaded. It mostly would solve the problem of deduplication (except for duplicates downloaded from different URLs)
from ripme.
Related Issues (20)
- Ripme Error JSON Video not found HOT 1
- Ceare a ripper for AI image generator website
- Imagefap Ripper Gallery Broken?? HOT 3
- Youtube ripper creation tutorial
- Instagram Downloads greeted with Instant Error HOT 1
- Unable to download twitter accounts HOT 15
- ImageFap gallery URLs changed, ripper requires update to match new format HOT 2
- Please add xhamster proxies like xhwide1.com HOT 3
- Can someone add bunkr.su to the program
- 0678 HOT 1
- 0678
- ImageFap not working HOT 3
- Failing to download sub reddits HOT 1
- Add pholder.com reddit mirror and "waybackmachine" HOT 4
- is there a way to rip from ex hentai? HOT 1
- u must be stupid to request anything here, its obvious that project is abandoned 2yrs ago HOT 1
- [SOLVED] RipMe Fails on Startup (Linux Kubuntu 22.04) HOT 1
- Olรก HOT 1
- URL Format change, site Myhentaicomics.com HOT 1
- Video download from Motherless not working HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ripme.