Giter Site home page Giter Site logo

discord-scraper's Introduction

Discord Scraper [Beta]

Release Date: October 3, 2017 @ 10:13 UTC-0

Tutorial

Step 1: Open/Login to your Discord app and press CTRL + SHIFT + I to open the Developer Tools panel. Take the Server ID and Channel ID data from the URL in the title of the Developer Tools panel. x74PJvp.png

Step 2: Go to the Application tab in the Developer Tools panel. Expand the Local Storage area on the left-side of the Developer Tools panel. Select the discordapp URL in the list. Highlight the data between the quotation marks in the token value and press CTRL + C to copy the token. hTsoSGd.png

Step 3: Open the Discord.py script in any Python IDE of your choosing (IDLE was used in this example). Replace the text '' with the token value you grabbed from Step 2. Replace the text '' with the Server ID data that you grabbed from Step 1. Replace the text '' with the Channel ID data that you grabbed from Step 1. Alter the Query value as necessary (the default is set to grab all NSFW and non-NSFW images and videos). dqsYX8q.png

Step 4: Make sure that you put the Discord.py script in its own folder since all contents will be placed in the same folder that the Discord.py file resides. Run the Discord.py script and it should prompt you for the number of pages to grab (this coincides with the page number from the search query) the general rule is 25 contents per page, so 2 pages would equate to about 50 items. Let the script run (it may take a while to finish) the beta version of this script is designed to halt upon error, but the final release of this script will ignore errors and continue grabbing data as demanded.

Warning

Do not run these scripts if you meet any of these limitations:

  • A CPU that is prone to overheating
  • A metered internet connection (monthly bandwidth caps)
  • A storage device that is low on available space

Changelog (DD-MM-YYYY)

07-04-2018 - Beta Fix #3:

  • Fixed threading issue (too many concurrent threads)
  • Fixed filename issues when grabbing files with similar filenames (still a potential issue with large amounts of files but significantly less issues)

07-04-2018 - Beta Fix #2:

  • Fixed problems when downloading from channels with less than 25 images/videos as the older scripts assumed more than 25 images/videos in the channel.
  • I will incorporate a better method of grabbing images where there's less corruptions and less missing photos.

21-02-2018 - Beta Update #1:

  • Updated this readme to include warning information
  • Created a version for those running Python 3
  • Updated the Python 2 version to match the Python 3 version with threading support

03-10-2017 - Beta Fix #1:

  • Fixed issue with URL appending offset query information ad infinitum.
  • Fixed issue with uninitialized opener data when grabbing multiple pages of JSON data.
  • Added new function to allow for the resetting of opener data when grabbing JSON data.

03-10-2017 - Beta Release:

  • The first release of the script.
  • Not meant for production use.
  • Still has bugs to fix and features to implement.

discord-scraper's People

Stargazers

Drago avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.