Giter Site home page Giter Site logo

dpn-flickr's Introduction

Scrape flickr

Installation

  1. Download this repo

  2. Check you have python3 installed on your machine

python3 -v

If not, download and install it from here: https://www.python.org/downloads/

  1. Open the repository with a text editor. Get an API key from Flickr and add it to the file called credentials.json which has the following text in it (replace the credentials with your own):
{"KEY":"YOUR_API_KEY", "SECRET":"YOUR_API_SECRET"}
  1. Install software requirements for this script by typing this command into the terminal. You may choose to use a virtual environment (see below).

pip3 install -r requirements.txt

  1. Then you can run the program to download with flickr. Check it works. I've used my name as there aren't too many results for it. For common search terms you will get 1000s of results and it will take a long time to download, and a lot of space.

python3 scraper.py --search "Tega Brain"

Note. If you want to stop the scraper running (without closing the terminal window). Press CNTRL + C (this shortcut stops terminal processes)

Usage

To scrape for a particular search term:

python scraper.py --search "SEARCH TERM"

You can also add a lat/lng coordinates to specify a geographic bounding box:

python scraper.py --search "SEARCH TERM" --bbox "minimum_longitude minimum_latitude maximum_longitude maximum_latitude"

To scrape for a particular group:

python scraper.py --group "GROUP URL"

Where GROUP URL is something like https://www.flickr.com/groups/scenery/pool/

Large-sized (1024px width) will be downloaded by default. You can download the original images by passing the flag --original.

Limit the number of pages of results downloaded by passing --max-pages N where N is pages of 500 results each.

python scraper.py --search "cicada" --max-pages 1

Use a Virtual Environment

python3 -m venv env
source env/bin/activate
pip install -r requirements.txt

dpn-flickr's People

Contributors

tegacodes avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.