Giter Site home page Giter Site logo

sniff-paste's Introduction

Sniff-Paste: OSINT Pastebin Harvester

Multithreaded pastebin scraper, scrapes to mysql database, then reads pastes for noteworthy information.

Use sniff-paste.py to go through the entire process of collection, logging, and harvest automatically. The scraper can be set to a paste limit of 0 to scrape indefinitely. If scraped indefinitely, press ctrl + c to stop scraping, any useful information will be in the database, along with a link back to the original paste it was found in.

Installation

sudo apt install libxslt-dev python3-lxml python3-nmap xsltproc mysql-server

pip3 install -r requirements.txt

  • Create database named sniff_paste in mysql server
  • Fill in settings.ini

python3 sniff-paste.py

This will scrape pastebin for the latest number of pastes, then run analysis for ip addresses, emails, and phone numbers. It filters out duplicates and runs scans on some of the harvested data.

Database Structure

  • sniff_paste -- root db
    • pastes -- stores paste with full text, date, link, title, and language
    • emails -- stores emails with extension to paste
    • links -- stores urls with extension to paste
    • ip -- stores ip with connectivity and extension to paste
    • phones -- stores phone numbers with extension to paste
    • secrets -- stores secret type with extension to paste
    • ports -- stores port scan info (port, status, service, version, ip)
    • cryptos -- stores cryptocurrency findings with extension to paste

Crypto findings are not certain to be valid, consider them low probability findings

Notes

  • Please contribute! If there's an error let me know -- even better if you can fix it :)
    • Regex Contributions would be very helpful, and should be pretty easy to add!
  • This tool is in the process of a bigger update, where the scraper can send all new pastes to my new project needmorecowbell/Funnel. I'm trying to consolidate all of my osint tools into one streamlined solution.

sniff-paste's People

Contributors

needmorecowbell avatar dmuhs avatar mend-bolt-for-github[bot] avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.