Giter Site home page Giter Site logo

darshannere / internship-scraper Goto Github PK

View Code? Open in Web Editor NEW

This project forked from masterspin/internship-scraper

0.0 0.0 0.0 2.29 MB

SWE/QUANT Internship Scraper & Manager for 2024-2025

Home Page: https://internship-scraper-lime.vercel.app

JavaScript 0.38% Python 24.17% TypeScript 74.43% CSS 1.02%

internship-scraper's Introduction

Internship Scraper & Manager for SWE/Quant/Business Internships/Co-ops (2024-2025 School Year)

Overview

This project is a comprehensive solution for scraping and managing internship and co-op listings for the 2024-2025 school year. It focuses on positions in software engineering (SWE), quantitative trading (Quant), and business domains. The scraper collects listings from LinkedIn, the Pitt CSC & Simplify GitHub repository, and the Ouckah & CSCareers GitHub repository. The platform also incorporates Google OAuth for seamless user management.

Features

  • Multi-source scraping: Collects job listings from LinkedIn, PittCSC GitHub, and Ouckah GitHub.
  • User authentication: Utilizes Google OAuth for secure and easy user management.
  • Internship management: Enables users to save, filter, and track internship applications.
  • Responsive design: Ensures a seamless experience across devices.

Tech Stack

  • Frontend: React, TypeScript, Next.js, TailwindCSS
  • Database: PostgreSQL, Supabase
  • Scraping Tools: Beautiful Soup, asyncio, aiohttp
  • Authentication: Google OAuth 2.0

Installation

Prerequisites

  • Node.js
  • Python
  • Supabase account
  • Google Cloud project for OAuth

Steps

  1. Clone the repository:

    git clone https://github.com/masterspin/internship-scraper.git
    cd internship-scraper
  2. Install dependencies:

    npm install
  3. Setup environment variables: Create a .env file in the root directory and add the following variables:

    NEXT_PUBLIC_SUPABASE_URL=your_supabase_url
    NEXT_PUBLIC_SUPABASE_ANON_KEY=your_supabase_anon_key
    NEXT_PUBLIC_SERVICE_ROLE_KEY=your_supabase_anon_key
    
  4. Run the Python scrapers: Ensure you have Python and the required libraries installed. Then run:

    python3 linkedinScraper.py
    python3 githubScraper.py
  5. Start the development server:

    npm run dev

Usage

  1. Log in with Google:

    • Navigate to the homepage.
    • Click on the "Sign In" button to authenticate.
  2. Scrape internships:

    • Use the provided options to initiate scraping from LinkedIn, PittCSC GitHub, and Ouckah GitHub.
    • Filter and manage the scraped listings.
  3. Save and track applications:

    • Keep track of your applications directly on the platform.
    • Add your own personal job postings.

Contributing

We welcome contributions from the community! To contribute, follow these steps:

  1. Fork the repository.
  2. Create a new branch (git checkout -b feature-branch).
  3. Make your changes and commit them (git commit -m 'Add new feature').
  4. Push to the branch (git push origin feature-branch).
  5. Create a new Pull Request.

License

This project is licensed under the MIT License.

Contact

For any inquiries or feedback, please contact us at [email protected].


This README provides a comprehensive guide to setting up, using, and contributing to the Internship Scraper & Manager. It also highlights the key features and technologies used in the project, making it easy for users and contributors to get started.

internship-scraper's People

Contributors

darshannere avatar masterspin avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.