Giter Site home page Giter Site logo

manimozaffar / linkedin-scraper Goto Github PK

View Code? Open in Web Editor NEW
205.0 6.0 21.0 137 KB

A playwright bot which is implemented to scrape linkedin and store advertisement data in a database and telegram channel

Python 99.40% Dockerfile 0.60%
fastapi linkedin linkedin-bot playwright chatgpt chatgpt-api browser-fingerprint browser-fingerprinting python scraper scraping sqlalchemy bot cralwer spider

linkedin-scraper's Issues

افزودن لوکیشن ترکیه

ممنون میشم آگهی‌های پایتون مربوط به لوکیشن ترکیه رو هم به لیست فیلترها اضافه کنید.

How to scrape posts that people make about job openings instead of the job openings themselves on LinkedIn?

How to scrape posts that people make about job openings instead of the job openings themselves on LinkedIn?

  1. Is there a way to modify the scraping to be done on publicly shared posts about job openings? It would be great if we could specify which keywords to search for in the feed posts. Perhaps we could also scrape the content of the link shared in the post itself.

  2. Is there a way to generate an RSS feed instead of using a Telegram bot?

Many job openings, even those not published within LinkedIn's own system, are shared by people in their feeds. This would allow us to not only access recent job openings (even outside of LinkedIn), but also have access to the contact information of the person who shared the opening (the user who shared the post on LinkedIn).

And if there is an RSS feed, it would be possible to integrate it into a news app.

bug

jobs from bot has information like {java} but in original page of job is not exist

Different Phases Of Development

  1. Leverage async more to speed up crawling
  2. SOLID implementation of crawling core, for flexibility in changing core or having multiple cores
  3. Ada core implementation instead of ThebAI
  4. SOLID implementation of FastAPI backend service for flexibility in changing cores or having multiple cores for filter queries
  5. Admin authentication backend
  6. Direct query ability backend
  7. Deploy to a server, so that information can be retrieved by third party
  8. Use a Lua script to evaluate filter queries and less overhead of connections as a new core (may need help)
  9. Telegram bot's feature to integrate with backend query ability

ChatGPT exported AD vs Real AD on linkedin

Hello, i saw an ad in telegram bot that requrement and text was about some technology but nothing of them was in real lnkedin AD.

Persian:
محتوای تبلیغ موجود در تلگرام، به محتوای تبلیغ اصلی در تلگرام بی شباهت بود. برای مثال، تبلیغ لینکدین درمورد یک محقق تجربه کاربری بود ولی در کانال نوشته شده بود که نیازمند ۳ سال تجربه ی پایتون و یکی از فریم ورک هاست، نیازمند یکی از پایگاه داده هاست، ویزا اسپانسر شیپ دارد. در تبلیغ اصلی هیچکدوم از اینها بیان نشده بود.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.