Giter Site home page Giter Site logo

wutongyuonce / news_recommend Goto Github PK

View Code? Open in Web Editor NEW

This project forked from luochana/news_recommend

0.0 0.0 0.0 17.46 MB

基于Spark的新闻推荐系统,包含爬虫项目、web网站以及spark推荐系统

Shell 0.10% C++ 1.58% C 0.11% PHP 0.18% Java 0.61% Scala 96.33% XSLT 0.05% HTML 0.98% VBScript 0.02% Classic ASP 0.05%

news_recommend's Introduction

News_recommend

本次项目是基于大数据计算引擎的新闻推荐系统--"今日小站",包含了爬虫,新闻网站(前端和后端),推荐系统(Spark)。

前端效果:

image
image
image

一.爬虫

开发环境:

pycharm+python3

软件架构:

mysql+scrapy+splash

项目描述:

爬虫项目负责重复爬取今日头条首页新闻,去除重复新闻,存入mysql。

二.新闻网站:

开发环境:

IntelliJ IDEA + maven + git + linux

软件架构:

mysql + springboot

项目描述:

今日小站是基于springboot框架搭建的web项目,用户在网站完成注册登录后,网站会记录用户的浏览行为。同时网站也会把推荐结果呈现给用户。

三.推荐系统:

开发环境:

IntelliJ IDEA + maven + git + linux

软件架构:

zookeeper + flume + kafka + spark + mysql

项目描述:

推荐系统计算两两文章间的相似度,并将与之最相似的10条数据id存入mysql。实时处理用户行为日志,将结果存入mysql。根据用户行为数据对用户进行推荐,并将结果存入mysql。

image

服务器规划:

spark1:192.168.56.101
spark2:192.168.56.102
spark3:192.168.56.103

项目持续更新中....

news_recommend's People

Contributors

luochana avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.