makaidong Goto Github PK
Name: 马开东
Type: User
Company: baidu
Bio: makaidong.com
Location: ** 北京
Blog: http://www.makaidong.com
Name: 马开东
Type: User
Company: baidu
Bio: makaidong.com
Location: ** 北京
Blog: http://www.makaidong.com
这个项目是一个基本包.封装了大多数nlp项目中常用工具
自然语言处理理论与实战
Mirror of Apache Nutch
各种爬虫---大众点评,安居客,58,人人贷,拍拍贷, IT桔子,拉勾网,豆瓣,搜房网,ASO100,气象数据,猫眼电影,链家,PM25.in...
阿里巴巴分布式数据库同步系统(解决中美异地机房)
PArallel Distributed Deep LEarning
python进阶 python高阶函数,底层理解和一些分布式计算
百度AI开放平台 Python SDK
58同城 智联招聘 hao123 网易云课堂 **大学排名 等 的python的一些爬虫
The code of book: Python Scraping
最新IP地址数据库-多语言解析以及导入数据库脚本
QuestionAnsweringSystem是一个Java实现的人机问答系统,能够自动分析问题并给出候选答案。
scikit-learn: machine learning in Python
Scrapy, a fast high-level web crawling & scraping framework for Python.
利用scrapy框架爬取网易博客
Redis-based components for Scrapy.
利用自动化测试工具selenium和无界面浏览器phantomjs爬取拉钩网数据
Simplehbase is a lightweight ORM framework between java app and hbase.
hbase web viewer
新浪微博爬虫(Scrapy、Redis)
DNN based hotword and wake word detection toolkit
Mirror of Apache Spark
使用java+httpclient+httpcleaner,多线程、分布式爬去电商网站商品信息,数据存储在hbase上,并使用solr对商品建立索引,使用redis队列存储一个共享的url仓库;使用zookeeper对爬虫节点生命周期进行监视等。
利用HttpClient4+实现网络小说爬虫,可动态添加热门的小说网站
爬虫项目源码整理,使用redis进行url缓存,hbase进行详细信息的存储。使用zookeeper进行爬虫线程的状态监控。
爬虫练习1 Python抓取静态网站信息
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.