sushengyang Goto Github PK
Name: Billy
Type: User
Name: Billy
Type: User
Process Migration - CMU 15640 Distributed System
RMI - CMU 15640 Distributed System Lab2
MapReduce
MPI Cluster - CMU 15640
Open Source, Distributed, RESTful Search Engine
I have done my individual project (dissertation) on ensemble methods. In which I first did the background study on different ensemble methods and then implemented Boosting, AdaBoost, Bagging and random forest techniques on underlying machine learning algorithms. I used boosting method to boost the performance of weak learner like decision stumps. Implemented bagging for decision trees (both regression and classification problems) and for KNN classifier. Used random forest for classification trees. I have implemented a special algorithm of boosting called “AdaBoost” on logistic regression algorithm using different threshold values. Then plotted the different graphs like an error rate as a function of boosting, bagging and random forest iterations. Compared results of bagging with boosting. Analysed the performance of classifier before applying ensemble methods and after applying ensemble methods. Used different model evaluation techniques like cross-validation, MSE, PRSS, ROC curves, confusion matrix, and out-of-bag error estimation to estimate the performance of ensemble techniques.
interview questions
Augmenting Facebook's graph search with functionality for full-text search of a user's posts.
The goal of this project is to implement a program that parses an ASCII text-based CSV data file and then transforms that same information to a memory-efficient binary format and writes it to a binary data file. The new binary data file also have multiple index files for efficient record retrieval of values from fields other than the primary key. The program should operate entirely from the command line (no GUI).
Developed a project under the course "Database Management System"
The Cognitive Foundry is an open-source Java library for building intelligent systems using machine learning
Free Learning Resources In Chinese
前端面试资源汇总
a collection about front-end development interview questions and answers
Document clustering
Accessing MongoDB Data with REST :: Learn how to work with RESTful, hypermedia-based data persistence using Spring Data REST.
My solutions to HackerRank problems (https://www.hackerrank.com)
This repository contains hadoop projects
IE 2nd term project prototype application based on Telefonica Mobility and BBVA Credit Card Payments. Provider data is strictly disclosed; but you can use the code in any purpose you desire. MVC stack framework using python Django. Api integrations with Expedia and Twitter Streaming API. Important work on TripAdvisor webscraping. NLP (NLTK) for Topic based sentiment analysis(Trip Advisor Reviews), Timeseries forecasting, Recommendation Engine, Leaflet Data Visualization, NetworkX SNA (python and JS). BBVA data is neglected because of lack of data integrity and necessary categories. I hope this work can be helpful to practicioners of Django framework and analytics. This application is developed in a very short term with Agile methodology, therefore it is normal that there are problems and inconsistencies of code quality. For example we tried to use mongoengine and Django framework document models as a common data source; but we faced with difficulties time to time because of lack of accurate documentation in web. Whenever we resolved we followed the accurate coding practice. Please followup the model usage practice in the last view in views.py to comply with MVC, do not use pymongo directly. Mongoengine will provide features like DBConnectionPooling that will facilitate a scalable architecture.
A smal university project for optimal color segmentation of an RGB image
Smarter implementaion of algorithms
Basic system for ranked information retrival, that indexes RSS feeds.
This project aims to build a Wikipedia indexer
Projects under Information Retrieval course
Assignments and Projects related to Infornation Retrieval
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.