khaltar's Projects
🔤 calculate average word embeddings (word2vec) from documents for transfer learning
Model to predict if a student will pass or fail
Implementation and use of decision trees to classify congress members based on how they voted on bills, or predicting voting on bills based on party membership. This is a primitive version that does not generalize to new voting issues with the use of nltk's semantic analysis. Plus, extensive use and optimization are yet to be done. This is a work in progress.
한국어 임베딩 (Sentence Embeddings Using Korean Corpora)
Project to create machine learning models to predict the outcome of a football match
In this assignment we're going to use information like a person's age, sex, BMI, no. of children and smoking habit to predict the price of yearly medical bills. This kind of model is useful for insurance companies to determine the yearly insurance premium for a person. The dataset for this problem is taken from: https://www.kaggle.com/mirichoi0218/insurance We will create a model with the following steps: Download and explore the dataset Prepare the dataset for training Create a linear regression model Train the model to fit the data Make predictions using the trained model
A machine learning system using Decision Trees and SVM's which predicts the votes of members of congress at 80% acurracy.
With Barry Chen and Vivian Hu: Using topic models to predict congressional bill pass / fail
A general notebook outlining an example of how to process and build prediction models for NLP classification tasks, in this case, a subjectivity dataset. Also includes word embeddings and deep learning models + interpretation tutorial using pytorch.
Repo for small projects that fit inside a Jupyter Notebook. Inside: word2vec news analysis and regression, tutorial on working with survey data, and more.
Creating 2 Classification models including Logistic Regression & Decision Tree to predict pass or fail of students in a class
Course Notes for Introduction to Data Science, Fall 2020
The goal of the model is to predict whether a passenger survived the Titanic disaster, given their age, class and a few other features.
Problem Statement: To predict the quality of wine to a range 1-10 from given 11 attributes.