Giter Site home page Giter Site logo

m-taghizadeh / bigdata_projects Goto Github PK

View Code? Open in Web Editor NEW
4.0 3.0 3.0 41.33 MB

Projects related to Big Data course will be implemented in this repository.

Jupyter Notebook 99.98% Python 0.02%
big-data computer-vision fake-news-detection image-captioning machine-learning transformer vision-transformer dna-sequencing

bigdata_projects's Introduction

Implementation of big data and data mining projects

In this repository, various projects in the field of big data and data mining are implemented using different approaches of machine learning and deep learning. This repository is very suitable for people interested in implementing different applications of artificial intelligence and machine learning in the real world. Below you can see the list of implementations we have done so far.

Project Title Project Description
Image Captioning In this project, we use deep learning and the architecture of Vision Transformers, and we implemented the task of image captioning with great precision and BLEU score. Vision Transformer architecture is the implementation of Google's Transformer architecture in the world of computer vision, the Transformer architecture was initially proposed by Google in the article Attention is all you need in 2017. In this implementation, trabsformers python library and hugging face are used.
Fake News Detection As we know, in today's world, we are faced with a lot of information and news, many of which are fake news due to the interests of people. In this project, using natural language processing techniques and using PassiveAggressiveClassifier and TFIDF Tokenizer, the operation of distinguishing fake news from real news. We reached 93.13% accuracy.
DNA Sequencing Machine learning is widely used and interested by researchers in bioinformatics and natural sciences. In this project, we used the Naive Bayese classifier to classify the DNA sequence. Kmers technique is used in this project. We reached more than 98% accuracy.
Diabetes Analysis In this project, we used diabetes as a case study. First, we visualized and analyzed the dataset and then applied dimension reduction techniques such as PCA on it. Finally, using the KNN classifier, we classified healthy people and people with diabetes with the parameters in the dataset.
Predicting if a person likes a song or not In this project, we used people's interest in music as a case study. First, we visualized and analyzed the dataset data and then applied dimension reduction techniques such as PCA on it. Finally, using the KNN classifier, we classified whether a person likes this song with these features or not.
Handling Imbalanced Data Handling Imbalanced Data with SMOTE and Near Miss Algorithm in Python
Dimensionality reduction Dimensionality reduction using PCA technique in Python using scikit learn library

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.