I am a 4th year PhD student at Rochester Institute of Technology (RIT).
I work at Mining Lab, under the supervision of Dr. Qi Yu.
Name: Mahsa Mozaffari
Type: User
Bio: Ph.D. Student at Rochester Institute of Technology Machine Learning and Data Intensive Computing (MINING Lab)
Location: Rochester, NY
I am a 4th year PhD student at Rochester Institute of Technology (RIT).
I work at Mining Lab, under the supervision of Dr. Qi Yu.
Anomaly detection related books, papers, videos, and toolboxes
A self-supervised learning framework for audio-visual speech
π₯π₯π₯Latest Papers, Codes and Datasets on Vid-LLMs.
A curated list of awesome self-supervised methods
A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Commonsense Reasoning and related area.
Contains source code for the CVPR2022 paper titled "Bayesian Nonparametric Submodular Video Partition for Robust Anomaly Detection"
Material for my Caltech tutorial on deep learning and tensor methods
Config files for my GitHub profile.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
[ICLR 2022] code for "How Much Can CLIP Benefit Vision-and-Language Tasks?" https://arxiv.org/abs/2107.06383
Deep Learning in Javascript. Train Convolutional Neural Networks (or ordinary ones) in your browser.
Cyclops Tensor Framework: parallel arithmetic on multidimensional arrays
Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.
A collection of various deep learning architectures, models, and tips
Anomaly based Network Intrusion Detection
NVIDIA's Deep Imagination Team's PyTorch Library
Learning Sparse Neural Networks through L0 regularization
This repository contains alternating optimization algorithms for L1-norm tucker decomposition
MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation [ECCV2020]
[CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
Empowers LLMs with the ability to see and draw.
A PyTorch-based Speech Toolkit
VIP cheatsheets for Stanford's CS 230 Deep Learning
The IPython notebooks for my talks.
tensor decomposition
A declarative, efficient, and flexible JavaScript library for building user interfaces.
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. πππ
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google β€οΈ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.