Rohit Rajesh's Projects
Contrastive Language-Audio Pretraining
Computer Vision Stack for autonomous machines, that include algorithms that can find the locate, find distance and velocity of objects from data captured from a 2D monocular camera.
Monocular depth estimation
Google chrome Dino game bot powered by Computer Vision and Deep Learning.
DSPy: The framework for programming—not prompting—foundation models
Official implementation of the paper "Towards Faster and Stabilized GAN Training for High-fidelity Few-shot Image Synthesis" in ICLR 2021
Florence with a custom image encoder
Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Program that makes chooing an organisantion easier by picking the ones that have projects in your skillset. It also checks if the organisation has participated in previous editions of GSoC.
Human Essential Gene Classification using DL
Analysis on the topic "Major factors that will determine the factors that could influence residential home prices across the United States over the next 10 years".
🏅 Collection of Kaggle Solutions and Ideas 🏅
Repo that holds all the scripts used to curate the KaggleCode dataset from Kaggle Meta Code dataset.
LLM code generation pipeline
Visual Instruction Tuning: Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.
Train LLMs by just modifying config files!
LLMDet is a text detection tool that can identify which generated sources the text came from (e.g. large language model or human-write).