I am a research scientist at NVIDIA's Applied Deep Learning Research (ADLR). I got my PhD degree from HKUST under the supervision of Prof. Pascale Fung. My research focus is on Large Vision-Language (Multimodal) Models, LLMs, and more. Feel free to contact ๐
wenliangdai / blip Goto Github PK
View Code? Open in Web Editor NEWThis project forked from salesforce/blip
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
License: BSD 3-Clause "New" or "Revised" License