Giter Site home page Giter Site logo
  • 👋 Hi, I’m @XenoZLH
  • 👀 I’m interested in a large variety of AI and computer vision researches, especially multimodal LMs and AIGC related tasks. Also pay attention to new deep learning frame such as self-supervised learning, world model, etc.
  • 🌱 I’m currently learning object detection and semi-supervised learning.

xenozlh's Projects

bunny icon bunny

A family of lightweight multimodal models.

cvinw_readings icon cvinw_readings

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

lisa icon lisa

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

llava icon llava

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

monkey icon monkey

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

psalm icon psalm

This is an official implementation for "PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model"

qwen-vl icon qwen-vl

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

segment-anything icon segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

xenozlh icon xenozlh

Config files for my GitHub profile.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.