Giter Site home page Giter Site logo

minguinho26 / prefix_aac_icassp2023 Goto Github PK

View Code? Open in Web Editor NEW
23.0 2.0 2.0 101.29 MB

Official Implementation of "Prefix tuning for Automated Audio Captioning(ICASSP 2023)"

Home Page: https://prefixaac.github.io

Python 18.04% Jupyter Notebook 81.88% Shell 0.08%
audio-captioning deep-learning icassp2023 pytorch-implementation

prefix_aac_icassp2023's Introduction

πŸ› 

Β  Β  Β  Β 


Β NewsπŸŽ‰πŸŽ‰

My paper was accepted to the 2024 International Conference on Machine Learning (ICML)! [arXiv]


Education

2017.03 ~ 2023.02 : Electrical & Electronics Engineering at Chung-Ang University (Cum Laude)
2023.02 ~ present : Graduate School of AI at POSTECH(Advisor : Prof. Jaeho Lee)


Publication

(* equal contribution)

Hagyeong Lee*, Minkyu Kim*, Jun-Hyuk Kim, Seungeon Kim, Dokwan Oh, Jaeho Lee, "Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity", ICML, 2024

Minkyu Kim*, Kim Sung-Bin*, Tae-Hyun Oh, "Prefix tuning for automated audio captioning", ICASSP [Oral], 2023


Experience

Researcher, EffL(Efficient Learning) Lab at POSTECH, 2023.02 ~ present
Undergraduate Research Fellowship, Algorithmic Machine Intelligence Lab (AMI Lab.) at POSTECH, 2022.07 ~ 2022.08
Research Intern, Computer Vision Lab at Korea Univ, 2021.09 ~ 2021.12
CAU-CVML Summer Seminar 2021, 2021.07 ~ 2021.08
CUAI(Chung-Ang University Artificial Intelligence, μ€‘μ•™λŒ€ν•™κ΅ 인곡지λŠ₯ ν•™νšŒ), 2021.03 ~ 2023.02


Services

Peer review, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023, 2024

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.