Giter Site home page Giter Site logo

Welcome to Yiwen Wang's personal website.

PhD Candidate, EECS, Peking University

Advisor: Xihong Wu, Tianshu Qu

Email: [email protected]

Scholar: Google Scholar

Biography

Sep. 2022 - July. 2025

Advised by Xihong Wu

Ph. D, Speech and Hearing Research Center, School of Intelligence Science and Technology, Peking University.

Sep. 2019 - July. 2022

Advised by Tianshu Qu

Master, Speech and Hearing Research Center, School of Intelligence Science and Technology, Peking University.

Sep. 2015 - July. 2019

Bachelor's student, Electronics Engineering and Computer Science, Peking University.

Research Interests

  • speech enhancement and universal sound separation

  • direction of arrival estimation

  • sound field analysis

  • higher order ambisonic analysis

Publications

  • Wang, Y., and Wu, X., 2024. TSE-PI: Target Sound Extraction under Reverberant Environments with Pitch Information. arXiv preprint arXiv:2406.08716. (Accpepted by Interspeech 2024)

  • Wang, Y., Lan, Z., Wu, X. and Qu, T., 2023, June. TT-Net: Dual-path transformer based sound field translation in the spherical harmonic domain. In ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 1-5). IEEE.

  • Wang, Y., Wu, X. and Qu, T., 2022, May. Up-wgan: Upscaling ambisonic sound scenes using wasserstein generative adversarial networks. In Audio Engineering Society Convention 152. Audio Engineering Society.

  • Wang, Y., Wu, X. and Qu, T., 2020, May. Direction of arrival estimation based on transfer function learning using autoencoder network. In Audio Engineering Society Convention 148. Audio Engineering Society.

  • Li, X., Wang, Y., Sun, Y., Wu, X. and Chen, J., 2023, June. PGSS: pitch-guided speech separation. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 37, No. 11, pp. 13130-13138).

  • Ma, D., Wang, Y., He, L., Jin, M., Su, D. and Yu, D., 2022, May. DP-DWA: Dual-Path Dynamic Weight Attention Network With Streaming Dfsmn-San For Automatic Speech Recognition. In ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 7692-7696). IEEE.

  • Peng, C., Wang, Y., Wu, X. and Qu, T., 2022, November. A Multi-channel Speech Separation System for Unknown Number of Multiple Speakers. In 2022 5th International Conference on Information Communication and Signal Processing (ICICSP) (pp. 158-162). IEEE.

Intern Experiments

Sep. 2020 - August. 2021

Tencent AI Lab, Speech Group II, Beijing, China.

Speech recognition, speech enhancement

Mar. 2020 - Jun. 2020

Beijing Momo Technology

Pitch extraction, autotune

Feb. 2019 - August. 2019

Bytedance, Beijing, China

Recommendation Algorithm Intern

Yiwen Wang's Projects

dense icon dense

ICASSP2025Dynamic Embedding Causal Target Speech Extraction

doa icon doa

Direction of Arrival Estimation for Microphone Arrays

hosts icon hosts

镜像:https://coding.net/u/scaffrey/p/hosts/git

mcp icon mcp

pku-mcp-problem on sssx class

spherical-harmonic-transform icon spherical-harmonic-transform

A collection of MATLAB routines for the Spherical Harmonic Transform and related manipulations in the spherical harmonic spectrum.

tse_pi icon tse_pi

Official Code for Target Sound Extraction under Reverberant Environments with Pitch Information (Interspeech 2024)

vis icon vis

可视化作业 d3学习

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.