Giter Site home page Giter Site logo

Welcome to Yiwen Wang's personal website.

PhD Candidate, EECS, Peking University

Advisor: Xihong Wu, Tianshu Qu

Email: [email protected]

Scholar: Google Scholar

Biography

Sep. 2022 - July. 2025

Advised by Xihong Wu

Ph. D, Speech and Hearing Research Center, School of Intelligence Science and Technology, Peking University.

Sep. 2019 - July. 2022

Advised by Tianshu Qu

Master, Speech and Hearing Research Center, School of Intelligence Science and Technology, Peking University.

Sep. 2015 - July. 2019

Bachelor's student, Electronics Engineering and Computer Science, Peking University.

Research Interests

  • speech enhancement and universal sound separation

  • direction of arrival estimation

  • sound field analysis

  • higher order ambisonic analysis

Publications

  • Wang, Y., and Wu, X., 2024. TSE-PI: Target Sound Extraction under Reverberant Environments with Pitch Information. arXiv preprint arXiv:2406.08716. (Accpepted by Interspeech 2024)

  • Wang, Y., Lan, Z., Wu, X. and Qu, T., 2023, June. TT-Net: Dual-path transformer based sound field translation in the spherical harmonic domain. In ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 1-5). IEEE.

  • Wang, Y., Wu, X. and Qu, T., 2022, May. Up-wgan: Upscaling ambisonic sound scenes using wasserstein generative adversarial networks. In Audio Engineering Society Convention 152. Audio Engineering Society.

  • Wang, Y., Wu, X. and Qu, T., 2020, May. Direction of arrival estimation based on transfer function learning using autoencoder network. In Audio Engineering Society Convention 148. Audio Engineering Society.

  • Li, X., Wang, Y., Sun, Y., Wu, X. and Chen, J., 2023, June. PGSS: pitch-guided speech separation. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 37, No. 11, pp. 13130-13138).

  • Ma, D., Wang, Y., He, L., Jin, M., Su, D. and Yu, D., 2022, May. DP-DWA: Dual-Path Dynamic Weight Attention Network With Streaming Dfsmn-San For Automatic Speech Recognition. In ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 7692-7696). IEEE.

  • Peng, C., Wang, Y., Wu, X. and Qu, T., 2022, November. A Multi-channel Speech Separation System for Unknown Number of Multiple Speakers. In 2022 5th International Conference on Information Communication and Signal Processing (ICICSP) (pp. 158-162). IEEE.

Yiwen Wang's Projects

Yiwen Wang doesnโ€™t have any public repositories yet.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.