default search action
Zeyu Jin
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j5]Zeyu Jin, Wenjiao Zai:
Audiovisual emotion recognition based on bi-layer LSTM and multi-head attention mechanism on RAVDESS dataset. J. Supercomput. 81(1): 31 (2025) - 2024
- [c40]Xiaohan Li, Qixin Wang, Zishan Wang, Zeyu Jin, Jia Jia:
SoulSkipper: A Voice-Controlled Emotional Adaptive Game to Complement Therapy for Social Anxiety Disorder. CHI Extended Abstracts 2024: 298:1-298:7 - [c39]Ke Chen, Jiaqi Su, Zeyu Jin:
MDX-GAN: Enhancing Perceptual Quality in Multi-Class Source Separation Via Adversarial Training. ICASSP 2024: 741-745 - [c38]Patrick O'Reilly, Zeyu Jin, Jiaqi Su, Bryan Pardo:
Maskmark: Robust Neuralwatermarking for Real and Synthetic Speech. ICASSP 2024: 4650-4654 - [c37]Yunyun Wang, Jiaqi Su, Adam Finkelstein, Zeyu Jin:
GR0: Self-Supervised Global Representation Learning for Zero-Shot Voice Conversion. ICASSP 2024: 10786-10790 - [c36]Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Ramaneswaran S., Deepali Aneja, Zeyu Jin, Ramani Duraiswami, Dinesh Manocha:
A Closer Look at the Limitations of Instruction Tuning. ICML 2024 - [c35]Yixuan Zhou, Xiaoyu Qin, Zeyu Jin, Shuoyi Zhou, Shun Lei, Songtao Zhou, Zhiyong Wu, Jia Jia:
VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling. ACM Multimedia 2024: 554-563 - [c34]Zeyu Jin, Jia Jia, Qixin Wang, Kehan Li, Shuoyi Zhou, Songtao Zhou, Xiaoyu Qin, Zhiyong Wu:
SpeechCraft: A Fine-Grained Expressive Speech Dataset with Natural Language Description. ACM Multimedia 2024: 1255-1264 - [i23]Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Ramaneswaran S., Deepali Aneja, Zeyu Jin, Ramani Duraiswami, Dinesh Manocha:
A Closer Look at the Limitations of Instruction Tuning. CoRR abs/2402.05119 (2024) - [i22]Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Utkarsh Tyagi, Oriol Nieto, Zeyu Jin, Dinesh Manocha:
VDGD: Mitigating LVLM Hallucinations in Cognitive Prompts by Bridging the Visual Perception Gap. CoRR abs/2405.15683 (2024) - [i21]Zeyu Jin, Jia Jia, Qixin Wang, Kehan Li, Shuoyi Zhou, Songtao Zhou, Xiaoyu Qin, Zhiyong Wu:
SpeechCraft: A Fine-grained Expressive Speech Dataset with Natural Language Description. CoRR abs/2408.13608 (2024) - [i20]Yixuan Zhou, Xiaoyu Qin, Zeyu Jin, Shuoyi Zhou, Shun Lei, Songtao Zhou, Zhiyong Wu, Jia Jia:
VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling. CoRR abs/2408.15676 (2024) - [i19]Ke Chen, Jiaqi Su, Taylor Berg-Kirkpatrick, Shlomo Dubnov, Zeyu Jin:
Improving Generalization of Speech Separation in Real-World Scenarios: Strategies in Simulation, Optimization, and Evaluation. CoRR abs/2408.16126 (2024) - [i18]Patrick O'Reilly, Prem Seetharaman, Jiaqi Su, Zeyu Jin, Bryan Pardo:
Code Drift: Towards Idempotent Neural Audio Codecs. CoRR abs/2410.11025 (2024) - [i17]Yingahao Aaron Li, Rithesh Kumar, Zeyu Jin:
DMDSpeech: Distilled Diffusion Model Surpassing The Teacher in Zero-shot Speech Synthesis via Direct Metric Optimization. CoRR abs/2410.11097 (2024) - 2023
- [c33]Oriol Nieto, Zeyu Jin, Franck Dernoncourt, Justin Salamon:
Efficient Spoken Language Recognition via Multilabel Classification. INTERSPEECH 2023: 506-510 - [c32]Yuting Yang, Zeyu Jin, Connelly Barnes, Adam Finkelstein:
White Box Search Over Audio Synthesizer Parameters. ISMIR 2023: 190-196 - [c31]Zeyu Jin, Zixuan Wang, Qixin Wang, Jia Jia, Ye Bai, Yi Zhao, Hao Li, Xiaorui Wang:
HoloSinger: Semantics and Music Driven Motion Generation with Octahedral Holographic Projection. ACM Multimedia 2023: 9393-9395 - [i16]Oriol Nieto, Zeyu Jin, Franck Dernoncourt, Justin Salamon:
Efficient Spoken Language Recognition via Multilabel Classification. CoRR abs/2306.01945 (2023) - 2022
- [j4]Zeyu Jin, Ruo Li:
High-order Numerical Homogenization for Dissipative Ordinary Differential Equations. Multiscale Model. Simul. 20(1): 583-617 (2022) - [j3]Jiaqi Zhang, Zeyu Jin, Bo Jiang, Zaiwen Wen:
Stochastic Augmented Projected Gradient Methods for the Large-Scale Precoding Matrix Indicator Selection Problem. IEEE Trans. Wirel. Commun. 21(11): 9553-9565 (2022) - [c30]Pranay Manocha, Zeyu Jin, Adam Finkelstein:
SQAPP: No-Reference Speech Quality Assessment Via Pairwise Preference. ICASSP 2022: 891-895 - [c29]Nikhil Kandpal, Oriol Nieto, Zeyu Jin:
Music Enhancement via Image Translation and Vocoding. ICASSP 2022: 3124-3128 - [c28]Yunyun Wang, Jiaqi Su, Adam Finkelstein, Zeyu Jin:
Controllable Speech Representation Learning Via Voice Conversion and AIC Loss. ICASSP 2022: 6682-6686 - [c27]Pranay Manocha, Zeyu Jin, Adam Finkelstein:
Audio Similarity is Unreliable as a Proxy for Audio Quality. INTERSPEECH 2022: 3553-3557 - [c26]Ziyi Wang, Xingqi Wang, Zeyu Jin, Xiaohan Li, Shikun Sun, Jia Jia:
AI Carpet: Automatic Generation of Aesthetic Carpet Pattern. ACM Multimedia 2022: 6958-6960 - [c25]Bryan Wang, Zeyu Jin, Gautham J. Mysore:
Record Once, Post Everywhere: Automatic Shortening of Audio Stories for Social Media. UIST 2022: 14:1-14:11 - [i15]Joseph Turian, Jordie Shier, Humair Raj Khan, Bhiksha Raj, Björn W. Schuller, Christian J. Steinmetz, Colin Malloy, George Tzanetakis, Gissel Velarde, Kirk McNally, Max Henry, Nicolas Pinto, Camille Noufi, Christian Clough, Dorien Herremans, Eduardo Fonseca, Jesse H. Engel, Justin Salamon, Philippe Esling, Pranay Manocha, Shinji Watanabe, Zeyu Jin, Yonatan Bisk:
HEAR 2021: Holistic Evaluation of Audio Representations. CoRR abs/2203.03022 (2022) - [i14]Nikhil Kandpal, Oriol Nieto, Zeyu Jin:
Music Enhancement via Image Translation and Vocoding. CoRR abs/2204.13289 (2022) - [i13]Pranay Manocha, Zeyu Jin, Adam Finkelstein:
Audio Similarity is Unreliable as a Proxy for Audio Quality. CoRR abs/2206.13411 (2022) - 2021
- [c24]Pranay Manocha, Zeyu Jin, Richard Zhang, Adam Finkelstein:
CDPAM: Contrastive Learning for Perceptual Audio Similarity. ICASSP 2021: 196-200 - [c23]Jiaqi Su, Yunyun Wang, Adam Finkelstein, Zeyu Jin:
Bandwidth Extension is All You Need. ICASSP 2021: 696-700 - [c22]Max Morrison, Lucas Rencker, Zeyu Jin, Nicholas J. Bryan, Juan Pablo Cáceres, Bryan Pardo:
Context-Aware Prosody Correction for Text-Based Speech Editing. ICASSP 2021: 7038-7042 - [c21]Youchen Miao, Zeyu Jin, Yumeng Zhang, Yuchen Chen, Junren Lai:
Compare Machine Learning Models in Text Classification Using Steam User Reviews. ICSED 2021: 40-45 - [c20]Shuqi Dai, Zeyu Jin, Celso Gomes, Roger B. Dannenberg:
Controllable deep melody generation via hierarchical music structure representation. ISMIR 2021: 143-150 - [c19]Joseph Turian, Jordie Shier, Humair Raj Khan, Bhiksha Raj, Björn W. Schuller, Christian J. Steinmetz, Colin Malloy, George Tzanetakis, Gissel Velarde, Kirk McNally, Max Henry, Nicolas Pinto, Camille Noufi, Christian Clough, Dorien Herremans, Eduardo Fonseca, Jesse H. Engel, Justin Salamon, Philippe Esling, Pranay Manocha, Shinji Watanabe, Zeyu Jin, Yonatan Bisk:
HEAR: Holistic Evaluation of Audio Representations. NeurIPS (Competition and Demos) 2021: 125-145 - [c18]Jiaqi Su, Zeyu Jin, Adam Finkelstein:
HiFi-GAN-2: Studio-Quality Speech Enhancement via Generative Adversarial Networks Conditioned on Acoustic Features. WASPAA 2021: 166-170 - [i12]Zeyu Jin, Ruo Li:
High Order Numerical Homogenization for Dissipative Ordinary Differential Equations. CoRR abs/2102.03527 (2021) - [i11]Pranay Manocha, Zeyu Jin, Richard Zhang, Adam Finkelstein:
CDPAM: Contrastive learning for perceptual audio similarity. CoRR abs/2102.05109 (2021) - [i10]Max Morrison, Lucas Rencker, Zeyu Jin, Nicholas J. Bryan, Juan Pablo Cáceres, Bryan Pardo:
Context-Aware Prosody Correction for Text-Based Speech Editing. CoRR abs/2102.08328 (2021) - [i9]Shuqi Dai, Zeyu Jin, Celso Gomes, Roger B. Dannenberg:
Controllable deep melody generation via hierarchical music structure representation. CoRR abs/2109.00663 (2021) - [i8]Max Morrison, Zeyu Jin, Nicholas J. Bryan, Juan Pablo Cáceres, Bryan Pardo:
Neural Pitch-Shifting and Time-Stretching with Controllable LPCNet. CoRR abs/2110.02360 (2021) - 2020
- [c17]Emma Frid, Celso Gomes, Zeyu Jin:
Music Creation by Example. CHI 2020: 1-13 - [c16]Jongpil Lee, Nicholas J. Bryan, Justin Salamon, Zeyu Jin, Juhan Nam:
Disentangled Multidimensional Metric Learning for Music Similarity. ICASSP 2020: 6-10 - [c15]Jiaqi Su, Zeyu Jin, Adam Finkelstein:
Acoustic Matching By Embedding Impulse Responses. ICASSP 2020: 426-430 - [c14]Kaizhi Qian, Zeyu Jin, Mark Hasegawa-Johnson, Gautham J. Mysore:
F0-Consistent Many-To-Many Non-Parallel Voice Conversion Via Conditional Autoencoder. ICASSP 2020: 6284-6288 - [c13]Pranay Manocha, Adam Finkelstein, Richard Zhang, Nicholas J. Bryan, Gautham J. Mysore, Zeyu Jin:
A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences. INTERSPEECH 2020: 2852-2856 - [c12]Max Morrison, Zeyu Jin, Justin Salamon, Nicholas J. Bryan, Gautham J. Mysore:
Controllable Neural Prosody Synthesis. INTERSPEECH 2020: 4437-4441 - [c11]Jiaqi Su, Zeyu Jin, Adam Finkelstein:
HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks. INTERSPEECH 2020: 4506-4510 - [c10]Jongpil Lee, Nicholas J. Bryan, Justin Salamon, Zeyu Jin, Juhan Nam:
Metric learning vs classification for disentangled music representation learning. ISMIR 2020: 439-445 - [c9]Nora S. Willett, Hijung Valentina Shin, Zeyu Jin, Wilmot Li, Adam Finkelstein:
Pose2Pose: pose selection and transfer for 2D character animation. IUI 2020: 88-99 - [i7]Pranay Manocha, Adam Finkelstein, Zeyu Jin, Nicholas J. Bryan, Richard Zhang, Gautham J. Mysore:
A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences. CoRR abs/2001.04460 (2020) - [i6]Kaizhi Qian, Zeyu Jin, Mark Hasegawa-Johnson, Gautham J. Mysore:
F0-consistent many-to-many non-parallel voice conversion via conditional autoencoder. CoRR abs/2004.07370 (2020) - [i5]Jiaqi Su, Zeyu Jin, Adam Finkelstein:
HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks. CoRR abs/2006.05694 (2020) - [i4]Max Morrison, Zeyu Jin, Justin Salamon, Nicholas J. Bryan, Gautham J. Mysore:
Controllable Neural Prosody Synthesis. CoRR abs/2008.03388 (2020) - [i3]Jongpil Lee, Nicholas J. Bryan, Justin Salamon, Zeyu Jin, Juhan Nam:
Disentangled Multidimensional Metric Learning for Music Similarity. CoRR abs/2008.03720 (2020) - [i2]Jongpil Lee, Nicholas J. Bryan, Justin Salamon, Zeyu Jin, Juhan Nam:
Metric Learning vs Classification for Disentangled Music Representation Learning. CoRR abs/2008.03729 (2020)
2010 – 2019
- 2019
- [j2]Ohad Fried, Ayush Tewari, Michael Zollhöfer, Adam Finkelstein, Eli Shechtman, Dan B. Goldman, Kyle Genova, Zeyu Jin, Christian Theobalt, Maneesh Agrawala:
Text-based editing of talking-head video. ACM Trans. Graph. 38(4): 68:1-68:14 (2019) - [c8]Berthy Feng, Zeyu Jin, Jiaqi Su, Adam Finkelstein:
Learning Bandwidth Expansion Using Perceptually-motivated Loss. ICASSP 2019: 606-610 - [c7]Jiaqi Su, Adam Finkelstein, Zeyu Jin:
Perceptually-motivated Environment-specific Speech Enhancement. ICASSP 2019: 7015-7019 - [i1]Ohad Fried, Ayush Tewari, Michael Zollhöfer, Adam Finkelstein, Eli Shechtman, Dan B. Goldman, Kyle Genova, Zeyu Jin, Christian Theobalt, Maneesh Agrawala:
Text-based Editing of Talking-head Video. CoRR abs/1906.01524 (2019) - 2018
- [b1]Zeyu Jin:
Speech Synthesis for Text-Based Editing of Audio Narration. Princeton University, USA, 2018 - [c6]Zeyu Jin, Adam Finkelstein, Gautham J. Mysore, Jingwan Lu:
Fftnet: A Real-Time Speaker-Dependent Neural Vocoder. ICASSP 2018: 2251-2255 - 2017
- [j1]Zeyu Jin, Gautham J. Mysore, Stephen DiVerdi, Jingwan Lu, Adam Finkelstein:
VoCo: text-based insertion and replacement in audio narration. ACM Trans. Graph. 36(4): 96:1-96:13 (2017) - 2016
- [c5]Zeyu Jin, Adam Finkelstein, Stephen DiVerdi, Jingwan Lu, Gautham J. Mysore:
Cute: A concatenative method for voice conversion using exemplar-based unit selection. ICASSP 2016: 5660-5664 - 2015
- [c4]Zeyu Jin, Reid Oda, Adam Finkelstein, Rebecca Fiebrink:
Mallo: a distributed synchronized musical instrument designed for internet performance. NIME 2015: 293-298 - 2014
- [c3]Ohad Fried, Zeyu Jin, Reid Oda, Adam Finkelstein:
AudioQuilt: 2D Arrangements of Audio Samples using Metric Learning and Kernelized Sorting. NIME 2014: 281-286 - 2013
- [c2]Zeyu Jin, Roger B. Dannenberg:
Formal Semantics for Music Notation control Flow. ICMC 2013 - [c1]Zeyu Jin, Roger B. Dannenberg:
Formal Semantics for Music Notation control Flow. ICMC 2013
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-01 01:17 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint