Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleDecember 2024
HyperLips: hyper control lips with high resolution decoder for talking face generation: HyperLips: hyper control lips with high resolution decoder for talking face generation
AbstractTalking face generation has a wide range of potential applications in the field of virtual digital humans. However, rendering high-fidelity facial video while ensuring lip synchronization remains challenging for existing audio-driven talking face ...
- ArticleDecember 2024
Audio-Driven Talking Face Generation with Stabilized Synchronization Loss
AbstractTalking face generation aims to create realistic videos with accurate lip synchronization and high visual quality, using given audio and reference video while preserving identity and visual characteristics. In this paper, we start by identifying ...
- ArticleSeptember 2024
Make Audio Solely Drive Lip in Talking Face Video Synthesis
Artificial Neural Networks and Machine Learning – ICANN 2024Pages 349–360https://doi.org/10.1007/978-3-031-72338-4_24AbstractIn this work, we investigate the problem of synthesizing a talking face video which should be synchronized with a target speech segment. Although there has been significant progress on this task, the most successful approaches are still those that ...
- research-articleMarch 2024
TellMeTalk: Multimodal-driven talking face video generation
Computers and Electrical Engineering (CENG), Volume 114, Issue Chttps://doi.org/10.1016/j.compeleceng.2023.109049AbstractIn this paper, we present TellMeTalk, an innovative approach for generating expressive talking face videos based on multimodal inputs. Our approach demonstrates robustness across various identities, languages, expressions, and head movements. It ...
- research-articleApril 2023
SATFace: Subject Agnostic Talking Face Generation with Natural Head Movement
Neural Processing Letters (NPLE), Volume 55, Issue 6Pages 7529–7542https://doi.org/10.1007/s11063-023-11272-7AbstractTalking face generation is widely used in education, entertainment, shopping, and other social practices. Existing methods focus on matching the speaker’s mouth shape with the speech content. Still, there is a lack of research on automatically ...
- research-articleOctober 2022
Expression-tailored talking face generation with adaptive cross-modal weighting
Neurocomputing (NEUROC), Volume 511, Issue CPages 117–130https://doi.org/10.1016/j.neucom.2022.09.025AbstractThe key of talking face generation is to synthesize the identity-preserving natural facial expressions with accurate audio-lip synchronization. To accomplish this, it requires to disentangle and fuse the latent features from multiple ...
- ArticleOctober 2022
Synthesizing Talking Face Videos with a Spatial Attention Mechanism
AbstractRecently, talking face generation has drawn considerable attention of researchers due to its wide applications. The lip synchronization accuracy and visual quality of the generated target speaker are very crucial for synthesizing photo-realistic ...
- research-articleJanuary 2022
Few-shot Adversarial Audio Driving Talking Face Generation
AISS '21: Proceedings of the 3rd International Conference on Advanced Information Science and SystemArticle No.: 7, Pages 1–6https://doi.org/10.1145/3503047.3503054Talking-face generation is an interesting and challenging problem in computer vision and has become a research focus. This project aims to generate real talking-face video sequences, especially lip synchronization and head motion. In order to create a ...