Conference Publications

W. Xie, A. Nagrani, J. S. Chung, A. Zisserman
Utterance-level Aggregation For Speaker Recognition In The Wild
International Conference on Acoustics, Speech, and Signal Processing, 2019.
 PDF

S.-W. Chung, J. S. Chung, H.-K. Kang
Perfect match: Improved cross-modal embeddings for audio-visual synchronisation
International Conference on Acoustics, Speech, and Signal Processing, 2019.
 PDF |  Model

T. Afouras, J. S. Chung, A. Zisserman
The Conversation: Deep Audio-Visual Speech Enhancement
Interspeech, 2018.
 PDF |   Video

T. Afouras, J. S. Chung, A. Zisserman
Deep Lip Reading: a comparison of models and an online application
Interspeech, 2018.
 PDF

J. S. Chung*, A. Nagrani*, A. Zisserman
VoxCeleb2: Deep Speaker Recognition
Interspeech, 2018.
 PDF |   Dataset

J. S. Chung*, A. Jamaludin*, A. Zisserman
You said that?
British Machine Vision Conference, 2017.
Oral presentation.
 PDF |   Video |  Model

A. Nagrani*, J. S. Chung*, A. Zisserman
VoxCeleb: a large-scale speaker identification dataset
Interspeech, 2017.
Oral presentation. Best Student Paper Award.
 PDF |   Dataset

J. S. Chung, A. Senior, O. Vinyals, A. Zisserman
Lip Reading Sentences in the Wild
IEEE Conference on Computer Vision and Pattern Recognition, 2017.
Oral presentation.
 PDF |  Video |  Dataset 

J. S. Chung, A. Zisserman
Lip Reading in the Wild
Asian Conference on Computer Vision, 2016.
Oral presentation. Best Student Paper Award.
 PDF |  Dataset

J. S. Chung, A. Zisserman
Out of time: automated lip sync in the wild
Workshop on Multi-view Lip-reading, ACCV, 2016.
 PDF |  Model

J. S. Chung, A. Zisserman
Signs in time: Encoding human motion as a temporal image
Workshop on Brave New Ideas for Motion Representations, ECCV, 2016.
 PDF |  Video

* Equal contribution.

Journal Publications

A. Jamaludin*, J. S. Chung*, A. Zisserman
You Said That?: Synthesising talking faces from audio
International Journal of Computer Vision, 2019.
 PDF

T. Afouras*, J. S. Chung*, A. Senior, O. Vinyals, A. Zisserman
Deep Audio-Visual Speech Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2019.
 PDF

J. S. Chung, A. Zisserman
Learning to Lip Read Words by Watching Videos
Computer Vision and Image Understanding, 2018.
 PDF