MathInstitutes Dev

Learning spoken concepts from unlabeled audio-visual data

Presenter

Karen Livescu

February 18, 2019

Keywords:

audio-visual training
cross-modal training
image captioning
speech search

Abstract

Abstract available at the link below.

Abstract