MFCCs

XFlow: Cross-modal Deep Neural Networks for Audiovisual Classification

In recent years, there have been numerous developments towards solving multimodal tasks, aiming to learn a stronger representation than through a single modality. Certain aspects of the data can be particularly useful in this case - for example, …