Audiovisual quality fusion based on relative multimodal complexity

Junyong You*, Jari Korhonen, Ulrich Reiter

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingPublished conference contribution

1 Citation (Scopus)


In multimodal presentations the perceived audiovisual quality assessment is significantly influenced by the content of both the audio and visual tracks. Based on our earlier subjective quality test for finding the optimal trade-off between audio and video quality, this paper proposes a novel method for relative multimodal complexity analysis to derive the fusion parameter in objective audiovisual quality metrics. Audio and video qualities are first estimated separately using advanced quality models, and then they are combined into the overall audiovisual quality using a linear fusion. Based on carefully designed auditory and visual features, the relative complexity analysis model across sensory modalities is proposed for deriving the fusion parameter. Experimental results have demonstrated that the content adaptive fusion parameter can improve the prediction accuracy of objective audiovisual quality metrics, compared to the fusion parameters obtained from the subjective quality tests using other known optimization methods.

Original languageEnglish
Title of host publicationICIP 2011
Subtitle of host publication2011 18th IEEE International Conference on Image Processing
Number of pages4
Publication statusPublished - 2011
Event2011 18th IEEE International Conference on Image Processing, ICIP 2011 - Brussels, Belgium
Duration: 11 Sept 201114 Sept 2011


Conference2011 18th IEEE International Conference on Image Processing, ICIP 2011


  • Audiovisual quality assessment
  • content analysis
  • multimodal complexity
  • quality fusion


Dive into the research topics of 'Audiovisual quality fusion based on relative multimodal complexity'. Together they form a unique fingerprint.

Cite this