Human interaction categorization by using audio-visual cues | Machine Vision and Applications | DeepDyve