Statistics for Captioned movies and listening comprehension