Automatic Association of Chats and Video Tracks for Activity Learning and Recognition in Aerial Video Surveillance
We describe two advanced video analysis techniques, including Baseball Cards video-indexed by voice annotations (VIVA) and multi-media indexing and explorer (MINER).VIVA utilizes analyst call-outs (ACOs) in the form of chat messages (voice-to-text) to associate labels with video target tracks, to designate spatial-temporal activity boundaries and t