Semantic Scholar Open Access 2018 200 sitasi

MAC: Mining Activity Concepts for Language-Based Temporal Localization

Runzhou Ge J. Gao Kan Chen R. Nevatia

Abstrak

We address the problem of language-based temporal localization in untrimmed videos. Compared to temporal localization with fixed categories, this problem is more challenging as the language-based queries not only have no pre-defined activity list but also may contain complex descriptions. Previous methods address the problem by considering features from video sliding windows and language queries and learning a subspace to encode their correlation, which ignore rich semantic cues about activities in videos and queries. We propose to mine activity concepts from both video and language modalities by applying the actionness score enhanced Activity Concepts based Localizer (ACL). Specifically, the novel ACL encodes the semantic concepts from verb-obj pairs in language queries and leverages activity classifiers' prediction scores to encode visual concepts. Besides, ACL also has the capability to regress sliding windows as localization results. Experiments show that ACL significantly outperforms state-of-the-arts under the widely used metric, with more than 5% increase on both Charades-STA and TACoS datasets.

Topik & Kata Kunci

Penulis (4)

R

Runzhou Ge

J

J. Gao

K

Kan Chen

R

R. Nevatia

Format Sitasi

Ge, R., Gao, J., Chen, K., Nevatia, R. (2018). MAC: Mining Activity Concepts for Language-Based Temporal Localization. https://doi.org/10.1109/WACV.2019.00032

Akses Cepat

Lihat di Sumber doi.org/10.1109/WACV.2019.00032
Informasi Jurnal
Tahun Terbit
2018
Bahasa
en
Total Sitasi
200×
Sumber Database
Semantic Scholar
DOI
10.1109/WACV.2019.00032
Akses
Open Access ✓