Projects

Holistic Video Understanding

Video Action Recognition

 
 
  • Z. Zhang, Z. Kuang, P. Luo, L. Feng and W. Zhang.
    Temporal sequence distillation: Towards few-frame action recognition in videos.
    Proc. ACM Multimedia (MM), 2018. [paper]

    Video Analytics Software as a Service (VA SaaS) has been rapidly growing in recent years. VA SaaS is typically accessed by users using a lightweight client. Because the transmission bandwidth between the client and cloud is usually limited and expensive, it brings great benefits to design cloud video analysis algorithms with a limited data transmission requirement. As the first attempt in this direction, this work introduces a problem of few-frame action recognition, which aims at maintaining high recognition accuracy, when accessing only a few frames during both training and test.

Video Summarization

 
  • L. Feng, Z. Li, Z. Kuang and W. Zhang.
    Extractive video summarizer with memory augmented neural networks.
    Proc. ACM Multimedia (MM), 2018.

    Humans usually create a summary after viewing and understanding the whole video, and the global attention mechanism capturing information from all video frames plays a key role in the summarization process. Motivated by this observation, we proposed a memory augmented extractive video summarizer, which utilizes an external memory to record visual information of the whole video with high capacity. With the external memory, the video summarizer simply predicts the importance score of a video shot based on the global understanding of the video frames.

Shot Boundary Detection

 
  • S. Tang, L. Feng, Z. Kuang, Y. Chen and W. Zhang.
    Fast video shot transition localization with deep structured models.
    Proc. Asian Conf. on Computer Vision (ACCV), 2018. [paper]

    A deep learning approach to detect both cut transitions and gradual transitions for shot boundary detection.







Locations of visitors to this page

profile counter Stats

Copyright © 2019 Wei Zhang