TR2006-026

Blind Summarization: Content-Adaptive Video Summarization Using Time-Series Analysis


    •  Divakaran, A., Radhakrishnan, R., Peker, K.A., "Blind Summarization: Content-Adaptive Video Summarization Using Time-Series Analysis", SPIE Conference on Multimedia Content Analysis, Management and Retrieval, January 2006, vol. 6073, pp. 6-10.
      BibTeX TR2006-026 PDF
      • @inproceedings{Divakaran2006jan,
      • author = {Divakaran, A. and Radhakrishnan, R. and Peker, K.A.},
      • title = {Blind Summarization: Content-Adaptive Video Summarization Using Time-Series Analysis},
      • booktitle = {SPIE Conference on Multimedia Content Analysis, Management and Retrieval},
      • year = 2006,
      • volume = 6073,
      • pages = {6--10},
      • month = jan,
      • url = {https://www.merl.com/publications/TR2006-026}
      • }
Abstract:

Severe complexity constraints on consumer electronic devices motivate us to investigate general-purpose video summarization techniques that are able to apply a common hardware setup to multiple content genres. On the other hand, we know that high quality summaries can only be produced with domain-specific processing. In this paper, we present a time-series analysis based video summarization technique that provides a general core to which we are able to add small content-specific extensions for each genre. The proposed time-series analysis technique consists of unsupervised clustering of samples taken through sliding windows from the time series of features obtained from the content. We classify content into two broad categories, scripted content such as news and drama, and unscripted content such as sports and surveillance. The summarization problem then reduces to finding either finding semantic boundaries of the scripted content or detecting highlights in the unscripted content. The proposed technique is essentially and event detection technique and it thus best suited to unscripted content, however, we also find applications to scripted content. We thoroughly examine the trade-off between content-neutral and content-specific processing for effective summarization for a number of genres, and find that our core technique enables us to minimize the complexity of the content-specific processing and to postpone it to the final stage. We achieve the best results with unscripted content such as sports and surveillance video in terms of quality of summaries and minimizing content-specific processing. For other genres such as drama, we find that more content-specific processing is required. We also find that judicious choice of key audio-visual object detectors enables us to minimize the complexity of the content-specific processing while maintaining its applicability to a broad range of genres.

 

  • Related News & Events

    •  NEWS    SPIE Conference on Multimedia Content Analysis, Management and Retrieval 2006: 3 publications by Ajay Divakaran, Clifton Forlines and others
      Date: January 17, 2006
      Where: SPIE Conference on Multimedia Content Analysis, Management and Retrieval
      Brief
      • The papers "Blind Summarization: Content-Adaptive Video Summarization Using Time-Series Analysis" by Divakaran, A., Radhakrishnan, R. and Peker, K.A., "Subjective Assessment of Consumer Video Summarization" by Forlines, C., Peker, K.A. and Divakaran, A. and "Subjective Evaluation Criterion for Selecting Affective Features and Modeling Highlights" by Xing, L., Yu, H., Huang, Q., Ye, Q. and Divakaran, A. were presented at the SPIE Conference on Multimedia Content Analysis, Management and Retrieval.
    •