An Accurate Scene Segmentation Method Based on Graph Analysis Using Object Matching and Audio Feature
スポンサーリンク
概要
- 論文の詳細を見る
A method for accurate scene segmentation using two kinds of directed graph obtained by object matching and audio features is proposed. Generally, in audiovisual materials, such as broadcast programs and movies, there are repeated appearances of similar shots that include frames of the same background, object or place, and such shots are included in a single scene. Many scene segmentation methods based on this idea have been proposed; however, since they use color information as visual features, they cannot provide accurate scene segmentation results if the color features change in different shots for which frames include the same object due to camera operations such as zooming and panning. In order to solve this problem, scene segmentation by the proposed method is realized by using two novel approaches. In the first approach, object matching is performed between two frames that are each included in different shots. By using these matching results, repeated appearances of shots for which frames include the same object can be successfully found and represented as a directed graph. The proposed method also generates another directed graph that represents the repeated appearances of shots with similar audio features in the second approach. By combined use of these two directed graphs, degradation of scene segmentation accuracy, which results from using only one kind of graph, can be avoided in the proposed method and thereby accurate scene segmentation can be realized. Experimental results performed by applying the proposed method to actual broadcast programs are shown to verify the effectiveness of the proposed method.
- (社)電子情報通信学会の論文
- 2009-08-01
著者
-
YAMAMOTO Makoto
Graduate School of Environmental Studies, Tohoku University
-
Haseyama Miki
Graduate School Of Engineering Hokkaido University
-
Yamamoto Makoto
Graduate School Of Information Science And Technology Hokkaido University
-
Yamamoto Makoto
Graduate School Of Environmental Studies Tohoku University
-
Yamamoto Makoto
Graduate School Of Environmental Earth Science Hokkaido University
関連論文
- An ER Algorithm-Based Method for Removal of Adherent Water Drops from Images Obtained by a Rear View Camera Mounted on a Vehicle in Rainy Conditions
- A Kalman Filter-Based Method for Restoration of Images Obtained by an In-Vehicle Camera in Foggy Conditions
- POCS-Based Texture Reconstruction Method Using Clustering Scheme by Kernel PCA(Papers Selected from the 21st Symposium on Signal Processing)
- Hydrothermal alteration and TL study of the Waiotapu Geothermal Field (New Zealand)
- A Study on Adaptive Spatial-Temporal Error Concealment method for Wavelet based Video Coding in Wireless Networks
- Kalman Filter-Based Error Concealment for Video Transmission
- A New Fitness Function of a Genetic Algorithm for Routing Applications
- A Kalman Filter Using Texture for Noise Reduction in SAR Images
- A New Conic Section Extraction Approach and Its Applications(Pattern Recognition)
- Estimating Number of People Using Calibrated Monocular Camera Based on Geometrical Analysis of Surface Area
- Steady-State Properties of a CORDIC-Based Adaptive ARMA Lattice Filter(Digital Signal Processing)
- Convergence Properties of a CORDIC-Based Adaptive ARMA Lattice Filter(Digital Signal Processing)
- A Cost-Effective CORDIC-Based Architecture for Adaptive Lattice Filters(Audio/Speech Coding)(Applications and Implementations of Digital Signal Processing)
- A Transformation Method of a CORDIC ARMA Lattice Filter for Signal Synthesis (Special Section on VLSI for Digital Signal Processing)
- Error-Resilient 3-D Wavelet Video Coding with Duplicated Lowest Sub-Band Coefficients and Two-Step Error Concealment Method
- A MODEL-BASED APPROACH FOR SOCCER TEAM ADVANTAGE MEASUREMENT(International Workshop on Advanced Image Technology 2006)
- A Significant Property of Mapping Parameters for Signal Interpolation Using Fractal Interpolation Functions(Digital Signal Processing)
- A Novel Contour Description with Expansion Ability Using Extended Fractal Interpolation Functions(Image Processing, Image Pattern Recognition)
- A Simplification Method for Line Drawings which Retains the Shape by Using the Fractal Dimension
- GAおよびSAを用いたフラクタル画像符号化(画像処理)
- Video Frame Interpolation by Image Morphing Including Fully Automatic Correspondence Setting
- A SIMILAR IMAGE CLUSTERING METHOD INCLUDING AUTOMATIC SELECTION OF NUMBER OF CLUSTERS(International Workshop on Advanced Image Technology 2006)
- AN IMAGE ENLARGEMENT METHOD USING ITERATED FUNCTION SYSTEM(International Workshop on Advanced Image Technology 2006)
- A Novel Video Retrieval Method Based on Web Community Extraction Using Features of Video Materials
- A SIMPLE WORD SPOTTING METHOD BASED ON TEMPLATE MATCHING FOR SPEECH RETRIEVAL(International Workshop on Advanced Image Technology 2005)
- A study on adaptive spatial-temporal error concealment method for wavelet based video coding in wireless networks (画像工学)
- A Study on Adaptive Spatial-Temporal Error Concealment method for Wavelet based Video Coding in Wireless Networks
- Low complexity speaker identification in AAC domain (メディア工学)
- Adaptive Missing Texture Reconstruction Method Based on Kernel Canonical Correlation Analysis with a New Clustering Scheme
- POCS-Based Annotation Method Using Kernel PCA for Semantic Image Retrieval
- A REGION MERGING METHOD FOR IMAGE SEGMENTATION(International Workshop on Advanced Image Technology 2005)
- Players Clustering Based on Graph Theory for Tactics Analysis Purpose in Soccer Videos(Papers Selected from the 21st Symposium on Signal Processing)
- A Study on Adaptive Spatial-Temporal Error Concealment method for Wavelet based Video Coding in Wireless Networks
- An Accurate Scene Segmentation Method Based on Graph Analysis Using Object Matching and Audio Feature
- Audio-Based Shot Classification for Audiovisual Indexing Using PCA, MGD and Fuzzy Algorithm(Papers Selected from the 21st Symposium on Signal Processing)
- Cross Low-Dimension Pursuit for Sparse Signal Recovery from Incomplete Measurements Based on Permuted Block Diagonal Matrix
- A Novel Framework for Extracting Visual Feature-Based Keyword Relationships from an Image Database
- A Novel Framework for Extracting Visual Feature-Based Keyword Relationships from an Image Database
- DIFFERENTIAL EFFECTS OF AUXINS ON DEFECTS IN ROOT GRAVITROPISM OF AUXIN-RESISTANT MUTANTS OF ARABIDOPSIS
- Photoresponses in Gold Nanoparticle Single-Electron Transistors with Molecular Floating Gates
- Erratum: Error-Resilient 3-D Wavelet Video Coding with Duplicated Lowest Sub-Band Coefficients and Two-Step Error Concealment Method [IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences E93.A (2010) , No. 11 pp.2173-218