A semantic-based video scene segmentation using a deep neural network

Ji, Hyesung; Hooshyar, Danial; Kim, Kuekyeng; Lim, Heuiseok

doi:10.1177/0165551518819964

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

A semantic-based video scene segmentation using a deep neural network

Full metadata record

DC Field	Value	Language
dc.contributor.author	Ji, Hyesung	-
dc.contributor.author	Hooshyar, Danial	-
dc.contributor.author	Kim, Kuekyeng	-
dc.contributor.author	Lim, Heuiseok	-
dc.date.accessioned	2021-08-31T22:43:04Z	-
dc.date.available	2021-08-31T22:43:04Z	-
dc.date.created	2021-06-19	-
dc.date.issued	2019-12	-
dc.identifier.issn	0165-5515	-
dc.identifier.uri	https://scholar.korea.ac.kr/handle/2021.sw.korea/61380	-
dc.description.abstract	Video scene segmentation is very important research in the field of computer vision, because it helps in efficient storage, indexing and retrieval of videos. Achieving this kind of scene segmentation cannot be done by just calculating the similarity of low-level features presented in the video; high-level features should also be considered to achieve a better performance. Even though much research has been conducted on video scene segmentation, most of these studies failed to semantically segment a video into scenes. Thus, in this study, we propose a Deep-learning Semantic-based Scene-segmentation model (called DeepSSS) that considers image captioning to segment a video into scenes semantically. First, the DeepSSS performs shot boundary detection by comparing colour histograms and then employs maximum-entropy-applied keyframe extraction. Second, for semantic analysis, using image captioning that benefits from deep learning generates a semantic text description of the keyframes. Finally, by comparing and analysing the generated texts, it assembles the keyframes into a scene grouped under a semantic narrative. That said, DeepSSS considers both low- and high-level features of videos to achieve a more meaningful scene segmentation. By applying DeepSSS to data sets from MS COCO for caption generation and evaluating its semantic scene-segmentation task results with the data sets from TRECVid 2016, we demonstrate quantitatively that DeepSSS outperforms other existing scene-segmentation methods using shot boundary detection and keyframes. What's more, the experiments were done by comparing scenes segmented by humans and scene segmented by the DeepSSS. The results verified that the DeepSSS' segmentation resembled that of humans. This is a new kind of result that was enabled by semantic analysis, which was impossible by just using low-level features of videos.	-
dc.language	English	-
dc.language.iso	en	-
dc.publisher	SAGE PUBLICATIONS LTD	-
dc.subject	IMAGE RETRIEVAL	-
dc.subject	FEATURES	-
dc.title	A semantic-based video scene segmentation using a deep neural network	-
dc.type	Article	-
dc.contributor.affiliatedAuthor	Lim, Heuiseok	-
dc.identifier.doi	10.1177/0165551518819964	-
dc.identifier.scopusid	2-s2.0-85059652131	-
dc.identifier.wosid	000501082000008	-
dc.identifier.bibliographicCitation	JOURNAL OF INFORMATION SCIENCE, v.45, no.6, pp.833 - 844	-
dc.relation.isPartOf	JOURNAL OF INFORMATION SCIENCE	-
dc.citation.title	JOURNAL OF INFORMATION SCIENCE	-
dc.citation.volume	45	-
dc.citation.number	6	-
dc.citation.startPage	833	-
dc.citation.endPage	844	-
dc.type.rims	ART	-
dc.type.docType	Article	-
dc.description.journalClass	1	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	ssci	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalResearchArea	Information Science & Library Science	-
dc.relation.journalWebOfScienceCategory	Computer Science, Information Systems	-
dc.relation.journalWebOfScienceCategory	Information Science & Library Science	-
dc.subject.keywordPlus	IMAGE RETRIEVAL	-
dc.subject.keywordPlus	FEATURES	-
dc.subject.keywordAuthor	Deep learning	-
dc.subject.keywordAuthor	image captioning	-
dc.subject.keywordAuthor	keyframe extraction	-
dc.subject.keywordAuthor	shot boundary detection	-
dc.subject.keywordAuthor	video scene segmentation	-

Files in This Item: There are no files associated with this item.

Appears in Collections: Graduate School > Department of Computer Science and Engineering > 1. Journal Articles

Show simple item record

qrcode

Altmetrics

Total Views & Downloads

STATISTICS: Total View :8,742,162; Today View :25,081

RSS_1.0 RSS_2.0 ATOM_1.0

(02841) 서울특별시 성북구 안암로 14502-3290-1114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Altmetrics

Total Views & Downloads

BROWSE