MULTI-MODAL UNDERSTANDING OF EMOTIONS IN VIDEO CONTENT

Invention Publication

US20240169711A1 MULTI-MODAL UNDERSTANDING OF EMOTIONS IN VIDEO CONTENT 审中-公开

Please log in to see more content

Patent Title: MULTI-MODAL UNDERSTANDING OF EMOTIONS IN VIDEO CONTENT
Application No.: US18057643

Application Date: 2022-11-21
Publication No.: US20240169711A1

Publication Date: 2024-05-23
Inventor: Divya Choudhary , Palash Goyal
Applicant: Samsung Electronics Co., Ltd.
Applicant Address: KR Suwon-si, Gyeonggi-do
Assignee: Samsung Electronics Co., Ltd.
Current Assignee: Samsung Electronics Co., Ltd.
Current Assignee Address: KR Suwon-si, Gyeonggi-do
Main IPC: G06V10/80
IPC: G06V10/80 ; G06V40/16

MULTI-MODAL UNDERSTANDING OF EMOTIONS IN VIDEO CONTENT

Abstract:

A method includes obtaining a video sequence having multiple video frames and audio data. The method also includes extracting video features associated with at least one face in the video frames and audio features associated with the audio data. The method further includes processing the video features and the audio features using a trained machine learning model. The trained machine learning model performs a multi-tiered fusion of the video features and different subsets of the audio features in order to identify at least one emotion expressed by at least one person in the video sequence. The multi-tiered fusion of the video features and the audio features may include (i) a first fusion of the video features and a first subset of the audio features and (ii) a second fusion of processed features and a second subset of the audio features, where the processed features are based on the first fusion.

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06V	图像或视频识别或理解
G06V10/00	图像或视频识别或理解的安排（图像或视频中的字符识别 G06V30/10）
G06V10/70	.使用模式识别或机器学习（光学模式识别或电子计算 G06V10/88）
G06V10/77	..处理特征空间中的图像或视频特征；使用数据集成或数据缩减，例如主成分分析 [PCA] 或独立成分分析 [ICA] 或自组织图 [SOM]；盲源分离
G06V10/80	...融合，即在传感器级别、预处理级别、特征提取级别或分类级别融合来自各种来源的数据（多模态讲话者的识别或验证 G10L17/10）