-
公开(公告)号:US20180262798A1
公开(公告)日:2018-09-13
申请号:US15463573
申请日:2017-03-20
Applicant: Wipro Limited
Inventor: Manjunath RAMACHANDRA
IPC: H04N21/431 , H04N21/45 , H04N21/472 , H04N21/482
CPC classification number: H04N21/4318 , G06F17/30828 , H04N21/4532 , H04N21/47217 , H04N21/4828
Abstract: In one embodiment, a method for rendering multimedia content on a user device is disclosed. The method includes receiving, by a content rendering device, multimedia content from a multimedia server. The multimedia content is stored in the multimedia server in form of chunks. The method includes transforming, by the content rendering device, at least one chunk of the multimedia content based on user preferences. The transforming comprises adjusting time duration of the multimedia content based on a user-defined time duration, Further, the method includes rendering, by the content rendering device, the transformed multimedia content on the user device.
-
公开(公告)号:US20180336417A1
公开(公告)日:2018-11-22
申请号:US15638404
申请日:2017-06-30
Applicant: WIPRO LIMITED
Inventor: Adrita BARARI , Manjunath RAMACHANDRA , Ghulam MOHIUDDIN KHAN
CPC classification number: G06K9/00684 , G06F17/241 , G06K9/00744 , G06K9/00751
Abstract: Disclosed subject matter relates to paraphrasing multimedia content including a method and system for generating a contextual summary of multimedia content. A contextual summary generator retrieves the multimedia content comprising scenes from a multimedia content database and generates scene descriptors, describing a scene, for each scene. Further, an emotion factor is identified in each scene based on each scene descriptor, each speech descriptor and each textual descriptor associated with each of the one or more scenes. Upon identifying the emotion factor, a context descriptor indicating context of each scene is generated for each scene based on analysis of each emotion factor and non-speech descriptors. Finally, the scene descriptors, textual descriptors and context descriptors are correlated based on a dynamically configured threshold value to generate the contextual summary of the multimedia content that saves precious time and efforts instead of watching or hearing the entire multimedia content that may be redundant.
-
公开(公告)号:US20170256262A1
公开(公告)日:2017-09-07
申请号:US15070827
申请日:2016-03-15
Applicant: Wipro Limited
Inventor: Manjunath RAMACHANDRA , Priyanshu SHARMA
IPC: G10L15/26 , G06F17/22 , G10L15/187 , G10L15/25 , G10L15/14
CPC classification number: G10L15/265 , G06F17/2288 , G10L15/14 , G10L15/187 , G10L15/25 , G10L15/26 , G10L2015/025
Abstract: This disclosure relates generally to speech recognition, and more particularly to system and method for speech-to-text conversion using audio as well as video input. In one embodiment, a method is provided for performing speech to text conversion. The method comprises receiving an audio data and a video data of a user while the user is speaking, generating a first raw text based on the audio data via one or more audio-to-text conversion algorithms, generating a second raw text based on the video data via one or more video-to-text conversion algorithms, determining one or more errors by comparing the first raw text and the second raw text, and correcting the one or more errors by applying one or more rules. The one or more rules employ at least one of a domain specific word database, a context of conversation, and a prior communication history.
-
-