-
公开(公告)号:US12032922B2
公开(公告)日:2024-07-09
申请号:US17318170
申请日:2021-05-12
Applicant: Microsoft Technology Licensing, LLC
Inventor: Ji Li , Konstantin Seleskerov , Huey-Ru Tsai , Muin Barkatali Momin , Ramya Tridandapani , Sindhu Vigasini Jambunathan , Amit Srivastava , Derek Martin Johnson , Gencheng Wu , Sheng Zhao , Xinfeng Chen , Bohan Li
IPC: G06F40/58 , G06F3/0481 , G06F16/2457 , G06F40/205 , G10L13/02
CPC classification number: G06F40/58 , G06F3/0481 , G06F16/24578 , G06F40/205 , G10L13/02
Abstract: Automatic generation of intelligent content is created using a system of computers including a user device and a cloud-based component that processes the user information. The system performs a process that includes receiving an input document and parsing the input document to generate inputs for a natural language generation model using a text analysis model. The natural language generation model generates one or more candidate presentation scripts based on the inputs. A presentation script is selected from the candidate presentation scripts and displayed. A text-to-speech model may be used to generate a synthesized audio presentation of the presentation script. A final presentation may be generated that includes a visual display of the input document and the corresponding audio presentation in sync with the visual display.
-
公开(公告)号:US11909922B2
公开(公告)日:2024-02-20
申请号:US18155918
申请日:2023-01-18
Applicant: Microsoft Technology Licensing, LLC
Inventor: Ji Li , Amit Srivastava , Derek Martin Johnson , Priyanka Vikram Sinha , Konstantin Seleskerov , Gencheng Wu
IPC: H04M3/56 , G06N3/08 , H04L12/18 , H04L65/401 , H04L65/403
CPC classification number: H04M3/568 , G06N3/08 , H04L12/1818 , H04L12/1822 , H04L65/403 , H04L65/4015
Abstract: The present disclosure relates to processing operations configured to provide processing that automatically analyzes acoustic signals from attendees of a live presentation and automatically triggers corresponding reaction indications from results of analysis thereof. Exemplary reaction indications provide feedback for live presentations that can be presented in real-time (or near real-time) without requiring a user to manually take action to provide any feedback. As a non-limiting example, reaction indications may be presented in a form that is easy to visualize and understand such as emojis or icons. Another example of a reaction indication is a graphical user interface (GUI) notification that provides a predictive indication of user intent derived from analysis of acoustic signals. Further examples described herein extend to training and application of artificial intelligence (AI) processing, in real-time (or near real-time), that is configured to automatically analyze acoustic features of audio streams and automatically generate exemplary reaction indications.
-
公开(公告)号:US11570307B2
公开(公告)日:2023-01-31
申请号:US16983649
申请日:2020-08-03
Applicant: Microsoft Technology Licensing, LLC
Inventor: Ji Li , Amit Srivastava , Derek Martin Johnson , Priyanka Vikram Sinha , Konstantin Seleskerov , Gencheng Wu
IPC: H04M3/56 , G06N3/08 , H04L12/18 , H04L65/401 , H04L65/403
Abstract: The present disclosure relates to processing operations configured to provide processing that automatically analyzes acoustic signals from attendees of a live presentation and automatically triggers corresponding reaction indications from results of analysis thereof. Exemplary reaction indications provide feedback for live presentations that can be presented in real-time (or near real-time) without requiring a user to manually take action to provide any feedback. As a non-limiting example, reaction indications may be presented in a form that is easy to visualize and understand such as emojis or icons. Another example of a reaction indication is a graphical user interface (GUI) notification that provides a predictive indication of user intent derived from analysis of acoustic signals. Further examples described herein extend to training and application of artificial intelligence (AI) processing, in real-time (or near real-time), that is configured to automatically analyze acoustic features of audio streams and automatically generate exemplary reaction indications.
-
公开(公告)号:US20220038580A1
公开(公告)日:2022-02-03
申请号:US16983649
申请日:2020-08-03
Applicant: Microsoft Technology Licensing, LLC
Inventor: Ji Li , Amit Srivastava , Derek Martin Johnson , Priyanka Vikram Sinha , Konstantin Seleskerov , Gencheng Wu
Abstract: The present disclosure relates to processing operations configured to provide processing that automatically analyzes acoustic signals from attendees of a live presentation and automatically triggers corresponding reaction indications from results of analysis thereof. Exemplary reaction indications provide feedback for live presentations that can be presented in real-time (or near real-time) without requiring a user to manually take action to provide any feedback. As a non-limiting example, reaction indications may be presented in a form that is easy to visualize and understand such as emojis or icons. Another example of a reaction indication is a graphical user interface (GUI) notification that provides a predictive indication of user intent derived from analysis of acoustic signals. Further examples described herein extend to training and application of artificial intelligence (AI) processing, in real-time (or near real-time), that is configured to automatically analyze acoustic features of audio streams and automatically generate exemplary reaction indications.
-
公开(公告)号:US12026948B2
公开(公告)日:2024-07-02
申请号:US17085755
申请日:2020-10-30
Applicant: Microsoft Technology Licensing, LLC
Inventor: Konstantin Seleskerov , Amit Srivastava , Derek Martin Johnson , Priyanka Vikram Sinha , Gencheng Wu , Brittany Elizabeth Mederos
CPC classification number: G06V20/41 , G06N20/00 , G06V20/46 , G06V40/176 , G06V40/23
Abstract: Techniques performed by a data processing system include establishing an online presentation session for conducting an online presentation, receiving first media streams comprising presentation content from the first computing device, receiving second media streams from the second computing devices of a subset of the plurality of participants, the second media streams including audio content, video content, or both of the subset of the plurality of participants, analyzing the first media streams using first machine learning models to generate feedback results, analyzing the set of second media streams to identify first reactions by the participants to obtain reaction information, automatically analyzing the feedback results and the reactions to identify discrepancies between the feedback results and the reactions, and automatically updating one or more parameters of the machine learning models based on the discrepancies to improve the suggestions for improving the online presentation.
-
公开(公告)号:US20190339820A1
公开(公告)日:2019-11-07
申请号:US15969538
申请日:2018-05-02
Applicant: Microsoft Technology Licensing, LLC
Inventor: Gencheng Wu , Lishan Yu
IPC: G06F3/0482 , G06F9/451 , G06F11/34 , G06F15/18
Abstract: Systems, methods, and software are disclosed herein to predict and display menu items based on a prediction of the next user-actions. In an implementation, a user interface is displayed to the application. The user interface comprises menu items displayed in sub-menus of a menu. In response to an occurrence of a user-action associated with a given item of a given sub-menu of the sub-menus, a set of user-actions likely to occur next is identified based on an identity of the user-action. A subset of the menu items is then identified corresponding to the set of the user-actions likely to occur next. The subset of the menu items is then displayed in the user interface.
-
-
-
-
-