CONTROL AND CAPTURE OF AUDIO DATA INTENDED FOR AN AUDIO ENDPOINT DEVICE OF AN APPLICATION EXECUTING ON A DATA PROCESSING DEVICE
    11.
    发明申请
    CONTROL AND CAPTURE OF AUDIO DATA INTENDED FOR AN AUDIO ENDPOINT DEVICE OF AN APPLICATION EXECUTING ON A DATA PROCESSING DEVICE 有权
    用于在数据处理设备上执行的应用的音频终端设备的音频数据的控制和捕获

    公开(公告)号:US20140371890A1

    公开(公告)日:2014-12-18

    申请号:US13919002

    申请日:2013-06-17

    Inventor: Ambrish Dantrey

    Abstract: A method includes implementing an audio framework to be executed on a data processing device with a virtual audio driver component and a User Mode Component (UMC) communicatively coupled to each other. The virtual audio driver component enables modifying an original default audio endpoint device of an application executing on the data processing device to an emulated audio device associated with a new audio endpoint in response to an initiation through the application in conjunction with the UMC. The virtual audio driver component also enables registering the new audio endpoint as the modified default audio endpoint with an operating system executing on the data processing device. Further, the virtual audio driver component enables capturing audio data intended for the original default audio endpoint device at the new audio endpoint following the registration thereof to enable control of the audio data.

    Abstract translation: 一种方法包括实现在数据处理设备上执行的音频框架,其中虚拟音频驱动器组件和通信地彼此耦合的用户模式组件(UMC)。 虚拟音频驱动器组件使得能够响应于通过与UMC结合的应用的启动,将在数据处理设备上执行的应用的原始默认音频端点设备修改为与新音频端点相关联的仿真音频设备。 虚拟音频驱动器组件还使得能够使用在数据处理设备上执行的操作系统将新音频端点注册为修改的默认音频端点。 此外,虚拟音频驱动器组件能够在其注册之后,在新的音频端点捕获旨在用于原始默认音频端点设备的音频数据,以便能够控制音频数据。

    Emergency response vehicle detection for autonomous driving applications

    公开(公告)号:US11816987B2

    公开(公告)日:2023-11-14

    申请号:US16951224

    申请日:2020-11-18

    Abstract: In various examples, audio alerts of emergency response vehicles may be detected and classified using audio captured by microphones of an autonomous or semi-autonomous machine in order to identify travel directions, locations, and/or types of emergency response vehicles in the environment. For example, a plurality of microphone arrays may be disposed on an autonomous or semi-autonomous machine and used to generate audio signals corresponding to sounds in the environment. These audio signals may be processed to determine a location and/or direction of travel of an emergency response vehicle (e.g., using triangulation). Additionally, to identify siren types—and thus emergency response vehicle types corresponding thereto—the audio signals may be used to generate representations of a frequency spectrum that may be processed using a deep neural network (DNN) that outputs probabilities of alert types being represented by the audio data. The locations, direction of travel, and/or siren type may allow an ego-vehicle or ego-machine to identify an emergency response vehicle and to make planning and/or control decisions in response.

    Emergency Response Vehicle Detection for Autonomous Driving Applications

    公开(公告)号:US20220157165A1

    公开(公告)日:2022-05-19

    申请号:US16951224

    申请日:2020-11-18

    Abstract: In various examples, audio alerts of emergency response vehicles may be detected and classified using audio captured by microphones of an autonomous or semi-autonomous machine in order to identify travel directions, locations, and/or types of emergency response vehicles in the environment. For example, a plurality of microphone arrays may be disposed on an autonomous or semi-autonomous machine and used to generate audio signals corresponding to sounds in the environment. These audio signals may be processed to determine a location and/or direction of travel of an emergency response vehicle (e.g., using triangulation). Additionally, to identify siren types—and thus emergency response vehicle types corresponding thereto—the audio signals may be used to generate representations of a frequency spectrum that may be processed using a deep neural network (DNN) that outputs probabilities of alert types being represented by the audio data. The locations, direction of travel, and/or siren type may allow an ego-vehicle or ego-machine to identify an emergency response vehicle and to make planning and/or control decisions in response.

    AUTOMATIC CLASSIFICATION AND REPORTING OF INAPPROPRIATE LANGUAGE IN ONLINE APPLICATIONS

    公开(公告)号:US20210370188A1

    公开(公告)日:2021-12-02

    申请号:US16884675

    申请日:2020-05-27

    Abstract: In various examples, game session audio data—e.g., representing speech of users participating in the game—may be monitored and/or analyzed to determine whether inappropriate language is being used. Where inappropriate language is identified, the portions of the audio corresponding to the inappropriate language may be edited or modified such that other users do not hear the inappropriate language. As a result, toxic behavior or language within instances of gameplay may be censored—thereby enhancing the user experience and making online gaming environments safer for more vulnerable populations. In some embodiments, the inappropriate language may be reported—e.g., automatically—to the game developer or game application host in order to suspend, ban, or otherwise manage users of the system that have a proclivity for toxic behavior.

    APPLICATION OF GEOMETRIC ACOUSTICS FOR IMMERSIVE VIRTUAL REALITY (VR)

    公开(公告)号:US20200293273A1

    公开(公告)日:2020-09-17

    申请号:US16890724

    申请日:2020-06-02

    Abstract: A virtual reality (VR) audio rendering system and method include spatializing microphone-captured real-world sounds according to a VR setting. In a game streaming system, when a player speaks through a microphone, the voice is processed by geometrical acoustic (GA) simulation configured for a virtual scene, and thereby spatialized audio effects specific to the scene are added. The GA simulation may include generating an impulse response using sound propagation simulation and dynamic HRTF-based listener directivity. When the GA-processed voice of the player is played, the local player or other fellow players can hear it as if the sound travels in the scenery and according to the geometries in the virtual scene. This mechanism can advantageously place the players' chatting in the same virtual world like built-in game audio, thereby advantageously providing enhanced immersive VR experience to users.

    Method and system for immersive virtual reality (VR) streaming with reduced geometric acoustic audio latency

    公开(公告)号:US10412529B1

    公开(公告)日:2019-09-10

    申请号:US16034203

    申请日:2018-07-12

    Abstract: A virtual reality (VR) audio rendering system and method using pre-computed impulse responses (IRs) to generate audio frames in a VR setting for rendering. Based on a current position of a user or a VR object, a set of possible motions are predicted and a set of IRs are pre-computed by using a Geometric Acoustic (GA) model of a virtual scene. Once a position change is actually detected, one of the pre-computed IRs is selected and convolved with a set of audio frames to generate modified audio frames for rendering. As the modified audio frames are generated by using pre-computed IR without requiring intensive ray tracing computations, the audio latency can be significantly reduced.

    Emergency response vehicle detection for autonomous driving applications

    公开(公告)号:US12283187B2

    公开(公告)日:2025-04-22

    申请号:US18462287

    申请日:2023-09-06

    Abstract: In various examples, audio alerts of emergency response vehicles may be detected and classified using audio captured by microphones of an autonomous or semi-autonomous machine in order to identify travel directions, locations, and/or types of emergency response vehicles in the environment. For example, a plurality of microphone arrays may be disposed on an autonomous or semi-autonomous machine and used to generate audio signals corresponding to sounds in the environment. These audio signals may be processed to determine a location and/or direction of travel of an emergency response vehicle (e.g., using triangulation). Additionally, to identify siren types—and thus emergency response vehicle types corresponding thereto—the audio signals may be used to generate representations of a frequency spectrum that may be processed using a deep neural network (DNN) that outputs probabilities of alert types being represented by the audio data.

    AUTOMATIC CLASSIFICATION AND REPORTING OF INAPPROPRIATE LANGUAGE IN ONLINE APPLICATIONS

    公开(公告)号:US20250032938A1

    公开(公告)日:2025-01-30

    申请号:US18918558

    申请日:2024-10-17

    Abstract: In various examples, game session audio data—e.g., representing speech of users participating in the game—may be monitored and/or analyzed to determine whether inappropriate language is being used. Where inappropriate language is identified, the portions of the audio corresponding to the inappropriate language may be edited or modified such that other users do not hear the inappropriate language. As a result, toxic behavior or language within instances of gameplay may be censored—thereby enhancing the user experience and making online gaming environments safer for more vulnerable populations. In some embodiments, the inappropriate language may be reported—e.g., automatically—to the game developer or game application host in order to suspend, ban, or otherwise manage users of the system that have a proclivity for toxic behavior.

    Application of geometric acoustics for immersive virtual reality (VR)

    公开(公告)号:US11809773B2

    公开(公告)日:2023-11-07

    申请号:US16890724

    申请日:2020-06-02

    CPC classification number: G06F3/165 H04L67/34 H04L67/52

    Abstract: A virtual reality (VR) audio rendering system and method include spatializing microphone-captured real-world sounds according to a VR setting. In a game streaming system, when a player speaks through a microphone, the voice is processed by geometrical acoustic (GA) simulation configured for a virtual scene, and thereby spatialized audio effects specific to the scene are added. The GA simulation may include generating an impulse response using sound propagation simulation and dynamic HRTF-based listener directivity. When the GA-processed voice of the player is played, the local player or other fellow players can hear it as if the sound travels in the scenery and according to the geometries in the virtual scene. This mechanism can advantageously place the players' chatting in the same virtual world like built-in game audio, thereby advantageously providing enhanced immersive VR experience to users.

Patent Agency Ranking