OBJECT RECOGNITION USING MULTI-MODAL MATCHING SCHEME
    122.
    发明申请
    OBJECT RECOGNITION USING MULTI-MODAL MATCHING SCHEME 有权
    使用多模式匹配方案的对象识别

    公开(公告)号:US20130272548A1

    公开(公告)日:2013-10-17

    申请号:US13664295

    申请日:2012-10-30

    Abstract: Methods, systems and articles of manufacture for recognizing and locating one or more objects in a scene are disclosed. An image and/or video of the scene are captured. Using audio recorded at the scene, an object search of the captured scene is narrowed down. For example, the direction of arrival (DOA) of a sound can be determined and used to limit the search area in a captured image/video. In another example, keypoint signatures may be selected based on types of sounds identified in the recorded audio. A keypoint signature corresponds to a particular object that the system is configured to recognize. Objects in the scene may then be recognized using a shift invariant feature transform (SIFT) analysis comparing keypoints identified in the captured scene to the selected keypoint signatures.

    Abstract translation: 公开了用于识别和定位场景中的一个或多个物体的方法,系统和制品。 拍摄场景的图像和/或视频。 使用在场景录制的音频,捕获的场景的对象搜索变窄。 例如,可以确定声音的到达方向(DOA)并用于限制捕获的图像/视频中的搜索区域。 在另一示例中,可以基于记录的音频中识别的声音的类型来选择关键点签名。 关键点签名对应于系统配置为识别的特定对象。 然后可以使用移位不变特征变换(SIFT)分析来比较场景中的对象,比较在捕获的场景中识别的关键点与所选择的关键点签名。

    Active self-voice naturalization using a bone conduction sensor

    公开(公告)号:US12063490B2

    公开(公告)日:2024-08-13

    申请号:US18167823

    申请日:2023-02-10

    CPC classification number: H04R3/04 H04R2460/13

    Abstract: Methods, systems, and devices for signal processing are described. Generally, as provided for by the described techniques, a wearable device to receive an input audio signal from one or more outer microphones, an input audio signal from one or more inner microphones, and a bone conduction signal from a bone conduction sensor based on the input audio signals. The wearable device may filter the bone conduction signal based on a set of frequencies of the input audio signals, such as a low frequency portion of the input audio signals. For example, the wearable device may apply a filter to the bone conduction signal that accounts for an error in the input audio signals. The wearable device may add a gain to the filtered bone conduction signal and may equalize the filtered bone conduction signal based on the gain. The wearable device may output an audio signal to a speaker.

    User voice activity detection using dynamic classifier

    公开(公告)号:US11783809B2

    公开(公告)日:2023-10-10

    申请号:US17308593

    申请日:2021-05-05

    CPC classification number: G10L15/12 G10L15/22 H04R3/005

    Abstract: A device includes a memory configured to store instructions and one or more processors configured execute the instructions. The one or more processors are configured execute the instructions to receive audio data including first audio data corresponding to a first output of a first microphone and second audio data corresponding to a second output of a second microphone. The one or more processors are also configured to execute the instructions to provide the audio data to a dynamic classifier. The dynamic classifier is configured to generate a classification output corresponding to the audio data. The one or more processors are further configured to execute the instructions to determine, at least partially based on the classification output, whether the audio data corresponds to user voice activity.

    Wireless control of remote devices through intention codes over a wireless connection

    公开(公告)号:US11290518B2

    公开(公告)日:2022-03-29

    申请号:US15717027

    申请日:2017-09-27

    Abstract: Various embodiments provide systems and methods which disclose a command device which can be used to establish a wireless connection, through one or more wireless channels, between the command device and a remote device. An intention code may be generated, prior to, or after, the establishment of the wireless connection, and the remote device may be selected based on the intention code. The command device may initiate a wireless transfer, through one or more wireless channels of the established wireless connection, of an intention code, and receive acknowledgement that the intention code was successfully transferred to the remote device. The command device may then control the remote device, based on the intention code sent to the remote device, through the one or more wireless channels of the established wireless connection between the command device and the remote device.

    Multiple microphone speech generative networks

    公开(公告)号:US10964335B2

    公开(公告)日:2021-03-30

    申请号:US15948681

    申请日:2018-04-09

    Abstract: Methods, systems, and devices for auditory enhancement are described. A device may receive a respective auditory signal at each of a set of microphones, where each auditory signal includes a respective representation of a target auditory component and one or more noise artifacts. The device may identify a directionality associated with a source of the target auditory component (e.g., based on an arrangement of the multiple microphones). The device may determine a distribution function for the target auditory component based at least in part on the directionality associated with the source and on the received plurality of auditory signals. The device may generate an estimate of the target auditory component based at least in part on the distribution function and output the estimate of the target auditory component.

Patent Agency Ranking