Method and system for generating an HRTF for a user

    公开(公告)号:US11528577B2

    公开(公告)日:2022-12-13

    申请号:US16875115

    申请日:2020-05-15

    Abstract: A method of obtaining a head-related transfer function for a user is provided. The method comprises generating an audio signal for output by a handheld device and outputting the generated audio signal at a plurality of locations by moving the handheld device to those locations. The audio output by the handheld device is detected at left-ear and right-ear microphones. A pose of the handheld device relative to the user's head is determined for at least some of the locations. One or more personalised HRTF features are then determined based on the detected audio and corresponding determined poses of the handheld device. The one or more personalised HRTF features are then mapped to a higher-quality HRTF for the user, wherein the higher-quality HRTF corresponds to an HRTF measured in an anechoic environment. This mapping may be learned using machine learning, for example. A corresponding system is also provided.

    APPARATUS AND METHOD FOR ANALYSIS OF AUDIO RECORDINGS

    公开(公告)号:US20220148584A1

    公开(公告)日:2022-05-12

    申请号:US17452499

    申请日:2021-10-27

    Abstract: A data processing apparatus includes storage circuitry to store audio data for a plurality of respective dialogue recordings for a content and to store text data indicative of a sequence of respective words within the audio data for each of the plurality of respective dialogue recordings, analysis circuitry to compare the text data for a current dialogue recording with predetermined text data for the content and to output comparison data for the current dialogue recording, the comparison data indicative of one or more differences between the text data for the current dialogue recording and the predetermined text data, selection circuitry to select one or more candidate dialogue recordings from the plurality of respective dialogue recordings for the content in dependence upon the comparison data, and recording circuitry to modify at least a portion of the audio data for the current dialogue recording in dependence upon the audio data for one or more of the candidate dialogue recordings to obtain modified audio data and to store the modified audio data for the current dialogue recording.

    Method and system for estimating the geometry of a scene

    公开(公告)号:US11250618B2

    公开(公告)日:2022-02-15

    申请号:US17075243

    申请日:2020-10-20

    Abstract: A method of obtaining real world scale information for a scene includes obtaining at least one image of a plurality of objects in a scene; detecting at least some of the objects in the at least one image as corresponding to pre-determined objects; generating a 3D reconstruction of the scene based on the image content of the at least one image; determining a relative size of each object in the 3D reconstruction of the scene in at least one dimension, the relative size being defined in dimensions of the generated 3D reconstruction; where the relative size of each object is determined based on a distance between at least two points corresponding to that object as transformed into 3D space; obtaining a size probability distribution function for each object detected in the at least one image, each size probability distribution function defining a range of sizes in at least one dimension that a corresponding object is likely to possess in real world units; resealing the size probability distribution function for each detected object based on a corresponding relative size of that object in the 3D reconstruction; and estimating a geometry of the scene in real world units by combining the re-scaled probability distribution function for at least one detected object with the re-scaled probability distribution function for at least one other detected object.

    GAZE TRACKING APPARATUS AND SYSTEMS

    公开(公告)号:US20210392318A1

    公开(公告)日:2021-12-16

    申请号:US17340282

    申请日:2021-06-07

    Abstract: A gaze tracking system comprising a first camera operable to capture images of a user within an environment, a second camera, having a smaller field of view than the first camera, operable to capture images of at least one of the user's eyes, an eye identification unit operable to identify a location of at least one of the user's eyes from images captured by the first camera, a camera control unit operable to modify the position and/or orientation of the second camera in dependence upon the detected location of the at least one of the user's eyes, so as to cause the second camera to be able to capture images of at least one of the user's eyes, and a gaze direction identification unit operable to identify a gaze direction of the user from images captured by the second camera.

    METHOD AND SYSTEM FOR RECONSTRUCTING COLOUR AND DEPTH INFORMATION OF A SCENE

    公开(公告)号:US20210241495A1

    公开(公告)日:2021-08-05

    申请号:US17268856

    申请日:2019-08-06

    Abstract: A method of reconstructing colour and depth information of a scene includes receiving a colour image of a scene and obtaining depth information of the scene. The colour and depth images are used to generate a point cloud, which is then projected to an alternative viewpoint and converted to sparse colour and depth images. Colour information is then estimated for at least some parts of the sparse colour image, resulting in a reconstructed colour image. The reconstructed colour image is used with the existing depth information to estimate depth information for the sparse depth image. In this way, colour and depth information of the scene can be estimated, and used to generate colour and depth images of the scene from a desired viewpoint. A corresponding system for reconstructing colour and depth information is also provided.

    ENCODING AND DECODING APPARATUS
    90.
    发明申请

    公开(公告)号:US20210124996A1

    公开(公告)日:2021-04-29

    申请号:US17074827

    申请日:2020-10-20

    Abstract: An encoding apparatus is provided. The apparatus comprises an input unit operable to obtain a plurality of training images, said training images being for use in training a machine learning model. The apparatus also comprises a label unit operable to obtain a class label associated with the training images; and a key unit operable to obtain a secret key for use in encoding the training images. The apparatus further comprises an image noise generator operable to generate, based on the obtained secret key, noise for introducing into the training images. The image noise generator is configured to generate noise that correlates with the class label associated with the training images such that a machine learning model subsequently trained with the modified training images learns to associate the introduced noise with the class label for those images. A corresponding decoding apparatus is also provided.

Patent Agency Ranking