Generating three-dimensional virtual scene

    公开(公告)号:US11574438B2

    公开(公告)日:2023-02-07

    申请号:US17340865

    申请日:2021-06-07

    IPC分类号: G06T17/00 G06T19/20

    摘要: A method and system for generating a three-dimensional (3D) virtual scene are disclosed. The method includes: identifying a two-dimensional (2D) object in a 2D picture and the position of the 2D object in the 2D picture; obtaining the three-dimensional model of the 3D object corresponding to the 2D object; calculating the corresponding position of the 3D object corresponding to the 2D object in the horizontal plane of the 3D scene according to the position of the 2D object in the picture; and simulating the falling of the model of the 3D object onto the 3D scene from a predetermined height above the 3D scene, wherein the position of the landing point the model of the 3D object in the horizontal plane is the corresponding position of the 3D object in the horizontal plane of the 3D scene.

    Classifying images in overlapping groups of images using convolutional neural networks

    公开(公告)号:US11341370B2

    公开(公告)日:2022-05-24

    申请号:US16692356

    申请日:2019-11-22

    摘要: The present disclosure relates to training a machine learning model to classify images. An example method generally includes receiving a training data set including images in a first category and images in a second category. A convolutional neural network (CNN) is trained using the training data set, and a feature map is generated from layers of the CNN based on features of images in the training data set. A first area in the feature map including images in the first category and a second area in the feature map where images in the first category overlap with images in the second category are identified. The first category is split into a first subcategory corresponding to the first area and a second subcategory corresponding to the second area. The CNN is retrained based on the images in the first subcategory, images in the second subcategory, and images in the second category.

    Audio source identification
    5.
    发明授权

    公开(公告)号:US10748554B2

    公开(公告)日:2020-08-18

    申请号:US16249654

    申请日:2019-01-16

    IPC分类号: G10L25/51 G10L25/27 G10L25/18

    摘要: Embodiments facilitating audio source identification are provided. A computer-implemented method comprises: receiving, by a device operatively coupled to one or more processors, an audio signal under inspection; generating, by the device, an image of time-frequency spectrum of low frequency component and high frequency component of the audio signal; and identifying, by the device, a source of the audio signal based on the generated image and one or more patterns of time-frequency spectrum, wherein each of the one or more patterns is corresponding to low frequency feature and high frequency feature of a specific audio source.

    AUDIO SOURCE IDENTIFICATION
    6.
    发明申请

    公开(公告)号:US20200227068A1

    公开(公告)日:2020-07-16

    申请号:US16249654

    申请日:2019-01-16

    IPC分类号: G10L25/51 G10L25/18 G10L25/27

    摘要: Embodiments facilitating audio source identification are provided. A computer-implemented method comprises: receiving, by a device operatively coupled to one or more processors, an audio signal under inspection; generating, by the device, an image of time-frequency spectrum of low frequency component and high frequency component of the audio signal; and identifying, by the device, a source of the audio signal based on the generated image and one or more patterns of time-frequency spectrum, wherein each of the one or more patterns is corresponding to low frequency feature and high frequency feature of a specific audio source.

    Object detection
    7.
    发明授权

    公开(公告)号:US10706530B2

    公开(公告)日:2020-07-07

    申请号:US15700684

    申请日:2017-09-11

    摘要: This disclosure provides a method for object detection. The method comprises receiving a user input that specifies one or more first regions and one or more second regions in a template image. The one or more second regions include one or more objects of interest. The method further comprises for each of the one or more first regions discovering a third region in an image under detection corresponding to the first region in the template image by matching the image under detection with the template image. The method further comprises computing a transformation function based on the matching from each of the one or more first regions to its corresponding third region. The method further comprises applying the computed transformation function to the one or more second regions to localize one or more fourth regions in the image under detection for the object detection.

    FLEXIBLE VISUAL INSPECTION MODEL COMPOSITION AND MODEL INSTANCE SCHEDULING

    公开(公告)号:US20200151869A1

    公开(公告)日:2020-05-14

    申请号:US16185348

    申请日:2018-11-09

    IPC分类号: G06T7/00 G06F9/455 G06F9/48

    摘要: Scheduling automated visual inspection tasks includes capturing an image of a component to be inspected. A visual inspection model is formed with a model engine as a composite model of utility modules and functional modules to perform visual inspection of the image of the component. An abstract processing workflow of the visual inspection model is derived with a scheduler including dependencies between the utility modules and the functional modules. Performance of each of the functional modules is profiled with the scheduler by testing performance with available hardware resources to produce a performance profile. Parallel instances of each of the functional modules in a branch of the abstract processing workflow are scheduled with the scheduler according to the dependencies and the performance profiles. An indication of defects in the component is produced by processing the visual inspection model according to the scheduled functional modules.