METHODS AND SYSTEMS FOR AUTOMATICALLY IDENTIFY IN A DATASET INSUFFICIENT DATA FOR LEARNING, OR RECORDS WITH ANOMALOUS COMBINATIONS OF FEATURE VALUES

    公开(公告)号:US20230205847A1

    公开(公告)日:2023-06-29

    申请号:US17561951

    申请日:2021-12-26

    IPC分类号: G06K9/62

    CPC分类号: G06K9/6219 G06K9/6261

    摘要: Systems and methods for automatically identifying in a dataset insufficient data for learning, or records with anomalous combinations of feature values, by partition of numeric and/or categorical data space into human-interpretable regions are disclosed. The method comprises: receiving a dataset of numeric and/or categorical features with a plurality of observations.
    Calculating observation density for each observation according to a distance or anomaly based metric, and receiving a density measurement. Partitioning the dataset along the numeric and/or categorical features according to the density measurement of each observation by a perpendicular cut along the feature spaces, receiving a map of a plurality of hyper-rectangular shapes representing various levels of density including empty spaces. Displaying the received map, being human-interpretable regions on a Graphic user interface, GUI, wherein the plurality of hyper-rectangular shapes are selectable and present information about the selected hyper-rectangular shape level of density when selected by a user.

    System and method for merging clusters

    公开(公告)号:US11662469B2

    公开(公告)日:2023-05-30

    申请号:US16375052

    申请日:2019-04-04

    发明人: Zaydoun Rawashdeh

    摘要: A LiDAR point cloud that includes two candidate clusters for merging is received. At a first phase, a distance between the two clusters is determined. If the distance is greater than a threshold, the candidate clusters are not merged. Otherwise, an additional point cloud is received for each cluster at different times. A motion characteristic is determined for each cluster. If the motion characteristic for each cluster is close (indicating that the objects are moving at the same speed), then the clusters are merged. Otherwise the clusters are not merged. The motion characteristic for a cluster can be determined by performing an alignment operation using the point cloud received for the cluster, and using the error associated with the alignment operation as the motion characteristic for the cluster. The decision to merge clusters is based on raw point cloud data, which can take place early in the tracking cycle.

    METHODS FOR IDENTIFYING NOVEL GENE EDITING ELEMENTS

    公开(公告)号:US20180068062A1

    公开(公告)日:2018-03-08

    申请号:US15679619

    申请日:2017-08-17

    IPC分类号: G06F19/24 G06K9/62 G06N3/08

    摘要: Embodiments disclosed herein provide methods for identifying new CRISPR loci and effectors, as well as different CRISPR loci combinations found in various organisms. Class-II CRISPR systems contain single-gene effectors that have been engineered for transformative biological discovery and biomedical applications. Discovery of additional single-gene or multi-component CRISPR effectors may enhance existing CRISPR applications, such as precision genome engineering. Comprehensive characterization of CRISPR-loci may identify novel functional roles of CRISPR loci enabling new tools for biomedicine and biological discovery. CRISPR loci have enormous feature complexity, but classification of CRISPR loci has been focused on a small fraction of highly abundant features. Increased genome sequencing has enhanced the sampling of this feature complexity.