Method and device for sorting Chinese characters, searching Chinese characters and constructing dictionary

    公开(公告)号:US12118292B2

    公开(公告)日:2024-10-15

    申请号:US17304849

    申请日:2021-06-27

    申请人: John Zhongqi Wang

    发明人: John Zhongqi Wang

    摘要: The invention discloses a method and a device for sorting Chinese characters, searching for Chinese characters and constructing a dictionary, and relates to the technical field of computers. A specific implementation of the method includes: obtaining the first basic character-forming component of a Chinese character according to the stroke order as the First Character, and encoding the First Character to obtain the First Character code, where the First Character includes the first character-forming component and the first main stroke component of a Chinese character; obtaining the number of strokes included in each Chinese character, and obtaining the corresponding stroke string of each Chinese character; using the First Character code as the first and highest priority sorting field, the number of strokes as the second sorting field, and the stroke string as the third and the lowest priority sorting field to sort Chinese characters. This embodiment can solve the problem of difficulty in sorting and searching of Chinese characters caused by the unfixed definition and position of radicals.

    System and method for trustworthy internet whitelists

    公开(公告)号:US12088593B2

    公开(公告)日:2024-09-10

    申请号:US18327663

    申请日:2023-06-01

    IPC分类号: H04L9/40 G06F7/08 H04W12/60

    摘要: Information is received from a first networked device for a first user and from a second networked device for a second user. The first user and the second user are verified and registered. A first set of data for the first user and a second set of data for the second user that each specify one or more network parameters per network address that communicates with each user are received from a networked collector device. Addresses are selected from each of the first set and the second set where each of the one or more network parameters are above a first activity threshold level for that parameter. A first set and a second set of first level activity addresses are produced. A whitelist is generated for the first user from an intersection of the first set of first level activity addresses and the second set of first level activity addresses.

    A COMPUTER-IMPLEMENTED METHOD AND A DATA PROCESSING HARDWARE FOR PROCESSING SENSOR DATA POINTS

    公开(公告)号:US20240289091A1

    公开(公告)日:2024-08-29

    申请号:US18574047

    申请日:2022-06-14

    IPC分类号: G06F7/08

    CPC分类号: G06F7/08

    摘要: This disclosure relates to a computer-implemented method for processing sensor data points by means of a data processing hardware where a distributions buffer for storing an initial plurality of Gaussian distributions is initialized, each Gaussian distribution of the initial plurality of Gaussian distributions including an initial plurality of sensor data points received sequentially from at least one sensor and having an associated predetermined distribution distance threshold, and where the distributions buffer is sequentially updated for a number n of new sensor data points based on a distribution distance condition, generating an updated plurality of Gaussian distributions including either a new Gaussian distribution, or an updated single Gaussian distribution or an updated merged Gaussian distribution.

    COMPRESSION OF MATRICES FOR DIGITAL SECURITY

    公开(公告)号:US20240259185A1

    公开(公告)日:2024-08-01

    申请号:US18161729

    申请日:2023-01-30

    摘要: Systems and techniques are described herein for compressing data used in cryptographic operations. For example, a process may include obtaining a first data structure, wherein the first data structure comprises polynomials; generating a second data structure based on the first data structure, wherein the second data structure comprises coefficients of the polynomials; sorting the second data structure in an ascending order to obtain a sorted second data structure; updating the sorted second data structure based on differences between elements of the sorted second data structure to obtain a delta-encoded data structure; performing an entropy coding on the delta-encoded data structure to obtain an entropy-encoded output; recovering an updated first data structure using the entropy-encoded output, wherein the updated first data structure corresponds to the first data structure with a different order of first data structure elements; and performing a cryptographic operation using the updated first data structure.

    COMPRESSION OF AN EXCHANGE TRADED DERIVATIVE PORTFOLIO

    公开(公告)号:US20240233016A9

    公开(公告)日:2024-07-11

    申请号:US18486545

    申请日:2023-10-13

    IPC分类号: G06Q40/04 G06F7/08

    CPC分类号: G06Q40/04 G06F7/08

    摘要: An illustrative computing device may include a processor and a non-transitory memory device for storing a data structure capable of being compressed, where the data structure includes a plurality of data elements and each of the plurality of data elements includes a date field and a quantity field. The computing device may process instructions to arrange the plurality of data elements in a consecutive series in date order based on a value stored in the date field of each data element, determine whether a gap appears in the consecutive series of data elements based on a value stored in the quantity field of each element, remove the determined gaps in each of the data elements, and repeat the determining and removing steps until a predetermined criterion has been reached.

    METHOD, APPARATUS, AND COMPUTER-READABLE MEDIUM FOR EFFICIENTLY CLASSIFYING A DATA OBJECT OF UNKNOWN TYPE

    公开(公告)号:US20240232229A1

    公开(公告)日:2024-07-11

    申请号:US18543550

    申请日:2023-12-18

    申请人: INFORMATICA LLC

    发明人: Igor BALABINE

    IPC分类号: G06F16/28 G06F7/08 G06F16/22

    摘要: An apparatus, computer-readable medium, and computer-implemented method for efficiently classifying a data object, including representing the data object as a data object vector in a vector space, each dimension of the data object vector corresponding to a different feature of the data object, determining a distance between the data object vector and centroids of data domain clusters in the vector space, each data domain cluster comprising data domain vectors representing data domains, sorting the data domain clusters according to their respective distances to the data object vector, and iteratively applying data domain classifiers corresponding to data domains represented in a closest data domain cluster in the sorted data domain clusters to the data object.

    ANOMALY DETECTION DATA WORKFLOW FOR TIME SERIES DATA

    公开(公告)号:US20240220480A1

    公开(公告)日:2024-07-04

    申请号:US18608327

    申请日:2024-03-18

    IPC分类号: G06F16/23 G06F7/08

    CPC分类号: G06F16/2365 G06F7/08

    摘要: Methods and systems are described herein for improving anomaly detection in timeseries datasets. Different machine learning models may be trained to process specific types of timeseries data efficiently and accurately. Thus, selecting a proper machine learning model for identifying anomalies in a specific set of timeseries data may greatly improve accuracy and efficiency of anomaly detection. Another way to improve anomaly detection is to process a multitude of timeseries datasets for a time period (e.g., 90 days) to detect anomalies from those timeseries datasets and then correlate those detected anomalies by generating an anomaly timeseries dataset and identifying anomalies within the anomaly timeseries dataset. Yet another way to improve anomaly detection is to divide a dataset into multiple datasets based on a type of anomaly detection requested.

    Anomaly detection data workflow for time series data

    公开(公告)号:US11977536B2

    公开(公告)日:2024-05-07

    申请号:US18189174

    申请日:2023-03-23

    IPC分类号: G06F16/23 G06F7/08

    CPC分类号: G06F16/2365 G06F7/08

    摘要: Methods and systems are described herein for improving anomaly detection in timeseries datasets. Different machine learning models may be trained to process specific types of timeseries data efficiently and accurately. Thus, selecting a proper machine learning model for identifying anomalies in a specific set of timeseries data may greatly improve accuracy and efficiency of anomaly detection. Another way to improve anomaly detection is to process a multitude of timeseries datasets for a time period (e.g., 90 days) to detect anomalies from those timeseries datasets and then correlate those detected anomalies by generating an anomaly timeseries dataset and identifying anomalies within the anomaly timeseries dataset. Yet another way to improve anomaly detection is to divide a dataset into multiple datasets based on a type of anomaly detection requested.