METHOD OF COMPARING DOCUMENTS, ELECTRONIC DEVICE AND READABLE STORAGE MEDIUM

    公开(公告)号:US20220108556A1

    公开(公告)日:2022-04-07

    申请号:US17552149

    申请日:2021-12-15

    Abstract: A method of comparing documents, an electronic device, and a readable storage medium are provided, which relate to the field of data processing technology, and specifically to the field of big data technology. In the present disclosure, an area division is performed on each document of two documents to be compared, according to a document layout of each document, so as to obtain at least two sets of comparison units. Each set of comparison units comprises comparison units for the two documents respectively and the comparison units for the two documents correspond to each other. Thus, a content comparison may be performed on between comparison units of each of the at least two sets, so as to obtain a content comparison result for each set of comparison units as a comparison result for the two documents.

    DATA PROCESSING METHOD
    4.
    发明申请

    公开(公告)号:US20230097986A1

    公开(公告)日:2023-03-30

    申请号:US18058640

    申请日:2022-11-23

    Abstract: A data processing method is provided. The method includes: determining fusion information based on a text to be processed and a plurality of reference text fragments; executing the following matching operation for each of the plurality of reference text fragments: determining a first coefficient of each feature vector of the fusion information respectively; determining a second coefficient of each feature vector of the fusion information respectively; determining a result feature vector of the reference text fragment using each feature vector included in the fusion information and a weight of the feature vector; and determining a matching degree of the reference text fragment and the text to be processed based on the result feature vector.

    INFORMATION EXTRACTION METHOD AND APPARATUS, ELECTRONIC DEVICE AND READABLE STORAGE MEDIUM

    公开(公告)号:US20230005283A1

    公开(公告)日:2023-01-05

    申请号:US17577531

    申请日:2022-01-18

    Abstract: The present disclosure provides an information extraction method and apparatus, an electronic device and a readable storage medium, and relates to the field of natural language processing technologies. The information extraction method includes: acquiring a to-be-extracted text; acquiring a sample set, the sample set including a plurality of sample texts and labels of sample characters in the plurality of sample texts; determining a prediction label of each character in the to-be-extracted text according to a semantic feature vector of each character in the to-be-extracted text and a semantic feature vector of each sample character in the sample set; and extracting, according to the prediction label of each character, a character meeting a preset requirement from the to-be-extracted text as an extraction result of the to-be-extracted text. The present disclosure can simplify steps of information extraction, reduce costs of information extraction and improve flexibility and accuracy of information extraction.

Patent Agency Ranking