Method and system for ideogram character analysis

    公开(公告)号:US12153624B2

    公开(公告)日:2024-11-26

    申请号:US17713074

    申请日:2022-04-04

    Abstract: Ideogram character analysis includes partitioning an original ideogram character into strokes and mapping each stroke to a corresponding stroke identifier (id) to create an original stroke id sequence that includes stroke identifiers. A candidate ideogram character that has a candidate stroke id sequence within a threshold distance to the original stroke id sequence is selected. One or more embodiments may create a new phrase by replacing the original ideogram character with the candidate ideogram character in a search phrase. One or more embodiments perform a search using the search phrase and the new phrase to obtain a result and present the result. One or more embodiments may replace an original ideogram character in a character recognized document with the candidate ideogram character and store the character recognized document.

    METHOD AND SYSTEM FOR IDEOGRAM CHARACTER ANALYSIS

    公开(公告)号:US20220222292A1

    公开(公告)日:2022-07-14

    申请号:US17713074

    申请日:2022-04-04

    Abstract: Ideogram character analysis includes partitioning an original ideogram character into strokes and mapping each stroke to a corresponding stroke identifier (id) to create an original stroke id sequence that includes stroke identifiers. A candidate ideogram character that has a candidate stroke id sequence within a threshold distance to the original stroke id sequence is selected. One or more embodiments may create a new phrase by replacing the original ideogram character with the candidate ideogram character in a search phrase. One or more embodiments perform a search using the search phrase and the new phrase to obtain a result and present the result. One or more embodiments may replace an original ideogram character in a character recognized document with the candidate ideogram character and store the character recognized document.

    Method and system for ideogram character analysis

    公开(公告)号:US11321384B2

    公开(公告)日:2022-05-03

    申请号:US15033309

    申请日:2015-09-30

    Abstract: Ideogram character analysis includes partitioning an original ideogram character into strokes, and mapping each stroke to a corresponding stroke identifier (id) to create an original stroke id sequence that includes stroke identifiers. A candidate ideogram character that has a candidate stroke id sequence within a threshold distance to the original stroke id sequence is selected. One or more embodiments may create a new phrase by replacing the original ideogram character with the candidate ideogram character in a search phrase. One or more embodiments perform a search using the search phrase and the new phrase to obtain a result, and present the result. One or more embodiments may replace an original ideogram character in a character recognized document with the candidate ideogram character and store the character recognized document.

    Method and system for document similarity analysis

    公开(公告)号:US10572544B1

    公开(公告)日:2020-02-25

    申请号:US14968421

    申请日:2015-12-14

    Abstract: A method for document similarity analysis. The method includes generating a reference document content identifier for a reference document, including identifying frequently occurring terms in reference document content, encoding each frequently occurring term in a term identifier and combining the term identifiers to form the reference document content identifier associated with the reference document. The method also includes obtaining at least one document similarity value by comparing the reference document content identifier to a set of archived document content identifiers stored in a document repository.

    METHOD AND SYSTEM FOR ASSESSING SIMILARITY OF DOCUMENTS

    公开(公告)号:US20180068183A1

    公开(公告)日:2018-03-08

    申请号:US15811118

    申请日:2017-11-13

    CPC classification number: G06K9/00483 G06F16/335 G06F16/93 G06K9/00469

    Abstract: Systems and methods for assessing similarity of documents are provided. Embodiments of the systems and methods include extracting a reference document text from a reference document, extracting an archived document text from an archived document, and quantifying the reference document and the archived document. The systems and methods may also include determining a document similarity value of the quantified reference document and the archived document. Determining the document similarity value includes calculating a set of vector similarity values for a set of combinations of a reference document text vector and an archived document text vector, and calculating the document similarity value, including a sum of the plurality of vector similarity values.

    Method and system for assessing similarity of documents

    公开(公告)号:US10521656B2

    公开(公告)日:2019-12-31

    申请号:US15811118

    申请日:2017-11-13

    Abstract: Systems and methods for assessing similarity of documents are provided. Embodiments of the systems and methods include extracting a reference document text from a reference document, extracting an archived document text from an archived document, and quantifying the reference document and the archived document. The systems and methods may also include determining a document similarity value of the quantified reference document and the archived document. Determining the document similarity value includes calculating a set of vector similarity values for a set of combinations of a reference document text vector and an archived document text vector, and calculating the document similarity value, including a sum of the plurality of vector similarity values.

Patent Agency Ranking