Invention Grant
- Patent Title: Document retrieval using internal dictionary-hierarchies to adjust per-subject match results
-
Application No.: US14077305Application Date: 2013-11-12
-
Publication No.: US09235638B2Publication Date: 2016-01-12
- Inventor: Anne Elizabeth Gattiker , Fadi H. Gebara , Anthony N. Hylick , Rouwaida N. Kanj
- Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Applicant Address: US NY Armonk
- Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee Address: US NY Armonk
- Agency: Mitch Harris, Atty at Law, LLC
- Agent Andrew M. Harris; William J. Stock
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
Techniques for managing big data include retrieval using per-subject dictionaries having multiple levels of sub-classification hierarchy within the subject. Entries may include subject-determining-power (SDP) scores that provide an indication of the descriptive power of the entry term with respect to the subject of the dictionary containing the term. The same term may have entries in multiple dictionaries with different SDP scores in each of the dictionaries. A retrieval request for one or more documents containing search terms descriptive of the one or more documents can be processed by identifying a set of candidate documents tagged with subjects, i.e., identifiers of per-subject dictionaries having entries corresponding to a search term, then using affinity values to adjust the aggregate score for the terms in the dictionaries. Documents are then selected for best match to the subject based on the adjusted scores. Alternatively, the adjustment may be performed after selecting the documents by re-ordering them according to adjusted scores.
Public/Granted literature
- US20150134666A1 DOCUMENT RETRIEVAL USING INTERNAL DICTIONARY-HIERARCHIES TO ADJUST PER-SUBJECT MATCH RESULTS Public/Granted day:2015-05-14
Information query