发明申请
US20050138005A1 SYSTEMS AND METHODS FOR SEARCHING EACH LEVEL OF THE INNER STRUCTURE OF A STRING OVER A LANGUAGE HAVING A VOCABULARY AND A GRAMMAR
失效
用于搜索具有VOCABULARY和GRAMMAR的语言的STRING的内部结构的各个层次的系统和方法
- 专利标题: SYSTEMS AND METHODS FOR SEARCHING EACH LEVEL OF THE INNER STRUCTURE OF A STRING OVER A LANGUAGE HAVING A VOCABULARY AND A GRAMMAR
- 专利标题(中): 用于搜索具有VOCABULARY和GRAMMAR的语言的STRING的内部结构的各个层次的系统和方法
-
申请号: US10739192申请日: 2003-12-19
-
公开(公告)号: US20050138005A1公开(公告)日: 2005-06-23
- 发明人: Claude Roux , Bernard Jacquemin
- 申请人: Claude Roux , Bernard Jacquemin
- 申请人地址: US CT STAMFORD 06904-1600
- 专利权人: XEROX CORPORATION
- 当前专利权人: XEROX CORPORATION
- 当前专利权人地址: US CT STAMFORD 06904-1600
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
Systems and methods for indexing and searching the inner structure of a string over a language having a vocabulary and a grammar using bit vectors. The index preserves the inner grammatical structure of the string while allowing for a fast search. A single search provides immediate access to every level of a document, without having ato re-search a single string to determine which sub-parts of that string match the search string. When a string is indexed, the index maintains a compositional representation and the grammatical relationship between the elements of the vocabulary according to the language. The string is then indexed on different levels by disregarding some of the grammatical relationships of component levels.
公开/授权文献
信息查询