发明申请
- 专利标题: Compression of logs of language data
- 专利标题(中): 压缩日志的语言数据
-
申请号: US10796644申请日: 2004-03-09
-
公开(公告)号: US20050203934A1公开(公告)日: 2005-09-15
- 发明人: Scott Meredith , Peter Leonard , Hsiao-Wuen Hon
- 申请人: Scott Meredith , Peter Leonard , Hsiao-Wuen Hon
- 申请人地址: US WA Redmond
- 专利权人: Microsoft Corporation
- 当前专利权人: Microsoft Corporation
- 当前专利权人地址: US WA Redmond
- 主分类号: G06F12/00
- IPC分类号: G06F12/00 ; G06F17/30 ; H03M7/30 ; G06F17/00
摘要:
A method and apparatus for compressing query logs is provided. Multiple levels of user-specifiable compression include character-based compression, token-based compression, and subsumption. An efficient method for performing subsumption is also provided. The compressed query logs are then used to train a statistical process such as a help function for a computer operating system.
信息查询