发明授权
- 专利标题: System and method for generating vocabulary from network data
- 专利标题(中): 用于从网络数据生成词汇的系统和方法
-
申请号: US12571390申请日: 2009-09-30
-
公开(公告)号: US08489390B2公开(公告)日: 2013-07-16
- 发明人: Thangavelu Arumugam , Satish K. Gannu , Virgil N. Mihailovici , Ashutosh A. Malegaonkar , Christian Posse , Sonali M. Sambhus , Nitasha Walia , Kui Zhang
- 申请人: Thangavelu Arumugam , Satish K. Gannu , Virgil N. Mihailovici , Ashutosh A. Malegaonkar , Christian Posse , Sonali M. Sambhus , Nitasha Walia , Kui Zhang
- 申请人地址: US CA San Jose
- 专利权人: Cisco Technology, Inc.
- 当前专利权人: Cisco Technology, Inc.
- 当前专利权人地址: US CA San Jose
- 代理机构: Patent Capital Group
- 主分类号: G10L19/00
- IPC分类号: G10L19/00
摘要:
A method is provided in one example and includes receiving data propagating in a network environment and separating the data into one or more fields. At least some of the fields are evaluated in order to identify nouns and noun phrases within the fields. The method also includes identifying selected words within the nouns and noun phrases based on a whitelist and a blacklist. The whitelist includes a plurality of designated words to be tagged and the blacklist includes a plurality of rejected words that are not to be tagged. A resultant composite is generated for the selected nouns and noun phrases that are tagged. The resultant composite is incorporated into the whitelist if the resultant composite is approved.
公开/授权文献
信息查询