发明授权
US07921068B2 Data mining platform for knowledge discovery from heterogeneous data types and/or heterogeneous data sources
失效
用于从异构数据类型和/或异构数据源进行知识发现的数据挖掘平台
- 专利标题: Data mining platform for knowledge discovery from heterogeneous data types and/or heterogeneous data sources
- 专利标题(中): 用于从异构数据类型和/或异构数据源进行知识发现的数据挖掘平台
-
申请号: US11928606申请日: 2007-10-30
-
公开(公告)号: US07921068B2公开(公告)日: 2011-04-05
- 发明人: Isabelle Guyon , Edward P. Reiss , René Doursat , Jason Aaron Edward Weston
- 申请人: Isabelle Guyon , Edward P. Reiss , René Doursat , Jason Aaron Edward Weston
- 申请人地址: US GA Savannah
- 专利权人: Health Discovery Corporation
- 当前专利权人: Health Discovery Corporation
- 当前专利权人地址: US GA Savannah
- 代理机构: Procopio, Cory, Hargreaves & Savitch, LLP
- 代理商 Eleanor M. Musick
- 主分类号: G06F17/00
- IPC分类号: G06F17/00 ; G06F15/00 ; G06F15/18 ; G06N5/00
摘要:
The data mining platform comprises a plurality of system modules, each formed from a plurality of components. Each module has an input data component, a data analysis engine for processing the input data, an output data component for outputting the results of the data analysis, and a web server to access and monitor the other modules within the unit and to provide communication to other units. Each module processes a different type of data, for example, a first module processes microarray (gene expression) data while a second module processes biomedical literature on the Internet for information supporting relationships between genes and diseases and gene functionality. In the preferred embodiment, the data analysis engine is a kernel-based learning machine, and in particular, one or more support vector machines (SVMs). The data analysis engine includes a pre-processing function for feature selection, for reducing the amount of data to be processed by selecting the optimum number of attributes, or “features”, relevant to the information to be discovered.
公开/授权文献
信息查询