- 专利标题: Cluster-based word vector processing method, device, and apparatus
-
申请号: US16743224申请日: 2020-01-15
-
公开(公告)号: US10769383B2公开(公告)日: 2020-09-08
- 发明人: Shaosheng Cao , Xinxing Yang , Jun Zhou , Xiaolong Li
- 申请人: ALIBABA GROUP HOLDING LIMITED
- 申请人地址: US KY Grand Cayman
- 专利权人: Alibaba Group Holding Limited
- 当前专利权人: Alibaba Group Holding Limited
- 当前专利权人地址: US KY Grand Cayman
- 代理机构: Sheppard Mullin Richter & Hampton LLP
- 优先权: com.zzzhc.datahub.patent.etl.us.BibliographicData$PriorityClaim@dfb0735
- 主分类号: G06F17/28
- IPC分类号: G06F17/28 ; G06N3/02 ; G06N3/08 ; G06F40/30 ; G06F40/40
摘要:
Embodiments of the present application disclose a cluster-based word vector processing method, apparatus, and device. Solutions are include: in a cluster having a server cluster and a worker computer cluster, in which each worker computer in the worker computer cluster separately reads some corpuses in parallel, extracts a word and context words of the word from the read corpuses, obtains corresponding word vectors from a server in the server cluster, and trains the corresponding word vectors, and the server cluster updates word vectors of same words that are stored before the training according to training results of one or more respective worker computers with respect to the word vectors of the same words.
公开/授权文献
信息查询