-
公开(公告)号:US10387568B1
公开(公告)日:2019-08-20
申请号:US15269539
申请日:2016-09-19
Applicant: Amazon Technologies, Inc.
Inventor: Weiwei Cheng , Amanda Dee Bottorff , Sandeep Ranganathan
IPC: G06F16/00 , G06F17/27 , G06F16/248 , G06F16/951 , G06F16/2458 , G06F16/2457 , G10L15/26
Abstract: An unsupervised keyword extraction process is disclosed. A single input document can be analyzed to identify multiple candidate keywords by utilizing splitting terms. A keyword score is calculated for each of the candidate keywords. The keyword score for a particular candidate keyword is determined based on the length of the candidate keywords that contain the candidate keyword and the frequency of the words appearing in the candidate keywords. One or more keywords having the highest keyword scores are selected as the extracted keywords. The extracted keywords can be used in applications, such as refining search results, providing suggested search terms, or improving the match rate of a network page at a search engine.
-
公开(公告)号:US10796094B1
公开(公告)日:2020-10-06
申请号:US16534407
申请日:2019-08-07
Applicant: Amazon Technologies, Inc.
Inventor: Weiwei Cheng , Amanda Dee Bottorff , Sandeep Ranganathan
IPC: G06F16/00 , G06F40/289 , G06F16/2457 , G06F16/951 , G06F16/248 , G06F16/2458 , G10L15/26
Abstract: An unsupervised keyword extraction process is disclosed. A single input document can be analyzed to identify multiple candidate keywords by utilizing splitting terms. A keyword score is calculated for each of the candidate keywords. The keyword score for a particular candidate keyword is determined based on the length of the candidate keywords that contain the candidate keyword and the frequency of the words appearing in the candidate keywords. One or more keywords having the highest keyword scores are selected as the extracted keywords. The extracted keywords can be used in applications, such as refining search results, providing suggested search terms, or improving the match rate of a network page at a search engine.
-