-
公开(公告)号:US12190061B2
公开(公告)日:2025-01-07
申请号:US17644856
申请日:2021-12-17
Applicant: ADOBE INC.
Inventor: Shashank Shailabh , Madhur Panwar , Milan Aggarwal , Pinkesh Badjatiya , Simra Shahid , Nikaash Puri , S Sejal Naidu , Sharat Chandra Racha , Balaji Krishnamurthy , Ganesh Karbhari Palwe
IPC: G06F40/289 , G06F40/30 , G06F40/40
Abstract: Systems and methods for topic modeling are described. The systems and methods include encoding words of a document using an embedding matrix to obtain word embeddings for the document. The words of the document comprise a subset of words in a vocabulary, and the embedding matrix is trained as part of a topic attention network based on a plurality of topics. The systems and methods further include encoding a topic-word distribution matrix using the embedding matrix to obtain a topic embedding matrix. The topic-word distribution matrix represents relationships between the plurality of topics and the words of the vocabulary. The systems and methods further include computing a topic context matrix based on the topic embedding matrix and the word embeddings and identifying a topic for the document based on the topic context matrix.
-
公开(公告)号:US12223002B2
公开(公告)日:2025-02-11
申请号:US17454445
申请日:2021-11-10
Applicant: ADOBE INC.
Inventor: Pinkesh Badjatiya , Tanay Anand , Simra Shahid , Nikaash Puri , Milan Aggarwal , S Sejal Naidu , Sharat Chandra Racha
IPC: G06F16/9536 , G06F16/9538 , G06F40/20
Abstract: A method of finding online relevant conversing posts, comprises receiving, by a web server serving an online forum, a query post from an inquirer using the online forum, computing a contextual similarity score between each conversing post of a set of conversing posts with a query post, wherein the contextual similarity score is computed between the body of each of conversing posts and of the query post, wherein N1 conversing posts with a highest contextual similarity score are selected; computing a fine grained similarity score between the subject of the query post and of each of the N1 conversing posts, wherein N2 conversing posts with a highest fine grained similarity score are selected; and boosting the fine grained similarity score of the N2 conversing posts based on relevance metrics, wherein N3 highest ranked conversing posts are selected as a list of conversing posts most relevant to the query post.
-
公开(公告)号:US20230169271A1
公开(公告)日:2023-06-01
申请号:US17644856
申请日:2021-12-17
Applicant: ADOBE INC.
Inventor: Shashank Shailabh , Madhur Panwar , Milan Aggarwal , Pinkesh Badjatiya , Simra Shahid , Nikaash Puri , S Sejal Naidu , Sharat Chandra Racha , Balaji Krishnamurthy , Ganesh Karbhari Palwe
IPC: G06F40/289 , G06F40/40 , G06F40/30
CPC classification number: G06F40/289 , G06F40/40 , G06F40/30
Abstract: Systems and methods for topic modeling are described. The systems and methods include encoding words of a document using an embedding matrix to obtain word embeddings for the document. The words of the document comprise a subset of words in a vocabulary, and the embedding matrix is trained as part of a topic attention network based on a plurality of topics. The systems and methods further include encoding a topic-word distribution matrix using the embedding matrix to obtain a topic embedding matrix. The topic-word distribution matrix represents relationships between the plurality of topics and the words of the vocabulary. The systems and methods further include computing a topic context matrix based on the topic embedding matrix and the word embeddings and identifying a topic for the document based on the topic context matrix.
-
-