-
公开(公告)号:US12153588B2
公开(公告)日:2024-11-26
申请号:US18167724
申请日:2023-02-10
Applicant: ROKU, INC.
Inventor: Peter Martigny , Fedor Bartosh , Danish Shaikh , Vinh Nguyen , Manasi Deshmukh , Ratul Ray , Nitish Aggarwal , Srimaruti Manoj Nimmagadda , Kapil Kumar , Sameer Girolkar
IPC: G06F16/2457 , G06F16/242 , G06F16/9535
Abstract: A content retrieval system may receive a query associated with a plurality of content items in a repository. For each content item of the plurality of content items: a respective first and second similarity score may be generated based on a similarity between embeddings indicative of a first and second data type generated from the query and for the content item; and a respective normalized similarity score may be generated based on a combination of the respective first and second similarity scores. A set of content items with respective normalized similarity scores that satisfy a similarity score threshold may be identified. An exact-match (lexical) search may yield respective mapping scores for content items that may also be ranked. An output indicative of content items that are identified in the set of content items with high-ranking similarity scores and identified in the set of content items with high-ranking mapping scores.
-
公开(公告)号:US20250036638A1
公开(公告)日:2025-01-30
申请号:US18911887
申请日:2024-10-10
Applicant: ROKU, INC.
Inventor: Peter Martigny , Fedor Bartosh , Danish Shaikh , Vinh Nguyen , Manasi Deshmukh , Ratul Ray , Nitish Aggarwal , Srimaruti Manoj Nimmagadda , Kapil Kumar , Sameer Girolkar
IPC: G06F16/2457 , G06F16/242 , G06F16/9535
Abstract: A content retrieval system may receive a query associated with a plurality of content items in a repository. For each content item of the plurality of content items: a respective first and second similarity score may be generated based on a similarity between embeddings indicative of a first and second data type generated from the query and for the content item; and a respective normalized similarity score may be generated based on a combination of the respective first and second similarity scores. A set of content items with respective normalized similarity scores that satisfy a similarity score threshold may be identified. An exact-match (lexical) search may yield respective mapping scores for content items that may also be ranked. An output indicative of content items that are identified in the set of content items with high-ranking similarity scores and identified in the set of content items with high-ranking mapping scores.
-