Patent search ap:("Google LLC") AND inv:"Sanjiv Kumar" Page 4

31.

发明申请
Sampled Softmax with Random Fourier Features 有权

公开(公告)号：US20210019654A1

公开(公告)日：2021-01-21

申请号：US16931862

申请日：2020-07-17

Applicant: Google LLC

Inventor： Xinnan Yu , Ankit Singh Rawat , Jiecao Chen , Ananda Theertha Suresh , Sanjiv Kumar

IPC: G06N20/00 , G06F40/20 , G06K9/62 , G06F17/14 , G06F17/18

Abstract: Systems and methods for low bias negative sampling of classes according to the sampled softmax method are described herein. The systems and methods can include training a machine-learned model for classifying inputs into one or more classes of a plurality of classes, each of the plurality of classes having an associated class embedding in a plurality of class embeddings. The systems and methods can include selecting, by the one or more computing devices, one or more negative classes from the plurality of classes based at least in part on a probability distribution approximating a softmax distribution, wherein the probability distribution is determined based at least in part on a Random Fourier Features map.

32.

发明申请
LOCAL ORTHOGONAL DECOMPOSITION FOR MAXIMUM INNER PRODUCT SEARCH 审中-公开

公开(公告)号：US20200257668A1

公开(公告)日：2020-08-13

申请号：US16715620

申请日：2019-12-16

Applicant: GOOGLE LLC

Inventor： Xiang Wu , David Morris Simcha , Sanjiv Kumar , Ruiqi Guo

IPC: G06F16/22 , G06F16/27 , G06F16/2458 , G06F16/28 , G06F17/16 , G06F16/248

Abstract: Techniques of indexing a database and processing a query involve decomposing the residual term according to a projection matrix that is based on a given direction v. For example, for each database element of a partition, the residual for that database element is split into a component parallel to a given direction and a component perpendicular to that direction. The parallel component lies in a one-dimensional subspace spanned by the direction and may be efficiently quantized with a scalar quantization. The perpendicular component is quantized using multiscale quantization techniques. The quantized residual components and the center elements of each partition define the indexed database. Upon receipt of a query from a user, the inner products of q with the residual may be computed efficiently using the quantized residual components. From these inner products, the database elements that are most similar to the query are selected and returned to the user.

33.

发明授权
Decreasing neural network inference times using softmax approximation 审中-公开

公开(公告)号：US10671909B2

公开(公告)日：2020-06-02

申请号：US16586702

申请日：2019-09-27

Applicant: Google LLC

Inventor： Yang Li , Sanjiv Kumar , Pei-Hung Chen , Si Si , Cho-Jui Hsieh

IPC: G06N3/04 , G06F17/16 , G06F17/18 , G06K9/62

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for decreasing neural network inference times using softmax approximation. One of the methods includes maintaining data specifying a respective softmax weight vector for each output in a vocabulary of possible neural network outputs; receiving a neural network input; processing the neural network input using one or more initial neural network layers to generate a context vector for the neural network input; and generating an approximate score distribution over the vocabulary of possible neural network outputs for the neural network input, comprising: processing the context vector using a screening model configured to predict a proper subset of the vocabulary for the context input; and generating a respective logit for each output that is in the proper subset, comprising applying the softmax weight vector for the output to the context vector.

34.

发明申请
COMPARING EXTRACTED CARD DATA USING CONTINUOUS SCANNING 审中-公开

公开(公告)号：US20190108415A1

公开(公告)日：2019-04-11

申请号：US16210973

申请日：2018-12-05

Applicant: GOOGLE LLC

Inventor： Sanjiv Kumar , Henry Allan Rowley , Xiaohang Wang , Yakov Okshtein , Farhan Shamsi , Alessandro Bissacco

IPC: G06K9/62 , G06K9/78 , G06K9/34 , G06K9/00 , G06Q20/32 , G06K9/22 , G06K9/20 , G06K9/03 , G06K9/18 , G06Q20/40 , G06Q20/34 , G06T17/00

Abstract: Comparing extracted card data from a continuous scan comprises receiving, by one or more computing devices, a digital scan of a card; obtaining a plurality of images of the card from the digital scan of the physical card; performing an optical character recognition algorithm on each of the plurality of images; comparing results of the application of the optical character recognition algorithm for each of the plurality of images; determining if a configured threshold of the results for each of the plurality of images match each other; and verifying the results when the results for each of the plurality of images match each other. Threshold confidence level for the extracted card data can be employed to determine the accuracy of the extraction. Data is further extracted from blended images and three-dimensional models of the card. Embossed text and holograms in the images may be used to prevent fraud.

35.

发明申请
RELATIVE POSITION BIASES IN ATTENTION NEURAL NETWORKS USING FUNCTIONAL INTERPOLATION 有权

公开(公告)号：US20250111210A1

公开(公告)日：2025-04-03

申请号：US18900531

申请日：2024-09-27

Applicant: Google LLC

Inventor： Chong You , Guru Guruganesh , Joshua Timothy Ainslie , Manzil Zaheer , Sanjiv Kumar , Santiago Ontañón , Shanda Li , Venkata Sesha Pavana Srinadh Bhojanapalli , Sumit Sanghai

IPC: G06N3/0475

Abstract: Systems and methods for processing inputs using attention neural networks. In particular, one or more of the attention layers within the attention neural network compute relative position biases using functional interpolation.

36.

发明授权
Tuning approximate nearest neighbor search engines for speed-recall tradeoffs via Lagrange multiplier methods 有权

公开(公告)号：US12235845B2

公开(公告)日：2025-02-25

申请号：US18474907

申请日：2023-09-26

Applicant: Google LLC

Inventor： Philip Wenjie Sun , Ruiqi Guo , Sanjiv Kumar

IPC: G06F16/245 , G06F16/2453 , G06F16/95 , G06F16/953

Abstract: Example quantization-based approximate nearest neighbors (ANN) search methods and systems (e.g., search engines) are tuned to perform at the speed-recall pareto frontier. With a desired search cost or recall as input, embodiments employ Lagrangian-based methods to perform constrained optimization on theoretically-grounded search cost and recall models. The resulting tunings, when paired with the efficient quantization-based ANN implementation of the embodiments, exhibit excellent performance on standard benchmarks while requiring minimal tuning or configuration complexity.

37.

发明授权
Sampled softmax with Random Fourier features 有权

公开(公告)号：US12205005B2

公开(公告)日：2025-01-21

申请号：US16931862

申请日：2020-07-17

Applicant: Google LLC

Inventor： Xinnan Yu , Ankit Singh Rawat , Jiecao Chen , Ananda Theertha Suresh , Sanjiv Kumar

IPC: G06N3/08 , G06F17/14 , G06F17/18 , G06F18/2431 , G06F40/20 , G06N3/084 , G06N20/00 , G06N20/10 , G06V10/77

Abstract: Systems and methods for low bias negative sampling of classes according to the sampled softmax method are described herein. The systems and methods can include training a machine-learned model for classifying inputs into one or more classes of a plurality of classes, each of the plurality of classes having an associated class embedding in a plurality of class embeddings. The systems and methods can include selecting, by the one or more computing devices, one or more negative classes from the plurality of classes based at least in part on a probability distribution approximating a softmax distribution, wherein the probability distribution is determined based at least in part on a Random Fourier Features map.

38.

发明公开
DYNAMIC SELECTION FROM AMONG MULTIPLE CANDIDATE GENERATIVE MODELS WITH DIFFERING COMPUTATIONAL EFFICIENCIES 审中-公开

公开(公告)号：US20240311405A1

公开(公告)日：2024-09-19

申请号：US18337316

申请日：2023-06-19

Applicant: GOOGLE LLC

Inventor： Seungyeon Kim , Ankit Singh Rawat , Wittawat Jitkrittum , Hari Narasimhan , Sashank Reddi , Neha Gupta , Srinadh Bhojanapalli , Aditya Menon , Manzil Zaheer , Tal Schuster , Sanjiv Kumar , Toby Boyd , Zhifeng Chen , Emanuel Taropa , Vikram Kasivajhula , Trevor Strohman , Martin Baeuml , Leif Schelin , Yanping Huang

IPC: G06F16/332

CPC classification number: G06F16/3329

Abstract: Implementations disclose selecting, in response to receiving a request and from among multiple candidate generative models (e.g., multiple candidate large language models (LLMs)) with differing computational efficiencies, a particular generative model to utilize in generating a response to the request. Those implementations reduce latency and/or conserve computational resource(s) through selection, for various requests, of a more computationally efficient generative model for utilization in lieu of a less computationally efficient generative model. Further, those implementations seek to achieve such benefits, through utilization of more computationally efficient generative models, while also still selectively utilizing less computationally efficient generative models for certain requests to mitigate occurrences of a generated response being inaccurate and/or under-specified. This, in turn, can mitigate occurrences of computational and/or network inefficiencies that result from a user issuing a follow-up request to cure the inaccuracies and/or under-specification of a generated response.

39.

发明公开
Systems and Methods for Weighted Quantization 审中-公开

公开(公告)号：US20240061889A1

公开(公告)日：2024-02-22

申请号：US18456688

申请日：2023-08-28

Applicant: Google LLC

Inventor： Ruiqi Guo , David Simcha , Quan Geng , Felix Chern , Sanjiv Kumar , Xiang Wu

IPC: G06F16/906 , G06F16/2457 , G06F16/25 , H03M7/30

CPC classification number: G06F16/906 , G06F16/24578 , G06F16/258 , H03M7/30

Abstract: Generally, the present disclosure is directed to systems and methods of quantizing a database with respect to a novel loss or quantization error function which applies a weight to an error measurement of quantized elements respectively corresponding to the datapoints in the database. The weight is determined based on the magnitude of an inner product between the respective datapoints and a query compared therewith. In contrast to previous work, embodiments of the proposed loss function are responsive to the expected magnitude of an inner product between the respective datapoints and a query compared therewith and can prioritize error reduction for higher-ranked pairings of the query and the datapoints. Thus, the systems and methods of the present disclosure provide solutions to some of the problems with traditional quantization approaches, which regard all error as equally impactful.

40.

发明授权
Multiscale quantization for fast similarity search 有权

公开(公告)号：US11874866B2

公开(公告)日：2024-01-16

申请号：US18081376

申请日：2022-12-14

Applicant: Google LLC

Inventor： Xiang Wu , David Simcha , Daniel Holtmann-Rice , Sanjiv Kumar , Ananda Theertha Suresh , Ruiqi Guo , Xinnan Yu

IPC: G06F16/33 , G06F16/31 , G06N20/00

CPC classification number: G06F16/3347 , G06F16/313 , G06F16/319 , G06N20/00

Abstract: The present disclosure provides systems and methods that include or otherwise leverage use of a multiscale quantization model that is configured to provide a quantized dataset. In particular, the multiscale quantization model can receive and perform vector quantization of a first dataset. The multiscale quantization model can generate a residual dataset based at least in part on a result of the vector quantization. The multiscale quantization model can apply a rotation matrix to the residual dataset to generate a rotated residual dataset that includes a plurality of rotated residuals. The multiscale quantization model can perform reparameterization of each rotated residual in the rotated residual dataset into a direction component and a scale component. The multiscale quantization model can perform product quantization of the direction components of the plurality of rotated residuals, and perform scalar quantization of the scale components of the plurality of rotated residuals.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification