-
公开(公告)号:US09959323B2
公开(公告)日:2018-05-01
申请号:US15064662
申请日:2016-03-09
Applicant: International Business Machines Corporation
Inventor: Lukasz Gaza , Artur M. Gruszecki , Tomasz Kazalski , Konrad K. Skibski , Tomasz Stradomski
IPC: G06F17/30
CPC classification number: G06F17/30536 , G06F17/30424 , G06F17/30864
Abstract: The invention relates to a computer-implemented method for processing a query in a database, the query comprising a search value. The database comprises a plurality of datasets the datasets comprising entries, wherein distance statistics are assigned to the datasets. The distance statistics describe the minimum and maximum distance between the values of the entries of a dataset of the plurality of datasets and a reference value. The method comprises determining the distance between the search value and the reference value, said determination resulting in a search distance, determining a subset of datasets from the plurality of datasets for which the search distance is within the limits given by the minimum and maximum distances described by the respective distance statistics, and searching for the search value in the subset of datasets.
-
公开(公告)号:US09928624B2
公开(公告)日:2018-03-27
申请号:US14513260
申请日:2014-10-14
Applicant: International Business Machines Corporation
Inventor: Raymond S. Glover
CPC classification number: G06T11/206 , G06F17/30536 , G06F17/30554 , G06F17/30958 , G06F17/30994 , G06N5/02 , G06N5/022 , G06N7/00 , G06T13/80
Abstract: One or more processors receive a dataset that includes a plurality of nodes. One or more processors identify relationships between a plurality of interacting nodes within the dataset. One or more processors determine relationship strength values between a plurality of interacting node pairs within the dataset. One or more processors generate a graphical representation that represents the relationship strength values between the plurality of interacting nodes within the dataset. Interacting node pairs are connected by edges and the edges have a length that correlates with the relationship strength value between the interacting node pairs.
-
公开(公告)号:US09846953B2
公开(公告)日:2017-12-19
申请号:US15387838
申请日:2016-12-22
Applicant: International Business Machines Corporation
Inventor: Raymond S. Glover
CPC classification number: G06T11/206 , G06F17/30536 , G06F17/30554 , G06F17/30958 , G06F17/30994 , G06N5/02 , G06N5/022 , G06N7/00 , G06T13/80
Abstract: One or more processors receive a dataset that includes a plurality of nodes. One or more processors identify relationships between a plurality of interacting nodes within the dataset. One or more processors determine relationship strength values between a plurality of interacting node pairs within the dataset. One or more processors generate a graphical representation that represents the relationship strength values between the plurality of interacting nodes within the dataset. Interacting node pairs are connected by edges and the edges have a length that correlates with the relationship strength value between the interacting node pairs.
-
公开(公告)号:US09787787B2
公开(公告)日:2017-10-10
申请号:US15176449
申请日:2016-06-08
Applicant: Adobe Systems Incorporated
Inventor: Joao Manuel Pinto Filipe , Pleun Christiaan Bel , Tiago Cipriano Pires , Zoltan Papp
CPC classification number: H04L67/22 , G06F17/30306 , G06F17/30536 , H04L43/062 , H04L67/02
Abstract: A method and a system for processing measurement data for website statistics are provided. The measurement data is processed in parallel bucket writers and stored in buckets. Upon receiving a report request the buckets are processed in parallel bucket queriers to obtain report data.
-
公开(公告)号:US20170228472A1
公开(公告)日:2017-08-10
申请号:US15494874
申请日:2017-04-24
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Michal Bodziony , Lukasz Gaza , Artur M. Gruszecki , Tomasz Kazalski , Konrad K. Skibski , Tomasz Stradomski
IPC: G06F17/30
CPC classification number: G06F17/30442 , G06F17/30536
Abstract: Software for processing a database query that includes: (i) receiving a query of a database including a search value; (ii) determining a distance between the search value and at least one reference value; (iii) determining a maximum distance from the search value to be used in searching a plurality of datasets of the database, wherein the maximum distance from the search value defines a search range and is based, at least in part, on the determined distance between the search value and the at least one reference value; (iv) determining a subset of datasets from the plurality of datasets that includes datasets for which a data range with respect to each reference value overlaps with the search range; and (v) performing approximate string matching for the search value on the subset of datasets.
-
6.
公开(公告)号:US20170206185A1
公开(公告)日:2017-07-20
申请号:US15476899
申请日:2017-03-31
Applicant: Splunk Inc.
Inventor: Steve Yu Zhang
CPC classification number: G06F17/18 , G06F7/22 , G06F7/483 , G06F7/544 , G06F17/30536 , G06K9/6222
Abstract: A method, system, and processor-readable storage medium are directed towards calculating approximate order statistics on a collection of real numbers. In one embodiment, the collection of real numbers is processed to create a digest comprising hierarchy of buckets. Each bucket is assigned a real number N having P digits of precision and ordinality O. The hierarchy is defined by grouping buckets into levels, where each level contains all buckets of a given ordinality. Each individual bucket in the hierarchy defines a range of numbers—all numbers that, after being truncated to that bucket's P digits of precision, are equal to that bucket's N. Each bucket additionally maintains a count of how many numbers have fallen within that bucket's range. Approximate order statistics may then be calculated by traversing the hierarchy and performing an operation on some or all of the ranges and counts associated with each bucket.
-
7.
公开(公告)号:US09710815B2
公开(公告)日:2017-07-18
申请号:US12010810
申请日:2008-01-30
Applicant: James W. MacIntyre , David Scherer , David Alan Rosenthal
Inventor: James W. MacIntyre , David Scherer , David Alan Rosenthal
CPC classification number: G06Q30/02 , G06F17/30536 , G06F17/30554 , G06Q10/063 , G06Q10/0637 , G06Q10/0639 , G06Q10/06393 , G06Q30/0201 , G06T11/206
Abstract: Systems and methods for processing and reporting information and data, such as business information, and more particularly, to systems, software, hardware, products, and processes for use by businesses, individuals and other organizations to collect, process, distribute, analyze and visualize information, including, but not limited to, business intelligence, data visualization, data warehousing, and data mining. Real-time monitoring of web site interactions allows users to modify and fine-tune their websites to maximize value realized.
-
公开(公告)号:US09697274B2
公开(公告)日:2017-07-04
申请号:US14141635
申请日:2013-12-27
Applicant: International Business Machines Corporation
Inventor: Andrey Balmin , Vuk Ercegovac , Peter J. Haas , Liping Peng , John Sismanis
IPC: G06F17/30
CPC classification number: G06F17/30598 , G06F17/30324 , G06F17/30486 , G06F17/3053 , G06F17/30536 , G06F17/30867
Abstract: Stratified sampling of a plurality of records is performed. A plurality of records are partitioned into a plurality of splits, wherein each split includes at least a portion of the plurality of records. The split of the plurality of splits is provided to a mapper. The mapper assigns at least a portion the records of the at least one split to a group based on a strata of the assigned records, and filters the records of the group based on a comparison of the weights of the records to a local threshold of the mapper. The mapper updates the local threshold of the mapper by communicating with a coordinator. The mapper shuffles the group to a reducer, where the reducer filters the records of the group based on the weights of the records. The reducer provides a stratified sampling of the plurality of records based on the group.
-
公开(公告)号:US20170154082A1
公开(公告)日:2017-06-01
申请号:US15209544
申请日:2016-07-13
Applicant: PALANTIR TECHNOLOGIES INC.
Inventor: Jean-Baptiste MICHEL , Alan HAMPTON , Ananya SHUKLA , Ishwar SIVAKUMAR
CPC classification number: G06F17/30536 , G06F17/30312 , G06F17/30699 , G06N7/005
Abstract: Systems and methods for using disparate data sets to attribute data to an entity are disclosed. Disparate data sets can be obtained from a variety of data sources. The disclosed systems and methods can obtain a first and second data set. Trajectories can represent multiple data records in a data set associated with an entity. Trajectories from the obtained data sets can be used to associate data stored among the various data sets. The association can be based on the agreement between the trajectories. The associated data records can further be used to associate the entities related to the associated data records.
-
10.
公开(公告)号:US20170139907A1
公开(公告)日:2017-05-18
申请号:US15360592
申请日:2016-11-23
Applicant: Veveo, Inc.
Inventor: Murali Aravamudan , Ajit Rajasekharan , Kajamalai G. Ramakrishnan
IPC: G06F17/30
CPC classification number: G06F17/3005 , G06F17/30029 , G06F17/30035 , G06F17/30442 , G06F17/30522 , G06F17/3053 , G06F17/30536 , G06F17/30554 , G06F17/30595 , G06F17/30699 , G06F17/30702 , G06F17/30752 , G06F17/30864 , G06F17/30867 , H04N21/44222 , H04N21/4828 , Y10S707/99933 , Y10S707/99934 , Y10S707/99937 , Y10S707/99943 , Y10S707/99945
Abstract: A method of selecting and presenting content based on learned user preferences is provided. The method includes providing a content system including a set of content items organized by genre characterizing the content items, and wherein the set of content items contains microgenre metadata further characterizing the content items. The method also includes receiving search input from the user for identifying desired content items and, in response, presenting a subset of content items to the user. The method further includes receiving content item selection actions from the user and analyzing the microgenre metadata within the selected content items to learn the preferred microgenres of the user. The method includes, in response to receiving subsequent user search input, selecting and presenting content items in an order that portrays as relatively more relevant those content items containing microgenre metadata that more closely match the learned microgenre preferences of the user.
-
-
-
-
-
-
-
-
-