摘要:
Software for online active learning receives content posted to an online stream at a website. The software converts the content into an elemental representation and inputs the elemental representation into a probit model to obtain a predictive probability that the content is abusive. The software also calculates an importance weight based on the elemental representation. And the software updates the probit model using the content, the importance weight, and an acquired label if a condition is met. The condition depends on an instrumental distribution. The software removes the content from the online stream if a condition is met. The condition depends on the predictive probability, if an acquired label is unavailable.
摘要:
Embodiments are directed towards clustering cookies for identifying unique mobile devices for associating activities over a network with a given mobile device. The cookies are clustered based on a Bayes Factor similarity model that is trained from cookie features of known mobile devices. The clusters may be used to determine the number of unique mobile devices that access a website. The clusters may also be used to provide targeted content to each unique mobile device.
摘要:
Methods and systems for presenting content such as articles based on utility are provided. In one embodiment, a plurality of articles are determined, each article in the plurality of articles including article content and a corresponding preview icon, the preview icon defining a link to the corresponding article content when presented. For each article in the plurality of articles, a user experience utility value is determined. And for each article in the plurality of articles, an economic utility value is also determined. A ranked order of the articles is determined based upon each article's user experience utility value and economic utility value. And a portion of the preview icons of the articles are presented on a graphical display page in a priority orientation based on the ranked order of the articles.
摘要:
The present invention is directed generally to providing systems and methods for data analysis. More specifically, embodiments may provide system(s) and method(s) including dynamic user modeling techniques to capture the relational and dynamic patterns of information content and/or users' or entities' interests. Various embodiments may include system(s) and method(s) that are based on, for example, the past history of content semantics, temporal changes, and/or user community relationship. Various embodiments may include modeling and/or analysis of the dynamic nature of an item of interest's value to a user(s)/entity(ies) over time. The dynamic factors may be consider in any manner, such as, individually or combined, sequentially or simultaneously, etc. Further, some embodiments may include, for example, system(s) and method(s) relating to analyzing data to capture user/entity interests and/or characteristics, consider content semantics and evolutionary information, and/or using community relationships of users/entities to thereby analyze information and provide dynamic conclusion(s) (e.g., recommendation(s)).
摘要:
The invention is directed generally to providing methods and systems for trend extraction and analysis. Embodiments include methods and systems for trend extraction and analysis of information extracted from dynamically changing data included in computer systems and/or networks. Various exemplary embodiments are provided that may generate characteristic indicators for trend(s) and/or distribution(s) for one or more data sources by use of, for example, temporal indicators derived through analysis of the difference in contribution separate portions of the data to the whole data set being considered, contribution of individual sources, and/or the interaction of the separate portions of the data with one another. Some exemplary approaches may include the use of singular value decomposition (SVD) and higher-order singular value decomposition (HOSVD) data extraction and analysis techniques. One use of these techniques is in the analysis of the dynamic data contained in Weblogs and the blogosphere.
摘要:
The present invention is directed to systems and methods for data and/or information analysis. The systems and methods may be directed to knowledge management and/or user modeling. In various embodiments, the systems and methods may utilize relational representations and/or evolutionary representations of information. For example, expertise information and/or evolutional information related to expertise information may be analyzed and representations presented indicating relationships and temporal evolution.
摘要:
Projector and camera arrangements are provided for use in electronic whiteboard systems. Specifically, the present invention provides projector and camera arrangements wherein the projector and camera share the same imaging optics. By sharing the same projection and camera optics, the distortions that affect the projection system are the same as those of the camera system. Thus, the calibration step required in conventional whiteboard systems where the projector and camera are separate, i.e., each having their own distinct optics and settings, is no longer needed. Further, the arrangements provided in accordance with the invention are self-aligning, even when lens distortions are large and even in the presence of strong perspective effects. The shared optics projector and camera arrangements of the invention also provide for dynamic zooming. In addition, various active and passive optical marker or lightpen designs are provided for use in electronic whiteboard systems.
摘要:
Improved document annotation techniques are provided. For example, in one aspect of the invention, a technique for determining an annotation for a document includes the following steps/operations. A user-proposed annotation to be associated with the document is obtained. Then, the technique automatically determines, in accordance with a knowledge base, whether the user-proposed annotation matches at least one allowed annotation.
摘要:
According to an example embodiment, a method comprises executing instructions by a special purpose computing apparatus to, for labeled source domain data having a plurality of original labels, generate a plurality of first predicted labels for the labeled source domain data using a target function, the target function determined by using a plurality of labels from labeled target domain data. The method further comprises executing instructions by the special purpose computing apparatus to apply a label relation function to the first predicted labels for the source domain data and the original labels for the source domain data to determine a plurality of weighting factors for the labeled source domain data. The method further comprises executing instructions by the special purpose computing apparatus to generate a new target function using the labeled target domain data, the labeled source domain data, and the weighting factors for the labeled source domain data, and evaluate a performance of the new target function to determine if there is a convergence.
摘要:
A method of generating a time managed challenge-response test is presented. The method identifies a geometric shape having a volume and generates an entry object of the time managed challenge-response test. The entry object is overlaid onto the geometric shape, such that the entry object is distributed over a surface of the geometric shape, and a portion of the entry object is hidden at any point in time. The geometric shape is rotated, which reveals the portion of the entry object that is hidden. A display region on a display is identified for rendering the geometric shape and the geometric shape is presented in the display region of the display.