Abstract:
Embodiments are directed to a system, a computer program product, and a method for identification of linguistically related elements, and more specifically to prediction of a linguistically related element. A linguistic algorithm forms a cluster representation of corpus entries. A linguistic term is identified and applied to the cluster representation to identify proximally related linguistic terms. Associative relationships between the proximally related terms and category metadata are iteratively investigated. One or more linguistic terms related across the two more metadata categories is identified and designated as the linguistically related element.
Abstract:
Methods, systems and computer program products for data fetching in a networked computing environment. In one embodiment, the method comprises receiving a request from an analytical node for a set of data for a defined job, and identifying in networked storage a subset of the data for the job. The subset of data is loaded to the analytical node based on the sequence in which the data are projected to be accessed in the job. In an embodiment, the request includes a specification for the job, and the specification is analyzed to identify the subset of data. In one embodiment, the subset of data is identified by identifying another job having a relationship to the defined job, and identifying the data used for that other job. In an embodiment, the networked computing environment is a cloud computing environment, and the defined job is an analytics job.
Abstract:
A computer-implemented method of executing a virtual agent bot includes receiving, via a computer server, at least one input query from a user, and analyzing the at least one input query to extract at least one input term. The method further comprises determining a semantic skill set of the virtual agent bot among a plurality of different candidate skill sets based at least in part on the at least one input term; and invoking the virtual agent bot to provide a semantic topic response corresponding to the semantic skill set to provide an answer to the at least one input query.
Abstract:
An approach is provided that receives a question at a question answering (QA) system. The question includes a number of words. The approach operates by calculating weights that correspond to search terms included in the plurality of words. The search terms include the plurality of words and may include terms that are one or more sequences of adjacent words included in the question. Based on the calculated weights and the words in the question, the approach generates a query that is used to search a corpus that is managed by the QA system with the search resulting in one or more search results.
Abstract:
Automatically associating information technology resource patterns with specific information technology products by receiving a set of data about information technology assets, matching a subset of that data to a pattern in a set of patterns, determining that the subset of the data represents a product associated with that pattern, reporting this determination; receiving feedback on the accuracy of the determination, and updating pattern set information in response to that feedback.
Abstract:
Methods, systems and computer program products for data fetching in a networked computing environment. In one embodiment, the method comprises receiving a request from an analytical node for a set of data for a defined job, and identifying in networked storage a subset of the data for the job. The subset of data is loaded to the analytical node based on the sequence in which the data are projected to be accessed in the job. In an embodiment, the request includes a specification for the job, and the specification is analyzed to identify the subset of data. In one embodiment, the subset of data is identified by identifying another job having a relationship to the defined job, and identifying the data used for that other job. In an embodiment, the networked computing environment is a cloud computing environment, and the defined job is an analytics job.
Abstract:
A computer program product for provisioning resources in a balanced data transfer pipeline may associate a first requested task with a first category, allocate a first resource of a data path to the first requested task based on the first resource corresponding to the first category; and allocate a second resource as part of the data path to the first requested task based on the second resource corresponding to the first category, wherein the first resource corresponds to a first segment of the data path and the second resource corresponds to a second segment of the data path.
Abstract:
Techniques for license reconciliation with multiple license types and restrictions. A method includes grouping a collection of multiple software installation instances, a collection of multiple hardware devices and a collection of multiple software licenses into multiple clusters, generating a reconciliation matrix for each cluster, wherein each row in the reconciliation matrix represents a software installation instance or a hardware device, each column in the reconciliation matrix represents a license type and/or an individual license, and each cell in the reconciliation matrix represents a license requirement and applicability of each software installation instance or hardware device, solving each reconciliation matrix, and generating a license reconciliation plan based on the solved reconciliation matrices.
Abstract:
Methods, systems and computer program products for data fetching in a networked computing environment. In one embodiment, the method comprises receiving a request from an analytical node for a set of data for a defined job, and identifying in networked storage a subset of the data for the job. The subset of data is loaded to the analytical node based on the sequence in which the data are projected to be accessed in the job. In an embodiment, the request includes a specification for the job, and the specification is analyzed to identify the subset of data. In one embodiment, the subset of data is identified by identifying another job having a relationship to the defined job, and identifying the data used for that other job. In an embodiment, the networked computing environment is a cloud computing environment, and the defined job is an analytics job.
Abstract:
In an approach to generating and using a sentence model for answer generation, one or more computer processors ingest a first corpus of a plurality of text sentences. One or more computer processors convert the plurality of text sentences into a plurality of sentence vectors. One or more computer processors group the plurality of sentence vectors into a plurality of sentence clusters, wherein a sentence cluster is composed of sentences that are semantically similar. One or more computer processors receive a second corpus. One or more computer processors determine, for each sentence cluster of the plurality of sentence clusters, a frequency each sentence cluster appears in the second corpus. Based on the determined frequency, one or more computer processors calculate a probability of each sentence cluster of the plurality of sentence clusters. Based on the calculated probabilities, one or more computer processors generate a first sentence model.