Abstract:
Displaying a comparison of genotypic information between relatives is disclosed, including receiving an indication that a first individual is a grandparent, receiving an indication that a second individual is a grandchild of the first individual, comparing the genotypic information of the first individual and the second individual and calculating a similarity score, and displaying an indication of the similarity score graphically using colors.
Abstract:
Determining relative connections between individuals includes: obtaining identification information of a first individual and identification information of a second individual; determining, based at least in part on a relative connections graph, a relative connections path connecting the first individual, the second individual, and at least one additional individual; and outputting information pertaining to the relative connections path.
Abstract:
In an implementation, a query signature corresponding to a query sequence based on a set of closest cluster centroids is generated. Based on the query signature, one or more target reference signatures from a plurality of reference signatures are identified. Further based on the one or more target reference signatures, a probable taxonomic group is identified and assigned to the query sequence.
Abstract:
A universal data-mining platform is provided capable of analyzing mass spectrometry (MS) serum proteomic profiles and/or gene array data to produce biologically meaningful classification; i.e., group together biologically related specimens into clades. This platform utilizes the principles of phylogenetics, such as parsimony, to reveal susceptibility to cancer development (or other physiological or pathophysiological conditions), diagnosis and typing of cancer, identifying stages of cancer, as well as post-treatment evaluation. By outgroup comparison, the parsing algorithm identifies under and/or overexpressed gene values or in the case of sera, (i) novel or (ii) vanished MS peaks, and peaks signifying (iii) up or (iv) down regulated proteins, and scores the variations as either derived (do not exist in the outgroup set) or ancestral (exist in the outgroup set); the derived is given a score of “1”, and the ancestral a score of “0”—these are called the polarized values.
Abstract:
Described embodiments enable identification of family networks using combinations of DNA analysis and genealogical information. Genealogical data is provided by users of a genealogical research service or collected from other sources and used to create family trees for each user. DNA samples are also received from the users. By analyzing the DNA samples, potential genetic relationships can be identified between some users. Once these DNA-suggested relationships have been identified, common ancestors can be sought in the respective trees of the potentially related users. Where these common ancestors exist, an inference is drawn that the DNA-suggested relationship accurately represents a familial overlap between the individuals in question. People descended from the same common ancestor are each members of a family network. Members of a family network not in a user's tree may be identified for the user, enabling the user to discover additional ancestors that might otherwise have remained unknown.
Abstract:
Determining relative relationship includes receiving recombinable deoxyribonucleic acid (DNA) information of a first user and recombinable DNA information of a second user, determining, based at least in part on the recombinable DNA information of the first user and recombinable DNA information of the second user, a predicted degree of relationship between the first user and the second user, and in the event that the expected degree of relationship between the first user and the second user at least meets the threshold, notifying at least the first user about a relative relationship with the second user.
Abstract:
A method and system for representing a similarity between at least two genomes that includes detecting gene clusters which are common to the at least two genomes and representing the common gene clusters in a PQ tree. The PQ tree includes a first internal node (P node), that allows permutation of the children thereof, and a second internal node (Q node), that maintains unidirectional order of the children thereof.
Abstract:
In silico nucleic acid recombination methods, related integrated systems utilizing genetic operators and libraries made by in silico shuffling methods are provided. One disclosed method involves identifying and producing a polypeptide. The method may involve (a) receiving data identifying sequences of two or more parental polypeptides or parental nucleic acids that encode the polypeptides; (b) selecting one or more cross-over sites on the sequences thereby defining one or more recombinant polypeptides or recombinant nucleic acids; (c) selecting at least one of the recombinant polypeptides or recombinant nucleic acids; (d) recombining one or more oligonucleotides, at least one of which has a sequence matching some or all of that of the recombinant polypeptides or recombinant nucleic acids selected in (c); (e) selecting at least some of the recombined oligonucleotides produced in (d); and (f) producing a polypeptide encoded by the selected nucleic acid.
Abstract:
A genealogical research and record keeping system and method for identifying commonalities in haplotypes and other genetic characteristics of two or more individual members of a biological sample. Chromosomal fragments identical by descent identify family ties between siblings, parents and children and ancestors and progeny across many generations. It is particularly useful in corroborating and improving the accuracy of genealogical data, and identifying previously unknown genetic relationships.
Abstract:
A method of generating a cell lineage tree of a plurality of cells of an individual is provided. The method comprising: (a) determining at least one genotypic marker for each cell of the plurality of cells; and (b) computationally clustering data representing the at least one genotypic marker to thereby generate the cell lineage tree of the plurality of cells of the individual.