Abstract:
A method for detecting large rearrangements in BRCA1 and BRCA2 genes includes amplifying a nucleic acid sample in the presence of a primer pool to produce amplicons, where the primer pool includes target specific primers targeting regions of exons of the BRCA1 and BRCA2 genes. The method further includes sequencing the amplicons to generate a plurality of reads, mapping the reads to a reference sequence, determining a number of reads per amplicon for the amplicons associated with the exons of the BRCA and the BRCA2 genes, determining exon copy numbers for the exons of the BRCA1 and BRCA2 genes based on the number of reads per amplicon, detecting an exon deletion or duplication based on the exon copy numbers, and detecting a whole gene deletion of the BRCA1 or BRCA2 gene based on the number of reads per amplicon associated with the exons of the BRCA1 and BRCA2 genes.
Abstract:
In some embodiments, the disclosure relates generally to methods, as well as related systems, compositions, kits, apparatuses and computer-readable media, comprising a multiplex molecular tagging procedure that employs a plurality of tags that are appended to a plurality of polynucleotides. The tags have characteristics, including a sequence, length and/or detectable moiety, or any other characteristic, that uniquely identifies the polynucleotide molecule to which it is appended, and permits tracking individual tagged molecules in a mixture of tagged molecules. For example, the tag having a unique tag sequence, can uniquely identify an individual polynucleotide to which it is appended, and distinguish the individual polynucleotide from other tagged polynucleotides in a mixture. In some embodiments, the multiplex molecular tagging procedure can be used for generating error-corrected sequencing data and for detecting a target polynucleotide which is present at low abundance in a nucleic acid sample.
Abstract:
Systems and method for determining variants can receive mapped reads, and call variants. In embodiments, flow space information for the reads can be aligned to a flow space representation of a corresponding portion of the reference. Reads spanning a position with a potential variant can be grouped and a score can be calculated for the variant. Based on the scores, a list of probable variants can be provided. In various embodiments, low frequency variants can be identified where multiple potential variants are present at a position.
Abstract:
Systems and method for determining variants can receive mapped reads, and call variants. In embodiments, flow space information for the reads can be aligned to a flow space representation of a corresponding portion of the reference. Reads spanning a position with a potential variant can be grouped and a score can be calculated for the variant. Based on the scores, a list of probable variants can be provided. In various embodiments, low frequency variants can be identified where multiple potential variants are present at a position.
Abstract:
Nucleic acid sequence mapping/assembly methods are disclosed. The methods initially map only a contiguous portion of each read to a reference sequence and then extends the mapping of the read at both ends of the mapped contiguous portion until the entire read is mapped (aligned). In various embodiments, a mapping score can be calculated for the read alignment using a scoring function, score (i, j)=M+mx, where M can be the number of matches in the extended alignment, x can be the number of mismatches in the alignment, and m can be a negative penalty for each mismatch. The mapping score can be utilized to rank or choose the best alignment for each read.
Abstract:
In some embodiments, the disclosure relates generally to methods, as well as related systems, compositions, kits, apparatuses and computer-readable media, comprising a multiplex molecular tagging procedure that employs a plurality of tags that are appended to a plurality of polynucleotides. The tags have characteristics, including a sequence, length and/or detectable moiety, or any other characteristic, that uniquely identifies the polynucleotide molecule to which it is appended, and permits tracking individual tagged molecules in a mixture of tagged molecules. For example, the tag having a unique tag sequence, can uniquely identify an individual polynucleotide to which it is appended, and distinguish the individual polynucleotide from other tagged polynucleotides in a mixture. In some embodiments, the multiplex molecular tagging procedure can be used for generating error-corrected sequencing data and for detecting a target polynucleotide which is present at low abundance in a nucleic acid sample.
Abstract:
A method for compressing nucleic acid sequence data wherein each sequence read is associated with a molecular tag sequence, wherein a portion of the sequence reads alignments correspond to sequence reads mapped to a targeted fusion reference sequence includes determining a consensus sequence read for each family of sequence reads based on flow space signal measurements corresponding to the family of sequence reads, determining a consensus sequence alignment for each family of sequence reads, wherein a portion of the consensus sequence alignments correspond to the consensus sequence reads aligned with the targeted fusion reference sequence, generating a compressed data structure comprising consensus compressed data, the consensus compressed data including the consensus sequence read and the consensus sequence alignment for each family, and detecting a fusion using the consensus sequence reads and the consensus sequence alignments from the compressed data structure.
Abstract:
Systems and method for determining variants can receive mapped reads and determine a distribution of matched-filter residuals distribution from a plurality of reads at a homopolymer region. The distribution of matched-filter residuals can be fit to uni-modal and bi-modal models. Based on the model that best fits the distribution of matched-filter residuals, the heterozygosity of the sample and the absence or presence of an insertion/deletion in the homopolymer can be determined.
Abstract:
Systems and method for determining variants can receive mapped reads, align flow space information to a flow space representation of a corresponding portion of the reference. Reads spanning a position with a potential variant can be evaluated in a context specific manner. A list of probable variants can be provided.
Abstract:
In some embodiments, the disclosure relates generally to methods, as well as related systems, compositions, kits, apparatuses and computer-readable media, comprising a multiplex molecular tagging procedure that employs a plurality of tags that are appended to a plurality of polynucleotides. The tags have characteristics, including a sequence, length and/or detectable moiety, or any other characteristic, that uniquely identifies the polynucleotide molecule to which it is appended, and permits tracking individual tagged molecules in a mixture of tagged molecules. For example, the tag having a unique tag sequence, can uniquely identify an individual polynucleotide to which it is appended, and distinguish the individual polynucleotide from other tagged polynucleotides in a mixture. In some embodiments, the multiplex molecular tagging procedure can be used for generating error-corrected sequencing data and for detecting a target polynucleotide which is present at low abundance in a nucleic acid sample.