VALIDATION METHODS AND SYSTEMS FOR SEQUENCE VARIANT CALLS

    公开(公告)号:US20240304280A1

    公开(公告)日:2024-09-12

    申请号:US18664975

    申请日:2024-05-15

    Applicant: ILLUMINA, INC.

    CPC classification number: G16B20/20 G16B30/00 G16B40/00

    Abstract: Presented herein are techniques for identifying and/or validating sequence variants in genomic sequence data. The techniques include generating an error rate reflective of sequence errors present in the genomic sequence data. The error rate may be used to validate potential sequence variants. The error rate may be based on errors identified during consensus sequence confirmation for sequence reads associated with individual unique molecular identifiers.

    Systems and methods for grouping and collapsing sequencing reads

    公开(公告)号:US11688489B2

    公开(公告)日:2023-06-27

    申请号:US16667642

    申请日:2019-10-29

    Applicant: Illumina, Inc.

    CPC classification number: G16B30/10 G06F16/2255 G06F16/24578 G16B30/20

    Abstract: Disclosed herein are systems and methods for collapsing sequencing reads and identifying similar sequencing reads. In one example, a method includes generating a plurality of first identifier subsequences from a first identifier sequence of each nucleotide sequencing read and generating a first signature for the nucleotide sequencing read by applying hashing to the plurality of first identifier subsequences. The method may include assigning the nucleotide sequencing read to a first particular bin of a first data structure based on the first signature and determining a nucleotide sequence for each first particular bin of the first data structure with one or more nucleotide sequencing reads assigned.

    VALIDATION METHODS AND SYSTEMS FOR SEQUENCE VARIANT CALLS

    公开(公告)号:US20190206510A1

    公开(公告)日:2019-07-04

    申请号:US16206552

    申请日:2018-11-30

    Applicant: ILLUMINA, INC.

    CPC classification number: G16B20/20 G16B30/00 G16B40/00

    Abstract: Presented herein are techniques for identifying and/or validating sequence variants in genomic sequence data. The techniques include generating an error rate reflective of sequence errors present in the genomic sequence data. The error rate may be used to validate potential sequence variants. The error rate may be based on errors identified during consensus sequence confirmation for sequence reads associated with individual unique molecular identifiers.

Patent Agency Ranking