SYSTEMS, METHODS, AND MEDIA FOR CLASSIFYING GENETIC SEQUENCING RESULTS BASED ON PATHOGEN-SPECIFIC ADAPTIVE THRESHOLDS

    公开(公告)号:US20230274790A1

    公开(公告)日:2023-08-31

    申请号:US18008004

    申请日:2021-06-03

    Applicant: Arc Bio, LLC

    CPC classification number: G16B20/00 G16B40/20 G16B30/10 G06N3/02

    Abstract: In accordance with some embodiments, systems, methods, and media for classifying genetic sequencing results based on pathogen-specific adaptive thresholds are provided. In some embodiments, a system comprises a processor programmed to: receive negative control results, each comprising values indicative of a number of reads detected in the respective negative control sample for an organism; generate a model based on the negative control results; receive a clinical sample result for a clinical sample, comprising values indicative of a number of reads detected in the clinical sample for an organism of a plurality of organisms; identify, utilizing the model, any values in the clinical sample that are likely to be diagnostically significant; generate a report based on the clinical sample result and organisms associated with a value likely to be diagnostically significant; and cause the report to be presented to a user.

    SAMPLE CONTAMINATION DETECTION OF CONTAMINATED FRAGMENTS FOR CANCER CLASSIFICATION

    公开(公告)号:US20230272477A1

    公开(公告)日:2023-08-31

    申请号:US17993597

    申请日:2022-11-23

    Applicant: GRAIL, LLC

    Abstract: Methods and systems for detecting contaminated fragments in a biological sample for cancer classification are disclosed. The system identifies multiple SNP site contamination markers and indel site contamination markers. The multiple SNP site contamination markers include at least two SNP sites within a threshold distance, having population haplotype frequency within a range of threshold frequencies, excluding guanine-adenine polymorphisms and/or cytosine-thymine polymorphisms, ensuring Hardy-Weinberg equilibrium, or any combination of the parameters above. The indel site contamination markers include indel sequences that are within a threshold length, having high complexity, having population haplotype frequency within a range of threshold frequencies, ensuring Hardy-Weinberg equilibrium, or any combination of the parameters above. The system identifies contamination markers for which the sample is homozygous. The system estimates the contamination level of the sample by identifying fragments having a haplotype that is different than the homozygous haplotype of the respective contamination marker site.

Patent Agency Ranking