Invention Grant
- Patent Title: Methods for compression of molecular tagged nucleic acid sequence data
-
Application No.: US17135196Application Date: 2020-12-28
-
Publication No.: US11468972B2Publication Date: 2022-10-11
- Inventor: Cheng-Zong Bai
- Applicant: LIFE TECHNOLOGIES CORPORATION
- Applicant Address: US CA Carlsbad
- Assignee: LIFE TECHNOLOGIES CORPORATION
- Current Assignee: LIFE TECHNOLOGIES CORPORATION
- Current Assignee Address: US CA Carlsbad
- Agent Carolyn Koenig
- Main IPC: G06F7/00
- IPC: G06F7/00 ; G16B50/50 ; H03M7/30 ; G16B50/00 ; G16B30/00 ; G16B40/00 ; G16B20/00 ; G16B40/10 ; G16B20/20 ; G16B20/40 ; C12Q1/6869 ; G16B30/10

Abstract:
A method for compressing molecular tagged sequence data includes: grouping sequence reads associated with a molecular tag sequence to form a family of sequence reads, corresponding vectors of flow space signal measurements and corresponding sequence alignments, calculating an arithmetic mean of the corresponding vectors of flow space signal measurements to form a vector of consensus flow space signal measurements, calculating a standard deviation of the corresponding vectors of flow space signal measurements to form a vector of standard deviations, determining a consensus base sequence based on the vector of consensus flow space signal measurements, determining a consensus sequence alignment and generating a compressed data structure comprising consensus compressed data, the consensus compressed data including for each family, the consensus base sequence, the consensus sequence alignment, the vector of consensus flow space signal measurements, the vector of standard deviations and the number of members.
Public/Granted literature
- US20210202044A1 METHODS FOR COMPRESSION OF MOLECULAR TAGGED NUCLEIC ACID SEQUENCE DATA Public/Granted day:2021-07-01
Information query