Sequencing Data-Encoded Peptides from Tandem Mass Spectra

    公开(公告)号:US20230326558A1

    公开(公告)日:2023-10-12

    申请号:US18297685

    申请日:2023-04-10

    CPC classification number: G16B50/30 G06F16/215 G06F16/9024 G16B40/10

    Abstract: Peptide sequencing is important in decoding data stored in a data-encoded peptide. Tandem mass spectrometry (MS/MS) is particularly useful for peptide sequencing. In a computer-implemented method for sequencing the data-encoded peptide from an experimental spectrum, raw data of the experimental spectrum are first preprocessed to remove uninterpretable peaks to yield preprocessed data. A first set of one or more candidate sequences contending for a peptide sequence of the peptide is identified from a spectrum graph. The spectrum graph is formed according to the preprocessed data rather than the raw data for generating a fewer number of candidate sequences to thereby reduce a time cost in sequencing. The first candidate-sequence set is then processed to estimate the peptide sequence to thereby obtain a set of peptide-sequence estimate(s). Each estimate is verified whether it is invalid. The set of peptide-sequence estimate(s) is purged to remove any invalid estimate.

    SYSTEMS AND METHODS FOR CREATING BIOMOLECULE EMBEDDINGS

    公开(公告)号:US20230253113A1

    公开(公告)日:2023-08-10

    申请号:US18164542

    申请日:2023-02-03

    Applicant: Seer, Inc.

    CPC classification number: G16H50/20 G16B20/00 G16B40/10 G16B40/20

    Abstract: In some aspects, the present disclosure describes a method for determining a biological state associated with a polyamino acid descriptor. In some cases, the method comprises receiving the polyamino acid descriptor comprising at least one dimension representing a polyamino acid association with a given assay method. In some cases, the method comprises generating, in a latent space, a latent descriptor based at least in part on the polyamino acid descriptor, and wherein the latent descriptor comprises sufficiently fewer dimensions than the polyamino acid descriptor such that at least a portion of information in the polyamino acid descriptor is lost in the latent descriptor. In some cases, the method comprises determining, based at least in part on the latent descriptor, the biological state associated with the polyamino acid descriptor.

Patent Agency Ranking