-
公开(公告)号:US20240331798A1
公开(公告)日:2024-10-03
申请号:US18619010
申请日:2024-03-27
Applicant: The Chinese University of Hong Kong
Inventor: Yu LI , Jiayang CHEN
Abstract: A foundation model for analysis of RNA sequences, including ncRNA sequences, can be trained to provide output embeddings (in a high-dimensional space) corresponding to input RNA sequences. Training of the RNA foundation model can use a large-scale dataset of RNA sequences without any annotation as to structure or function. The trained RNA foundation model can thereafter be used to produce embeddings that can be used as input features in downstream task-specific machine-learning models (or other computer models) that can learn to predict particular aspects of structure and/or function for a given RNA sequence.