-
公开(公告)号:US20240144656A1
公开(公告)日:2024-05-02
申请号:US18394249
申请日:2023-12-22
Applicant: Lemon Inc. , Beijing Youzhuju Network Technology Co., Ltd.
Inventor: Song Bai , Junhao Zhang , Heng Wang , Rui Yan , Chuhui Xue , Wenqing Zhang
IPC: G06V10/774 , G06V10/40 , G06V10/74 , G06V10/772 , G06V10/82
CPC classification number: G06V10/774 , G06V10/40 , G06V10/761 , G06V10/772 , G06V10/82
Abstract: A method, apparatus, device, and medium for image processing is provided. The method includes generating, using an image generation process, a first set of synthetic images based on a first set of codes associated with the first image class in a codebook and based on a first class feature associated with a first image class; generating, using a feature extraction process, a first set of reference features based on the first set of synthetic images and generating a first set of target features based on a plurality of sets of training images belonging to the first image class in a training image set; and updating the image generation process and the codebook according to at least a first training objective to reduce a difference between each reference feature in the first set of reference features and a corresponding target feature in the first set of target features.
-
公开(公告)号:US20240121485A1
公开(公告)日:2024-04-11
申请号:US18543956
申请日:2023-12-18
Inventor: Hongyang ZHANG
IPC: H04N21/84 , H04N21/442 , H04N21/482
CPC classification number: H04N21/84 , H04N21/44204 , H04N21/4826
Abstract: Embodiments of the present disclosure relate to a method, apparatus, device, storage medium, and program product for obtaining text material, comprising: in response to a material obtaining instruction, obtaining a set of candidate videos associated with a target object comprising a plurality of posted candidate videos; for each candidate video, determining the popularity of the candidate video based on the playing data of the candidate video and/or the object information of the explained object included in the candidate video; selecting recommended videos based on popularity for display; in response to a selection operation for the recommended video, generating text material corresponding to the target object based on key information in the recommended video. In this way, users can quickly review the recommended videos and obtain creative inspiration.
-
公开(公告)号:US20240112299A1
公开(公告)日:2024-04-04
申请号:US18255473
申请日:2021-12-01
Applicant: BEIJING YOUZHUJU NETWORK TECHNOLOGY CO. LTD.
Inventor: Hao WU , Changhu WANG
CPC classification number: G06T3/0012 , G06T3/4007
Abstract: This disclosure relates to a video cropping method and apparatus, storage medium, and electronic device. The present disclosure method: acquiring an original video to be cropped; performing frame extraction processing on the original video to obtain a plurality of target video frames; determining, for each of the target video frames, a target candidate cropping box corresponding to the target video frame according to a main content in the target video frame; performing interpolation processing according to the target candidate cropping box corresponding to each of the target video frames to determine a target cropping box corresponding to each frame picture in the original video; and cropping the original video according to the target cropping box corresponding to the each frame picture.
-
144.
公开(公告)号:US20240095451A1
公开(公告)日:2024-03-21
申请号:US18469461
申请日:2023-09-18
Applicant: BEIJING YOUZHUJU NETWORK TECHNOLOGY CO. LTD.
Inventor: Yuxiang ZOU , Zejun MA
IPC: G06F40/284 , G06F40/263 , G10L13/10
CPC classification number: G06F40/284 , G06F40/263 , G10L13/10
Abstract: Provided are an electronic device and a computer readable storage medium. The method includes: acquiring a text to be analyzed; performing token conversion on words in the text to be analyzed to obtain a token sequence to be analyzed, where tokens in token sequences to be analyzed corresponding to texts to be analyzed in different languages belong to a same type; and performing feature extraction on the token sequence to be analyzed, and processing a target task based on the extracted feature, to determine an analysis result for the text to be analyzed.
-
公开(公告)号:US11899639B1
公开(公告)日:2024-02-13
申请号:US17877644
申请日:2022-07-29
Applicant: Lemon Inc. , Beijing Youzhuju Network Technology Co., Ltd.
Inventor: Viacheslav Dubeyko , Jian Wang
IPC: G06F16/00 , G06F16/22 , G06F16/248 , G06F16/29
CPC classification number: G06F16/2228 , G06F16/248 , G06F16/29
Abstract: A first combined key may be generated based on a geographic location, a first time, and a first user that are associated with a first event. The first combined key and first data indicating the first event may be stored in a database, the first combined key configured to identify the first data. A second combined key may be generated based on the geographic location, a second time, and a second user that are associated with a second event. The second combined key and second data indicating the second event may be stored in the database, the second combined key configured to identify the second data. A set of events associated with the geographic location and comprising the first event and the second event may be presented by retrieving the first data and the second data using the first combined key and the second combined key, respectively.
-
公开(公告)号:US20240046921A1
公开(公告)日:2024-02-08
申请号:US18365765
申请日:2023-08-04
Inventor: Linhao DONG , Zhenlin Liang , Zhiyun Fan , Yi Liu , Zejun Ma
IPC: G10L15/18 , G10L15/183 , G10L17/06
CPC classification number: G10L15/1815 , G10L15/183 , G10L17/06
Abstract: Embodiments of the present disclosure provide a method, apparatus, electronic device, and medium for speech processing. The method comprises generating a token-level semantic feature of target speech data based on a frame-level acoustic feature of the target speech data. The method further comprises generating a token-level voiceprint feature of the target speech data based on the frame-level acoustic feature. The method further comprises determining a token in the target speech data where speaker change occurs based on the token-level semantic feature and the token-level voiceprint feature. According to embodiments of the present disclosure, speaker change in speech data is detected at the token level in conjunction with the speaker's acoustic features and speech contents, and speaker-based speech recognition results are output directly without post-processing, simplifying the speech recognition process.
-
147.
公开(公告)号:US20240037349A1
公开(公告)日:2024-02-01
申请号:US18255790
申请日:2021-11-17
Inventor: Chengqi ZHAO , Jianze LIANG , Mingxuan WANG , Lei LI
Abstract: Provided are a model training method and apparatus, a machine translation method and apparatus, a device, and a storage medium. The model training method includes the steps described below. Through a neural network pruning technique, a respective influence degree of each parameter in multiple parameters in a first translation model on a translation result in a first field is determined to obtain at least one first parameter and at least one second parameter. By using the first corpus of the first field, the at least one first parameter is trained obtain the second translation model, and the at least one second parameter remains unchanged. Similarity between a translation result of the second translation model in the first field and a translation result of the first translation model in the first field meets a preset condition.
-
公开(公告)号:US20240021202A1
公开(公告)日:2024-01-18
申请号:US18037546
申请日:2021-11-19
CPC classification number: G10L15/26 , G10L15/02 , G10L25/78 , G10L25/45 , G10L2025/783
Abstract: Embodiments of the disclosure disclose a method and apparatus for speech recognition, an electronic device and a medium. The method includes: acquiring an audio data to be recognized (201), the audio data to be recognized including a speech segment; determining a start and end time corresponding to the speech segment which is comprised in the audio data (202); extracting at least one speech segment from the audio data to be recognized based on the determined start and end time (203); and performing speech recognition on the at least one extracted speech segment to generate recognition text corresponding to the audio data to be recognized (204).
-
公开(公告)号:US20230394671A1
公开(公告)日:2023-12-07
申请号:US18251228
申请日:2021-09-27
CPC classification number: G06T7/11 , G06T3/40 , G06T2207/20221
Abstract: Provided are an image segmentation method and apparatus, a device, and a storage medium. The image segmentation method includes: fusing a visual feature corresponding to an original image with a text feature corresponding to a description language to obtain a multimodal feature, where the description language is used for specifying a target object to be segmented in the original image; determining a visual region of the target object according to an image corresponding to the multimodal feature and recording an image corresponding to the visual region as a response heat map; and determining a segmentation result of the target object according to the image corresponding to the multimodal feature and the response heat map.
-
公开(公告)号:US20230359364A1
公开(公告)日:2023-11-09
申请号:US18223477
申请日:2023-07-18
Applicant: Lemon Inc. , BEIJING YOUZHUJU NETWORK TECHNOLOGY CO., LTD.
Inventor: Viacheslav Dubeyko , Neema Mashayekhi , Cong Wang , Jian Wang
IPC: G06F3/06
CPC classification number: G06F3/0614 , G06F3/0679 , G06F3/0649
Abstract: The present disclosure describes techniques for guaranteeing online services based on predicting failures of storage devices. Statistical data may be extracted on a regular basis by each of a plurality of storage devices. Each of the plurality of storage devices may comprise a set of NAND dies. Each of the set of NAND dies may be configured to measure and track a set of metrics indicating characteristics of each NAND die. Prediction data indicating potential failures of the plurality of storage devices may be generated. The prediction data may be shared with a host on a periodic basis. A strategy of decommissioning an aged storage device and adding a new storage device based on the prediction data may be created by the host. The data migration to the new storage device may be implemented.
-
-
-
-
-
-
-
-
-