METHOD, APPARATUS, DEVICE, MEDIUM AND PROGRAM PRODUCT FOR OBTAINING TEXT MATERIAL

    公开(公告)号:US20240121485A1

    公开(公告)日:2024-04-11

    申请号:US18543956

    申请日:2023-12-18

    Inventor: Hongyang ZHANG

    CPC classification number: H04N21/84 H04N21/44204 H04N21/4826

    Abstract: Embodiments of the present disclosure relate to a method, apparatus, device, storage medium, and program product for obtaining text material, comprising: in response to a material obtaining instruction, obtaining a set of candidate videos associated with a target object comprising a plurality of posted candidate videos; for each candidate video, determining the popularity of the candidate video based on the playing data of the candidate video and/or the object information of the explained object included in the candidate video; selecting recommended videos based on popularity for display; in response to a selection operation for the recommended video, generating text material corresponding to the target object based on key information in the recommended video. In this way, users can quickly review the recommended videos and obtain creative inspiration.

    VIDEO CROPPING METHOD AND APPARATUS, STORAGE MEDIUM AND ELECTRONIC DEVICE

    公开(公告)号:US20240112299A1

    公开(公告)日:2024-04-04

    申请号:US18255473

    申请日:2021-12-01

    Inventor: Hao WU Changhu WANG

    CPC classification number: G06T3/0012 G06T3/4007

    Abstract: This disclosure relates to a video cropping method and apparatus, storage medium, and electronic device. The present disclosure method: acquiring an original video to be cropped; performing frame extraction processing on the original video to obtain a plurality of target video frames; determining, for each of the target video frames, a target candidate cropping box corresponding to the target video frame according to a main content in the target video frame; performing interpolation processing according to the target candidate cropping box corresponding to each of the target video frames to determine a target cropping box corresponding to each frame picture in the original video; and cropping the original video according to the target cropping box corresponding to the each frame picture.

    Data storage based on geographic location

    公开(公告)号:US11899639B1

    公开(公告)日:2024-02-13

    申请号:US17877644

    申请日:2022-07-29

    CPC classification number: G06F16/2228 G06F16/248 G06F16/29

    Abstract: A first combined key may be generated based on a geographic location, a first time, and a first user that are associated with a first event. The first combined key and first data indicating the first event may be stored in a database, the first combined key configured to identify the first data. A second combined key may be generated based on the geographic location, a second time, and a second user that are associated with a second event. The second combined key and second data indicating the second event may be stored in the database, the second combined key configured to identify the second data. A set of events associated with the geographic location and comprising the first event and the second event may be presented by retrieving the first data and the second data using the first combined key and the second combined key, respectively.

    METHOD, APPARATUS, ELECTRONIC DEVICE, AND MEDIUM FOR SPEECH PROCESSING

    公开(公告)号:US20240046921A1

    公开(公告)日:2024-02-08

    申请号:US18365765

    申请日:2023-08-04

    CPC classification number: G10L15/1815 G10L15/183 G10L17/06

    Abstract: Embodiments of the present disclosure provide a method, apparatus, electronic device, and medium for speech processing. The method comprises generating a token-level semantic feature of target speech data based on a frame-level acoustic feature of the target speech data. The method further comprises generating a token-level voiceprint feature of the target speech data based on the frame-level acoustic feature. The method further comprises determining a token in the target speech data where speaker change occurs based on the token-level semantic feature and the token-level voiceprint feature. According to embodiments of the present disclosure, speaker change in speech data is detected at the token level in conjunction with the speaker's acoustic features and speech contents, and speaker-based speech recognition results are output directly without post-processing, simplifying the speech recognition process.

    MODEL TRAINING METHOD AND APPARATUS, MACHINE TRANSLATION METHOD AND APPARATUS, AND DEVICE AND STORAGE MEDIUM

    公开(公告)号:US20240037349A1

    公开(公告)日:2024-02-01

    申请号:US18255790

    申请日:2021-11-17

    CPC classification number: G06F40/58 G06F40/51

    Abstract: Provided are a model training method and apparatus, a machine translation method and apparatus, a device, and a storage medium. The model training method includes the steps described below. Through a neural network pruning technique, a respective influence degree of each parameter in multiple parameters in a first translation model on a translation result in a first field is determined to obtain at least one first parameter and at least one second parameter. By using the first corpus of the first field, the at least one first parameter is trained obtain the second translation model, and the at least one second parameter remains unchanged. Similarity between a translation result of the second translation model in the first field and a translation result of the first translation model in the first field meets a preset condition.

    IMAGE SEGMENTATION METHOD AND APPARATUS, AND DEVICE, AND STORAGE MEDIUM

    公开(公告)号:US20230394671A1

    公开(公告)日:2023-12-07

    申请号:US18251228

    申请日:2021-09-27

    Inventor: Tao KONG Ya JING Lei LI

    CPC classification number: G06T7/11 G06T3/40 G06T2207/20221

    Abstract: Provided are an image segmentation method and apparatus, a device, and a storage medium. The image segmentation method includes: fusing a visual feature corresponding to an original image with a text feature corresponding to a description language to obtain a multimodal feature, where the description language is used for specifying a target object to be segmented in the original image; determining a visual region of the target object according to an image corresponding to the multimodal feature and recording an image corresponding to the visual region as a response heat map; and determining a segmentation result of the target object according to the image corresponding to the multimodal feature and the response heat map.

    GUARANTEEING ONLINE SERVICES BASED ON PREDICTING FAILURES OF STORAGE DEVICES

    公开(公告)号:US20230359364A1

    公开(公告)日:2023-11-09

    申请号:US18223477

    申请日:2023-07-18

    CPC classification number: G06F3/0614 G06F3/0679 G06F3/0649

    Abstract: The present disclosure describes techniques for guaranteeing online services based on predicting failures of storage devices. Statistical data may be extracted on a regular basis by each of a plurality of storage devices. Each of the plurality of storage devices may comprise a set of NAND dies. Each of the set of NAND dies may be configured to measure and track a set of metrics indicating characteristics of each NAND die. Prediction data indicating potential failures of the plurality of storage devices may be generated. The prediction data may be shared with a host on a periodic basis. A strategy of decommissioning an aged storage device and adding a new storage device based on the prediction data may be created by the host. The data migration to the new storage device may be implemented.

Patent Agency Ranking