ARTIFICIAL INTELLIGENCE (AI) ENCODING APPARATUS AND METHOD AND AI DECODING APPARATUS AND METHOD FOR REGION OF OBJECT OF INTEREST IN IMAGE

    公开(公告)号:US20230276070A1

    公开(公告)日:2023-08-31

    申请号:US18195221

    申请日:2023-05-09

    CPC classification number: H04N19/59 H04N19/124 H04N19/188

    Abstract: An artificial intelligence (AI) encoding apparatus includes a memory storing one or more instructions, and a processor configured to execute the one or more instructions stored in the memory to identify an object region of interest in an original image, obtain, from the original image, a plurality of original part images respectively including the object region of interest and a non-interest region, obtain a plurality of first images by performing AI scaling on the plurality of original part images through a scaling neural network (NN) that is configured to operate with NN setting information selected from among a plurality of pieces of NN setting information, at least based on whether the plurality of original part images include the object region of interest or the non-interest region, generate image data by encoding the plurality of first images, and transmit the image data, and AI data including information related to the AI scaling.

    Apparatus and method for processing audio

    公开(公告)号:US12062377B2

    公开(公告)日:2024-08-13

    申请号:US17722569

    申请日:2022-04-18

    CPC classification number: G10L19/008 G06N3/08 H04S3/008 H04S2400/01

    Abstract: An audio processing apparatus may obtain second audio signals corresponding to channels included in a second channel group from first audio signals corresponding to channels included in a first channel group, downsample at least one third audio signal corresponding to at least one channel identified based on a correlation with the second channel group from among the channels included in the first channel group, by using an artificial intelligence (AI) model, and generate a bitstream including the second audio signals corresponding to the channels included in the second channel group and the downsampled at least one third audio signal. The first channel group includes a channel group of an original audio signal, and the second channel group is constructed by combining at least two channels from among the channels included in the first channel group.

    Apparatus and method for processing multi-channel audio signal

    公开(公告)号:US12200464B2

    公开(公告)日:2025-01-14

    申请号:US17728037

    申请日:2022-04-25

    Abstract: According to various embodiments of the disclosure, an audio processing apparatus includes at least one processor configured to execute one or more instructions to obtain a second audio signal down-mixed from at least one first audio signal, obtain information related to error removal for the at least one first audio signal, de-mix the at least one first audio signal from the down-mixed second audio signal, and reconstruct the at least one first audio signal by applying the information related to the error removal for the at least one first audio signal to the at least one first audio signal de-mixed from the second audio signal. The information related to the error removal having been generated using at least one of an original signal power of the at least one first audio signal or a second signal power of the at least one first audio signal after decoding.

Patent Agency Ranking