MACHINE LEARNING BASED RATE-DISTORTION OPTIMIZER FOR VIDEO COMPRESSION

    公开(公告)号:US20220256169A1

    公开(公告)日:2022-08-11

    申请号:US17165680

    申请日:2021-02-02

    Abstract: Systems and techniques are described for data encoding using a machine learning approach to generate a distortion prediction {circumflex over (D)} and a predicted bit rate {circumflex over (R)}, and to use {circumflex over (D)} and {circumflex over (R)} to perform rate-distortion optimization (RDO). For example, a video encoder can generate the distortion prediction {circumflex over (D)} and the bit rate residual prediction based on outputs of the one or more neural networks in response to the one or more neural networks receiving a residual portion of a block of a video frame as input. The video encoder can determine bit rate metadata prediction based on metadata associated with a mode of compression, and determine {circumflex over (R)} to be the sum of and . The video encoder can determine a rate-distortion cost prediction Ĵ as a function of {circumflex over (D)} and {circumflex over (R)}, and can determine a prediction mode for compressing the block based on Ĵ.

    MULTIPLE HYPOTHESIS TESTING FOR WORD DETECTION
    16.
    发明申请
    MULTIPLE HYPOTHESIS TESTING FOR WORD DETECTION 有权
    用于词检测的多重假设测试

    公开(公告)号:US20150063700A1

    公开(公告)日:2015-03-05

    申请号:US14268904

    申请日:2014-05-02

    CPC classification number: G06K9/18 G06K9/344 G06K9/6821 G06K9/723 G06K2209/01

    Abstract: Embodiments disclosed pertain to Optical Character Recognition using Multiple Hypothesis Testing based techniques on images occurring in a variety of settings, including images captured by mobile stations. In some embodiments, a set of bifurcation points for a character cluster in an image may be determined. The character cluster may comprise non-uniformly spaced text or closely spaced text. A plurality of hypotheses may be determined for the character cluster, where each hypothesis is based on a subset of the bifurcation points and comprises a set of words generated from the character cluster. A plurality of scores corresponding to the plurality of hypotheses may be determined, where each score corresponds to a hypothesis, and a hypothesis may be selected from among the plurality of hypotheses based on a score associated with the selected hypothesis.

    Abstract translation: 所公开的实施例涉及使用基于多种假设检验技术的光学字符识别,所述技术包括在各种设置中出现的图像,包括由移动台捕获的图像。 在一些实施例中,可以确定图像中的字符簇的一组分叉点。 字符簇可以包括非均匀间隔的文本或紧密间隔的文本。 可以针对字符簇确定多个假设,其中每个假设基于分支点的子集,并且包括从字符簇生成的一组单词。 可以确定与多个假设相对应的多个分数,其中每个分数对应于假设,并且可以基于与所选择的假设相关联的评分从多个假设中选择假设。

Patent Agency Ranking