Frequency segmentation to obtain bands for efficient coding of digital media
    72.
    发明申请
    Frequency segmentation to obtain bands for efficient coding of digital media 有权
    频率分割以获得有效编码数字媒体的频带

    公开(公告)号:US20070016412A1

    公开(公告)日:2007-01-18

    申请号:US11183087

    申请日:2005-07-15

    CPC classification number: G10L19/0208 G10L19/24

    Abstract: Frequency segmentation is important to the quality of encoding spectral data. Segmentation involves breaking the spectral data into units called sub-bands or vectors. Homogeneous segmentation may be suboptimal. Various features are described for providing spectral data intensity dependent segmentation. Finer segmentation is provided for regions of greater spectral variance and coarser segmentation is provided for more homogeneous regions. Sub-bands which have similar characteristics may be merged with very little effect on quality, whereas sub-bands with highly variable data may be better represented if a sub-band is split. Various methods are described for measuring tonality, energy, or shape of a sub-band. These various measurements are discussed in light of making decisions of when to split or merge sub-bands to provide variable frequency segmentation.

    Abstract translation: 频率分割对于编码光谱数据的质量很重要。 分段涉及将频谱数据分解成称为子带或向量的单元。 均匀分割可能不是最佳的。 描述了各种特征,用于提供频谱数据强度相关分割。 为更大的频谱方差的区域提供更精细的分割,为更均匀的区域提供较粗的分割。 具有相似特性的子带可以对质量影响很小,而如果子带被分割,则可以更好地表示具有高度可变数据的子带。 描述了用于测量子带的音调,能量或形状的各种方法。 根据决定何时拆分或合并子带以提供可变频率分段来讨论这些各种测量。

    Text detection in continuous tone image segments
    73.
    发明授权
    Text detection in continuous tone image segments 失效
    连续色调图像段中的文本检测

    公开(公告)号:US07085420B2

    公开(公告)日:2006-08-01

    申请号:US10186887

    申请日:2002-06-28

    Inventor: Sanjeev Mehrotra

    CPC classification number: G06T9/00

    Abstract: For encoding of mixed-mode images containing text and continuous-tone content, the pixels in the image that form the text content are detected and separated. Text detection classifies pixels as text or continuous tone content by accumulating pixel counts for groups of contiguous, non-smooth pixels with the same color. Groups whose pixel count exceeds a threshold are classified as text. The text detection technique further reduces classification errors by testing for boundary dimensions and pixel density of the group characteristic of long straight lines or large borders. The text detection technique further searches the neighborhood of groups qualifying as text for pixels of the same color, so as to also detect pixels for isolated text marks like dots, accents or punctuation. The separated text and continuous-tone content can be encoded separately for efficient compression while preserving text quality, and the text again superimposed on the continuous tone content at decompression.

    Abstract translation: 对于包含文本和连续色调内容的混合模式图像的编码,检测和分离形成文本内容的图像中的像素。 文本检测通过为具有相同颜色的连续非平滑像素组累加像素计数来将像素分类为文本或连续色调内容。 像素数超过阈值的组被分类为文本。 文本检测技术通过测试长直线或大边界特征组的边界尺寸和像素密度,进一步减少分类误差。 文本检测技术进一步搜索符合相同颜色的像素的文本的组的邻域,以便还检测像点,重音符或标点符号之类的孤立文本标记的像素。 分离的文本和连续色调内容可以单独编码以进行有效压缩,同时保持文本质量,并且文本在解压缩时再次叠加在连续色调内容上。

    Intra compression of pixel blocks using predicted mean
    74.
    发明申请
    Intra compression of pixel blocks using predicted mean 有权
    使用预测平均值的像素块的内部压缩

    公开(公告)号:US20050259877A1

    公开(公告)日:2005-11-24

    申请号:US10403962

    申请日:2003-03-31

    Abstract: An apparatus and method for encoding video frames is provided. The video frames are divided into blocks for encoding. Encoding of the video blocks utilizes motion detection, motion estimation and adaptive compression, to obtain the desired compression for a particular bit rate. Adaptive compression includes intra compression (without regard to other frames) and inter compression (with regard to other frames). Intra compression, inter compression with motion detection, and inter compression with motion estimation are performed on a block by block basis, as needed. Segmentation is provided to compare encoding of a block with encoding of its sub-blocks, and to select the best block size for encoding.

    Abstract translation: 提供了一种用于编码视频帧的装置和方法。 视频帧被分成块进行编码。 视频块的编码利用运动检测,运动估计和自适应压缩,以获得针对特定比特率的期望的压缩。 自适应压缩包括帧内压缩(不考虑其他帧)和帧间压缩(关于其他帧)。 根据需要逐块地执行内部压缩,具有运动检测的帧间压缩和具有运动估计的帧间压缩。 提供分段以比较块的编码与其子块的编码,并选择用于编码的最佳块大小。

    Complex-transform channel coding with extended-band frequency coding
    75.
    发明授权
    Complex-transform channel coding with extended-band frequency coding 有权
    具有扩展频带编码的复变换信道编码

    公开(公告)号:US09105271B2

    公开(公告)日:2015-08-11

    申请号:US12907889

    申请日:2010-10-19

    CPC classification number: G10L21/038 G10L19/008

    Abstract: An audio encoder receives multi-channel audio data comprising a group of plural source channels and performs channel extension coding, which comprises encoding a combined channel for the group and determining plural parameters for representing individual source channels of the group as modified versions of the encoded combined channel. The encoder also performs frequency extension coding. The frequency extension coding can comprise, for example, partitioning frequency bands in the multi-channel audio data into a baseband group and an extended band group, and coding audio coefficients in the extended band group based on audio coefficients in the baseband group. The encoder also can perform other kinds of transforms. An audio decoder performs corresponding decoding and/or additional processing tasks, such as a forward complex transform.

    Abstract translation: 音频编码器接收包括一组多个源信道的多声道音频数据,并执行信道扩展编码,其包括对该组的组合信道进行编码,并确定用于表示该组的各个源信道的多个参数,作为编码组合的修改版本 渠道。 编码器还执行频率扩展编码。 频率扩展编码可以包括例如将多声道音频数据中的频带划分成基带组和扩展频带组,并且基于基带组中的音频系数对扩展频带组中的音频系数进行编码。 编码器还可以执行其他类型的转换。 音频解码器执行相应的解码和/或附加处理任务,例如前向复合变换。

    Content aware chunking for achieving an improved chunk size distribution
    76.
    发明授权
    Content aware chunking for achieving an improved chunk size distribution 有权
    内容感知分块实现改进的块大小分布

    公开(公告)号:US08918375B2

    公开(公告)日:2014-12-23

    申请号:US13222198

    申请日:2011-08-31

    Abstract: The subject disclosure is directed towards partitioning a file into chunks that satisfy a chunk size restriction, such as maximum and minimum chunk sizes, using a sliding window. For file positions within the chunk size restriction, a signature representative of a window fingerprint is compared with a target pattern, with a chunk boundary candidate identified if matched. Other signatures and patterns are then checked to determine a highest ranking signature (corresponding to a lowest numbered Rule) to associate with that chunk boundary candidate, or set an actual boundary if the highest ranked signature is matched. If the maximum chunk size is reached without matching the highest ranked signature, the chunking mechanism regresses to set the boundary based on the candidate with the next highest ranked signature (if no candidates, the boundary is set at the maximum). Also described is setting chunk boundaries based upon pattern detection (e.g., runs of zeros).

    Abstract translation: 本发明涉及使用滑动窗口将文件分成满足块大小限制的块,例如最大和最小块大小。 对于块大小限制内的文件位置,将窗口指纹的签名代表与目标模式进行比较,如果匹配则识别出块边界候选。 然后检查其他签名和模式以确定与该块块边界候选者相关联的最高排名签名(对应于最小编号的规则),或者如果最高排名签名匹配则设置实际边界。 如果没有匹配最高排名的签名达到最大块大小,则分块机制基于具有下一个最高排名的签名的候选者(如果没有候选,边界被设置为最大)而退化以设置边界。 还描述了基于模式检测(例如,零的运行)设置块边界。

    Client-side audio signal mixing on low computational power player using beat metadata
    77.
    发明授权
    Client-side audio signal mixing on low computational power player using beat metadata 有权
    使用节拍元数据在低计算能力播放器上的客户端音频信号混合

    公开(公告)号:US08426715B2

    公开(公告)日:2013-04-23

    申请号:US11958331

    申请日:2007-12-17

    CPC classification number: G11B20/10527 G11B27/10 G11B2020/10546

    Abstract: A low computational power digital audio player achieves beat continuous transitioning between digital audio pieces based on beat metadata, which can be generated via offline processing on a higher computational power computer or via background or idle processing on the digital audio player. The digital audio player produces playlists of beat matching compatible songs based on the metadata, or pick lists of songs that are beat matching compatible with a currently playing song. By facilitating selection of songs with beat matching compatible tempos based on metadata, the beat continuous transitions can be achieved without altering the beat tempo of digital audio pieces, or with simple resampling.

    Abstract translation: 低计算能力数字音频播放器可以基于节拍元数据实现数字音频片段之间的节拍连续转换,其可以通过在较高计算能力计算机上的离线处理或经由数字音频播放器上的背景或空闲处理来生成。 数字音频播放器产生基于元数据的匹配兼容歌曲的播放列表,或者选择与当前播放的歌曲兼容的节拍匹配的歌曲列表。 通过促进基于元数据的匹配匹配兼容节奏的歌曲的选择,可以在不改变数字音频片段的节拍速度的情况下或者通过简单的重采样来实现节拍连续转变。

    Lossless and near lossless scalable audio codec
    79.
    发明授权
    Lossless and near lossless scalable audio codec 有权
    无损和近无损可扩展音频编解码器

    公开(公告)号:US08386271B2

    公开(公告)日:2013-02-26

    申请号:US12055223

    申请日:2008-03-25

    CPC classification number: G10L19/24 G10L19/0017 G10L19/008 G10L19/0212

    Abstract: A scalable audio codec encodes an input audio signal as a base layer at a high compression ratio and one or more residual signals as an enhancement layer of a compressed bitstream, which permits a lossless or near lossless reconstruction of the input audio signal at decoding. The scalable audio codec uses perceptual transform coding to encode the base layer. The residual is calculated in a transform domain, which includes a frequency and possibly also multi-channel transform of the input audio. For lossless reconstruction, the frequency and multi-channel transforms are reversible.

    Abstract translation: 可扩展音频编解码器将输入音频信号编码为高压缩比的基本层和作为压缩比特流的增强层的一个或多个残留信号,其允许在解码时输入音频信号的无损或近无损重建。 可扩展音频编解码器使用感知变换编码来对基本层进行编码。 在变换域中计算残差,其包括输入音频的频率和可能的多通道变换。 对于无损重建,频率和多通道变换是可逆的。

Patent Agency Ranking