Efficient transformation techniques for video coding
    1.
    发明授权
    Efficient transformation techniques for video coding 有权
    高效的视频编码技术

    公开(公告)号:US08654833B2

    公开(公告)日:2014-02-18

    申请号:US11861804

    申请日:2007-09-26

    Abstract: This disclosure describes efficient transformation techniques that can be used in video coding. In particular, intermediate results of computations associated with transformation of a first block of video data are reused in the transformation of a second block of video data. The techniques may be used during a motion estimation process in which video blocks of a search space are transformed, but this disclosure is not necessarily limited in this respect. Pipelining techniques may be used to accelerate the efficient transformation techniques, and transposition memories can be implemented to facilitate efficient pipelining.

    Abstract translation: 本公开描述了可用于视频编码的有效的转换技术。 特别地,与第一视频数据块的变换相关联的计算的中间结果在第二视频数据块的变换中被重新使用。 可以在其中对搜索空间的视频块进行变换的运动估计处理期间使用这些技术,但是本发明并不限于此。 可以使用流水线技术来加速有效的转换技术,并且可以实现转置存储器以便于有效的流水线化。

    EFFICIENT TRANSFORMATION TECHNIQUES FOR VIDEO CODING
    2.
    发明申请
    EFFICIENT TRANSFORMATION TECHNIQUES FOR VIDEO CODING 有权
    视频编码的高效转换技术

    公开(公告)号:US20090080515A1

    公开(公告)日:2009-03-26

    申请号:US11861804

    申请日:2007-09-26

    Abstract: This disclosure describes efficient transformation techniques that can be used in video coding. In particular, intermediate results of computations associated with transformation of a first block of video data are reused in the transformation of a second block of video data. The techniques may be used during a motion estimation process in which video blocks of a search space are transformed, but this disclosure is not necessarily limited in this respect. Pipelining techniques may be used to accelerate the efficient transformation techniques, and transposition memories can be implemented to facilitate efficient pipelining.

    Abstract translation: 本公开描述了可用于视频编码的有效的转换技术。 特别地,与第一视频数据块的变换相关联的计算的中间结果在第二视频数据块的变换中被重新使用。 可以在其中对搜索空间的视频块进行变换的运动估计处理期间使用这些技术,但是本发明并不限于此。 可以使用流水线技术来加速有效的转换技术,并且可以实现转置存储器以便于有效的流水线化。

    Efficient fetching for motion compensation video decoding process
    3.
    发明授权
    Efficient fetching for motion compensation video decoding process 有权
    高效提取运动补偿视频解码过程

    公开(公告)号:US08559514B2

    公开(公告)日:2013-10-15

    申请号:US11495270

    申请日:2006-07-27

    CPC classification number: H04N19/57 H04N19/433 H04N19/44

    Abstract: Efficient memory fetching techniques are described that can improve data fetching during a motion compensation decoding process. The techniques propose several different memory fetching modes that may be very efficient in different scenarios of the motion compensation decoding process. A motion compensator may a particular memory fetch mode from a plurality of possible modes on a case-by-case basis for the memory fetches associated with a motion compensation decoding process of a macroblock. The techniques described herein may be particularly useful when fractional interpolation to sub-integer pixels is used in the inter-frame compression.

    Abstract translation: 描述了可以在运动补偿解码过程期间改进数据获取的高效存储器获取技术。 该技术提出了在运动补偿解码过程的不同情况下可能非常有效的几种不同的存储器提取模式。 运动补偿器可以针对与宏块的运动补偿解码过程相关联的存储器提取,在多个可能模式中针对具体存储器提取模式。 当在帧间压缩中使用对子整数像素的分数插值时,本文描述的技术可能是特别有用的。

    Efficient fetching for motion compensation video decoding process
    4.
    发明申请
    Efficient fetching for motion compensation video decoding process 有权
    高效提取运动补偿视频解码过程

    公开(公告)号:US20080025398A1

    公开(公告)日:2008-01-31

    申请号:US11495270

    申请日:2006-07-27

    CPC classification number: H04N19/57 H04N19/433 H04N19/44

    Abstract: This disclosure describes efficient memory fetching techniques that can improve data fetching during a motion compensation decoding process. The techniques propose several different memory fetching modes that may be very efficient in different scenarios of the motion compensation decoding process. In some embodiments, a motion compensator selects a particular memory fetches associated with a motion compensation decoding process of a macroblock. The techniques described herein may be particularly useful when fractional interpolation to sub-integer pixels is used in the inter-frame compression.

    Abstract translation: 本公开描述了可以在运动补偿解码过程期间改进数据获取的有效存储器获取技术。 该技术提出了在运动补偿解码过程的不同情况下可能非常有效的几种不同的存储器提取模式。 在一些实施例中,运动补偿器选择与宏块的运动补偿解码过程相关联的特定存储器提取。 当在帧间压缩中使用对子整数像素的分数插值时,本文描述的技术可能是特别有用的。

    Waveform fetch unit for processing audio files
    5.
    发明授权
    Waveform fetch unit for processing audio files 有权
    用于处理音频文件的波形提取单元

    公开(公告)号:US07807914B2

    公开(公告)日:2010-10-05

    申请号:US12041834

    申请日:2008-03-04

    CPC classification number: G10H7/004 G10H2230/031 G10H2250/641

    Abstract: This disclosure describes techniques that make use of a waveform fetch unit that operates to retrieve waveform samples on behalf of each of a plurality of hardware processing elements that operate simultaneously to service various audio synthesis parameters generated from one or more audio files, such as musical instrument digital interface (MIDI) files. In one example, a method comprises receiving a request for a waveform sample from an audio processing element, and servicing the request by calculating a waveform sample number for the requested waveform sample based on a phase increment contained in the request and an audio synthesis parameter control word associated with the requested waveform sample, retrieving the waveform sample from a local cache using the waveform sample number, and sending the retrieved waveform sample to the requesting audio processing element.

    Abstract translation: 本公开描述了利用波形获取单元的技术,其操作以代表同时操作的多个硬件处理元件中的每一个来检索波形采样,以服务从一个或多个音频文件(例如乐器)生成的各种音频合成参数 数字接口(MIDI)文件。 在一个示例中,一种方法包括从音频处理元件接收对波形样本的请求,并且通过基于包含在请求中的相位增量和音频合成参数控制来计算所请求的波形样本的波形采样数来为该请求提供服务 与所请求的波形样本相关联的单词,使用波形采样号从本地高速缓存中检索波形采样,以及将检索的波形样本发送到请求音频处理单元。

    Voice and text communication system, method and apparatus
    6.
    发明申请
    Voice and text communication system, method and apparatus 审中-公开
    语音和文本通信系统,方法和设备

    公开(公告)号:US20100030557A1

    公开(公告)日:2010-02-04

    申请号:US11497011

    申请日:2006-07-31

    Abstract: The disclosure relates to systems, methods and apparatus to convert speech to text and vice versa. One apparatus comprises a vocoder, a speech to text conversion engine, a text to speech conversion engine, and a user interface. The vocoder is operable to convert speech signals into packets and convert packets into speech signals. The speech to text conversion engine is operable to convert speech to text. The text to speech conversion engine is operable to convert text to speech. The user interface is operable to receive a user selection of a mode from among a plurality of modes, wherein a first mode enables the speech to text conversion engine, a second mode enables the text to speech conversion engine, and a third mode enables the speech to text conversion engine and the text to speech conversion engine.

    Abstract translation: 本公开涉及将语音转换为文本的系统,方法和装置,反之亦然。 一种装置包括声码器,语音到文本转换引擎,文本到语音转换引擎和用户界面。 声码器可操作以将语音信号转换成分组并将分组转换为语音信号。 语音到文本转换引擎可操作以将语音转换为文本。 文本到语音转换引擎可操作以将文本转换为语音。 用户接口可操作以从多个模式中接收用户对模式的选择,其中第一模式使得语音能够进行文本转换引擎,第二模式使文本到语音转换引擎成为可能,第三模式使语音 将文字转换引擎和文字转换为语音转换引擎。

    AUDIO PROCESSING HARDWARE ELEMENTS
    7.
    发明申请
    AUDIO PROCESSING HARDWARE ELEMENTS 失效
    音频处理硬件元件

    公开(公告)号:US20080229919A1

    公开(公告)日:2008-09-25

    申请号:US12042181

    申请日:2008-03-04

    CPC classification number: G10H7/004 G10H1/0066 G10H2230/031 G10H2250/541

    Abstract: This disclosure describes techniques that make use of a plurality of hardware elements that operate simultaneously to service synthesis parameters generated from one or more audio files, such as musical instrument digital interface (MIDI) files. In one example, a method comprises storing audio synthesis parameters generated for one or more audio files of an audio frame, processing a first audio synthesis parameter using a first audio processing element of a hardware unit to generate first audio information, processing a second audio synthesis parameter using a second audio processing element of the hardware unit to generate second audio information, and generating audio samples for the audio frame based at least in part on a combination of the first and second audio information.

    Abstract translation: 本公开描述了利用多个硬件元件的技术,该多个硬件元件同时工作来服务从诸如乐器数字接口(MIDI)文件的一个或多个音频文件生成的合成参数。 在一个示例中,一种方法包括存储为音频帧的一个或多个音频文件生成的音频合成参数,使用硬件单元的第一音频处理元件处理第一音频合成参数以产生第一音频信息,处理第二音频合成 参数,使用所述硬件单元的第二音频处理元件来产生第二音频信息,以及至少部分地基于所述第一和第二音频信息的组合来生成所述音频帧的音频样本。

    PIPELINE TECHNIQUES FOR PROCESSING MUSICAL INSTRUMENT DIGITAL INTERFACE (MIDI) FILES
    8.
    发明申请
    PIPELINE TECHNIQUES FOR PROCESSING MUSICAL INSTRUMENT DIGITAL INTERFACE (MIDI) FILES 失效
    用于处理音乐数字接口(MIDI)文件的管道技术

    公开(公告)号:US20080229918A1

    公开(公告)日:2008-09-25

    申请号:US12042170

    申请日:2008-03-04

    CPC classification number: G10H1/0066 G10H7/004

    Abstract: This disclosure describes techniques for processing audio files that comply with the musical instrument digital interface (MIDI) format. In particular, various tasks associated with MIDI file processing are delegated between software operating on a general purpose processor, firmware associated with a digital signal processor (DSP), and dedicated hardware that is specifically designed for MIDI file processing. Alternatively, a multi-threaded DSP may be used instead of a general purpose processor and the DSP. In one aspect, this disclosure provides a method comprising parsing MIDI files and scheduling MIDI events associated with the MIDI files using a first process, processing the MIDI events using a second process to generate MIDI synthesis parameters, and generating audio samples using a hardware unit based on the synthesis parameters.

    Abstract translation: 本公开描述了用于处理符合乐器数字接口(MIDI)格式的音频文件的技术。 具体而言,与在通用处理器上运行的软件,与数字信号处理器(DSP)相关联的固件以及专门为MIDI文件处理而专门设计的专用硬件之间的任何与MIDI文件处理相关的各种任务被委派。 或者,可以使用多线程DSP来代替通用处理器和DSP。 在一个方面,本公开提供了一种方法,包括使用第一处理解析MIDI文件和调度与MIDI文件相关联的MIDI事件,使用第二处理来处理MIDI事件以产生MIDI合成参数,以及使用基于硬件单元生成音频样本 对合成参数。

    Memory organizational scheme and controller architecture for image and video processing
    9.
    发明申请
    Memory organizational scheme and controller architecture for image and video processing 有权
    内存组织方案和图像和视频处理的控制器架构

    公开(公告)号:US20070153095A1

    公开(公告)日:2007-07-05

    申请号:US11327241

    申请日:2006-01-05

    CPC classification number: G06T1/60 H04N19/423 H04N19/436 H04N19/61

    Abstract: This disclosure describes an organizational scheme for memory that is useful for image processing. A memory controller architecture is also described, which takes advantage of the organizational scheme. The organizational scheme and controller architecture is particularly useful for high performance, high quality image processing of images that form a video sequence, but may also be applied in other image processing settings. The described techniques and organizational structure of the memory also allows the memory to be shared for other storage applications of a video device.

    Abstract translation: 本公开描述了对于图像处理有用的存储器的组织方案。 还描述了利用组织方案的存储器控​​制器架构。 组织方案和控制器架构对于形成视频序列的图像的高性能,高质量图像处理特别有用,但也可以应用于其他图像处理设置。 所描述的存储器的技术和组织结构还允许为视频设备的其他存储应用共享存储器。

    Error recovery of corrupted MPEG-4 bitstreams using fuzzy decoding of start codes and resync markers
    10.
    发明授权
    Error recovery of corrupted MPEG-4 bitstreams using fuzzy decoding of start codes and resync markers 失效
    使用起始码和重新同步标记的模糊解码错误恢复损坏的MPEG-4比特流

    公开(公告)号:US06728318B2

    公开(公告)日:2004-04-27

    申请号:US09681234

    申请日:2001-03-02

    CPC classification number: H04N19/68 H04N19/89

    Abstract: A decoder for motion-picture-experts group (MPEG-4) video detects start codes at the beginning of video object planes (VOP) and resync markers at the start of each video packet (VP) in the VOP. When an error occurs in the bitstream, a parser searched for a next start code or resync marker to find the start of the next video packet. A partial match of the unique start-code bit sequence signals a fuzzy match, allowing the VOP header and data to be decoded even when bit errors occur in the VOP start code. A fuzzy match of the shorter resync marker can also be enabled. Fuzzy matching of VOP start codes and resync markers allows for faster recovery from corrupted bitstreams such as those transmitted over wireless networks.

    Abstract translation: 用于运动图像专家组(MPEG-4)视频的解码器检测VOP中每个视频分组(VP)开始处的视频对象平面(VOP)开始处的开始代码和重新同步标记。 当比特流中出现错误时,解析器搜索下一个起始码或重新同步标记以找到下一个视频包的开始。 唯一起始码位序列的部分匹配发出模糊匹配,即使在VOP起始码中出现位错误,也允许对VOP头和数据进行解码。 也可以启用更短的重新同步标记的模糊匹配。 VOP起始码和再同步标记的模糊匹配允许从损坏的比特流(例如通过无线网络传送的比特流)更快地恢复。

Patent Agency Ranking