Error recovery of corrupted MPEG-4 bitstreams using fuzzy decoding of start codes and resync markers
    1.
    发明授权
    Error recovery of corrupted MPEG-4 bitstreams using fuzzy decoding of start codes and resync markers 失效
    使用起始码和重新同步标记的模糊解码错误恢复损坏的MPEG-4比特流

    公开(公告)号:US06728318B2

    公开(公告)日:2004-04-27

    申请号:US09681234

    申请日:2001-03-02

    CPC classification number: H04N19/68 H04N19/89

    Abstract: A decoder for motion-picture-experts group (MPEG-4) video detects start codes at the beginning of video object planes (VOP) and resync markers at the start of each video packet (VP) in the VOP. When an error occurs in the bitstream, a parser searched for a next start code or resync marker to find the start of the next video packet. A partial match of the unique start-code bit sequence signals a fuzzy match, allowing the VOP header and data to be decoded even when bit errors occur in the VOP start code. A fuzzy match of the shorter resync marker can also be enabled. Fuzzy matching of VOP start codes and resync markers allows for faster recovery from corrupted bitstreams such as those transmitted over wireless networks.

    Abstract translation: 用于运动图像专家组(MPEG-4)视频的解码器检测VOP中每个视频分组(VP)开始处的视频对象平面(VOP)开始处的开始代码和重新同步标记。 当比特流中出现错误时,解析器搜索下一个起始码或重新同步标记以找到下一个视频包的开始。 唯一起始码位序列的部分匹配发出模糊匹配,即使在VOP起始码中出现位错误,也允许对VOP头和数据进行解码。 也可以启用更短的重新同步标记的模糊匹配。 VOP起始码和再同步标记的模糊匹配允许从损坏的比特流(例如通过无线网络传送的比特流)更快地恢复。

    Constrained discrete-cosine-transform coefficients for better error detection in a corrupted MPEG-4 bitstreams
    2.
    发明授权
    Constrained discrete-cosine-transform coefficients for better error detection in a corrupted MPEG-4 bitstreams 失效
    约束离散余弦变换系数,用于在损坏的MPEG-4比特流中进行更好的错误检测

    公开(公告)号:US06721362B2

    公开(公告)日:2004-04-13

    申请号:US09681423

    申请日:2001-03-30

    CPC classification number: H04N19/20 H04N19/65 H04N19/89

    Abstract: Error detection is added to a motion-picture-experts group (MPEG) decoder by checking each 8×8-pixel block for constraints. The constraints are added during compression by adjusting discrete cosine transform (DCT) coefficients in the block to meet a constraint. When the decoder determines that the constraint is not met by the DCT coefficients, an error is signaled for that block. The error can then be concealed using pixels from another frame or block. In one embodiment, the constraint is that the last two non-zero coefficients have the same magnitude. The constraint is added during compression after quantization but before variable-length coding by averaging the magnitudes and using the average magnitude for the last two non-zero coefficients. This minimizes visible distortion caused by the constraints and reduces computations.

    Abstract translation: 通过检查每个8×8像素块的约束将错误检测添加到运动图像专家组(MPEG)解码器。 在压缩期间通过调整块中的离散余弦变换(DCT)系数来满足约束来加入约束。 当解码器确定DCT系数不满足约束时,向该块发出错误信号。 然后可以使用来自另一个帧或块的像素隐藏该错误。 在一个实施例中,约束是最后两个非零系数具有相同的大小。 在量化之后的压缩期间,通过对可变长度编码进行平均,并且使用最后两个非零系数的平均幅度来增加约束。 这最大限度地减少了由约束引起的可见失真并减少了计算。

    Efficient fetching for motion compensation video decoding process
    3.
    发明授权
    Efficient fetching for motion compensation video decoding process 有权
    高效提取运动补偿视频解码过程

    公开(公告)号:US08559514B2

    公开(公告)日:2013-10-15

    申请号:US11495270

    申请日:2006-07-27

    CPC classification number: H04N19/57 H04N19/433 H04N19/44

    Abstract: Efficient memory fetching techniques are described that can improve data fetching during a motion compensation decoding process. The techniques propose several different memory fetching modes that may be very efficient in different scenarios of the motion compensation decoding process. A motion compensator may a particular memory fetch mode from a plurality of possible modes on a case-by-case basis for the memory fetches associated with a motion compensation decoding process of a macroblock. The techniques described herein may be particularly useful when fractional interpolation to sub-integer pixels is used in the inter-frame compression.

    Abstract translation: 描述了可以在运动补偿解码过程期间改进数据获取的高效存储器获取技术。 该技术提出了在运动补偿解码过程的不同情况下可能非常有效的几种不同的存储器提取模式。 运动补偿器可以针对与宏块的运动补偿解码过程相关联的存储器提取,在多个可能模式中针对具体存储器提取模式。 当在帧间压缩中使用对子整数像素的分数插值时,本文描述的技术可能是特别有用的。

    Waveform fetch unit for processing audio files
    4.
    发明授权
    Waveform fetch unit for processing audio files 有权
    用于处理音频文件的波形提取单元

    公开(公告)号:US07807914B2

    公开(公告)日:2010-10-05

    申请号:US12041834

    申请日:2008-03-04

    CPC classification number: G10H7/004 G10H2230/031 G10H2250/641

    Abstract: This disclosure describes techniques that make use of a waveform fetch unit that operates to retrieve waveform samples on behalf of each of a plurality of hardware processing elements that operate simultaneously to service various audio synthesis parameters generated from one or more audio files, such as musical instrument digital interface (MIDI) files. In one example, a method comprises receiving a request for a waveform sample from an audio processing element, and servicing the request by calculating a waveform sample number for the requested waveform sample based on a phase increment contained in the request and an audio synthesis parameter control word associated with the requested waveform sample, retrieving the waveform sample from a local cache using the waveform sample number, and sending the retrieved waveform sample to the requesting audio processing element.

    Abstract translation: 本公开描述了利用波形获取单元的技术,其操作以代表同时操作的多个硬件处理元件中的每一个来检索波形采样,以服务从一个或多个音频文件(例如乐器)生成的各种音频合成参数 数字接口(MIDI)文件。 在一个示例中,一种方法包括从音频处理元件接收对波形样本的请求,并且通过基于包含在请求中的相位增量和音频合成参数控制来计算所请求的波形样本的波形采样数来为该请求提供服务 与所请求的波形样本相关联的单词,使用波形采样号从本地高速缓存中检索波形采样,以及将检索的波形样本发送到请求音频处理单元。

    Voice and text communication system, method and apparatus
    5.
    发明申请
    Voice and text communication system, method and apparatus 审中-公开
    语音和文本通信系统,方法和设备

    公开(公告)号:US20100030557A1

    公开(公告)日:2010-02-04

    申请号:US11497011

    申请日:2006-07-31

    Abstract: The disclosure relates to systems, methods and apparatus to convert speech to text and vice versa. One apparatus comprises a vocoder, a speech to text conversion engine, a text to speech conversion engine, and a user interface. The vocoder is operable to convert speech signals into packets and convert packets into speech signals. The speech to text conversion engine is operable to convert speech to text. The text to speech conversion engine is operable to convert text to speech. The user interface is operable to receive a user selection of a mode from among a plurality of modes, wherein a first mode enables the speech to text conversion engine, a second mode enables the text to speech conversion engine, and a third mode enables the speech to text conversion engine and the text to speech conversion engine.

    Abstract translation: 本公开涉及将语音转换为文本的系统,方法和装置,反之亦然。 一种装置包括声码器,语音到文本转换引擎,文本到语音转换引擎和用户界面。 声码器可操作以将语音信号转换成分组并将分组转换为语音信号。 语音到文本转换引擎可操作以将语音转换为文本。 文本到语音转换引擎可操作以将文本转换为语音。 用户接口可操作以从多个模式中接收用户对模式的选择,其中第一模式使得语音能够进行文本转换引擎,第二模式使文本到语音转换引擎成为可能,第三模式使语音 将文字转换引擎和文字转换为语音转换引擎。

    AUDIO PROCESSING HARDWARE ELEMENTS
    6.
    发明申请
    AUDIO PROCESSING HARDWARE ELEMENTS 失效
    音频处理硬件元件

    公开(公告)号:US20080229919A1

    公开(公告)日:2008-09-25

    申请号:US12042181

    申请日:2008-03-04

    CPC classification number: G10H7/004 G10H1/0066 G10H2230/031 G10H2250/541

    Abstract: This disclosure describes techniques that make use of a plurality of hardware elements that operate simultaneously to service synthesis parameters generated from one or more audio files, such as musical instrument digital interface (MIDI) files. In one example, a method comprises storing audio synthesis parameters generated for one or more audio files of an audio frame, processing a first audio synthesis parameter using a first audio processing element of a hardware unit to generate first audio information, processing a second audio synthesis parameter using a second audio processing element of the hardware unit to generate second audio information, and generating audio samples for the audio frame based at least in part on a combination of the first and second audio information.

    Abstract translation: 本公开描述了利用多个硬件元件的技术,该多个硬件元件同时工作来服务从诸如乐器数字接口(MIDI)文件的一个或多个音频文件生成的合成参数。 在一个示例中,一种方法包括存储为音频帧的一个或多个音频文件生成的音频合成参数,使用硬件单元的第一音频处理元件处理第一音频合成参数以产生第一音频信息,处理第二音频合成 参数,使用所述硬件单元的第二音频处理元件来产生第二音频信息,以及至少部分地基于所述第一和第二音频信息的组合来生成所述音频帧的音频样本。

    PIPELINE TECHNIQUES FOR PROCESSING MUSICAL INSTRUMENT DIGITAL INTERFACE (MIDI) FILES
    7.
    发明申请
    PIPELINE TECHNIQUES FOR PROCESSING MUSICAL INSTRUMENT DIGITAL INTERFACE (MIDI) FILES 失效
    用于处理音乐数字接口(MIDI)文件的管道技术

    公开(公告)号:US20080229918A1

    公开(公告)日:2008-09-25

    申请号:US12042170

    申请日:2008-03-04

    CPC classification number: G10H1/0066 G10H7/004

    Abstract: This disclosure describes techniques for processing audio files that comply with the musical instrument digital interface (MIDI) format. In particular, various tasks associated with MIDI file processing are delegated between software operating on a general purpose processor, firmware associated with a digital signal processor (DSP), and dedicated hardware that is specifically designed for MIDI file processing. Alternatively, a multi-threaded DSP may be used instead of a general purpose processor and the DSP. In one aspect, this disclosure provides a method comprising parsing MIDI files and scheduling MIDI events associated with the MIDI files using a first process, processing the MIDI events using a second process to generate MIDI synthesis parameters, and generating audio samples using a hardware unit based on the synthesis parameters.

    Abstract translation: 本公开描述了用于处理符合乐器数字接口(MIDI)格式的音频文件的技术。 具体而言,与在通用处理器上运行的软件,与数字信号处理器(DSP)相关联的固件以及专门为MIDI文件处理而专门设计的专用硬件之间的任何与MIDI文件处理相关的各种任务被委派。 或者,可以使用多线程DSP来代替通用处理器和DSP。 在一个方面,本公开提供了一种方法,包括使用第一处理解析MIDI文件和调度与MIDI文件相关联的MIDI事件,使用第二处理来处理MIDI事件以产生MIDI合成参数,以及使用基于硬件单元生成音频样本 对合成参数。

    Memory organizational scheme and controller architecture for image and video processing
    8.
    发明申请
    Memory organizational scheme and controller architecture for image and video processing 有权
    内存组织方案和图像和视频处理的控制器架构

    公开(公告)号:US20070153095A1

    公开(公告)日:2007-07-05

    申请号:US11327241

    申请日:2006-01-05

    CPC classification number: G06T1/60 H04N19/423 H04N19/436 H04N19/61

    Abstract: This disclosure describes an organizational scheme for memory that is useful for image processing. A memory controller architecture is also described, which takes advantage of the organizational scheme. The organizational scheme and controller architecture is particularly useful for high performance, high quality image processing of images that form a video sequence, but may also be applied in other image processing settings. The described techniques and organizational structure of the memory also allows the memory to be shared for other storage applications of a video device.

    Abstract translation: 本公开描述了对于图像处理有用的存储器的组织方案。 还描述了利用组织方案的存储器控​​制器架构。 组织方案和控制器架构对于形成视频序列的图像的高性能,高质量图像处理特别有用,但也可以应用于其他图像处理设置。 所描述的存储器的技术和组织结构还允许为视频设备的其他存储应用共享存储器。

    Residual coding in compliance with a video standard using non-standardized vector quantization coder
    9.
    发明授权
    Residual coding in compliance with a video standard using non-standardized vector quantization coder 有权
    使用非标准化矢量量化编码器的符合视频标准的残留编码

    公开(公告)号:US07804901B2

    公开(公告)日:2010-09-28

    申请号:US11271218

    申请日:2005-11-10

    Applicant: Stephen Molloy

    Inventor: Stephen Molloy

    CPC classification number: H04N19/94 H04N19/61

    Abstract: This disclosure describes residual coding techniques that use vector quantization coding, which relies on template matching. However, rather than transmit vector quantization codebook indexes (or codewords) that may have no meaning to standard compliant CODECs, the techniques described herein map the residual templates to pre-computed compressed residual blocks that are compliant with a video standard. The standard compliant compressed residual blocks can then be transmitted to another device, which complies with the standard. The techniques can exploit the advantages associated with vector quantization coding, such as the ability to perform more parallel processing, while still maintaining compliance with a video coding standard.

    Abstract translation: 本公开描述了使用依赖于模板匹配的矢量量化编码的残差编码技术。 然而,本文描述的技术不是传送对标准兼容编解码器无意义的矢量量化码本索引(或码字),而是将残留模板映射到符合视频标准的预先计算的压缩残余块。 然后,可以将符合标准的压缩残余块传输到符合标准的另一个设备。 这些技术可以利用与矢量量化编码相关联的优点,例如执行更多并行处理的能力,同时仍然保持与视频编码标准的一致性。

    Pipeline techniques for processing musical instrument digital interface (MIDI) files
    10.
    发明授权
    Pipeline techniques for processing musical instrument digital interface (MIDI) files 失效
    用于处理乐器数字接口(MIDI)文件的管道技术

    公开(公告)号:US07663046B2

    公开(公告)日:2010-02-16

    申请号:US12042170

    申请日:2008-03-04

    CPC classification number: G10H1/0066 G10H7/004

    Abstract: This disclosure describes techniques for processing audio files that comply with the musical instrument digital interface (MIDI) format. In particular, various tasks associated with MIDI file processing are delegated between software operating on a general purpose processor, firmware associated with a digital signal processor (DSP), and dedicated hardware that is specifically designed for MIDI file processing. Alternatively, a multi-threaded DSP may be used instead of a general purpose processor and the DSP. In one aspect, this disclosure provides a method comprising parsing MIDI files and scheduling MIDI events associated with the MIDI files using a first process, processing the MIDI events using a second process to generate MIDI synthesis parameters, and generating audio samples using a hardware unit based on the synthesis parameters.

    Abstract translation: 本公开描述了用于处理符合乐器数字接口(MIDI)格式的音频文件的技术。 具体而言,与在通用处理器上运行的软件,与数字信号处理器(DSP)相关联的固件以及专门为MIDI文件处理而专门设计的专用硬件之间的任何与MIDI文件处理相关的各种任务被委派。 或者,可以使用多线程DSP来代替通用处理器和DSP。 在一个方面,本公开提供了一种方法,包括使用第一处理解析MIDI文件和调度与MIDI文件相关联的MIDI事件,使用第二处理来处理MIDI事件以产生MIDI合成参数,以及使用基于硬件单元生成音频样本 对合成参数。

Patent Agency Ranking