Method and apparatus for implementing motion estimation in video compression
    1.
    发明授权
    Method and apparatus for implementing motion estimation in video compression 失效
    用于在视频压缩中实现运动估计的方法和装置

    公开(公告)号:US06584226B1

    公开(公告)日:2003-06-24

    申请号:US08819587

    申请日:1997-03-14

    Abstract: Methods and apparatus for processing video data that is divided into frames are presented. In one aspect, this relates to a method for processing video data that is divided into frames. The video data includes a current frame, which has an associated current macroblock, and an adjacent frame, which also has an associated macroblock. The method for processing video data involves obtaining an uncompressed current block that is a part of the current macroblock and an adjacent block that is part of the adjacent macroblock, and calculating a distance between the uncompressed current block and the adjacent block. It is determined whether the distance between the uncompressed current block and the adjacent block is acceptable. If the distance is unacceptable, then the motion between the uncompressed current block and the adjacent block is estimated, and the uncompressed current block is adaptively compressed.

    Abstract translation: 提出了分割为帧的视频数据处理方法和装置。 一方面,这涉及一种用于处理被划分成帧的视频数据的方法。 视频数据包括具有相关联的当前宏块的当前帧以及也具有相关宏块的相邻帧。 用于处理视频数据的方法涉及获得作为当前宏块的一部分的未压缩的当前块和作为相邻宏块的一部分的相邻块,以及计算未压缩的当前块与相邻块之间的距离。 确定未压缩的当前块与相邻块之间的距离是否可接受。 如果距离不可接受,则估计未压缩的当前块和相邻块之间的运动,并且自压压缩未压缩的当前块。

    Method and apparatus for implementing motion detection in video compression
    2.
    发明授权
    Method and apparatus for implementing motion detection in video compression 失效
    用于在视频压缩中实现运动检测的方法和装置

    公开(公告)号:US06639945B2

    公开(公告)日:2003-10-28

    申请号:US09759408

    申请日:2001-01-12

    Abstract: The present invention relates, in one aspect, to a method for processing video data that is divided into frames. The video data includes a current frame, which has an associated current macroblock, and an adjacent frame, which also has an associated macroblock. The method for processing video data involves obtaining an uncompressed current block that is a part of the current macroblock and an adjacent block that is part of the adjacent macroblock, and calculating a distance between the uncompressed current block and the adjacent block. It is determined whether the distance between the uncompressed current block and the adjacent block is acceptable. If the distance is unacceptable, then the current block is adoptively compressed.

    Abstract translation: 本发明在一个方面涉及一种用于处理被划分成帧的视频数据的方法。 视频数据包括具有相关联的当前宏块的当前帧以及也具有相关宏块的相邻帧。 用于处理视频数据的方法涉及获得作为当前宏块的一部分的未压缩的当前块和作为相邻宏块的一部分的相邻块,以及计算未压缩的当前块与相邻块之间的距离。 确定未压缩的当前块与相邻块之间的距离是否可接受。 如果距离不可接受,则当前块被过度压缩。

    Multiple multicasting of multimedia streams
    3.
    发明授权
    Multiple multicasting of multimedia streams 失效
    多媒体流多播

    公开(公告)号:US06728775B1

    公开(公告)日:2004-04-27

    申请号:US08855246

    申请日:1997-05-13

    Applicant: Navin Chaddha

    Inventor: Navin Chaddha

    CPC classification number: G06T9/40 H04N19/30 H04N19/51 H04N19/527

    Abstract: A method of interactively providing a number of client computers with a dynamically selectable and scalable range of multimedia data over a diverse computer network including local area networks (LANs) wide area networks (WANs) such as the internet. Multimedia data is provided by a server to the client computers includes a base layer and one or more enhancement layers. Enhancement layers can be spatial and/or temporal in nature. Depending on the implementation, the server may also provide information about the multimedia data to the client computers. The server splits the multimedia data for streaming via multiple multicast group (MMG) addresses. Information about the portion of the multimedia data carried by each MMG is broadcasted to the client computers. Armed with the information about the multimedia data, client computers can intelligently join and leave MMGs as needed. In some embodiments, the client computers provide feedback about the usage and/or need for the multimedia data, enabling the server to right-size, e.g., grow and/or prune, the multimedia data for network efficiency. With right sizing, the content of the base layer may be increased or decreased with the corresponding growing and pruning of the enhancement layers. Enhancement layers may also be grown and/or pruned independently of the base layer, i.e., without a corresponding change in the base layer.

    Abstract translation: 一种通过包括例如因特网的局域网(WAN)广域网(WAN)在内的各种计算机网络交互地向多个客户端计算机提供动态可选择和可扩展的多媒体数据范围的方法。 多媒体数据由服务器提供给客户端计算机包括基本层和一个或多个增强层。 增强层可以是空间和/或时间性质的。 根据实现,服务器还可以向客户端计算机提供关于多媒体数据的信息。 服务器通过多个组播组(MMG)地址分割多媒体数据进行流式传输。 关于每个MMG携带的多媒体数据的部分的信息被广播到客户端计算机。 掌握有关多媒体数据的信息,客户端计算机可以根据需要智能地加入并离开MMG。 在一些实施例中,客户端计算机提供关于多媒体数据的使用和/或需要的反馈,使得服务器能够正确地大小,例如增长和/或修剪多媒体数据以实现网络效率。 利用正确的尺寸,可以随着增强层的相应生长和修剪而增加或减少基底层的含量。 增强层也可以独立于基层生长和/或修剪,即基底层没有相应的变化。

    Multimedia compression system with additive temporal layers
    4.
    发明授权
    Multimedia compression system with additive temporal layers 失效
    具有加性时间层的多媒体压缩系统

    公开(公告)号:US06392705B1

    公开(公告)日:2002-05-21

    申请号:US08888422

    申请日:1997-07-07

    Applicant: Navin Chaddha

    Inventor: Navin Chaddha

    CPC classification number: G06T9/40 H04N19/30 H04N19/51 H04N19/527

    Abstract: A multimedia compression system for generating frame rate scaleable data in the case of video, and, more generally, universally scaleable data. Universally scaleable data is scaleable across all of the relevant characteristics of the data. In the case of video, these characteristics include frame rate, resolution, and quality. The scaleable data generated by the compression system is comprised of multiple additive layers for each characteristic across which the data is scaleable. In the case of video, the frame rate layers are additive temporal layers, the resolution layers are additive base and enhancement layers, and the quality layers are additive index planes of embedded codes. Various techniques can be used for generating each of these layers (e.g., Laplacian pyramid decomposition or wavelet decomposition for generating the resolution layers; tree structured vector quantization or tree structured scalar quantization for generating the quality layers). The compression system further provides for embedded inter-frame compression in the context of frame rate scalability, and non-redundant layered multicast network delivery of the scaleable data.

    Abstract translation: 一种用于在视频情况下生成帧率可缩放数据的多媒体压缩系统,以及更普遍地,普遍可扩展的数据。 通用数据可扩展到数据的所有相关特征。 在视频的情况下,这些特征包括帧率,分辨率和质量。 由压缩系统产生的可扩展数据由对于数据可扩展的每个特性的多个附加层组成。 在视频的情况下,帧速率层是加性时间层,分辨率层是加性基础和增强层,质量层是嵌入代码的附加索引平面。 可以使用各种技术来生成这些层中的每一层(例如,用于生成分辨率层的拉普拉斯金字塔分解或小波分解;用于生成质量层的树结构矢量量化或树结构标量量化)。 压缩系统进一步提供了在帧速率可伸缩性的上下文中的嵌入式帧间压缩,以及可缩放数据的非冗余分层组播网络递送。

    Dynamic bandwidth selection for efficient transmission of multimedia streams in a computer network
    5.
    发明授权
    Dynamic bandwidth selection for efficient transmission of multimedia streams in a computer network 失效
    动态带宽选择,用于在计算机网络中有效传输多媒体流

    公开(公告)号:US06292834B1

    公开(公告)日:2001-09-18

    申请号:US08818127

    申请日:1997-03-14

    Abstract: An efficient transmission protocol for transmitting multimedia streams from a server to a client computer over a diverse computer network including local area networks (LANs) and wide area networks (WANs) such as the internet. The client computer includes a playout buffer, and the transmission rate is dynamically matched to the available bandwidth capacity of the network connection between the server and the client computer. If a playtime of the playout buffer, which is one measure of the number of data packets currently in the playout buffer, drops below a dynamically computed Decrease_Bandwidth (DEC_BW) threshold, then the transmission rate is decreased by sending a DEC_BW message to the server. Conversely, if the number of packets remaining in the playout buffer rises above a dynamically computed Upper Increase_Bandwidth (INC_BW) threshold and does not drop below a Lower INC_BW threshold for at least an INC_BW wait period, then the transmission rate is incremented. The transmission rate can be selected from among a predetermined set of discrete bandwidth values or from within a continuous range of bandwidth values. In one variation, in addition to responding to changes in network connection capacity, the client computer also determines an average client computational capacity. Accordingly, if the average client computational capacity is less than the network capacity, the lower of the two capacities is the determining one, thereby avoiding a playout buffer overrun.

    Abstract translation: 一种用于通过包括诸如因特网的局域网(LAN)和广域网(WAN)的不同计算机网络从服务器向客户端计算机传输多媒体流的有效传输协议。 客户端计算机包括播放缓冲器,并且传输速率与服务器和客户端计算机之间的网络连接的可用带宽容量动态匹配。 如果播放缓冲器的播放时间是当前在播放缓冲器中的数据分组的数量的一个度量,则低于动态计算的减少_Bandwidth(DEC_BW)阈值,则通过向服务器发送DEC_BW消息来降低传输速率。 相反,如果在播放缓冲器中剩余的分组数量上升到高于动态计算的UpperBay_Bandwidth(INC_BW)阈值,并且在INC_BW等待期间内不降低到较低的INC_BW阈值,则传输速率增加。 传输速率可以从预定的一组离散带宽值中选择,或者从带宽值的连续范围内选择。 在一个变型中,除了响应网络连接容量的变化之外,客户端计算机还确定平均客户端计算能力。 因此,如果平均客户端计算容量小于网络容量,则两个容量中较小的一个是确定的,从而避免播出缓冲区溢出。

    Table based compression with embedded coding
    6.
    发明授权
    Table based compression with embedded coding 失效
    基于表的压缩与嵌入式编码

    公开(公告)号:US6154572A

    公开(公告)日:2000-11-28

    申请号:US885159

    申请日:1997-06-30

    Applicant: Navin Chaddha

    Inventor: Navin Chaddha

    Abstract: An image compression system includes a vectorizer and a hierarchical vector quantization table that outputs embedded code. The vectorizer converts an image into image vectors representing respective blocks of image pixels. The table provides computation-free transformation and compression of the image vectors. Table design can be divided into codebook design and fill-in procedures for each stage. Codebook design for the preliminary stages uses a splitting generalized Lloyd algorithm (LBG/GLA) using a perceptually weighted distortion measure. Codebook design for the final stage uses a greedily-grown and then entropy-pruned tree-structure variation of GLA with an entropy-constrained distortion measure. Table fill-in for all stages uses an unweighted proximity measure for assigning inputs to codebook vectors. Transformations and compression are fast because they are computation free. The hierarchical, multi-stage, character of the table allow it to operate with low memory requirements. The embedded output allows convenient scalability suitable for collaborative video applications over heterogeneous networks.

    Abstract translation: 图像压缩系统包括矢量化器和输出嵌入代码的分层矢量量化表。 向量化器将图像转换成表示各个图像像素块的图像矢量。 该表提供了图像矢量的无计算变换和压缩。 表格设计可分为码本设计和每个阶段的填写程序。 用于初级阶段的码本设计使用一种分解广义劳埃德算法(LBG / GLA),使用感知加权失真度量。 最终阶段的Codebook设计使用熵约束失真测量的贪婪生长,然后熵修剪GLA的树结构变化。 所有阶段的表格填充使用未加权的邻近度量来将输入分配给码本向量。 转换和压缩很快,因为它们是免费的。 表的分级,多级,字符允许它以低内存要求运行。 嵌入式输出允许适用于异构网络上的协作视频应用程序的方便的可扩展性。

    Multimedia compression system with additive temporal layers

    公开(公告)号:US07082164B2

    公开(公告)日:2006-07-25

    申请号:US10151455

    申请日:2002-05-20

    Applicant: Navin Chaddha

    Inventor: Navin Chaddha

    CPC classification number: G06T9/40 H04N19/30 H04N19/51 H04N19/527

    Abstract: A multimedia compression system for generating frame rate scaleable data in the case of universally scaleable data. Universally scaleable data is scaleable across all of the relevant characteristics of the data (e.g., frame rate, resolution, and quality for video). The scaleable data generated by the compression system includes multiple additive layers for each characteristic across which the data is scaleable. For video, the frame rate layers are additive temporal layers, the resolution layers are additive base and enhancement layers, and the quality layers are additive index planes of embedded codes. Various techniques can be used for generating these layers (e.g., Laplacian pyramid decomposition or wavelet decomposition for generating the resolution layers; tree structured vector quantization or tree structured scalar quantization for generating the quality layers). The system further provides for embedded inter-frame compression in the context of frame rate scalability, and non-redundant layered multicast network delivery of the scaleable data.

    Multimedia compression system with adaptive block sizes
    8.
    发明授权
    Multimedia compression system with adaptive block sizes 失效
    具有自适应块大小的多媒体压缩系统

    公开(公告)号:US06233017B1

    公开(公告)日:2001-05-15

    申请号:US08885011

    申请日:1997-06-30

    Applicant: Navin Chaddha

    Inventor: Navin Chaddha

    Abstract: A multimedia compression system for generating frame rate scalable data in the case of video, and, more generally, universally scalable data. Universally scalable data is scalable across all of the relevant characteristics of the data. In the case of video, these characteristics include frame rate, resolution, and quality. The scalable data generated by the compression system is comprised of multiple additive layers for each characteristic across which the data is scalable. In the case of video, the frame rate layers are additive temporal layers, the resolution layers are additive base and enhancement layers, and the quality layers are additive index planes of embedded codes. Various techniques can be used for generating each of these layers (e.g., Laplacian pyramid decomposition or wavelet decomposition for generating the resolution layers; tree structured vector quantization or tree structured scalar quantization for generating the quality layers). The compression system further provides for embedded inter-frame compression in the context of frame rate scalability, and non-redundant layered multicast network delivery of the scalable data.

    Abstract translation: 一种用于在视频情况下生成帧速可伸缩数据的多媒体压缩系统,以及更普遍地,普遍可扩展的数据。 通用数据可扩展到数据的所有相关特征。 在视频的情况下,这些特征包括帧率,分辨率和质量。 由压缩系统产生的可伸缩数据由数据可扩展的每个特性的多个附加层组成。 在视频的情况下,帧速率层是加性时间层,分辨率层是加性基础和增强层,质量层是嵌入代码的附加索引平面。 可以使用各种技术来生成这些层中的每一层(例如,用于生成分辨率层的拉普拉斯金字塔分解或小波分解;用于生成质量层的树结构矢量量化或树结构标量量化)。 该压缩系统进一步提供了帧率可扩展性的上下文中的嵌入式帧间压缩,以及可伸缩数据的非冗余分层组播网络传递。

    Production of a video stream with synchronized annotations over a computer network

    公开(公告)号:US06230172B1

    公开(公告)日:2001-05-08

    申请号:US09389541

    申请日:1999-09-03

    Abstract: The production of synchronization scripts and associated annotated multimedia streams for servers and client computers coupled to each other by a diverse computer network which includes local area networks (LANs) and/or wide area networks (WANs) such as the intermet. Annotated multimedia streams can include a compressed video stream for display in a video window, an accompanying compressed audio stream and annotations. Synchronization scripts include annotation streams for synchronizing the display of video streams with annotations, e.g., displayable events, such textual/graphical data in the form of HTML pages with Java applets to be displayed in one or more event windows. The producer includes a capture module and an author module for capturing video streams and generating annotation streams, respectively. The capture module compresses the video stream using a suitable compression format. Annotation streams include annotation frames which provide either pointer(s) to the event(s) of interest or include displayable data embedded within the annotation stream. Accordingly, each annotation frame includes either an event locator or an event data. In addition, each annotation frame includes an event time marker which corresponds to the time stamp(s) of associated video frame(s) within the video stream. Embedded displayable data include ticker tape data embedded within the annotation stream. Examples of event locators to displayable events include URL addresses pointing to HTML web pages. The video/audio streams and annotation streams are stored in stream server(s) for subsequent retrieval by client computer(s) in a coordinated manner, so that the client computer(s) is able to synchronously display the video frames and displayable event(s) in a video window and event window(s), respectively. In one implementation, annotation streams include a flipper stream for locating HTML pages and a ticker stream which include ticker (tape) data.

    Streaming and displaying a video stream with synchronized annotations over a computer network
    10.
    发明授权
    Streaming and displaying a video stream with synchronized annotations over a computer network 失效
    通过计算机网络流式传输并显示具有同步注释的视频流

    公开(公告)号:US06173317B2

    公开(公告)日:2001-01-09

    申请号:US08819585

    申请日:1997-03-14

    Abstract: Client computer(s) retrieve and display synchronized annotated multimedia streams from servers dispersed over a diverse computer network which includes local area networks (LANs) and/or wide area networks (WANs) such as the internet. Multimedia streams provided to the client computer(s) can include a compressed video stream for display in a video window and an accompanying compressed audio stream. Annotations, i.e., displayable events, include textual/graphical data in the form of HTML pages with Java applets to be displayed in one or more event windows. The video/audio and annotation streams are produced and then stored in stream server(s). Annotation streams include annotation frames which provide either pointer(s) to the event(s) of interest or include displayable data embedded within the annotation stream. Accordingly, each annotation frame includes either an event locator or an event data. In addition, each annotation frame includes an event time marker which corresponds to the time stamp(s) of associated video frame(s) within the video stream. Examples of embedded displayable data include ticker tape data embedded within the annotation stream. Examples of event locators to displayable events include URL addresses pointing to HTML web pages. Video/audio streams and annotation streams are provided by the stream server(s) to the client computer(s) in a coordinated manner, so that the client computer(s) is able to synchronously display the video frames and displayable event(s) in a video window and event window(s), respectively.

    Abstract translation: 客户端计算机从分散在包括诸如互联网的局域网(LAN)和/或广域网(WAN)的不同计算机网络的服务器上检索和显示同步的注释多媒体流。 提供给客户端计算机的多媒体流可以包括用于在视频窗口中显示的压缩视频流和伴随的压缩音频流。 注释,即可显示的事件,包括HTML页面形式的文本/图形数据,Java applet将被显示在一个或多个事件窗口中。 生成视频/音频和注释流,然后存储在流服务器中。 注释流包括对所感兴趣的事件提供指针或者包括嵌入在注释流内的可显示数据的注释帧。 因此,每个注释帧都包括事件定位器或事件数据。 另外,每个注释帧包括对应于视频流内的相关联的视频帧的时间戳的事件时间标记。 嵌入式可显示数据的示例包括嵌入在注释流内的代码磁带数据。 可显示事件的事件定位器的示例包括指向HTML网页的URL地址。 视频/音频流和注释流由流服务器以协调的方式提供给客户端计算机,使得客户端计算机能够同步地显示视频帧和可显示事件, 分别在视频窗口和事件窗口中。

Patent Agency Ranking