Selective conference digest
    41.
    发明授权

    公开(公告)号:US11076052B2

    公开(公告)日:2021-07-27

    申请号:US15548265

    申请日:2016-02-03

    Abstract: Various disclosed implementations involve processing and/or playback of a recording of a conference involving a plurality of conference participants. Some implementations disclosed herein involve receiving audio data corresponding to a recording of at least one conference involving a plurality of conference participants. In some examples, only a portion of the received audio data will be selected as playback audio data. The selection process may involve a topic selection process, a talkspurt filtering process and/or an acoustic feature selection process. Some examples involve receiving an indication of a target playback time duration. Selecting the portion of audio data may involve making a time duration of the playback audio data within a threshold time difference of the target playback time duration.

    Jitter buffer apparatus and method
    42.
    发明授权

    公开(公告)号:US10812401B2

    公开(公告)日:2020-10-20

    申请号:US16084932

    申请日:2017-03-16

    Abstract: Disclosed is an apparatus and method operative to receive packets of media from a network including a receiver unit operative to receive the packets from the network, a jitter buffer data structure for receiving the packets in an ordered queue, the jitter buffer data structure having a tail into which the packets are input; a plurality of heads defining points in the jitter buffer data structure from which the ordered queue of packets are to be played back, the heads comprise an adjustable actual playback head coupled to an actual playback unit and at least one prototype head, each prototype head having associated therewith a target latency a processor having decision logic operable to determine a cost of achieving the associated target latency for each prototype head, wherein the decision logic compares the costs determined for each prototype head to identify a particular target latency and head location for the actual playback head of the buffer and a playback unit coupled to the processor for actual playback of the playback head of the buffer, such that the particular target latency of the jitter buffer data structure is determined at playback of the buffer rather than upon input of the packets into the jitter buffer data structure.

    Adaptive audio construction
    43.
    发明授权

    公开(公告)号:US10728688B2

    公开(公告)日:2020-07-28

    申请号:US16424409

    申请日:2019-05-28

    Abstract: Described herein is a method for creating an object-based audio signal from an audio input, the audio input including one or more audio channels that are recorded to collectively define an audio scene. The one or more audio channels are captured from a respective one or more spatially separated microphones disposed in a stable spatial configuration. The method includes the steps of: a) receiving the audio input; b) performing spatial analysis on the one or more audio channels to identify one or more audio objects within the audio scene; c) determining contextual information relating to the one or more audio objects; d) defining respective audio streams including audio data relating to at least one of the identified one or more audio objects; and e) outputting an object-based audio signal including the audio streams and the contextual information.

    Method of rendering one or more captured audio soundfields to a listener

    公开(公告)号:US10362420B2

    公开(公告)日:2019-07-23

    申请号:US16009154

    申请日:2018-06-14

    Abstract: A computer implemented system for rendering captured audio soundfields to a listener comprises apparatus to deliver the audio soundfields to the listener. The delivery apparatus delivers the audio soundfields to the listener with first and second audio elements perceived by the listener as emanating from first and second virtual source locations, respectively, and with the first audio element and/or the second audio element delivered to the listener from a third virtual source location. The first virtual source location and the second virtual source location are perceived by the listener as being located to the front of the listener, and the third virtual source location is located to the rear or the side of the listener.

    Perceptually continuous mixing in a teleconference

    公开(公告)号:US10009475B2

    公开(公告)日:2018-06-26

    申请号:US15121744

    申请日:2015-02-17

    CPC classification number: H04M3/568 G10L25/51 G10L25/78 H04M3/569

    Abstract: In an audio teleconference mixing system, of the type mixing a first plurality of audio uplink input streams containing audio information including sensed audio and associated control information, to produce at least one audio downlink output stream for downlinking to at least one conference participants, wherein the audio uplink input streams potentially can include continuous transmission (CTX) and discontinuous transmission streams (DTX), a method of mixing multiple current audio uplink streams together to produce the at least one audio output stream, the method including the steps of: (a) determining a verbosity measure indicative of the likely importance of each current audio uplink streams; (b) where at least one current audio uplink stream can comprise a CTX stream, utilizing at least one CTX stream in the mix to produce at least one current downlink output stream.

Patent Agency Ranking