Efficient QoS support for software packet processing on general purpose servers

    公开(公告)号:US10237171B2

    公开(公告)日:2019-03-19

    申请号:US15270377

    申请日:2016-09-20

    Abstract: Methods and apparatus for facilitating efficient Quality of Service (QoS) support for software-based packet processing by offloading QoS rate-limiting to NIC hardware. Software-based packet processing is performed on packet flows received at a compute platform, such as a general purpose server, and/or packet flows generated by local applications running on the compute platform. The packet processing includes packet classification that associates packets with packet flows using flow IDs, and identifying a QoS class for the packet and packet flow. NIC Tx queues are dynamically configured or pre-configured to effect rate limiting for forwarding packets enqueued in the NIC Tx queues. New packet flows are detected, and mapping data is created to map flow IDs associated with flows to the NIC Tx queues used to forward the packets associated with the flows.

    TECHNOLOGIES FOR MANAGING SINGLE-PRODUCER AND SINGLE CONSUMER RINGS

    公开(公告)号:US20190044871A1

    公开(公告)日:2019-02-07

    申请号:US16144384

    申请日:2018-09-27

    Abstract: Technologies for managing a single-producer and single-consumer ring include a producer of a compute node that is configured to allocate data buffers, produce work, and indicate that work has been produced. The compute node is configured to insert reference information for each of the allocated data buffers into respective elements of the ring and store the produced work into the data buffers. The compute node includes a consumer configured to request the produced work from the ring. The compute node is further configured to dequeue the reference information from each of the elements of the ring that correspond to the portion of data buffers in which the produced work has been stored, and set each of the elements of the ring for which the reference information has been dequeued to an empty (i.e., NULL) value. Other embodiments are described herein.

    [ICE] ARCHITECTURE AND MECHANISMS TO ACCELERATE TUPLE-SPACE SEARCH WITH INTERGRATED GPU

    公开(公告)号:US20190042304A1

    公开(公告)日:2019-02-07

    申请号:US15829938

    申请日:2017-12-03

    Abstract: Methods, apparatus, systems, and software for architectures and mechanisms to accelerate tuple-space search with integrated GPUs (Graphic Processor Units). One of the architectures employs GPU-side lookup table sorting, under which local and global hit count histograms are maintained for work groups, and sub-tables containing rules for tuple matching are re-sorted based on the relative hit rates of the different sub-tables. Under a second architecture, two levels of parallelism are implemented: packet-level parallelism and lookup table-parallelism. Under a third architecture, dynamic two-level parallel processing with pre-screen is implemented. Adaptive decision making mechanisms are also disclosed to select which architecture is optimal in view of multiple considerations, including application preferences, offered throughput, and available GPU resources. The architectures leverage utilization of both processor cores and GPU processing elements to accelerate tuple-space searches, including searches using wildcard masks.

    Concurrent network application scheduling for reduced power consumption

    公开(公告)号:US10042414B2

    公开(公告)日:2018-08-07

    申请号:US14129783

    申请日:2013-10-01

    Abstract: Technologies for scheduling network requests to reduce power consumption include a mobile computing device configured to receive a network request from a network application, and determine whether the network request is delayable, based on execution constraints of the application. Execution constraints may include required processing resources, acceptable latency, priority, application class, and others. If the request is delayable, the mobile computing device may delay the request until a threshold number of delayable requests are received, or until a non-delayable request is received. The mobile computing device performs the delayed request and any subsequently received requests concurrently. The execution constraints may be supplied by each network application, or may be determined by the mobile computing device through observation. Other embodiments are described and claimed.

Patent Agency Ranking