Techniques for storing and distributing metadata among nodes in a storage cluster system

    公开(公告)号:US11509720B2

    公开(公告)日:2022-11-22

    申请号:US17151329

    申请日:2021-01-18

    申请人: NetApp Inc.

    IPC分类号: H04L67/1097

    摘要: Various embodiments are generally directed to techniques for reducing the time required for a node to take over for a failed node or to boot. An apparatus includes an access component to retrieve a metadata from a storage device coupled to a first D-module of a first node during boot, the metadata generated from a first mutable metadata portion and an immutable metadata portion, and the first metadata specifying a first address of a second D-module of a second node; a replication component to contact the second data storage module at the first address; and a generation component to, in response to failure of the contact, request a second mutable metadata portion from a N-module of the first node and generate a second metadata from the second mutable metadata portion and the immutable metadata portion, the second mutable metadata portion specifying a second address of the second D-module.

    TECHNIQUES FOR COORDINATING PARALLEL PERFORMANCE AND CANCELLATION OF COMMANDS IN A STORAGE CLUSTER SYSTEM

    公开(公告)号:US20200162555A1

    公开(公告)日:2020-05-21

    申请号:US16774108

    申请日:2020-01-28

    申请人: NetApp Inc.

    IPC分类号: H04L29/08 H04L29/06 G06F11/20

    摘要: Various embodiments are directed to techniques for coordinating at least partially parallel performance and cancellation of data access commands between nodes of a storage cluster system. An apparatus may include a processor component of a first node coupled to a first storage device storing client device data; an access component to perform replica data access commands of replica command sets on the client device data, each replica command set assigned a set ID; a communications component to analyze a set ID included in a network packet to determine whether a portion of a replica command set in the network packet is redundant, and to reassemble the replica command set from the portion based if the portion is not redundant; and an ordering component to provide the communications component with set IDs of replica command sets of which the access component has fully performed the set of replica data access commands.

    Techniques for maintaining communications sessions among nodes in a storage cluster system

    公开(公告)号:US09830238B2

    公开(公告)日:2017-11-28

    申请号:US14473779

    申请日:2014-08-29

    申请人: NETAPP, INC.

    IPC分类号: G06F11/00 G06F11/20 G06F11/14

    摘要: Various embodiments are generally directed to techniques for preparing to respond to failures in performing a data access command to modify client device data in a storage cluster system. An apparatus may include a processor component of a first node coupled to a first storage device; an access component to perform a command on the first storage device; a replication component to exchange a replica of the command with the second node via a communications session formed between the first and second nodes to enable at least a partially parallel performance of the command by the first and second nodes; and a multipath component to change a state of the communications session from inactive to active to enable the exchange of the replica based on an indication of a failure within a third node that precludes performance of the command by the third node. Other embodiments are described and claimed.

    TECHNIQUES FOR ERROR HANDLING IN PARALLEL SPLITTING OF STORAGE COMMANDS
    4.
    发明申请
    TECHNIQUES FOR ERROR HANDLING IN PARALLEL SPLITTING OF STORAGE COMMANDS 审中-公开
    存储命令并行分割中的错误处理技术

    公开(公告)号:US20170054529A1

    公开(公告)日:2017-02-23

    申请号:US15343365

    申请日:2016-11-04

    申请人: NetApp Inc.

    IPC分类号: H04L1/08 G06F3/06 G06F11/20

    摘要: Various embodiments are generally directed to techniques for handling errors affecting the at least partially parallel performance of data access commands between nodes of a storage cluster system. An apparatus may include a processor component of a first node, an access component to perform a command received from a client device via a network to alter client device data stored in a first storage device coupled to the first node, a replication component to transmit a replica of the command to a second node via the network to enable performance of the replica by the second node at least partially in parallel, an error component to retry transmission of the replica based on a failure indicated by the second node and a status component to select a status indication to transmit to the client device based on the indication of failure and results of retrial of transmission of the replica.

    摘要翻译: 各种实施例通常涉及用于处理影响存储集群系统的节点之间的数据访问命令的至少部分并行性能的错误的技术。 装置可以包括第一节点的处理器组件,访问组件,用于执行从客户端设备经由网络接收的命令,以改变存储在耦合到第一节点的第一存储设备中的客户机设备数据,复制组件,用于发送 经由网络将命令的副本复制到第二节点以使得第二节点至少部分地并行地执行副本的性能;错误组件,用于基于由第二节点指示的故障和状态组件来重试发送副本; 根据失败的指示和复制传输的重试结果,选择要发送给客户端设备的状态指示。

    TECHNIQUES FOR COORDINATING PARALLEL PERFORMANCE AND CANCELLATION OF COMMANDS IN A STORAGE CLUSTER SYSTEM
    5.
    发明申请
    TECHNIQUES FOR COORDINATING PARALLEL PERFORMANCE AND CANCELLATION OF COMMANDS IN A STORAGE CLUSTER SYSTEM 审中-公开
    在存储集群系统中协调并行性能和取消命令的技术

    公开(公告)号:US20160088082A1

    公开(公告)日:2016-03-24

    申请号:US14491799

    申请日:2014-09-19

    申请人: NETAPP, INC.

    IPC分类号: H04L29/08 H04L29/06

    摘要: Various embodiments are directed to techniques for coordinating at least partially parallel performance and cancellation of data access commands between nodes of a storage cluster system. An apparatus may include a processor component of a first node coupled to a first storage device storing client device data; an access component to perform replica data access commands of replica command sets on the client device data, each replica command set assigned a set ID; a communications component to analyze a set ID included in a network packet to determine whether a portion of a replica command set in the network packet is redundant, and to reassemble the replica command set from the portion based if the portion is not redundant; and an ordering component to provide the communications component with set IDs of replica command sets of which the access component has fully performed the set of replica data access commands.

    摘要翻译: 各种实施例涉及用于协调在存储集群系统的节点之间的数据访问命令的至少部分并行性能和消除的技术。 装置可以包括耦合到存储客户端设备数据的第一存储设备的第一节点的处理器组件; 访问组件,用于在客户端设备数据上执行副本命令集的副本数据访问命令,每个副本命令集分配了集合ID; 通信组件,用于分析网络分组中包括的集合ID,以确定所述网络分组中的所述副本命令的一部分是否是冗余的,并且如果所述部分不是冗余的,则从所述部分重新组合所述副本命令集; 以及排序组件,用于向通信组件提供其访问组件完全执行了该副本数据访问命令集的副本命令集的集合ID。

    TECHNIQUES FOR STORING AND DISTRIBUTING METADATA AMONG NODES IN A STORAGE CLUSTER SYSTEM

    公开(公告)号:US20210144208A1

    公开(公告)日:2021-05-13

    申请号:US17151329

    申请日:2021-01-18

    申请人: NetApp Inc.

    IPC分类号: H04L29/08

    摘要: Various embodiments are generally directed to techniques for reducing the time required for a node to take over for a failed node or to boot. An apparatus includes an access component to retrieve a metadata from a storage device coupled to a first D-module of a first node during boot, the metadata generated from a first mutable metadata portion and an immutable metadata portion, and the first metadata specifying a first address of a second D-module of a second node; a replication component to contact the second data storage module at the first address; and a generation component to, in response to failure of the contact, request a second mutable metadata portion from a N-module of the first node and generate a second metadata from the second mutable metadata portion and the immutable metadata portion, the second mutable metadata portion specifying a second address of the second D-module.

    Techniques for maintaining communications sessions among nodes in a storage cluster system

    公开(公告)号:US10552275B2

    公开(公告)日:2020-02-04

    申请号:US15820717

    申请日:2017-11-22

    申请人: NetApp Inc.

    IPC分类号: G06F11/00 G06F11/20 G06F11/14

    摘要: Various embodiments are generally directed to techniques for preparing to respond to failures in performing a data access command to modify client device data in a storage cluster system. An apparatus may include a processor component of a first node coupled to a first storage device; an access component to perform a command on the first storage device; a replication component to exchange a replica of the command with the second node via a communications session formed between the first and second nodes to enable at least a partially parallel performance of the command by the first and second nodes; and a multipath component to change a state of the communications session from inactive to active to enable the exchange of the replica based on an indication of a failure within a third node that precludes performance of the command by the third node. Other embodiments are described and claimed.

    Techniques for performing resynchronization on a clustered system

    公开(公告)号:US09720752B2

    公开(公告)日:2017-08-01

    申请号:US14518422

    申请日:2014-10-20

    申请人: NETAPP, INC.

    IPC分类号: G06F17/30 G06F11/00

    摘要: Various embodiments are generally directed an apparatus and method for receiving information to write on a clustered system comprising at least a first cluster and a second cluster, determining that a failure event has occurred on the clustered system creating unsynchronized information, the unsynchronized information comprising at least one of inflight information and dirty region information, and performing a resynchronization operation to synchronize the unsynchronized information on the first cluster and the second cluster based on log information in at least one of an inflight tracker log for the inflight information and a dirty region log for the dirty region information.

    Servicing of Network Software Components of Nodes of a Cluster Storage System
    10.
    发明申请
    Servicing of Network Software Components of Nodes of a Cluster Storage System 审中-公开
    维护群集存储系统节点的网络软件组件

    公开(公告)号:US20160239437A1

    公开(公告)日:2016-08-18

    申请号:US15137906

    申请日:2016-04-25

    申请人: NetApp, Inc.

    摘要: Described herein are method and apparatus for servicing software components of nodes of a cluster storage system. During data-access sessions with clients, client IDs and file handles for accessing files are produced and stored to clients and stored (as session data) to each node. A serviced node is taken offline, whereby network connections to clients are disconnected. Each disconnected client is configured to retain its client ID and file handles and attempt reconnections. Session data of the serviced node is made available to a partner node (by transferring session data to the partner node). After clients have reconnected to the partner node, the clients may use the retained client IDs and file handles to continue a data-access session with the partner node since the partner node has access to the session data of the serviced node and thus will recognize and accept the retained client ID and file handles.

    摘要翻译: 这里描述了用于维护集群存储系统的节点的软件组件的方法和装置。 在与客户端的数据访问会话期间,生成用于访问文件的客户端ID和文件句柄,并将其存储到客户端并存储(作为会话数据)到每个节点。 服务节点脱机,从而断开与客户端的网络连接。 每个断开连接的客户端被配置为保留其客户端ID和文件句柄并尝试重新连接。 服务节点的会话数据使对方节点可用(通过将会话数据传送到伙伴节点)。 在客户端重新连接到伙伴节点之后,客户端可以使用保留的客户端ID和文件句柄来继续与伙伴节点的数据访问会话,因为伙伴节点可以访问服务节点的会话数据,并且因此将识别和 接受保留的客户端ID和文件句柄。