-
公开(公告)号:US11308235B2
公开(公告)日:2022-04-19
申请号:US16812029
申请日:2020-03-06
发明人: Rajesh M. Desai , Mu Qiao , Roger C. Raphael , Ramani Routray
摘要: A method, system and computer program product for detecting sensitive personal information in a storage device. A block delta list containing a list of changed blocks in the storage device is processed. After identifying the changed blocks from the block delta list, a search is performed on those identified changed blocks for sensitive personal information using a character scanning technique. After identifying a changed block deemed to contain sensitive personal information, the changed block is translated from the block level to the file level using a hierarchical reverse mapping technique. By only analyzing the changed blocks to determine if they contain sensitive personal information, a lesser quantity of blocks needs to be processed in order to detect sensitive personal information in the storage device in near real-time. In this manner, sensitive personal information is detected in the storage device using fewer computing resources in a shorter amount of time.
-
公开(公告)号:US11210410B2
公开(公告)日:2021-12-28
申请号:US16573326
申请日:2019-09-17
发明人: Roger C. Raphael , Hani Talal Jamjoom , Rajesh M. Desai , Iun Veng Leong , Uttama Shakya , Arjun Natarajan
摘要: Serving data assets based on security policies is provided. A request to access an asset received from a user having a particular context is evaluated based on a set of asset access enforcement policies. An asset access policy enforcement decision is generated based on evaluating the request. It is determined whether the asset access policy enforcement decision is to transform particular data of the asset prior to allowing access. In response to determining that the asset access policy enforcement decision is to transform the particular data of the asset prior to allowing access, a transformation specification that includes an ordered subset of unit transformations for transforming the particular data of the asset is generated based on the particular context of the user and the set of asset access enforcement policies. A transformed asset is generated by applying the transformation specification to the asset transforming the particular data of the asset.
-
公开(公告)号:US11178186B2
公开(公告)日:2021-11-16
申请号:US16824466
申请日:2020-03-19
IPC分类号: H04W12/67 , H04W12/72 , H04L29/06 , H04L12/813 , H04L12/24
摘要: A method, apparatus, system, and computer program product for evaluating enforcement decisions on an asset using a policy. Rules in the policy are applied by a computer system to the asset taking into account a context for a request to access the asset in response receiving to the request to access the asset, and wherein the rules in the policy determine whether access to the asset is allowed. A determination is made by the computer system as to whether a conflict is present in an initial decision made using the rules in the policy. A set of conflict resolution processes are applied by the computer system when the conflict is present such that a final decision is made on the request to access the asset.
-
4.
公开(公告)号:US11157477B2
公开(公告)日:2021-10-26
申请号:US16202215
申请日:2018-11-28
摘要: A method, computer system, and computer program product for segment differential-based document text-index modeling are provided. The embodiment may include receiving, by a processor, a document with a valid document ID and version ID tuple. The embodiment may also include determining the received document is a new version of a previously stored document and consequently multiplexing versions of the document into a single indexed document. The embodiment may further include segmenting the received document and building a token vector. The embodiment may also include calculating a difference between the received new version of the document and the previously stored document using information obtained from the segmentation. The embodiment may further include in response to the calculated difference being below a pre-configured threshold value, discarding the received new version.
-
5.
公开(公告)号:US10783112B2
公开(公告)日:2020-09-22
申请号:US15470194
申请日:2017-03-27
摘要: Provided are techniques for a high performance compliance mechanism for structured and unstructured data in an enterprise. A record to represent a collection of structured objects is generated. The record is stored in a file plan container associated with a disposition schedule. The collection of the structured objects represented by the record is disposed in accordance with the disposition schedule.
-
公开(公告)号:US09971761B2
公开(公告)日:2018-05-15
申请号:US14734404
申请日:2015-06-09
IPC分类号: G06F17/27
CPC分类号: G06F17/277 , G06F17/2785
摘要: In an approach for parallelizing document processing in an information handling system, a processor receives a document, wherein the document includes text content. A processor extracts information from the text content, utilizing natural language processing and semantic analysis, to form tokenized semantic partitions, comprising a plurality of sub-documents. A processor schedules a plurality of concurrently executing threads to process the plurality of sub-documents.
-
公开(公告)号:US09606998B2
公开(公告)日:2017-03-28
申请号:US14298111
申请日:2014-06-06
CPC分类号: G06F17/30091 , G06F17/30011 , G06F17/30174 , G06F17/3023
摘要: According to one embodiment of the present invention, a system extends a content repository by creating an auxiliary data store outside of the content repository and storing auxiliary data in the auxiliary data store, wherein the auxiliary data is associated with a collection of documents in the content repository. The system stores version information for the auxiliary data store and records of operations against the auxiliary data store in a log in the repository. In response to receiving a request for an operation against the auxiliary data store, the system determines that the auxiliary data store and repository are consistent based on the version information and applies the operation against the auxiliary data store. Embodiments of the present invention further include a method and computer program product for extending a content repository data model in substantially the same manners described above.
-
公开(公告)号:US09594813B2
公开(公告)日:2017-03-14
申请号:US14556099
申请日:2014-11-29
CPC分类号: G06F17/30554 , G06F17/30011 , G06F17/30345 , G06F17/30386 , G06F17/30516 , G06F17/30864
摘要: In searching electronic documents, prior to executing a query, a reviewer indicates whether a result set of the query will be dynamic or static. The query is then executed on the electronic documents to obtain an original result set, which is provided to the reviewer through a user interface. Upon determining that one or more changes to one or more of the electronic documents have occurred, and if the result set is static, then the original result set continues to be provided to the reviewer without re-executing the query. If the result set is dynamic, then the query is re-executed on the electronic documents to obtain an updated result set, and the updated result set is provided to the reviewer through the user interface. The original result set may be associated with a search session and/or may be a random sample of the electronic documents for an overview query.
-
公开(公告)号:US11151132B2
公开(公告)日:2021-10-19
申请号:US16440971
申请日:2019-06-13
IPC分类号: G06F16/00 , G06F16/2453 , G06F16/2458 , G06F16/2457 , G06F16/84
摘要: Provided are a computer program product, system, and method for distributed processing of a query with distributed posting lists. A dispatch map has entries, wherein each entry identifies one of a plurality of terms in a dictionary, wherein for each of the terms there is a posting list identifying zero or more objects including the term, wherein at least one of the dispatch map entries indicate at least one distributed processing element including the posting list for the term. The dispatch map is used to dispatch sub-expressions comprising portions of a query to distributed processing elements having the posting lists for terms in the sub-expressions, wherein the distributed processing elements distributed the sub-expressions execute the sub-expressions on the posting lists for the terms in the sub-expression.
-
公开(公告)号:US20210263977A1
公开(公告)日:2021-08-26
申请号:US16795678
申请日:2020-02-20
IPC分类号: G06F16/93 , G06F16/906 , G06Q50/18
摘要: Discovering second-order documents and latent custodians in an e-discovery system is provided. A list of first-order documents and document custodians within a base state of the e-discovery system are identified based on a plurality of terms corresponding to a meet and confer practice for a legal matter instance. The plurality of terms is masked within the first-order documents. The first-order documents having the plurality of terms masked are divided into groups. A list of second-order documents is generated from a group of documents. A list of second-order document custodians is generated based on corresponding custodian relationships to second-order documents. Finally, each second-order document custodian in the list of second-order document custodians that has a corresponding rank exceeding a defined rank threshold level is identified as an official document custodian in the e-discovery system.
-
-
-
-
-
-
-
-
-