Identifying information in plain text narratives EMRs

发明授权

US11532387B2 Identifying information in plain text narratives EMRs 有权

请登陆查看更多内容

专利标题： Identifying information in plain text narratives EMRs
申请号： US16548443

申请日： 2019-08-22
公开(公告)号： US11532387B2

公开(公告)日： 2022-12-20
发明人: Bharath Dandala , Ananya Aniruddha Poddar , Murthy V. Devarakonda
申请人： International Business Machines Corporation
申请人地址： US NY Armonk
专利权人： International Business Machines Corporation
当前专利权人： International Business Machines Corporation
当前专利权人地址： US NY Armonk
代理商 Stephen J. Walder, Jr.; Kristofer Haggerty
主分类号： G16H15/00
IPC分类号： G16H15/00 ; G06N20/10 ; G16H10/60 ; G06N3/08 ; G06F40/30

Identifying information in plain text narratives EMRs

摘要：

A clinical information extraction and training mechanism is provided for automatically extracting and identifying information in plain text narratives in a set of electronic medical records. The mechanism segments each clinical note in a plurality of clinical notes into one or more identified sections, labels each identified section with an associated tag, and generate a tag data structure utilizing explicitly tagged sequences of sentences and associated tags. The mechanism performs statistical analysis of the identified sections that contain tags identified in the tag data structure to identify one or more valid stop/start conditions; extracts a first set of positive examples of sentences for a selected type of information, and then trains a cognitive system to identify sentences in the plurality of clinical notes that fail to have a tag associated with the selected type using the positive examples of sentences for different types of information.

公开/授权文献

US20210057068A1 Identifying Information in Plain Text Narratives EMRs 公开/授权日：2021-02-25

信息查询

Espacenet

IPC分类:

G	物理
G16	特别适用于特定应用领域的信息通信技术
G16H	医疗保健信息学，即专门用于处置或处理医疗或健康数据的信息和通信技术[ICT]
G16H15/00	特别适用于医疗报告的ICT，例如其生成或传输