- 专利标题: Identifying information in plain text narratives EMRs
-
申请号: US16548443申请日: 2019-08-22
-
公开(公告)号: US11532387B2公开(公告)日: 2022-12-20
- 发明人: Bharath Dandala , Ananya Aniruddha Poddar , Murthy V. Devarakonda
- 申请人: International Business Machines Corporation
- 申请人地址: US NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: US NY Armonk
- 代理商 Stephen J. Walder, Jr.; Kristofer Haggerty
- 主分类号: G16H15/00
- IPC分类号: G16H15/00 ; G06N20/10 ; G16H10/60 ; G06N3/08 ; G06F40/30
摘要:
A clinical information extraction and training mechanism is provided for automatically extracting and identifying information in plain text narratives in a set of electronic medical records. The mechanism segments each clinical note in a plurality of clinical notes into one or more identified sections, labels each identified section with an associated tag, and generate a tag data structure utilizing explicitly tagged sequences of sentences and associated tags. The mechanism performs statistical analysis of the identified sections that contain tags identified in the tag data structure to identify one or more valid stop/start conditions; extracts a first set of positive examples of sentences for a selected type of information, and then trains a cognitive system to identify sentences in the plurality of clinical notes that fail to have a tag associated with the selected type using the positive examples of sentences for different types of information.
公开/授权文献
- US20210057068A1 Identifying Information in Plain Text Narratives EMRs 公开/授权日:2021-02-25
信息查询