Invention Application
- Patent Title: METHOD FOR SEGMENTING COMMUNICATION TRANSCRIPTS USING UNSUPERVISED AND SEMI-SUPERVISED TECHNIQUES
- Patent Title (中): 使用不受限制的和受监督的技术分隔通信转录的方法
-
Application No.: US12060469Application Date: 2008-04-01
-
Publication No.: US20090112571A1Publication Date: 2009-04-30
- Inventor: Krishna Kummamuru , Deepak S. Padmanabhan , Shourya Roy , L. Venkata Subramaniam
- Applicant: Krishna Kummamuru , Deepak S. Padmanabhan , Shourya Roy , L. Venkata Subramaniam
- Applicant Address: US NY Armonk
- Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee Address: US NY Armonk
- Main IPC: G06F17/20
- IPC: G06F17/20

Abstract:
A method is provided for forming discrete segment clusters of one or more sequential sentences from a corpus of communication transcripts of transactional communications that comprises dividing the communication transcripts of the corpus into a first set of sentences spoken by a caller and a second set of sentences spoken by a responder; generating a set of sentence clusters by grouping the first and second sets of sentences according to a measure of lexical similarity using an unsupervised partitional clustering method; generating a collection of sequences of sentence types by assigning a distinct sentence type to each sentence cluster and representing each sentence of each communication transcript of the corpus with the sentence type assigned to the sentence cluster into which the sentence is grouped; and generating a specified number of discrete segment clusters by successively merging sentence clusters according to a proximity-based measure between the sentence types assigned to the sentence clusters within sequences of the collection.
Public/Granted literature
- US07912714B2 Method for segmenting communication transcripts using unsupervised and semi-supervised techniques Public/Granted day:2011-03-22
Information query