-
公开(公告)号:US20240211994A1
公开(公告)日:2024-06-27
申请号:US18145279
申请日:2022-12-22
Applicant: Verizon Patent and Licensing Inc.
Inventor: Subham Biswas , Saurabh Tahiliani
IPC: G06Q30/0251 , G06V10/82 , G06V20/40
CPC classification number: G06Q30/0251 , G06V10/82 , G06V20/41
Abstract: A method may include receiving frames associated with a video stream, identifying a first object image included in at least some of the frames and masking a region, in the at least some of the frames, associated with the first object image. The method may also include receiving information identifying at least one attribute associated with a user and identifying, based on the received information, a second object image to replace the first object image. The method may further include replacing pixel values in the masked region with contextually suitable pixel values associated with the second object image and outputting the video stream with the second object image replacing the first object image in the at least some of the frames.
-
公开(公告)号:US11889168B1
公开(公告)日:2024-01-30
申请号:US17811732
申请日:2022-07-11
Applicant: Verizon Patent and Licensing Inc.
Inventor: Subham Biswas , Saurabh Tahiliani
IPC: H04N21/8549 , G10L15/02
CPC classification number: H04N21/8549 , G10L15/02 , G10L2015/025
Abstract: A video summary device may generate a textual summary of a transcription of a virtual event. The video summary device may generate a phonemic transcription of the textual summary and generate a text embedding based on the phonemic transcription. The video summary device may generate an audio embedding based on a target voice. The video summary device may generate an audio output of the phonemic transcription uttered by the target voice. The audio output may be generated based on the text embedding and the audio embedding. The video summary device may generate an image embedding based on video data of a target user. The image embedding may include information regarding images of facial movements of the target user. The video summary device may generate a video output of different facial movements of the target user uttering the phonemic transcription, based on the text embedding and the image embedding.
-
公开(公告)号:US11825353B2
公开(公告)日:2023-11-21
申请号:US17536971
申请日:2021-11-29
Applicant: Verizon Patent and Licensing Inc.
Inventor: Seng Gan , Subham Biswas , Christopher A. Graffeo , Saurabh Tahiliani
IPC: H04W28/08
CPC classification number: H04W28/0958 , H04W28/095
Abstract: A system described herein may provide a technique for the assignment of Centralized Units (“CUs”) to Distributed Units (“DUs”) in a radio access network (“RAN”) that includes a distributed or hierarchical arrangement of network infrastructure equipment. Different groups of DUs may be modeled based on usage or traffic patterns, and complementary groups of DUs may be identified based on measures of usage that may vary with time. For example, one model associated with one group of DUs may experience relatively heavy usage during morning hours and light usage during evening hours, and another model associated with a complementary group of DUs may experience relatively light usage during morning hours and heavy usage during evening hours.
-
公开(公告)号:US20250124452A1
公开(公告)日:2025-04-17
申请号:US18488557
申请日:2023-10-17
Applicant: Verizon Patent and Licensing Inc.
Inventor: Durgesh Kumar , Saurabh Tahiliani
IPC: G06Q20/40
Abstract: The present teaching relates to customer service with AI-based automated auditing on agent fraud. Real-time features of a communication between an agent and a customer are obtained. To detect agent fraud, a batch feature vector is computed based on real-time features extracted from communications involving the agent and accumulated over a batch period. Agent fraud is detected based on a model and the detection result is used to audit the agent for service performance.
-
15.
公开(公告)号:US20250053992A1
公开(公告)日:2025-02-13
申请号:US18447839
申请日:2023-08-10
Applicant: Verizon Patent and Licensing Inc.
Inventor: Durgesh Kumar , Saurabh Tahiliani
IPC: G06Q30/015 , G06Q30/0601
Abstract: The present teaching relates to conduct persona-adaptive communications with a customer at a geo-locale. Transcripts of a current and historic communications involving the customer are used to characterize the persona of the customer. Transcripts of historic communications with customers at the geo-locale are used to characterize the persona of the geo-locale. Current persona of the customer exhibited in the current communication is combined with the customer's persona and the geo-locale's persona to compute a response input vector, A language model generates, based on the response input vector, a persona-adaptive response, which is then sent to the customer a response.
-
公开(公告)号:US12200322B2
公开(公告)日:2025-01-14
申请号:US18389764
申请日:2023-12-19
Applicant: Verizon Patent and Licensing Inc.
Inventor: Subham Biswas , Saurabh Tahiliani
IPC: H04N21/8549 , G10L15/02
Abstract: A video summary device may generate a textual summary of a transcription of a virtual event. The video summary device may generate a phonemic transcription of the textual summary and generate a text embedding based on the phonemic transcription. The video summary device may generate an audio embedding based on a target voice. The video summary device may generate an audio output of the phonemic transcription uttered by the target voice. The audio output may be generated based on the text embedding and the audio embedding. The video summary device may generate an image embedding based on video data of a target user. The image embedding may include information regarding images of facial movements of the target user. The video summary device may generate a video output of different facial movements of the target user uttering the phonemic transcription, based on the text embedding and the image embedding.
-
公开(公告)号:US20240311983A1
公开(公告)日:2024-09-19
申请号:US18122159
申请日:2023-03-16
Applicant: Verizon Patent and Licensing Inc.
Inventor: Subham Biswas , Saurabh Tahiliani
IPC: G06T5/00 , G06T5/50 , G06V10/774 , G06V10/82 , G06V20/40
CPC classification number: G06T5/77 , G06T5/50 , G06T5/80 , G06V10/774 , G06V10/82 , G06V20/40 , G06T2207/10016 , G06T2207/20081 , G06T2207/20084
Abstract: In an example, an image may be identified. Object detection may be performed on the image to identify a region including a distorted representation of an object. The region may be masked to generate a masked image including a masked region corresponding to the object. Using a machine learning model, the masked region may be replaced with an undistorted representation of the object to generate a modified image.
-
公开(公告)号:US11750742B2
公开(公告)日:2023-09-05
申请号:US17930480
申请日:2022-09-08
Applicant: Verizon Patent and Licensing Inc.
Inventor: Subham Biswas , Saurabh Tahiliani
CPC classification number: H04M3/5183 , G10L15/1815 , G10L25/06 , G10L25/63 , H04M3/5175
Abstract: A device may receive audio data of a first call between a first user and a second user. The device may generate, based on the audio data, time series data associated with an audio signal of the first call and may process, using a first machine learning model, the time series data to generate first call insight information regarding one or more first insights associated with the first call. The device may process the audio data to generate image data associated with the audio signal and may process, using a second machine learning model, the image data to generate second call insight information regarding one or more second insights associated with the first call. The device may combine the first call insight information and the second call insight information to generate combined call insight information and cause an action to be performed based on the combined call insight information.
-
公开(公告)号:US20230215128A1
公开(公告)日:2023-07-06
申请号:US17568770
申请日:2022-01-05
Applicant: Verizon Patent and Licensing Inc.
Inventor: Prakash Ranganathan , Saurabh Tahiliani
CPC classification number: G06V10/273 , G06V20/41 , G06V10/82 , G06V20/52 , G06V40/103
Abstract: Systems and methods described herein utilize synthetic pixel generation using a custom neural network to generate synthetic versions of objects hidden by occlusions for effective detection and tracking. A computing device stores an object detector model and a synthetic image generator model; receives a video feed; detects objects of interest in a current frame of the video feed; identifies an occluded object in the current frame; retrieves a previous frame from the video feed; generates synthetic data based on the previous frame for the occluded object; and forwards a modified version of the current frame to an object tracking system, wherein the modified version of the current frame includes the synthetic data.
-
公开(公告)号:US11451666B1
公开(公告)日:2022-09-20
申请号:US17445360
申请日:2021-08-18
Applicant: Verizon Patent and Licensing Inc.
Inventor: Subham Biswas , Saurabh Tahiliani
Abstract: A device may receive audio data of a first call between a first user and a second user. The device may generate, based on the audio data, time series data associated with an audio signal of the first call and may process, using a first machine learning model, the time series data to generate first call insight information regarding one or more first insights associated with the first call. The device may process the audio data to generate image data associated with the audio signal and may process, using a second machine learning model, the image data to generate second call insight information regarding one or more second insights associated with the first call. The device may combine the first call insight information and the second call insight information to generate combined call insight information and cause an action to be performed based on the combined call insight information.
-
-
-
-
-
-
-
-
-