-
11.
公开(公告)号:US20180192058A1
公开(公告)日:2018-07-05
申请号:US15840893
申请日:2017-12-13
Applicant: Sony Interactive Entertainment Inc.
Inventor: Eric Hsuming Chen , Hung-Ju Lee , Jason N. Wang , Rathish Krishnan , Deepali Arya
IPC: H04N19/167 , H04N19/119 , H04N19/172 , G06T3/40 , G06T7/11 , G06T11/60
CPC classification number: H04N19/167 , G06T3/403 , G06T7/11 , G06T7/73 , G06T11/60 , G06T2207/30201 , H04N19/119 , H04N19/132 , H04N19/17 , H04N19/172 , H04N19/59
Abstract: Gaze tracking data is analyzed to determine one or more regions of interest within an image of a video stream. The video stream data is selectively scaled so that sections within the regions of interest maintain high resolution while areas not within the region of interest are down-scaled to reduce bandwidth cost of transmission. A scheme for reduction of motion sickness by reducing the size of the high resolution area is also claimed.
-
12.
公开(公告)号:US20180007362A1
公开(公告)日:2018-01-04
申请号:US15199686
申请日:2016-06-30
Applicant: Sony Interactive Entertainment Inc.
Inventor: Rathish Krishnan
IPC: H04N19/132 , H04N19/70 , H04N19/33 , H04N19/169 , H04N19/172 , H04L29/06 , H04N19/159
Abstract: Input digital frames may be down-sampled to create one or more base frames characterized by a lower resolution than the input digital frames. Enhancement information corresponding to a difference between pixel values for the one or more input digital frames and corresponding pixel values of up-sampled versions of the one or more base frames is then created. The one base frames are encoded to form a set of base data and the enhancement information is encoded to form a set of enhancement data. The base data and enhancement data may then be transmitted over a network or stored in a memory.
-
公开(公告)号:US12271975B2
公开(公告)日:2025-04-08
申请号:US18060552
申请日:2022-11-30
Applicant: Sony Interactive Entertainment Inc.
Inventor: Rathish Krishnan , Deepali Arya , Manoj Srivastava , Seema Kataria
IPC: G06T11/00 , A63F13/213 , A63F13/26 , G06N3/0442
Abstract: A machine learning (ML) model is trained using pairs of images. Each pair includes an image of a human face and a duplicate of the image with a computer game headset overlaid on the face using computer graphics. The ML model subsequently can be used to receive an image of a gamer wearing a headset and output a full-face image of the gamer for use in, e.g., social network settings related to the game.
-
公开(公告)号:US20250108291A1
公开(公告)日:2025-04-03
申请号:US18478174
申请日:2023-09-29
Applicant: Sony Interactive Entertainment Inc.
Inventor: Rathish Krishnan , Eric Hsuming Chen , Jason Wang , Deepali Arya , Hung-Ju Lee
IPC: A63F13/335
Abstract: Techniques are described for reducing latency in networked gaming by reducing I-frame sizes (which also results in automatically increasing P-frame sizes) to reduce the overall amount of video being transmitted. The reduced size of the I-frames is compensated for by increasing the size of other frames using a low pass filter (LPF) such as a Gaussian filter which reduces sharpness that the decoder can try to recover, or by use of lower resolution. The I-frame can be reduced by rotating it or flipping/mirroring it to produce the smaller coded frame, sending a flag to signal the orientation.
-
公开(公告)号:US20210266571A1
公开(公告)日:2021-08-26
申请号:US17313882
申请日:2021-05-06
Applicant: Sony Interactive Entertainment Inc.
Inventor: Eric Hsuming Chen , Hung-Ju Lee , Jason N. Wang , Rathish Krishnan , Deepali Arya
IPC: H04N19/167 , G06T7/73 , H04N19/59 , H04N19/17 , H04N19/132 , H04N19/119 , H04N19/172 , G06T7/11 , G06T3/40 , G06T11/60
Abstract: Video stream data is selectively scaled so that sections within regions of interest (ROI) maintain high resolution while areas not within the region of interest are down-scaled to reduce bandwidth cost of transmission. A low compression encoder compresses sections of a video frame corresponding to one or more ROI without motion search or prediction mode decision to generate low-compression section data. The video frame is downscaled and a high compression encoder compresses the resulting downscaled video frame with prediction mode decision to generate high-compression frame data.
-
公开(公告)号:US20210142520A1
公开(公告)日:2021-05-13
申请号:US16721733
申请日:2019-12-19
Applicant: Sony Interactive Entertainment Inc.
Inventor: Rathish Krishnan , Jason N. Wang
IPC: G06T9/00 , G06T3/40 , G06T5/20 , G06T5/00 , H04N19/167 , H04N19/132 , H04N19/176
Abstract: A method, system and computer readable instructions for video encoding comprising, determining one or more region of interest (ROI) parameters for pictures in a picture stream and a temporal down sampling interval. One or more areas outside the ROI in a picture in the picture stream are temporally down sampled according to the interval. The resulting temporally down sampled picture is then encoded and the encoded temporally down-sampled picture is transmitted. Additionally, a picture encoded in this way in an encoded picture stream may be decoded and areas outside an ROI of the picture may be temporally up sampled. The temporally up sampled areas outside the ROI are inserted into the decoded encoded picture stream.
-
公开(公告)号:US20190108859A1
公开(公告)日:2019-04-11
申请号:US16215295
申请日:2018-12-10
Applicant: Sony Interactive Entertainment Inc.
Inventor: Rathish Krishnan
IPC: G11B27/10 , H04N21/81 , H04N5/232 , H04N5/262 , H04N21/218
Abstract: Some embodiments provide methods of playing back content, comprising: accessing video content comprising a series of frames that if fully decoded would extend beyond a viewer's field of view, and wherein each encoded frame comprises multiple encoded sections; determining a field of view of the viewer; identifying one or more sections of the first frame that are at least partially within the field of view; decoding the one or more sections of the first frame while not decoding one or more of the sections of the first frame that are not within the field of view; and displaying the one or more decoded sections of the first frame such that the portion of the first frame is displayed, and wherein less than all of the first frame is decoded and less than all of the first frame is displayed during playback.
-
公开(公告)号:US20240408483A1
公开(公告)日:2024-12-12
申请号:US18208159
申请日:2023-06-09
Applicant: Sony Interactive Entertainment Inc.
Inventor: Rathish Krishnan , Chockalingam Ravi Sundaram , Charlie Denison , Ryder McMinn , Orlando Cardoso , Warren Benedetto , Vinit Acharya
IPC: A63F13/52 , A63F13/5378 , A63F13/60 , G06F3/04817 , G06V20/70
Abstract: A system for generating gameplay context information for a game may include a game screen classification module trained to classify contextually relevant data from gameplay data, one or more game object recognition modules trained to detect game icons from gameplay data, and a multimodal context generation neural network module trained to generate structured gameplay context information from the contextually relevant data and icons within the gameplay data. The multimodal context generation neural network module at least partially generates structured gameplay context information. The modules may include neural networks trained by suitable machine learning algorithms using suitable masked data and labeled data.
-
公开(公告)号:US20240177359A1
公开(公告)日:2024-05-30
申请号:US18060552
申请日:2022-11-30
Applicant: Sony Interactive Entertainment Inc.
Inventor: Rathish Krishnan , Deepali Arya , Manoj Srivastava , Seema Kataria
IPC: G06T11/00 , A63F13/213 , A63F13/26 , G06N3/0442
CPC classification number: G06T11/00 , A63F13/213 , A63F13/26 , G06N3/0442 , A63F2300/8082 , G06T2210/62
Abstract: A machine learning (ML) model is trained using pairs of images. Each pair includes an image of a human face and a duplicate of the image with a computer game headset overlaid on the face using computer graphics. The ML model subsequently can be used to receive an image of a gamer wearing a headset and output a full-face image of the gamer for use in, e.g., social network settings related to the game.
-
公开(公告)号:US20230222754A1
公开(公告)日:2023-07-13
申请号:US17571397
申请日:2022-01-07
Applicant: Sony Interactive Entertainment Inc.
Inventor: Rathish Krishnan
CPC classification number: G06V10/24 , H04N5/23296 , H04N5/08 , G06V10/25 , G06T7/33
Abstract: Responsive to a zoom command when presenting a first video, a second video is combined with the first video and presented. The first and second videos are generated from substantially the same camera location as each other at substantially the same time with substantially the same resolution. However, the second video is generated by a physical or virtual lens having a field of view (FOV) smaller than the FOV of a physical or virtual lens used in generating the first video. Modules are described for using alignment metrics to correctly place the second video over the inner video and make it appear seamless.
-
-
-
-
-
-
-
-
-