-
公开(公告)号:US20230062598A1
公开(公告)日:2023-03-02
申请号:US17893693
申请日:2022-08-23
发明人: Roi Nathan , Tal Rosenwein , Nir Sancho , Yonatan Wexler , Amnon Shashua
IPC分类号: G06F3/16 , G10L15/22 , G10L15/30 , G10L25/78 , G06V20/50 , G06V40/16 , G10L15/25 , G06V20/62 , G06V40/20 , H04R1/08 , H04R3/00 , G06V30/19
摘要: A method for adjusting an audio transmission when a user of the system is being spoken to by another person includes receiving audio signals representative of sounds from an environment of the user captured by at least one microphone; determining at least from the received audio signals that the another person is speaking to user; and subject to the user being spoken to by the another person, adjusting the audio transmission to the user and signaling to the user that the user is being spoken to.
-
公开(公告)号:US11580727B2
公开(公告)日:2023-02-14
申请号:US17141985
申请日:2021-01-05
发明人: Yonatan Wexler , Amnon Shashua
摘要: System and methods for processing audio signals are disclosed. In one implementation, a system may comprise a wearable camera configured to capture images from an environment of a user; a microphone configured to capture sounds from the environment of the user; and a processor. The processor may be configured to receive at least one image of the plurality of images, the at least one image comprising a plurality of image portions associated with corresponding image portion timestamps; receive at least one audio signal representative of the sounds captured by the at least one microphone; identify an audio timestamp associated with a portion of the audio signal; identify an image portion from among the plurality of image portions, the image portion having an image portion timestamp associated with the audio timestamp; and analyze the image portion to identify a voice originating from an object represented in the image.
-
公开(公告)号:US20230012272A1
公开(公告)日:2023-01-12
申请号:US17756991
申请日:2020-12-10
发明人: Yonatan Wexler , Amnon Shashua , Tal Rosenwein
IPC分类号: G10L13/02 , G06V30/148 , G06V40/20 , G06F3/01
摘要: Systems and methods are disclosed for selectively reading text. A system may comprise an image capture device, an audio capture device, and a processor. The processor may be configured to receive images captured by the image capture device and audio signals captured by the audio capture device. The processor may analyze the image to identify text represented in the image; identify, based on the image, a structural element of the text; identify a request to read a first portion of the text associated with the structural element, the request being identified by at least one of analyzing the audio signals to detect a spoken request or detecting a gesture in the plurality of images; and present the first portion of text to the user of the wearable device.
-
公开(公告)号:US20230005471A1
公开(公告)日:2023-01-05
申请号:US17850578
申请日:2022-06-27
发明人: Yonatan Wexler , Amnon Shashua , Tal Rosenwein
IPC分类号: G10L15/08 , G10L15/18 , G10L15/183 , G06F40/284
摘要: A method for responding to a user query based on captured images and audio. An audio signal captured by at least one microphone is analyzed to determine at least one word. At least one image captured by at least one image sensor is analyzed to determine at least one identifier of at least one of a person, an object, a location, or an event represented in the image. The at least one word and the at least one identifier are stored in a database. A question is received from the user and is analyzed to determine at least one term. The database is searched to determine a correlation between the at least one term and the at least one word or between the at least one term and the at least one identifier. A response to the question is generated based on the correlation and is provided to the user.
-
公开(公告)号:US11546690B2
公开(公告)日:2023-01-03
申请号:US17241283
申请日:2021-04-27
发明人: Yonatan Wexler , Amnon Shashua
IPC分类号: H04R3/00 , H04R5/04 , G06F16/683 , G10L15/08 , G06V40/16
摘要: A wearable device may include an image sensor configured to capture a plurality of images from an environment, a microphone configured to capture sounds from the environment, and at least one processor. The at least one processor may be programmed to receive audio signals representative of the sounds captured by the at least one microphone, and receive a first image including a representation of a first individual from among the plurality of images captured by the image sensor. The at least one processor may also be programmed to obtain a first audio segment from the audio signals using the first image. The first audio segment may include a first portion of the audio signals in which the first individual is speaking. The at least one processor may also be programmed to receive a second image including a representation of a second individual from among the plurality of images captured by the image sensor, and obtain a second audio segment from the audio signals using the second image. The second audio segment may include a second portion of the audio signals in which the second individual is speaking. The at least one processor may also be programmed to receive a third image including a representation of the first individual from among the plurality of images captured by the image sensor, and using the third image, obtain a third audio segment from the audio signals. The audio segment may include a third portion of the audio signals in which the first individual is speaking. The at least one processor may also associate the first and third audio segments with the first individual and associate the second audio segment with the second individual.
-
公开(公告)号:US20220248149A1
公开(公告)日:2022-08-04
申请号:US17585853
申请日:2022-01-27
发明人: Yonatan WEXLER , Amnon Shashua
摘要: A hearing aid and related systems and methods are disclosed. In one implementation, a system may comprise a microphone and a processor. The processor may be configured to receive an original audio signal representative of sounds captured by the microphone; determine that the original audio signal includes a voice of the user; process the original audio signal according to a first processing scheme to generate a first processed audio signal; transmit the first processed audio signal to a hearing interface device after a first time delay associated with the first processing scheme; determine that the original audio signal includes an additional sound; process the original audio signal according to a second processing scheme to generate a second processed audio signal; and transmit the second processed audio signal to the hearing interface device after a second time delay associated with the second processing scheme.
-
公开(公告)号:US20220191568A9
公开(公告)日:2022-06-16
申请号:US15649036
申请日:2017-07-13
发明人: Yonatan Wexler , Amnon Shashua
IPC分类号: H04N21/2668 , H04N21/442 , G06T7/11 , H04N21/422 , H04N7/18 , H04N5/247 , H04L29/08 , H04N21/4223 , G06F1/16 , G06F17/30 , G06K9/00 , G06Q30/02 , G06K9/46 , G10L15/30
摘要: A wearable apparatus is provided for capturing and processing images from an environment of a user. In one implementation, a wearable apparatus for monitoring activities includes a wearable image sensor configured to capture a plurality of images from an environment of a user of the wearable apparatus. The wearable apparatus also includes at least one processing device programmed to analyze the plurality of images to identify in one or more of the plurality of images at least one indicator of an activity, and to transmit, to an external device, the at least one indicator of the activity.
-
公开(公告)号:US20210350823A1
公开(公告)日:2021-11-11
申请号:US17315780
申请日:2021-05-10
发明人: Yonatan Wexler , Amnon Shashua
摘要: A wearable device for processing audio signals may include a microphone configured to capture sounds from an environment of a user and at least one processor. The processor may be programmed to receive first audio signals captured by the microphone during a first time period during which the user is in a location, and obtain an audio segment from the first audio signals. The audio segment may include a portion of the first audio signals in which an individual is speaking. The processor may also be programmed to generate a voice print of the individual using at least the audio segment, and receive second audio signals representative of additional sounds captured by the microphone. The additional sounds may include sounds made by the individual. The second audio signals may be at least one of audio signals captured by the microphone within a predetermined time period after the first time period, or audio signals captured by the microphone while the user is in the location. The at least one processor may also be programmed to process the second audio signals using the generated voice print.
-
公开(公告)号:US20210209362A1
公开(公告)日:2021-07-08
申请号:US17141985
申请日:2021-01-05
发明人: Yonatan Wexler , Amnon Shashua
摘要: System and methods for processing audio signals are disclosed. In one implementation, a system may comprise a wearable camera configured to capture images from an environment of a user; a microphone configured to capture sounds from the environment of the user; and a processor. The processor may be configured to receive at least one image of the plurality of images, the at least one image comprising a plurality of image portions associated with corresponding image portion timestamps; receive at least one audio signal representative of the sounds captured by the at least one microphone, identify an audio timestamp associated with a portion of the audio signal; identify an image portion from among the plurality of image portions, the image portion having an image portion timestamp associated with the audio timestamp; and analyze the image portion to identify a voice originating from an object represented in the image.
-
公开(公告)号:USD910735S1
公开(公告)日:2021-02-16
申请号:US29681243
申请日:2019-02-22
设计人: Yonatan Wexler , Dave Bortz , Amnon Shashua
-
-
-
-
-
-
-
-
-