Abstract:
An intention identification model learning method includes receiving positive data that corresponds to a first skill, generating, based on the positive data that corresponds to the first skill, negative data that corresponds to the first skill, determining, a second skill similar to the first skill, obtaining data that corresponds to each second skill, generating a second base model based on the data that corresponds to the second skill and a first base model stored on the server, and performing learning based on the second base model, the positive data, and the negative data that correspond to the first skill, and generating an intention identification model.
Abstract:
This application provides an image aesthetic processing method and an electronic device. A method for generating an image aesthetic scoring model includes: constructing a first neural network based on a preset convolutional structure set; obtaining an image classification neural network, where the image classification neural network is used to classify image scenarios; obtaining a second neural network based on the first neural network and the image classification neural network, where the second neural network is a neural network containing scenario information; and determining an image aesthetic scoring model based on the second neural network, where output information of the image aesthetic scoring model includes image scenario classification information. In this method, scenario information is integrated into a backbone neural network, so that a resulting image aesthetic scoring model is interpretable. In addition, scoring accuracy of the image aesthetic scoring model can be improved by using the preset convolutional structure set.
Abstract:
A synchronous audio playback method, apparatus and system are provided. The method includes calculating a first playback delay generated when an audio file is played; acquiring a second playback delay; calculating a delay offset; calculating a data adjustment amount; adding to-be-adjusted data to or deleting to-be-adjusted data from a current to-be-played part of the audio file according to a value of the delay offset, to obtain a replacement part corresponding to the current to-be-played part, where a data amount of the to-be-adjusted data is the data adjustment amount; and playing the replacement part. With the technical solutions of the present invention, playback duration of a current to-be-played part can be adjusted in a manner of adding or deleting data, so that a controlling device and a controlled device generate sounds simultaneously when playing a to-be-played clip in a very simple implementation process.
Abstract:
A digital compressor for compressing an input audio signal is presented. The digital compressor comprises a compression gain control for providing a compression gain parameter, and a compression parameter determiner for determining a compression ratio from the compression gain parameter. The compression parameter determiner may be configured to weight the compression gain parameter by a predetermined weight factor to obtain the compression ratio. The digital compressor further comprises an auxiliary signal generator for manipulating the input audio signal in dependence of the compression ratio to obtain a first auxiliary signal, and a combiner unit for combining the first auxiliary signal with the compression gain parameter to obtain a second auxiliary signal, and for combining the input audio signal with the second auxiliary signal to obtain the compressed audio signal.
Abstract:
The present disclosure relates to a method, apparatus, and system for encoding and decoding signals. The encoding method includes: converting a first-domain signal into a second-domain signal; performing Linear Prediction (LP) processing and Long-Term Prediction (LTP) processing for the second-domain signal; obtaining a long-term flag value according to a decision criterion; obtaining a second-domain predictive signal according to the LP processing result and the LTP processing result when the long-term flag value is a first value; obtaining a second-domain predictive signal according to the LP processing result when the long-term flag value is a second value; converting the second-domain predictive signal into a first-domain predictive signal, and calculating a first-domain predictive residual signal; and outputting a bit stream that includes the first-domain predictive residual signal.
Abstract:
A coding method includes: obtaining a value of each sample of an input data frame; determining pulse samples and non-pulse samples in the input data frame according to the distribution of values of samples of the input data frame; encoding the determined pulse samples in the input data frame in a first coding mode to obtain a first data stream; encoding the determined non-pulse samples in the input data frame in a second coding mode to obtain a second data stream; and multiplexing the first data stream and the second data stream to obtain an output coded data stream of the input data frame. The technical solution under the present disclosure reduces the number of bits required for encoding the entire data frame is reduced, and improves the compression efficiency of the data frame with a wide dynamic range.
Abstract:
A method, device, and system for signal encoding and decoding are disclosed. The method includes: encoding a core layer signal to obtain a core layer signal code; selecting an enhancement sample point that requires enhancement layer signal encoding according to the core layer signal code and the number of bits that can be used by an enhancement layer; obtaining an enhancement layer signal code of the enhancement sample point; and outputting a bit stream, where the bit stream includes the core layer signal code and the enhancement layer signal code. According to the number of bits that can be used by the enhancement layer, the enhancement sample point that requires enhancement layer signal encoding is selected; the enhancement layer signal of the selected enhancement sample point is encoded and decoded; when no sufficient bits are available for the enhancement layer, the enhancement quality of the core layer can be improved.
Abstract:
An intention identification model learning method includes receiving positive data that corresponds to a first skill, generating, based on the positive data that corresponds to the first skill, negative data that corresponds to the first skill, determining a second skill similar to the first skill, obtaining data that corresponds to each second skill, generating a second base model based on the data that corresponds to the second skill and a first base model stored on the server, and performing learning based on the second base model, the positive data, and the negative data that correspond to the first skill, and generating an intention identification model.
Abstract:
The disclosure provides a display device for head mounting, in particular a Maxwellian display device, with an enlarged eye-box. The display device includes a light source emitting light beams for forming an image, an optical system for guiding the light beams to a beam forming element. The beam forming element is configured to converge each of multiple sets of the light beams into a different point of convergence, for example on an exit pupil of the display device. Each point of convergence is a convergent viewing point for a user. The eye-box is thus enlarged.
Abstract:
The present invention provides a display system for displaying image data. The display system comprises a light source; a spatial light modulator including a plurality of controllable reflective elements; a light source controller, configured to control the light source to emit, in each of a plurality of time intervals, light with a different illumination power level, wherein the time intervals have the same length; and a spatial light modulator controller, configured to activate, based on received image data, the plurality of controllable elements for at least one of the time intervals for directing the light from the light source for display the image data. Accordingly, the frame rate and bandwidth of the display system are increased significantly so as to be suitable for the light filed applications.