RussianPatents.com

Method of creating codebook and search therein during vector quantisation of data. RU patent 2504027.

Method of creating codebook and search therein during vector quantisation of data. RU patent 2504027.
IPC classes for russian patent Method of creating codebook and search therein during vector quantisation of data. RU patent 2504027. (RU 2504027):

G10L19/09 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, e.g. for compression or expansion, source-filter models or; psychoacoustic analysis
Another patents in same IPC classes:
Parametric stereophonic upmix apparatus, parametric stereophonic decoder, parametric stereophonic downmix apparatus, parametric stereophonic encoder Parametric stereophonic upmix apparatus, parametric stereophonic decoder, parametric stereophonic downmix apparatus, parametric stereophonic encoder / 2497204
Parametric stereophonic upmix apparatus (300, 400) generating a left signal (206) and a right signal (207) from a monophonic downmix signal (204) based on spatial parameters (205). Said parametric stereophonic upmix is characterised by that it comprises a means (310) for predicting a difference signal (311) comprising a difference between the left signal (206) and the right signal (207) based on the monophonic downmix signal (204) scaled with a prediction coefficient (321). Said prediction coefficient is derived from the spatial parameters (205). Said parametric stereophonic upmix apparatus (300, 400) further comprises an arithmetic means (330) for deriving the left signal (206) and the right signal (207) based on a sum and a difference of the monophonic downmix signal (204) and said difference signal (311).
Sound encoding device, sound decoding device, sound encoding and decoding device and teleconferencing system Sound encoding device, sound decoding device, sound encoding and decoding device and teleconferencing system / 2495503
Sound encoding device includes: a downmixing signal generating unit (410) which generates in the time domain a first downmixing signal which is one of a 1-channel sound signal and a 2-channel sound signal from an input multi-channel sound signal; a downmixing signal encoding unit (404) which encodes the first downmixing signal; a first conversion unit t-f (401), which converts the input multi-channel sound signal into a frequency-domain multi-channel sound signal; and a spatial information computing unit (409), which generates spatial information for generating a multi-channel sound signal from the downmixing signal.
Lossless multi-channel audio codec using adaptive segmentation with random access point (rap) and multiple prediction parameter set (mpps) capability Lossless multi-channel audio codec using adaptive segmentation with random access point (rap) and multiple prediction parameter set (mpps) capability / 2495502
Invention relates to lossless multi-channel audio codec which uses adaptive segmentation with random access point (RAP) and multiple prediction parameter set (MPPS) capability. The lossless audio codec encodes/decodes a lossless variable bit rate (VBR) bit stream with random access point (RAP) capability to initiate lossless decoding at a specified segment within a frame and/or multiple prediction parameter set (MPPS) capability partitioned to mitigate transient effects. This is accomplished with an adaptive segmentation technique that fixes segment start points based on constraints imposed by the existence of a desired RAP and/or detected transient in the frame and selects a optimum segment duration in each frame to reduce encoded frame payload subject to an encoded segment payload constraint. RAP and MPPS are particularly applicable to improve overall performance for longer frame durations.
Apparatus, method and computer programme for providing set of spatial indicators based on microphone signal and apparatus for providing double-channel audio signal and set of spatial indicators Apparatus, method and computer programme for providing set of spatial indicators based on microphone signal and apparatus for providing double-channel audio signal and set of spatial indicators / 2493617
Apparatus for providing a set of spatial indicators associated with an upmix audio signal, having more than two channels, based on a double-channel microphone signal, comprises a signal analyser and an additional spatial information generator. The signal analyser is configured to receive component energy information and direction information based on the double-channel microphone signal such that the component energy information describes an estimate of energy of the direct sound component of the double-channel microphone signal, and such that the direction information describes an estimate of the direction from which the direct sound component of the double-channel microphone signal arrives. The additional spatial information generator is configured to compare component energy information and direction information with spatial indicator information which describes the set of spatial indicators associated with an upmix audio signal, having more than two channels.
Efficient use of stepwise transmitted information in audio encoding and decoding Efficient use of stepwise transmitted information in audio encoding and decoding / 2491657
Audio signal can be derived using correlation information indicating a correlation between first and second input audio signals, when a signal characterisation information, indicating at least a first or a second different characteristic of the input audio signal, is additionally considered. Phase information indicating a phase relation between the first and the second input audio signals is derived, when the input audio signals have the first characteristic. The phase information and a correlation measure are included into the encoded representation when the input audio signals have the first characteristic, and only the correlation information is included into the encoded representation when the input audio signals have the second characteristic.
Audio signal decoder and method of controlling audio signal decoder balance Audio signal decoder and method of controlling audio signal decoder balance / 2491656
To support stereo perception, localisation vibration of a decoded signal is suppressed. A selection unit (220) selects balance parameters if balance parameters are input data from a gain decoding unit (210), or selects input data of balance parameters from a gain computation unit (223) if there are no input data of the balance parameter from the gain decoding unit (210), and outputs the selected balance parameters to a multiplier unit (221). The multiplier unit (221) multiplies gain input data from the selection unit (220) by decoded input data of a monophonic signal from a monophonic decoding unit (202), to process balance control.
Coding device, decoding device and method Coding device, decoding device and method / 2488897
Speech coding device comprises a coding section of the first level, which performs processing by coding in respect to the input speech signal; a decoding section of the first level, which performs processing by decoding using coded data of the first level; a section for calculation of erroneous transformation coefficients of the first level, which converts the erroneous signal of the first level into a frequency area for calculation of erroneous coefficients of transformation of the first level; and a coding section of the second level, which performs processing by coding in respect to erroneous transformation coefficients of the first level, besides, the coding section of the second level comprises: a setting section; a selection section; a connected strip configuration section, which connects a strip selected from the low-frequency strip, and a fixed strip from the high-frequency strip, in order to configure the connected strip; and a section of coded data generation, which codes erroneous transformation coefficients of the first level, included into the connected strip, in order to generate coded data of the second level.
Mixing of incoming information flows and generation of outgoing information flow Mixing of incoming information flows and generation of outgoing information flow / 2488896
Device (500) is used for mixing of multiple incoming information flows (510), in which each of incoming information flows (510) comprises a frame (540) of audio data in a spectral area, the frame (540) of the incoming information flow (510), comprising spectral information for multiple spectral components. The device comprises a unit of data processing (520), arranged to compare frames (540) of multiple incoming information flows (510). The data processing unit (520) is also arranged to determine, on the basis of comparison, for a spectral component of an outgoing frame (550) of an outgoing information flow (530) only one incoming information flow (510) from multiple incoming information flows (510). The data processing unit (520) is further arranged to generate an outgoing information flow (530) by means of copying of at least a part of information of the appropriate spectral component of the frame of the certain information flow (510), in order to describe the spectral component of the outgoing frame (550) of the outgoing information flow (530).
Quantiser, encoder and methods thereof Quantiser, encoder and methods thereof / 2486609
Quantising apparatus that quantises a value related to transformation coefficients when performing a principal component analysis transformation of a first vector signal and a second vector signal, the apparatus comprising: a power and correlation calculating section that calculates power of the first vector signal, power of the second vector signal and a correlation value between the first vector signal and the second vector signal; an intermediate value calculating section that calculates, as an intermediate value, a result of performing a difference computation using the power of the first vector signal and the power of the second vector signal; a codebook that holds a plurality of pairs of a first coefficient and a second coefficient, which are related to the transformation coefficients and numbered; and a quantising section that calculates, as a reference value, an addition result of a first multiplication result acquired by multiplying the first coefficient by the correlation value and a second multiplication value acquired by multiplying the second coefficient by the intermediate value, and, based on magnitude of the reference value, selects the number as a code.
Quantiser, encoder and methods thereof Quantiser, encoder and methods thereof / 2486609
Quantising apparatus that quantises a value related to transformation coefficients when performing a principal component analysis transformation of a first vector signal and a second vector signal, the apparatus comprising: a power and correlation calculating section that calculates power of the first vector signal, power of the second vector signal and a correlation value between the first vector signal and the second vector signal; an intermediate value calculating section that calculates, as an intermediate value, a result of performing a difference computation using the power of the first vector signal and the power of the second vector signal; a codebook that holds a plurality of pairs of a first coefficient and a second coefficient, which are related to the transformation coefficients and numbered; and a quantising section that calculates, as a reference value, an addition result of a first multiplication result acquired by multiplying the first coefficient by the correlation value and a second multiplication value acquired by multiplying the second coefficient by the intermediate value, and, based on magnitude of the reference value, selects the number as a code.
Method for compaction and decompaction of speech messages Method for compaction and decompaction of speech messages / 2244963
Method comprises steps of preliminarily, at reception and transmission forming R matrices of allowed vectors, each matrix has dimension m2 x m1 of unit and zero elements; then from unidimensional analog speech signal forming initial matrix of N x N elements; converting received matrix to digital one; forming rectangular matrices with dimensions N x m and m x N being digital representation of initial matrix from elements of lines of permitted vectors; transmitting elements of those rectangular matrices through digital communication circuit; correcting errors at transmission side on base of testing matching of element groups of received rectangular matrices to line elements of preliminarily formed matrices of permitted vectors; then performing inverse operations for decompacting speech messages. Method is especially suitable for telephone calls by means of digital communication systems at rate 6 - 16 k bit/s.
Method and system for abolishing quantizer saturation during communication with data transfer in speech signal band Method and system for abolishing quantizer saturation during communication with data transfer in speech signal band / 2249860
Method and system for decreasing prediction error an averaging device for calculation of transfer coefficient is used, pulse detector, signals classifier, decision-taking means and transfer coefficient compensation device, wherein determining of compensated transfer coefficient of quantizer count is performed in process of coding/decoding of transferred data in speech signal band by use of vector linear non-adaptive predicting-type algorithm.
Method and device for reproducing speech signals and method for transferring said signals Method and device for reproducing speech signals and method for transferring said signals / 2255380
During encoding speech signals are separated on frames and separated signals are encoded on frame basis for output of encoding parameters like parameters of linear spectral couple, tone height, vocalized/non-vocalized signals or spectral amplitude. During calculation of altered parameters of encoding, encoding parameters are interpolated for calculation of altered encoding parameters, connected to temporal periods based on frames. During decoding harmonic waves and noise are synthesized on basis of altered encoding parameters and synthesized speech signals are selected.
Method for simulating auditory patient perception of acoustic signal after cochlear implantation Method for simulating auditory patient perception of acoustic signal after cochlear implantation / 2277375
Method involves applying analog-to-digital input signal transformation expressed as word, dividing transformed signal spectrum into odd and even frequency bands, summing odd bands, carrying out digital-to-analog transformation of resulting summed signal and training its perception by preliminarily getting familiar with the word shown for listening and following testing. Spectrum division is based on tonotopic frequency distribution law over cochlea axis. Frequency bands having odd numbers are arranged in equal distances along basilar membrane length in agreement with normal tonotopic frequency distribution law over cochlea axis. At least three odd spectrum bands are summed up. Training is carried out by multiple repetition of the word shown for listening until unambiguous correlation to the known word meaning given in preliminary acquaintance takes place. The same words are to be shown in testing and training.
Method and device for encoding an audio signal with usage of harmonics extraction Method and device for encoding an audio signal with usage of harmonics extraction / 2289858
In accordance to audio signal encoding method, harmonic components are extracted with usage of information resulting from fast Fourier transformation, which is received with usage of psycho-acoustic model 2 to received audio data of impulse-code modulation. Then, extracted harmonic components are removed from received audio data of impulse-code modulation. After that audio data, from which extracted harmonic components have been removed, are subjected to modified discontinuous cosine transformation and quantization.
Method and device for transmission of speech activity in distribution system of voice recognition Method and device for transmission of speech activity in distribution system of voice recognition / 2291499
Distributed system of voice recognition has voice recognition (VR) local mechanism in user unit and VR server mechanism in server. VR local mechanism has module for selection of features (FS), which selects features from voice signals. Voice activity detector (VAD) module detects voice activity invoice signal. Indication of voice activity is transmitted before features from user unit to server.
Alternating frame length encoding optimized for precision Alternating frame length encoding optimized for precision / 2305870
In accordance to the invention, polyphonic signals are used for creation of main signal, typically, a signal and a collateral signal. A row of encoding schemes of collateral signal (xside) is implemented, each encoding scheme is characterized by a set of sub-frames of varying length, while total length of sub-frames corresponds to encoding frame length of encoding scheme. Encoding scheme for collateral signal (xside) is selected on basis of current content of polyphonic signals, and collateral remainder signal is created as a difference between collateral signal and main signal, scaled with usage of balancing coefficient, which is selected for minimization of collateral remainder signal. Optimized collateral remainder signal and balancing coefficient are encoded and implemented as encoding parameters, representing the collateral signal.
Method and device for efficient masking of deleted shots in speech coders on basis of linear prediction Method and device for efficient masking of deleted shots in speech coders on basis of linear prediction / 2325707
Invention is related to method and device for improvement of masking shots of coded sound signal, which were deleted in the process of transfer from coder to decoder and for acceleration of restoration in decoder, after non-deleted shots of coded sound signal were accepted. When parameters of masking/restoration are determined in coder, they are transferred to decoder, where masking of deleted shots takes place and deleted shots are restored in accordance with parameters of masking/restoration. Masking/restoration parameters may be selected from the group that consists of the following: parameter of signal classification, parameter of energy information and parameter of phase information. Determination of masking/restoration parameters contains classification of sequential shots of coded sound signal as unvoiced shot, unvoiced conversion, voiced conversion, voiced shot or access shot, moreover, this classification is defined on the basis of the least part of the following parameters: parameter of normalized correlation, parameter of spectral gradient, parameter of relationship signal-noise, parameter of parameter main tone stability relative to parameter of shot relative energy and parameter of transition through zero.
Device and method for processing signal containing sequence of discrete values Device and method for processing signal containing sequence of discrete values / 2325708
Said utility invention relates to signal processing in the form of successive values, e.g., audio signal samples or video signal samples, which, in particular, are especially suitable for lossless coding applications. During processing of a signal containing a sequence of discrete values, having the first frequency band with high energy signal and the second frequency band with low energy signal, the sequence of discrete values is manipulated initially (202) to obtain a sequence of manipulated values so that at least one of the manipulated values would be different from an integer. After that, the sequence of manipulated values is rounded (204) to obtain a sequence of rounded manipulated values. Rounding is performed in order to create a generated rounding error spectrum so that the rounding error with the spectrum created would have higher energy in the first frequency band as compared to the second frequency band.
Improved error concealment in frequency range Improved error concealment in frequency range / 2328775
Essence of the invention lies in the concealment of coding coefficients with errors, using correlation of the coding coefficient on time as well as on frequency. The concealment technique can be applied to any type of information, such as audio data, video data and image data, which are compressed into coding coefficients and transmitted under unfavourable conditions in the channel. Error concealment is achieved using redundancy of the initial data signal on time as well as on frequency. This provides the possibility of using redundancy between frames (inter-frame), as well as within the limits of frames (intra-frame). Use of coding coefficients from the same frame, consisting of the coding coefficient with the error, is sometimes called correlation of intra-frame coefficients, and it is a more special case than the general frequency correlation.

FIELD: information technology.

SUBSTANCE: method can be used to reduce consumption of computational resources and the required size of storage devices when creating codebooks and executing reference vector search algorithms therein, including when performing low-speed speech signal coding. The technical result of the disclosed method is reducing the required size of storage devices and reducing consumption of computing computational resources when performing search in a codebook during vector quantisation. The set task is achieved by constructing a special codebook structure based on neural networks using training algorithms with adjustment. Search is performed in form of step-by-step hierarchical vector quantisation. The resultant vector is a sum of code vectors found at each step. The disclosed method can be used to reduce consumption of computational resources and the required size of storage devices when executing reference vector search algorithms in a codebook.

EFFECT: reducing consumption of computational resources and the required size of storage devices.

7 dwg

 

The invention relates to the field of digital communication, namely, the methods of reducing the amount of data during their processing. The proposed method can be used to reduce the cost of computing resources and the required volume of storage devices when creating code books and realization of algorithms of search of support vectors in them, including in the implementation of low-rate coding of speech signals.

Vector quantization requires a sufficiently large number of operations with the formation of the code books and search vectors in them, which leads to high computational complexity of these procedures, especially with large volumes of the processed information. Reduction of the number of computational operations when looking for a vector in the code book is an urgent task.

Known methods of vector quantization for the procedure of coding of speech [Makhoul D., C., Gish, Vector quantization in coding of speech. // . - 1985. - .73. - №11. - P.19-61], [V. Ramasubramanian and Kuldip K. Paliwal «Fast Nearest-Neighbor Search Based on Voronoi Projections and Its Application to Vector Quantization Encoding» in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, vol. 1, no. 2, March 1999]. Also known way to create code book and depth first search, presented in the patent RU 2175454 C2, which proposes a tree structure with a preset number of levels. This method is characterized by a greater computational complexity, since it uses probabilistic methods for implementing the procedure of finding the support vector-centroid of each cell in the code book. In the patent RU 2391715 C2 describes the principle of multidimensional vector quantization using multi level code books. The disadvantage of this method are the requirements for the availability of relatively large amount of memory needed to store the tables coordinates support vector machine centroids.

The closest to the technical nature of the claimed method is a method referred to in the patent US 6161086, where the vector quantization of the data in a linear prediction speech signal signal conditioning excitation synthesizing filter use the combined code book composed of a fixed and adaptive code books, and adaptive correction code book produced with the help of the inverse procedure BIFT (Backward and Inverse Filtered Target) on three levels adaptive code book. Fixed code book contains a stochastic component of the excitation signal, which displays component. Adaptive code book is formed on the basis of the tonal component of excitation signal and shows the availability of long-term correlations due to structure of the speech signal and carries information on the number of readings of the corresponding period of the main tone of the analyzed frame speech.

Finding support vector machine is using a tree structure, find the nearest vector in the code book, which requires high computational complexity.

The disadvantage of the prototype is the large amount of memory needed to store the table of coordinates of support vectors, as well as the high computational complexity of this operation.

The objective of the invention is to create a code book and search in it when vector quantization, allowing to reduce the amount of storage devices and the computational complexity of implementing the procedure of search in the code book.

This task is solved by the fact that when creating the code book and search in it when vector quantization in the combined code book for fixed codebook used self-learning neural network Kohonen, also known as card self - SOM (self-organizing ball), and for adaptive code book apply neural network with quantization - LVQ (learning vector quantization). A procedure of finding implement fully hierarchical vector quantization, which provides small loss of accuracy while increasing the speed of calculations.

Consider the claimed method more. Neural network SOM is intended for transformation of received vectors signals with arbitrary dimension, in one - or two-dimensional discrete map. In this case, such a transformation is performed adaptively in topologically consistent manner. Figure 1 shows the schematic diagram of the two-dimensional lattice of neurons used as a discrete card for the functioning of fixed codebook. All the neurons of the lattice connected to all the nodes in the input layer. This network has the structure of direct distribution with one computing layer composed of neurons that are arranged in columns and rows.

The essence of self-learning neural network SOM consists in the formation of the card (space) coordinates of vectors with the division into cell (Voronoi polygons)with support vector-centroid of each polygon. The essential characteristics of the algorithm of self-learning, necessary for the formation of fixed codebook, are:

continuous input image space activation (stochastic vectors excitation signals), which is generated according to some probability distribution;

- the topology of the neural networks in the form of a grid consisting of neurons, defining a discrete output space;

- the time-dependent function surroundings h j,i(x) (n), which determines the radius of the surroundings neuron-winner of i(x);

- the speed parameter training η(n), for which to set the initial value η 0 and which is gradually decreasing in time n, but never reaches zero.

It is experimentally established that the formation of fixed code book value η(n)equal to 0.005, it is necessary for a proper statistical accuracy during convergence. However, to create a codebook was used signal balance of long-term linear prediction speech from the output filter synthesis. Description of the linear prediction speech fairly presented in detail in (SERGEY Bykov, V.I. Zhuravlev, Shalimov I.A. Digital telephony: Textbook. manual for high schools. - M: Radio and communication, 2003. - Pp. 102-105).

The sequence of steps prior training in fixed code book next.

1) Initialization. For source vectors of synaptic weights of the neural network w j (0) take a random values from the set of input vectors

. As a condition of correct training on vectors of excitation filter synthesizing of speech signals, you must difference vectors for different values of j=1,2,...,l, where l is the total number of neurons in the lattice.

2) Subsampling. Choose the vector x of the input space with a certain probability. This vector is an institution, which is applied to the lattice of neurons. The dimension of the vector x: is m.

3) Search of maximum likelihood. To find the most suitable (who) neuron i(x) at step n, using the criterion of minimum Euclidean distance:

4) Correction. Correction vectors of synaptic weights of all neurons

w j (n+1)=w j (n)+η(n)h j,i(x) (n)(x-w j (n)),

where η(n) - the speed parameter learning; (h j,i(x) (n) is the function of the neighborhood with the center in conquered the neuron i(x). Both of these parameters dynamically change during training to obtain the best result.

5) Continued. Return to step 2 and calculation to achieve a given number of iterations.

Upon completion of the process of convergence of the self-organizing map (SOM) displays important statistical characteristics of the space of stochastic vectors excitation signals. Because the algorithm SOM refers to the algorithms of training of neural networks «without a teacher», formed by the space of Voronoi cells is close in terms of placement support vector machine centroids in the N-dimensional coordinate system. The approximation is determined by the vectors of synaptic weights of neurons on the map features.

As a mechanism for fine-tuning is necessary to make quantization vectors training. For the quantization of the vector-centroid is used method of training «teacher»that uses the class information for small-displacement support vector and, accordingly, the Voronoi cell borders and, consequently, to improve the quality of regions of the decision of the classifier. If the label class input vector x and vector-centroid (vector Voronoi) w consistent, then the latter is shifted towards the first. When vector-centroid moves in the direction opposite to the vector x. Briefly the process of quantization is described as follows:

1) in the case of the maximum proximity of vector Voronoi w C to input vector x i w C (n+1)=w (C (n)+a n [x i w C (n)], where 0<α n <1;

2) if there is a mismatch w C (n+1)=w (C (n)-a n [x i w C (n)];

3) other vectors the Voronoi not changed.

Constant training α n for the formation of fixed codebook choose monotonously decreasing with an initial value of 0.05. As a result of the adaptation procedure after a few passes over the data input the coordinates of support vector Voronoi stops scrolling, and consequently, and the creation of a space Voronoi polygons for fixed codebook stochastic component signals excitation.

For adaptive code book offered to use the neural network LVQ. Figure 2 presents the structure of the neural network that functions as a code book, which stores information about the space of Voronoi cells with vectors tone component of excitation signal. In the case of speech signal processing network LVQ is a cascade connection layer SOM and network. Self-organizing layer captures significant signs of process (localizes them based on the input), after which they attributed to the input vector in layer. Because of good localization signs process tone excitation of the first layer of the network in most applications, speech processing suffice application perceptron, containing only one layer of neurons (often linear).

LVQ network is taught on the basis of the set of pairs of input/output, composed of elements of the training sequence {R,T}: {p-1 , t 1 },{p-2 , t 2 },...,{P Q t Q }. Each target vector has the only element equal to 1, and the rest are equal to 0. For the network training determine the vector of input p, such that in a competing layer configured the elements of the matrix of weights W 1.1 . Weights neuron i* most similar to the input vector p and the neuron i* wins the competition. Then competing activation function returns 1 as the item i* vector and 1 , and all other elements and 1 is equal to 0. In the second, layer work W 2.1 *a 1 highlights some column of the matrix W 2.1 and an associated class of k*. Thus, the network connects the input vector p class k*. This destination can be either correct or wrong. Therefore, in the process of training you need to edit a line i* matrix W 1.1 thus, to bring it closer to the vector p, assignment of the right, and remove from the vector p, if the destination is incorrect. Based on this, a rule setting the following parameters:

As a mechanism for fine-tuning produce quantization vectors learning similarly correction fixed codebook. It is experimentally established, that for the formation of adaptive code book monotonically decreasing constant training α n is equal to 0.07.

Flowchart algorithms of fixed and adaptive code books are presented in figure 3 and figure 4.

When performing a search procedure is used multi-stage hierarchical vector quantization, which accelerates the speed of the search unlike the tree search support vector-centroid. Multilevel hierarchical vector quantization shares a common search operation on a lot of sub-operations, each of which requires a small amount of calculations for fixed and adaptive code book. In each sub-operation process the remainder of the vector, formed in the previous . Input vector quantizing Li - tier vector quantizer, the remainder (error) quantization fed to the input of the second Lj - tier vector quantizer. The process can be repeated any number of sub-stages.

The final quantized vector value for both code books are in the form of the sum of output vectors of intermediate and final .

The analysis of the equipment has allowed to establish that the analogues, characterized by the totality of features identical to all features of the claimed technical solutions, no, that indicates compliance of the claimed device with the condition of patentability "novelty".

Search results known solutions in this and related areas of technology for signs that coincide with the distinctive features of the prototype of the declared object, showed that they are obvious from the prior art. Of the level of technology also found no known impact involves the essential features of the claimed invention transformations to achieve the specified result. Therefore, the claimed invention meets the condition of patentability of "inventive step".

Industrial applicability of the invention by the fact that it can be done with the help of modern element base, with the achievement of specified in the invention of destination.

For fixed codebook it consists of the creation of initial data vectors training stochastic component signals excitation 1, the output of the unit 1 is connected with the entrance of the block of preliminary training of a neural network SOM 2, the output of the unit 2 connected to the input of correction block support vector machine centroids 3, block 3, connected to the input of the block storage indexed in the vector table 4 candidates.

Functional scheme of the procedure for formation of fixed codebook and indexed tables vectors of candidates is shown in figure 5.

For adaptive code book schema contains the set of initial data vectors training tone component signals excitation 5, the output of the unit 5 is connected to the entrance of the block of the first level of adaptation of the neural network LVQ 6, block 6 is connected to the entrance of the block of the second level of adaptation of the neural network LVQ 7, exit block 7 connected to the input of correction block support vector machine centroids 8, the output of the unit 8 is connected with the entrance of the block storage indexed in the vector table 9 candidates.

Functional diagram of the implementation of the procedure of the formation of adaptive code book and indexed tables vectors of candidates is shown in Fig.6.

Box 1 contains information about the source data for neural network training SOM is the vectors excitation for synthesizers speech signals containing stochastic (noise) components. Data vectors to the input unit 2 of the preliminary training of a neural network SOM fixed code book, in block 2 of the procedure is performed, the settings of weight coefficients specified neural network, thereby creating space Voronoi cells with vectors-centroids, with an output of unit 2 information to the input of the unit 3, where there is a correction of support vectors according to description of the mechanism of fine-tuning (P.5), with an output of unit 3 unit 4 are submitted data about the coordinates of the Voronoi cells and vectors candidates for their storage in the form of a table which represents the fixed code book.

Box 5 contains information about the source data for neural network training LVQ is vectors excitation for synthesizers speech signals containing tone () components. For teaching two-level neural network LVQ adaptive code book vectors excitation served with an output of unit 5 unit 6, where there is a setting weights of the first layer of the network LVQ, according to the algorithm of learning without a teacher», similar to the algorithm of training SOM. From the output of the block 6 block 7 of the second layer of a neural network LVQ served previously created the coordinates of the cell and vectors Voronoi, which is the completion of the adaptation procedure using a learning algorithm with a «teacher»as the second level of adaptation is layer neural network LVQ. With an output of unit 7 information to the input of the unit 8, where there is a correction of support vectors according to the mechanism of fine-tuning (P.5), with an output of unit 8 in block 9 move the data on the coordinates of Voronoi cells and vectors candidates for their storage in the form of a table which represents the adaptive code book.

Block diagram of the algorithm for the multi-stage hierarchical vector quantization performing the procedure for search in fixed and adaptive code books, presents the Fig.7.

Application of the proposed method will substantially reduce required for the implementation of the volume of the storage devices on 25-30%and the realization process of the multi-stage hierarchical vector quantization reduce computational costs on 20-23% in comparison with the known solutions in this sphere.

Way to create code book and search in it when vector quantization of the data, according to which to obtain the excitation signal synthesizing the filter at a linear prediction speech signal uses a code book composed of a fixed and adaptive code books

wherein the to create a fixed codebook form the original data vectors training stochastic component signals excitation, train the neural network Kohonen SOM (self-organizing map), for self-learning algorithm which determines the continuous input image space activation of stochastic vectors excitation signals generated according to some probability distribution, then form the topology of the neural networks in the form of a grid consisting of neurons and defining the discrete output space, then calculate the time-dependent function surroundings h

j,i(x)

(n) to find the radius of the surroundings neuron-winner and gradually decreasing in time, but never reaches zero parameter speed training η(n) with initial value η 0 , then adjust the support vectors centroid for storage index table of vectors of candidates, and to create an adaptive code book form the original data vectors training tone component signals excitation, produce a two-level adaptation neural network with quantization of the LVQ (learning vector quantization), correct the reference vectors centroid for storage index table vectors of candidates, the procedure of search in the code books implement with the use of multi-stage hierarchical vector quantization.

 

© 2013-2014 Russian business network RussianPatents.com - Special Russian commercial information project for world wide. Foreign filing in English.