Adaptive weighing of reference images during video signal coding

FIELD: physics.

SUBSTANCE: said utility invention relates to video encoders and, in particular, to the use of adaptive weighing of reference images in video encoders. A video encoder and a method of video signal data processing for an image block and the specific reference image index for predicting this image block are proposed, which use the adaptive weighing of reference images to increase the video signal compression, the encoder having a reference image weighting factor assigning module for the assignment of the weighting factor corresponding to the said specific reference image index.

EFFECT: increased efficiency of reference image predicting.

8 cl, 7 dwg

 

Cross-reference to related applications

This application claims the priority of provisional patent application U.S. serial number 60/395 .843 (registry attorneyPU020340), entitled "Adaptive Weighting Of Reference Pictures In Video CODEC" and filed July 15, 2002, which is fully included in the materials of this application by reference. Additionally, this application claims the priority of provisional patent application U.S. serial number 60/395 .874 (registry attorney PU020339), entitled "Motion Estimation With Weighting Prediction" and also submitted July 15, 2002, which is fully included in the materials of this application by reference.

The technical field to which the invention relates

The present invention relates to video encoders and, in particular, to the use of adaptive weighting of reference pictures in video encoders.

Prior art

Basically, the video information is processed and transmitted as a stream of bits. Conventional encoders and decoders video compression (CODEC) achieve the highest effectiveness of their compression by generating a prediction reference image for the image to be encoded, and encoding the difference between the current image and the prediction. The stronger correlation of forecast the Oia with the current image, the smaller the number of bits required to compress this image, thereby increasing the efficiency of the process. Accordingly, it is preferred to form the best possible prediction reference image.

Many standards video compression, including standards Expert group on the moving image such as MPEG-1, MPEG-2 and MPEG-4, as a prediction for the current image used motion compensated version of the previous reference image and encodes only the difference between the current image and the prediction. When using one prediction picture (P-picture), the reference image is not scaled when forming a motion compensated prediction. When using bi-directional predicted pictures (B-pictures)are formed intermediate the predictions of two different images, and then two intermediate predictions are averaged using equal weights (1/2, 1/2) for each, to form a single averaged predictions. In the above-mentioned MPEG, from two reference images for B-pictures, one always corresponds to the forward direction and one direction back.

The INVENTION

These and other drawbacks and nedostatki prior art are eliminated by the system and method of adaptive weighting of reference pictures in video encoders and videodecoder.

Disclosed to the video encoder and corresponding methods for processing video data for a block of image and index-specific reference image for prediction of the block image, which use adaptive weighting of reference pictures to enhance video compression. The encoder contains the target module is the weighting factor of the reference image to assign a weighting factor referred to the index of a particular reference image.

The appropriate method of encoding video data for a block image includes receiving essentially uncompressed block image and assigning the weighting factor for the block image corresponding to the particular reference picture having a corresponding index. Calculates the motion vectors corresponding to the difference between the block image and a particular reference image. In accordance with the motion vectors are motion compensation specific reference image and the motion compensated reference image is changed by the assigned weighting factor for the formation of the weighted motion compensated reference image. Essentially uncompressed block image is compared with the weighted motion compensated reference from the reflection, and the signal indicating the difference between the essentially uncompressed block image and the weighted motion compensated reference picture, together with the corresponding index of a particular reference image, is encoded.

LIST of DRAWINGS

Adaptive weighting of reference pictures in video encoders and videodecoder in accordance with the principles of the present invention is illustrated by the following illustrative drawings:

Figure 1 - block diagram for the standard decoder.

Figure 2 - block diagram of a video decoder with adaptive dual prediction.

Figure 3 - block diagram for a decoder with weighted reference image in accordance with the principles of the present invention.

4 is a block diagram for a standard video encoder.

5 is a block diagram of a video encoder with weighted reference image in accordance with the principles of the present invention.

6 is a block diagram of the sequence of operations of the decoding process in accordance with the principles of the present invention.

7 is a block diagram of the sequence of operations of the encoding process in accordance with the principles of the present invention.

A DETAILED DESCRIPTION of the PREFERRED embodiments

The present invention represents a device is on and the method for estimating the motion vector and the adaptive assignment of the weighting coefficient of the reference image. In some sequences, including sequences with sinking, the current image or unit image, which must be encoded more strongly correlated with the reference image, the scaled weight ratio than directly with the reference image. The codecs without applying weighting coefficients to the reference images to encode sequences with a sinking very inefficient. When using the weight coefficients in the coding of the encoder must determine and weighting coefficients and motion vectors, but the best choice for each of them depends on another, with motion estimation, which is usually computationally most intensive part of the digital encoder video compression.

In the proposed compression standard video signal of the combined group for video signals (JVT), each P-picture may use multiple reference images to generate a prediction image, but each individual unit movement, or region 8×8 macroblock, using only one reference picture for prediction. In addition to the coding and transmission of motion vectors for each block traffic, or areas 8×8 is passed the index of the reference image, which indicates that the reference image used is changing. In the encoder and in the decoder stores a limited set of possible reference images and is passed the number of the reference images.

In the JVT standard for images with double prediction (also called B-pictures), for each block, motion, or areas 8×8, are formed two predictors, each of which may be separate from the reference image, and these two predictors are averaged to form a single averaged predictor. For block motion encoded with a double prediction, both the reference image may correspond to the forward direction, both may be backwards, or one forward and one direction back. Supports two list of available reference images that can be used for prediction. Two reference images have been identified as predictors of list 0 and list 1. For each reference image is encoded and transmitted index ref_idx_I0 and ref_idx_I1 for the reference image list 0 and list 1, respectively. Image with double prediction, or image, of the consolidated group video (JVT) provide the adaptive weighting of the two predictions, ie,

Pred = [(P0)(Pred0)] + [(P1)(Pred1)] + D

where P0 and P1 are weighting coefficients, Pred0 and Pred1 - prediction reference shows the I for list 0 and list 1, respectively, and D is the offset.

We proposed two ways to specify the weights. In the first weighting factors are determined by the directions, which are used for the reference images. In this way, if the index ref_idx_I0 does not exceed ref_idx_I1, used weights (1/2, 1/2), otherwise the coefficients of (2, -1).

In the second proposed method is passed an arbitrary number of weights for each section. Then passed the index weighting coefficient for each block traffic, or areas 8×8 macroblock, which uses bidirectional prediction. The decoder uses the received index the weighting factor for selecting the appropriate weighting factor, from the passed set for use when decoding block traffic, or areas 8×8. For example, if the section were transferred to the three weighting factor, they must comply with the indexes 0, 1 and 2 are weighting coefficients, respectively.

The following description only illustrates the principles of the invention. Accordingly, it will be clear that the experts in the art can devise various means, which, although not described in or expressly granted herein, embody the principles of the invention and included in its scope. In addition, all possible options and conditional the language, presented here is largely designed specifically for training purposes, in order to facilitate the understanding of the principles of the invention and the concepts contributed by the inventor in the development of technology, and should not be considered as possible options and conditions that limit. In addition, the descriptions of the guidelines, aspects and embodiments of the invention, and also their particular possible options are designed to cover their structural and functional equivalents. Additionally, it is understood that such equivalents include cash equivalents, known at the present time, and equivalents, which will be developed in the future, i.e. any developed elements that perform the same function, regardless of structure.

For example, specialists in the art should be understood that the block diagram here represent conceptual views of illustrative circuits embodying the principles of the invention. Similarly, it is clear that any flowchart of sequences of operations, schematics, algorithms, charts, transitions between States, pseudo-code, etc. represent various processes which, essentially, can be represented in a machine-readable medium and executed by a computer or processor, regardless of whether shown whether such computers is or processor explicitly.

The functions of the various elements shown in the drawings, may be provided through the use of specialized hardware and hardware that can run the software, together with the corresponding software. While providing a processor, the functions may be provided by one specialized processor, a single shared processor or multiple processors, some of which may be used together. Moreover, explicit use of the term "processor" or "controller" should not be construed as a reference solely on hardware that can run the software, and may implicitly include, but not restrictively, hardware digital signal processor (DSP, DSP), a persistent storage device (RAM, ROM) for storing software, random access memory (RAM, RAM) and nonvolatile memory. May also include other hardware, standard and/or customized. Similarly, any of the switches shown in the drawings, only conceptual. Their function can be performed through the operation of the software logical tools, specialized logical means of interaction programme the management and specialized logic means, or even manually, the particular technique can be selected by the implementing party, as more specifically understood from the context.

In the claims, any element that acts as a tool to perform a specific function, is intended to cover any means of performing this function, including, for example, a) a combination of circuit elements that perform a specified function or b) software in any form to perform functions including, therefore, firmware, microcode, and so on, combined with appropriate schemes to run this software. The invention, as defined by the mentioned claims, is actually typical that the functionality provided by the various means described, are combined and joined together, as required by the claims. Accordingly, the applicant considers any tools that can provide the aforementioned functionality in the form of equivalent presented here functionality.

According to figure 1 standard video decoder is indicated in General by the reference position 100. The video decoder 100 includes a decoder 110 variable length fields (VLD), coupled with the possibility of exchanging signals with the inverse ofthe quantizer 120. The inverse quantizer 120 is coupled to exchange signals with the inverse of the Converter 130. Inverter 130 is connected with the possibility of exchanging signals with the first input of the adder or summing connection 140, and the output of the summing connection 140 provides an output of decoder 100. The output of the summing connection 140 is connected with the possibility of exchanging signals with the storage 150 reference images. Store 150 of the reference images is connected with the possibility of exchanging signals with the compensator 160 movement, which is connected with the possibility of exchange of signals with the second input of the summing connection 140.

According to figure 2, the video decoder with adaptive dual prediction is indicated in General by the reference position 200. The video decoder 200 includes VLD 210, coupled with the possibility of exchanging signals with the inverse quantizer 220. The inverse quantizer 220 is connected with the possibility of exchanging signals with the inverse Converter 230. Inverter 230 is connected with the possibility of exchanging signals with the first input of the summing connection 240, and the output of the summing connection 240 provides the output of the decoder 200. The output of the summing connection 240 is connected with the possibility of exchanging signals with the storage 250 of the reference images. Store 250 of the reference images is connected with the possibility of exchanging signals with the compensator 260 motion, which is connected with the possibility of exchanging the signals with the first input of the multiplier 270.

VLD 210 is additionally connected to exchange signals with the tool 280 search weighting factor reference image to provide a means 280 search index coefficient of an adaptive dual prediction (ADP, ABP). First output means 280 search is designed to provide the weighting factor and is connected with the possibility of exchange of signals with the second input of the multiplier 270. The output of the multiplier 270 is connected with the possibility of exchanging signals with the first input of the summing connection 290. Second output means 280 of the search is intended to provide the offset and connected to exchange signals with the second input of the summing connection 290. The output of the summing connection 290 is connected with the possibility of exchange of signals with the second input of the summing connection 240.

According to figure 3, the video decoder with weighted reference image designated in General by the reference position 300. The video decoder 300 includes VLD 310, coupled with the possibility of exchanging signals with the inverse quantizer 320. The inverse quantizer 320 is connected with the possibility of exchanging signals with the inverse Converter 330. Inverter 330 is connected with the possibility of exchanging signals with the first input of the summing connection 340, and the output of the summing connection 340 provides the output of the decoder 300. Output summerhouse the connection 340 is connected with the possibility of exchanging signals with the vault 350 reference images. Store 350 reference images are connected with the possibility of exchanging signals with the compensator 360 movement, which is connected with the possibility of exchanging signals with the first input of the multiplier 370.

VLD 310 is additionally connected to exchange signals with the tool 380 search weighting factor reference image to provide a means 380 search index of a reference image. First output means 380 search is designed to provide the weighting factor and is connected with the possibility of exchange of signals with the second input of the multiplier 370. The output of multiplier 370 is connected with the possibility of exchanging signals with the first input of the summing connection 390. Second output means 380 search is designed to provide offset and connected to exchange signals with the second input of the summing connection 390. The output of the summing connection 390 is connected with the possibility of exchange of signals with the second input of the summing connection 340.

According to figure 4 standard video encoder is indicated in General by the reference position 400. The entrance to the encoder 400 is connected with the possibility of exchanging signals with reinvestiruet input summing connection 410. The output of the summing connection 410 is connected with the possibility of exchanging signals with the transducer 420the blocks. The inverter 420 is connected with the possibility of exchanging signals with kV is Novatel 430. The output of the quantizer 430 is connected with the possibility of exchanging signals with the encoder variable length fields (VLC) 440, where the output of the VLC 440 is the output of the encoder 400, accessible from the outside.

The output of the quantizer 430 is additionally connected to exchange signals with the inverse quantizer 450. The inverse quantizer 450 is connected with the possibility of exchanging signals with the inverse Converter 460 blocks, which, in turn, is connected with the possibility of exchanging signals with the storage 470 reference images. The first output of the storage 470 reference images is connected with the possibility of exchanging signals with the first input module 480 of the motion estimation. The entrance to the encoder 400 is additionally connected with the possibility of exchange of signals with the second input module 480 of the motion estimation. The output module 480 motion estimation is connected with the possibility of exchanging signals with the first input of the compensator 490 motion. The second output of the storage 470 reference images is connected with the possibility of exchange of signals with the second input of the compensator 490 motion. The output of the compensator 490 motion is connected with the possibility of exchanging signals with the inverting input of the summing connection 410.

According to figure 5 video encoder with weighted reference image designated in General by the reference position 500. The entrance to the encoder 500 is connected with the possibility of exchanging signals with reinvestiruet input summing with the organisations 510. The output of the summing connection 510 is connected with the possibility of exchanging signals with the transducer 520 blocks. Converter 520 is connected with the possibility of exchanging signals with the quantizer 530. The output of the quantizer 530 is connected with the possibility of exchanging signals with VLC 540, where the output of the VLC 440 is the output of the encoder 500, accessible from the outside.

The output of the quantizer 530 additionally connected to exchange signals with the inverse quantizer 550. The inverse quantizer 550 is connected with the possibility of exchanging signals with the inverse Converter 560 units, which, in turn, is connected with the possibility of exchanging signals with the storage 570 reference images. The first output of the storage 570 reference images is connected with the possibility of exchanging signals with the first module input572destinationthe weight ratio of the reference image. The entrance to the encoder 500 is additionally connected with the possibility of exchange of signals with the second input module 572 assigning the weighting factor of the reference image. The output module 572 assignment weighting factor reference image indicating a weighting factor that is connected with the possibility of exchanging signals with the first input module 580 of the motion estimation. The second output of the storage 570 reference images is connected with the possibility of exchange of signals with the second input module 580 of the motion estimation.

In the od in the encoder 500 is additionally connected to exchange signals with the third input module 580 of the motion estimation. The output module 580 motion estimation, indicating the motion vectors, is connected with the possibility of exchanging signals with the first input of the compensator 590 movement. The third outlet store 570 reference images is connected with the possibility of exchange of signals with the second input of the compensator 590 movement. The output of the compensator 590 movement indicating the reference image, the compensated movement, connected with the possibility of exchanging signals with the first input of the multiplier 592. The output module 572 assignment weighting factor reference image indicating a weighting factor that is connected with the possibility of exchange of signals with the second input of the multiplier 592. The output of multiplier 592 is connected with the possibility of exchanging signals with the inverting input of the summing connection 510.

According to Fig.6 illustrative process for decoding video data for a block of the image is indicated in General by the reference position 600. The process includes a step 610 the beginning, which transfer control to step 612 input. At step 612 accept input data of the compressed block image and transfer control to step 614 input. At step 614 input accept at least one index of the reference image data for a block of the image, with each index of the reference image corresponding to a particular reference image. At step 614 input passed to control the functional group stage 616, which determines a weighting factor corresponding to each of the received index of the reference image, and transmit control optional functional stage 617. The optional functional stage 617 determine the offset corresponding to each of the received index of the reference image, and transmit the control functional step 618. At functional step 618 extract the reference image corresponding to each of the received index of the reference image, and transmit the control functional stage 620. At functional step 620, in turn, perform motion compensation of the extracted reference image and transfer control to the functional stage 622. On the functional stage 622 multiply compensated by movement of the reference image to the corresponding weighting factor and transmit control optional functional step 623. The optional functional stage 623 summarize the motion compensated reference image with the corresponding offset and transmit control functional step 624. At functional step 624, in turn, form a weighted motion compensated reference image and transfer control to step 626 finish.

According to Fig.7 illustrative process of encoding videos from the drove for a block of the image is indicated in General by the reference position 700. The process includes the step 710 the beginning, which transfer control to step 712 input. At step 712 input accept, in essence, the data is uncompressed block image and transfer control to the functional stage 714. At functional step 714 assign a weighting factor for the block image corresponding to the particular reference picture having a corresponding index. At functional step 714 transmit control optional functional step 715. The optional functional stage 715 designate the offset for the block image corresponding to the particular reference picture having a corresponding index. The optional functional stage 715 delegate the management of a functional step 716, which calculates the motion vectors corresponding to the difference between the block image and a particular reference picture, and passes the control to the functional stage 718. At functional step 718, in accordance with the motion vectors, perform motion compensation of a particular reference image and transfer control to the functional stage 720. At functional step 720, in turn, multiply motion compensated reference picture by the assigned weighting factor for the formation of the weighted motion compensated reference image is supply and relay control optional functional step 721. The optional functional step 721, in turn, summarize the motion compensated reference image with an assigned shift for forming a weighted motion compensated reference image and transfer control to functional step 722. On the functional stage 722 subtract the weighted motion compensated reference picture from, essentially, the uncompressed block image and transfer control to the functional stage 724. At functional step 724, in turn, encode the signal with the difference between essentially uncompressed block image and the weighted motion compensated reference image together with the corresponding index of a particular reference image and transfer control to step 726 is complete.

In presents a possible embodiment, for each encoded image or partition, each valid reference image, in which relation can be encoded blocks of the current image associated weighting factor. When encoding or decoding each block in the current image to generate the weighted predictor, to the reference prediction applied weighting factor(s) and offset(I), the index of the reference image. All the blocks in time is barely, which are encoded relative to the same reference image, is applied to the prediction reference image in the same weight ratio.

Whether to use adaptive weighting for encoding image or not can be specified in the set image settings or set parameterssequence or in the image header or section. For each partition or image using adaptive weighting may be given a weighting factor for each of the permissible reference image, which can be used for encoding of this section or image. The number ofthe permissible reference image is transmitted in the header section. For example, if the encoding of the current section can be used three reference image, is transferred to the three weighting coefficients, and they are mapped to the reference image with the same index.

If not passed a single weighting factor, used weight set by default. In one embodiment of the present invention, when not passed a single weighting factor, used weight default (1/2, 1/2). The weighting coefficients can be transmitted using the codes of fixed or variable length.

Unlike the standard is s systems, each weighting factor, which is given with each section, unit, or an image corresponding to a particular index of a reference image. Previously, any set of weights that is passed with each section or image was not mapped to any specific reference image. Instead, for each block, motion, or areas 8×8, passed the index of the adaptive weighting of double predictions, to select which of the weighting coefficients of the transmitted set should be used for that specific block traffic, or areas 8×8.

In the present embodiment, the index weighting coefficient for each block traffic, or areas 8×8, is not passed explicitly. Instead, it uses a weighting factor associated transmitted index of the reference image. This significantly reduces the amount of unproductive service information in the transmitted bit stream to enable adaptive weighting of reference pictures.

The system and method can be applied to P-image prediction, which is encoded by a single predictor, or B-pictures with double prediction, which are encoded by the two predictors. The following describes the processes of decoding for the cases of P - and B-pictures, which shall take place in the encoder, and decoders. Alternatively, the technique may also be applied to the coding system uses a concept similar to the I, B and P-pictures.

For unidirectional prediction in B-pictures, and for bidirectional prediction in B-pictures can be used identical weights. When the macroblock using one predictor, P-pictures or for unidirectional prediction in B-pictures for this block is passed the index of the reference image. After the decoding process at the stage motion compensation create a predictor, this predictor apply a weighting factor. Then the weighted predictor summarize with an encoded residue and the result of summation perform cut-off at the boundaries of the region for forming the decoded image. When used for blocks in P-images or blocks in a B-images that use only the prediction list 0, the weighted predictor form as

Pred=W0*Pred0+D0 (1),

where W0 is the weight of the associated reference picture list 0, D0 is the displacement of the associated reference picture list 0, and Pred0 - block motion compensated prediction from a reference picture list 0.

To use DL the blocks in B-pictures, making use of only the prediction list 0, the weighted predictor form as

Pred=W1*Pred1+D1 (2),

where W1 is the weighting factor associated reference picture list 1, D0 is the displacement of the associated reference picture list 1, and Pred1 - block motion compensated prediction from a reference picture list 1.

To ensure that the resulting values will be within the allowable range of pixel values, typically from 0 to 255, can be cut-off at the boundaries of the region in relation to the weighted predictors. Precision multiplication formulas weighting may be limited by any pre-defined number of bits of resolution.

In the case of a double prediction is passed the index of the reference image for each of the two predictors. For the formation of the two predictors is motion compensation. For the formation of two weighted predictors, each predictor uses a weighting factor associated with his index reference image. Then two weighted predictors are averaged together to calculate an average of the predictor, which is then coded with the rest.

To use for blocks in B-images that use predictors of list 0 and list the 1, weighted predictor form as

Pred=(P0*Pred0+D0+P1*Pred1+D1)/2 (3)

To ensure that the resulting values will be within the allowable range of pixel values, typically from 0 to 255, a weighted predictor or any of the intermediate values in the calculation of the weighted predictor can be applied cut-off at the boundaries of the region.

Accordingly, the prediction reference image decoder and encoder video compression that use multiple reference images, applies a weighting factor. The weighting factor is adapted to the individual blocks of the motion within the image based on the index of a reference image that is used for this block movement. Because the index of the reference image is transmitted in the bit stream of the compressed video signal is significantly reduced amount of additional overhead service information to adapt the weighting factor based on block motion. All blocks movement, coded relative to the same reference image, is applied to the prediction reference image in the same weight ratio.

Experts in the art on the basis of these principles can easily discover these and other features and advantages of the present invention. The concept is but the principles of the present invention can be implemented in various forms of hardware, software, hardware, specialized processors or their combinations.

The most preferred implementation of the principles of the present invention as a combination of hardware and software. In addition, the software is preferably implemented as an application program, physically implemented in the block of memory to store programs. An application program can be loaded into a computing device, having any appropriate architecture, and run it. Preferably, the computing device is implemented on the platform of the computer, which has the same hardware as one or more Central processing units (CPU), random access memory (RAM) and interfaces input/output (I/O). The platform of the computer can include the code of the microinstructions and the operating system. Various processes and functions described herein may be part of the code of microinstructions or part of the application program, or any combination thereof, which may be performed by the CPU. In addition to the platform of the computer can be connected to various peripheral devices, such as additional memory block for the injury data and a printing device.

Also it should be understood that, because some of the constituent system components and methods depicted in the attached drawings, preferably implemented in software, the actual connections between the system components or functional blocks of the process can vary depending on how you programmed the present invention. In the presence here of the principles of a specialist in this field of technology can provide these and similar implementations or configurations of the present invention.

While here, according to the attached drawings, and described illustrative embodiments of, it is clear that the present invention is not limited to these options for implementation and that a person skilled in the art do not depart from the essence and not going beyond the scope of the present invention, can make various changes and modifications. It is implied that all such changes and modifications are included within the scope of present invention defined by the attached claims.

1. Video encoder for encoding video data for a block of image and index of a particular reference image containing

a receiving module for receiving essentially uncompressed block image,

the target module is the weighting factor on what I purpose for this block image weighting factor, corresponding to the first specific reference image having a corresponding index,

the calculator tool to calculate the motion vectors corresponding to the difference between the block image and the first concrete reference image

module motion compensation for performing motion compensation in respect of the first specific reference image in accordance with said motion vectors,

the module multiplication for multiplying the offset by the movement of the first specific reference image on the assigned weighting factor to form a weighted motion compensated first reference image,

the second assignment module weighting factor to assign to the above-mentioned block image weighting factor corresponding to the second particular reference picture having a second corresponding index,

the calculator tool to calculate the motion vectors corresponding to the difference between the block image and the second particular reference picture

module motion compensation for performing motion compensation in relation to the second particular reference picture in accordance with said motion vectors,

modulometer for multiplying the motion compensated second particular reference picture by the assigned weighting factor to form a weighted motion compensated reference image,

the merge tool to merge the weighted motion compensated first reference image with a weighted motion compensated second reference picture for the formation of the motion compensated result of the reference image,

the subtraction module for subtracting the motion compensated result of the reference image from the mentioned essentially uncompressed block image, and

the encryption module to encode a signal indicating the difference between these essentially uncompressed block image and the motion compensated result of the reference image, together with the corresponding indexes mentioned two specific reference images used to generate a motion compensated result of the reference image.

2. The video encoder according to claim 1, additionally containing store reference images associated with the possibility of exchanging signals with the mentioned modules assigning the weighting factor of the reference image, to provide reference images corresponding to the indices mentioned specific reference images.

3. The video encoder according to claim 1, used with predictors image with dual prediction and additional content is of ASI tool predictions for the formation of the first and second predictors from two different reference images.

4. The video encoder according to claim 3, in which both of these two different reference images correspond to the same direction relative to that of the block image.

5. A method of encoding video data for a block of an image, comprising the steps in which

take, essentially, the uncompressed block image,

appointed for this block image weighting factor corresponding to the first specific reference image having a corresponding index,

calculates the motion vectors corresponding to the difference between the block image and the first concrete reference image

perform motion compensation in respect of the first specific reference image in accordance with said motion vectors,

multiply compensated by movement of the first specific reference image on the assigned weighting factor for the formation of the weighted motion compensated first reference image,

appointed for the said unit image, the weighting factor corresponding to the second particular reference picture having a corresponding index,

calculates the motion vectors corresponding to the difference between the block image and the second specific this is as image

perform motion compensation in relation to the second particular reference picture in accordance with said motion vectors,

multiply compensated by movement of the second particular reference picture by the assigned weighting factor for the formation of the weighted motion compensated second reference picture,

combine the weighted motion compensated first reference image with a weighted motion compensated second reference picture for the formation of the motion compensated result of the reference image,

subtract the compensated movement resulting reference image of these essentially uncompressed block image, and

encode the signal indicating the difference between these essentially uncompressed block image and the motion compensated result of the reference image, together with the corresponding indexes mentioned two specific reference images used to generate a motion compensated result of the reference image.

6. The method according to claim 5, in which the calculation of motion vectors includes the steps, which perform validation within the search area for any p the motion within a predetermined range of displacements on the said unit image, calculate at least one of a sum of absolute difference and standard error of each pixel in said block image with motion compensated reference image, and choose the offset with the lowest sum of absolute differences or the root mean squared error as a motion vector.

7. The method according to claim 5, in which both of these two different reference images correspond to the same direction relative to that of the block image.

8. The method according to claim 5, in which the calculation of motion vectors includes the steps, which perform validation within the search area, subject to any move within a predetermined range of displacements on the said unit image, calculates at least one of a sum of absolute difference and standard error of each pixel in said block image with the first motion compensated reference image that corresponds to the first predictor, choose the offset with the lowest sum of absolute differences or the root mean squared error as a motion vector for the first predictor, calculate at least one of a sum of absolute difference and standard error of each pixel in the image block with the second, skompensirovannogo movement of the reference image, corresponding to the second predictor, and choose the offset with the lowest sum of absolute differences or the root mean squared error as a motion vector for the second predictor.



 

Same patents:

FIELD: movement estimation, in particular, estimation of movement on block basis in video image compression application.

SUBSTANCE: method and device are claimed for conducting search for movement in video encoder system using movement vectors which represent difference between coordinates of macro-block of data in current frame of video data and coordinates of corresponding macro-block of data in standard frame of video data. A set of movement vector prediction parameters is received, where movement vector prediction parameters represent approximations of possible movement vectors for current macro-block, movement vector search pattern is determined and search is conducted around each movement vector prediction parameter from the set of movement vector prediction parameters using search pattern, and on basis of search result, the final movement vector is determined.

EFFECT: increased efficiency of video signals compression.

3 cl, 7 dwg

FIELD: video encoding, in particular, methods and devices for ensuring improved encoding and/or prediction methods related to various types of video data.

SUBSTANCE: the method is claimed for usage during encoding of video data in video encoder, containing realization of solution for predicting space/time movement vector for at least one direct mode macro-block in B-image, and signaling of information of space/time movement vector prediction solution for at least one direct mode macro-block in the header, which includes header information for a set of macro-blocks in B-image, where signaling of aforementioned information of space/time movement vector prediction solution in the header transfers a space/time movement vector prediction solution into video decoder for at least one direct mode macro-block in B-image.

EFFECT: creation of improved encoding method, which is capable of supporting newest models and usage modes of bi-directional predictable (B) images in a series of video data with usage of spatial prediction or time distance.

2 cl, 17 dwg

FIELD: compensation of movement in video encoding, namely, method for encoding coefficients of interpolation filters used for restoring pixel values of image in video encoders and video decoders with compensated movement.

SUBSTANCE: in video decoder system for encoding a video series, containing a series of video frames, each one of which has a matrix of pixel values, interpolation filter is determined to restore pixel values during decoding. System encodes interpolation filter coefficients differentially relatively to given base filter, to produce a set of difference values. Because coefficients of base filter are known to both encoder and decoder and may be statistically acceptably close to real filters, used in video series, decoder may restore pixel values on basis of a set of difference values.

EFFECT: efficient encoding of values of coefficients of adaptive interpolation filters and ensured resistance to errors of bit stream of encoded data.

5 cl, 17 dwg

FIELD: video decoders; measurement engineering; TV communication.

SUBSTANCE: values of motion vectors of blocks are determined which blocks are adjacent with block where the motion vector should be determined. On the base of determined values of motion vectors of adjacent blocks, the range of search of motion vector for specified block is determined. Complexity of evaluation can be reduced significantly without making efficiency of compression lower.

EFFECT: reduced complexity of determination.

7 cl, 2 dwg

The invention relates to the field of digital signal processing

The invention relates to the formation, transmission and processing of the television program guide for broadcast television service

The invention relates to the field of digital signal processing
The invention relates to the field of digital signal processing image and can be used when transmitting video over narrowband communication channels and implementing quick view large archival databases

FIELD: video processing.

SUBSTANCE: method for encoding includes determining if all flags of current layer included in the specified unit area are equal flags of the main layer, setting pre-defined flag of forecast according to the result of determining, and if it is determined that flags of current layer are equal to flags of the main layer, flags of current layer are omitted and flags of the main layer are inserted and said forecast flag into the bit stream.

EFFECT: improvement of encoding efficiency of various flags used in multilayer scaled video coder-decoder based on interlayer correlation; method and device are suggested for efficient encoding of various flags used in multilayer scaled video coder-decoder based on interlayer correlation.

21 cl, 12 dwg

FIELD: movement estimation, in particular, estimation of movement on block basis in video image compression application.

SUBSTANCE: method and device are claimed for conducting search for movement in video encoder system using movement vectors which represent difference between coordinates of macro-block of data in current frame of video data and coordinates of corresponding macro-block of data in standard frame of video data. A set of movement vector prediction parameters is received, where movement vector prediction parameters represent approximations of possible movement vectors for current macro-block, movement vector search pattern is determined and search is conducted around each movement vector prediction parameter from the set of movement vector prediction parameters using search pattern, and on basis of search result, the final movement vector is determined.

EFFECT: increased efficiency of video signals compression.

3 cl, 7 dwg

FIELD: engineering of systems for encoding digital video signals, in particular, indication of values of quantization parameters in video encoding system.

SUBSTANCE: method and device for encoding a digital video series are claimed, where indication of quantization parameter is given out in encoded bit stream for use during decoding. Indication of information concerning the quantization parameter is ensured by insertion of SQP value - series level quantization parameter value. In particular, instead of encoding absolute values of parameters of quantization of image/section, indication of difference ΔQP between series level quantization parameter SQP and QP of image/section, is given out.

EFFECT: increased efficiency when encoding digital video signals and reduced speed of data transmission in bits.

4 cl, 8 dwg

FIELD: optics.

SUBSTANCE: method of optical registration of process, changing in time, is realized due to frame-by-frame shooting of digital camera with CCD array to get time dependence of at least one cinematic characteristic of tested process. During frame-by-frame shooting, registration is carried out in area of one or row of spatially fixed fragments of field of any frame. Video camera reads out information at any field from those columns or lines of CCD array only which are disposed in field of specified fragment. After that images of any registered fragment are combined to get time dependence in form of time-base.

EFFECT: improved reliability.

5 cl, 3 dwg

FIELD: method for interpolating values of sub-pixels during encoding and decoding of data.

SUBSTANCE: method of interpolation during video data encoding is claimed, which features an image, containing pixels ordered in rows and columns and represented by values having given dynamic range, where pixels in rows are in integral horizontal positions, and pixels in rows are in integral vertical positions, which image is interpolated in such a way that values of sub-pixels are generated in fractional horizontal and vertical positions, aforementioned method containing following stages: a) when values are required for sub-pixels in half-integral horizontal positions and integral vertical positions and in integral horizontal positions and half-integral vertical positions, such values are interpolated directly using weighted sums of pixels located in integral horizontal and integral vertical positions; b) when values are required for sub-pixels in half-integral horizontal positions and half-integral vertical positions, such values are interpolated directly using a weighted sum of values for sub-pixels located in half-integral horizontal positions and integral vertical positions, computed in accordance with stage a); and c), when values are required for sub-pixel in quaternary horizontal position and quaternary vertical position, such values are interpolated by averaging of at least one pair from first pair of values of sub-pixel located in half-integral horizontal position and half-integral vertical position, and of sub-pixel, located in integral horizontal position and integral vertical position, and second pair of values of pixel, located in integral horizontal position and integral vertical position, and of sub-pixel, located in semi-integral horizontal position and semi-integral vertical position.

EFFECT: creation of improved method for interpolating values of sub-pixels during encoding and decoding of data.

13 cl, 26 dwg, 2 tbl

FIELD: observation of moving objects.

SUBSTANCE: method includes using movement sensors, capable of recording two-dimensional distributions of intensity in form of images, where sensors are positioned with known spatial orientation, making it possible to perform simultaneous observation of one and the same scene, periodical query of sensors is performed during whole time period after their enabling, processing and analysis of data received from sensors is performed, which constitutes series of images, and output signal is generated in case of detection of three-dimensional moving object and determining of its spatial position, which signal is injected into output device.

EFFECT: increased trustworthiness when determining spatial position of a moving object.

3 cl, 1 dwg

FIELD: systems for automatic video surveillance of an object.

SUBSTANCE: system for automatic detection and tracking of individuals on basis of images and biometric identity recognition based on target list, realizes following operations: on basis of three-dimensional data about scene and two-dimensional data, characterizing optical flow, detection of objects-noises of scene is performed, static background objects are selected, and regular dynamic object-noises; on basis of comparison of two-dimensional and two-dimensional data about the scene in current frame with reference data on previous frames and a map of object-noises changes are determined on a scene, in three-dimensional zones of interest, preliminary check of presence of human-like objects is performed, zones of interest are determined more precisely and their changes are tracked: a contour of separate elements of human body is singled out, zones of interest are divided onto a set of sub-zones of interest for elements, detection of three-dimensional head of individual is performed and it is tracked in each zone of interest; face of individual is tracked in each zone of interest; images of detected face are normalized in terms of dimensions, angles and brightness; recognition is performed.

EFFECT: objectivity and stability of system operation.

1 dwg

FIELD: video encoding, in particular, methods and devices for ensuring improved encoding and/or prediction methods related to various types of video data.

SUBSTANCE: the method is claimed for usage during encoding of video data in video encoder, containing realization of solution for predicting space/time movement vector for at least one direct mode macro-block in B-image, and signaling of information of space/time movement vector prediction solution for at least one direct mode macro-block in the header, which includes header information for a set of macro-blocks in B-image, where signaling of aforementioned information of space/time movement vector prediction solution in the header transfers a space/time movement vector prediction solution into video decoder for at least one direct mode macro-block in B-image.

EFFECT: creation of improved encoding method, which is capable of supporting newest models and usage modes of bi-directional predictable (B) images in a series of video data with usage of spatial prediction or time distance.

2 cl, 17 dwg

FIELD: mobile robot, such as cleaner robot, and, in particular, device for tracking movement of mobile robot.

SUBSTANCE: suggested device for tracking movement of mobile robot includes: video camera for filming an individual object; unit for tracking movement and creation of image for setting support one in an image for current moment by means of filming of individual object by video camera and creation of image in current moment, for which support zone is set; unit for selecting image of difference of pixels of image support zone limit based on difference between pixels present only at limit of support zone of aforementioned images; and micro-computer for tracking movement of separate object on basis of selected image of difference.

EFFECT: decreased time of pixel comparison operation and increased efficiency of room perception.

5 cl, 4 dwg

FIELD: system for encoding moving image, in particular, method for determining movement vector being predicted, of image block in B-frame in process of decoding of moving image.

SUBSTANCE: in accordance to method, at least one movement vector is produced for at least one block, different from current block, while aforementioned at least one block is related to one, at least, supporting frame in a row of supporting frame, movement vector is predicted for current block on basis of received one, at least, movement vector, while prediction operation includes also operation of comparison of value of order number of B-frame to value of order number of one, at least, supporting frame, while movement vector for current block and aforementioned one, at least, movement vector are vectors of forward movement.

EFFECT: increased efficiency.

2 cl, 1 dwg

FIELD: television.

SUBSTANCE: support frame is assigned with sign, showing information about direction of support frame, and during determining of predicted vector of movement of encoded block averaging operation is performed with use of vectors of movement of neighboring blocks, during which, if one of aforementioned blocks has movement vectors, information about direction of support frames is received, to which these movement vectors are related, and one of movement vectors is selected with reference to received information about direction, than averaging operation is performed with use of selected movement vector to receive subject movement vector of encoded block.

EFFECT: higher precision, higher reliability.

3 cl, 1 dwg, 3 ex

Up!