Method for scalable encoding of multimedia objects

FIELD: technology for encoding multimedia objects.

SUBSTANCE: method for encoding a multimedia object includes following stages: multimedia object is encoded for producing a bit stream and information about quality is added to bit stream, while information about quality denotes quality of multimedia object relatively to given position or relatively to given part of bit stream, while information about quality is provided in quality tags, aforementioned quality tag provides a values of quality tag, and value of quality tag characterizes distortion in encoded multimedia object being reproduced, when bit stream is truncated in point, related to quality tag.

EFFECT: development of improved and efficient method/system for encoding multimedia objects.

13 cl, 2 dwg

 

The invention relates to the encoding of the media objects.

Scalable compression, such as fine-grained scalable compression of objects, such as media objects, has a useful feature, namely, that the encoded bit stream can be truncated at a given point, and the remaining flow can still be decoded (albeit with a lower quality of the object). The standard of such scalable coding, i.e. fine-grained Scalability, MPEG-4 (MPEG-4 FGS) currently defined, see ISO/IEC 14496-2/ AMD 4, document ISO/IEC JTC1/SC29/WG11 N3315, March 2000 (hereinafter called N3315), it is included in the present description by reference. An improved method for scalable encoding described in the Application for the European patent 00201037.9 filed 23.03.2000 (our Ref. PHNL000153), without publishing to the grant of the patent; this application is also included in the present description by reference.

The possibility of such a scalable bitstream greatly simplifies the system design due to the fact that virtually eliminates the need for a method of controlling the buffer, when the adaptation of the encoded bit stream to a certain predetermined transfer speed in bits or memory size. Specifically, the same one bit stream serves at the same time on different channels with different bandwidth capabilities without which perekodiruya the original data. Therefore, much easier to adapt in real time to changing bandwidth channels (as applied to the Internet or wireless channels).

To fine-grained scale already existed some form of limited scale. In them the bitstream is composed of several large levels, i.e. base level and, for example, one or two advanced levels. Such scalability is defined, for example, in the JPEG standard (hierarchical coding), as well as in standard MPEG2 (SNR scalability, spatial scalability, temporal scalability).

The task of the invention is to provide improved coding. For this purpose according to the invention are provided: the encoding of the media object to obtain a bit stream, the control bit stream, the transmission bit stream, the reception of the bit stream multiplexer or a network node (scalable) bitstream representing the media object, the medium for storing information, a computer program and a signal carrying computer program, as defined in the independent claims. Preferred variants are defined in the dependent claims.

According to the first aspect of the invention, the media object is encoded to obtain the bits of the first stream and the bit stream added information about the quality, this information about the quality indicates the quality of the object in relation to a given position (or specified part) in the bit stream. By adding to the bit stream of the information about quality can be optimized joint storage and transmission of multiple coded objects, it can be considered as an object. This aspect of the invention is based on the notion that it is easy to determine the transmission rate of the compressed object, but another important parameter, the measure of quality, for example, the distortion, not so easily defined. In fact, the distortion can be accurately obtained only during encoding, when still available the full source information. According to this aspect of the invention, the syntax of the bitstream is improved by adding information about the quality (distortion). This can be accomplished with no increase or only a slight increase in transmission speed in bits, and the range of application of this aspect of the invention applies to multiple encoding schemes. The media object may be audio and/or video or any other reproducible object for which it is appropriate to talk about quality. The media object may be an image or image sequence, such as a program.

Preferably, the encoding is masshtabiv is my coding (encoding, scaling) and the resulting bit stream is a scalable bit stream. Information about the quality of especially favorable for schemes with scalable coding, because these bit streams are suitable for truncation. For many applications, where truncated scalable bit streams, it is important to have a quality index of the bit stream obtained after truncation, which is easily achieved by using quality information included in the scalable bit stream.

Preferably information about the quality represents the quality of the reproduction object. Then easily determined information about the quality of the playback object depending on the number of bits. To quantify the quality preferably uses the values in the signal-to-noise ratio (SNR) or peak signal to noise ratio (PSNR).

While the coded stream of the same object can be optimally truncated by simply adjusting it to the available bandwidth or the available memory, this method is not suitable when dealing simultaneously with multiple objects. For optimal allocation of a certain bandwidth or memory for many simultaneously encoded objects, you need to know for each encoded object differential curve of the rate-distortion". Despite the fact that this curve is relatively easily obtained during the encoding of the object (when available on iginal), get it (by estimation) later on, not just when there is only a truncated version of the encoded bit stream. For quality assessment at a later time requires detailed information about the compression method, as well as at least partial decoding of the encoded bit stream.

In practically implemented embodiment of the invention the tag quality, added to the scalable bit stream represent the quality of the reproduction of the encoded object, when the bitstream is truncated at the point related to a given tag. Although adding quality information may require certain administrative costs, these costs can be kept low. An important advantage is that information about the quality makes it easy to optimally truncate the bit streams together for many objects. This problem of multiple truncation occurs, for example, flexible memory, as described in the Application for the European patent 00200890.2 filed 13.03.2000 (our Ref. PHNL000110), the application prior to the issuance of a patent is published, the application is incorporated into this description by reference. Another application is a multiplexer or a network node, where the output bandwidth is temporarily less than the input bandwidth and, therefore, requires the truncation of incoming scalable compressed bit streams.

When the issue is neetsa compression, the input data is usually compressed into many blocks (such as, for example, parts of blocks of DCT - discrete cosine transform, part of frequency bands converted by small ripples image or levels). Each encoded portion usually contains some headers with different parameters or tags. In a preferred embodiment of the invention in such a header is added to the parameter indicating the quality of the object when it is truncated immediately after (or alternatively, directly in front of) current portion of encoded data. One example of the quality parameter is adding numbers related to the root mean square errors (or PSNR or SNR) playback; this number can also be visually balanced (P)SNR. Type (or multiple types) specify quality can be standardized so that all coders used the same or a limited number of different indicators of quality. Quality could also be relative (for example, expressed in percents), so that the encoder would not have to describe the measure of quality. Then the relative quality could be concluded in the range from 0 to 100% on a certain scale for each individual object, with an additional parameter, the scale/weight for each object, to provide different weights for p is slichnih objects.

Tags qualities can be placed approximately at equal distances (expressed in number of bits) around the encoded stream, or they can be used more often when rapidly changing the bit rate depending on the curve quality. When the encoded bit stream must be truncated, then the quality for each truncation point, not corresponding to the location of the tag can be approximated by interpolation (linear or more complex) values of the tag.

In the embodiment of the invention information about the quality is added to the encoded bit stream of MPEG-4 FGS, using a tag that has already been defined in the standard, see the above link N3315. In this way information can be added without changes to the proposed standard, which is a huge advantage.

For the decoded media information about quality can be used for adaptive post-processing or for processing algorithms scalable video signals, etc. for unscaled methods of compression. For example, for post-processing of MPEG-compressed video (signal) information about quality can help determine the strength or size or type (blocking distortion/zatsiklivayas conversion) required post-processing. For processing algorithms masshtabiv the constituent video (signal) information about quality can help to better assess the number of cycles the CPU (Central processing unit), required to achieve a certain desired level of quality handle, using a selected algorithm for video processing.

Information about quality can be added as additional information to the bit stream, i.e. not included in the bit stream.

For the encrypted bit streams is beneficial to information about the quality was unencrypted. Then the quality of a given part of the bitstream (e.g., level) can be determined in the decoder without decoding the bit stream.

Information about as well as it can be advantageously used in applications where the source coding and channel coding are not at the same time or not in the same place. Then information about the quality of the used channel coding, such as determining factors of protection.

In the method according to the present invention, the scalable bitstream includes more than one level of quality, and to each of these levels of quality bitstream add information about quality.

Disclosed is also a method of controlling at least one bit stream representing the media object, in which the bit stream added information quality, and information quality indicates the quality of the media object in relation to a given the th position or a specified part in the bit stream, when this information as provided in the tag quality, tag quality contain the tag value quality, and the aforementioned tag value quality characterizes the distortion in the reproduced encoded media object, when the bitstream is truncated at the point relating to tag quality, the method includes the steps of: receiving at least one bit stream, the allocation of the bit stream of the information about the quality, TRANS-coding or truncation mentioned at least one bit stream if the current combination of bit rate and quality mentioned at least one accepted bit stream reaches the desired combination of bit rate and quality mentioned at least one bit stream, providing at least one bit stream having the desired combination of bit rate and quality.

Also presented a method of transferring at least one media object, comprising stages: encoding of the media object to obtain a bit stream, adding a bit stream quality information, and the information about the quality specifies the quality of the media object in relation to a given position or a given part in the bit stream, and the information as provided in the tag quality is eh quality contains the tag value quality the above tag value quality characterizes the distortion in the reproduced encoded media object, when the bitstream is truncated at the point relating to tag quality, and transmitting the bit stream to which we have added information about quality.

In the method, at least one bit stream representing the media object, in which the bit stream added information quality, and information quality indicates the quality of the media object in relation to a given position or a given part in the bit stream, the information as provided in the tag quality, tag quality contains the tag value of the qualities mentioned the tag value quality characterizes the distortion in the reproduced encoded media object, when the bitstream is truncated at the point relating to tag quality, there are the steps: receiving at least one bit stream, the allocation of the bitstream information about the quality, TRANS-coding or truncation mentioned at least one bit stream if the current combination of bit rate and quality that at least one of the received bit stream reaches the desired combination of bit rate and quality mentioned at least one bit stream, ensuring that at m is re, one bit stream having the desired combination of bit rate and quality, decoding at least one bit stream having the desired combination of bit rate and quality.

Disclosed is also a method of receiving at least one bit stream representing the media object, in which the bit stream added information quality, and information quality indicates the quality of the object in relation to a given position or a given part in the bit stream, and the information about the quality of the provided tags in quality, tag quality contains the tag value of the qualities mentioned the tag value quality characterizes the distortion in the reproduced encoded media object, when the bitstream is truncated at the point relating to tag quality, the method includes the steps of: accepting at least one bit stream, remove from the bitstream, information about the quality; decode the bitstream to obtain a decoded media object; perform post-processing on the decoded media object to reduce artifacts in the extracted quality.

Device for encoding the media object according to the present invention includes: means for encoding object for receiving the bit stream and cresteds adding quality information to the bit stream, moreover, the information about the quality indicates the quality of the media object in relation to a given position or a given part in the bit stream, and the information about the quality of the provided tags in quality, tag quality contains the tag value of the qualities mentioned the tag value quality characterizes the distortion in the reproduced encoded media object, when the bitstream is truncated at the point relating to tag.

Also disclosed transmitter containing the above-mentioned device, and means for transmitting the bit stream is appended with information about quality.

A controller to control at least one bit stream representing the media object, in which the bit stream added information quality, and information quality indicates the quality of the object in relation to a given position or a given part in the bit stream, and the information about the quality of the provided tags in quality, tag quality contains the tag value of the qualities mentioned the tag value quality characterizes the distortion in the reproduced encoded media object, when the bitstream is truncated at the point relating to tag quality, and the controller includes: means for receiving at least one bit stream, means for extraction and the formation about the quality of at least one bit stream, adopted by the means for receiving, means for trimming at least one of the received bit stream if the current combination of bit rate and quality, specified information about the quality of extracted means for extracting at least one of the received bit stream reaches the desired combination of bit rate and quality mentioned, at least one of the received bit stream, means for providing at least one bit stream, a truncated means for truncating and having the desired combination of bit rate and quality.

There are also a receiver, the multiplexer and the network node, each of which contains the above-mentioned controller.

According to the present invention disclosed a receiver for receiving at least one bit stream representing the media object, in which the bit stream added information quality, and information quality specifies the quality of the object in relation to a given position or a given part in the bit stream, and the information about the quality of the provided tags in quality, tag quality contains the tag value of the qualities mentioned the tag value quality characterizes the distortion in the reproduced encoded media object, when the bitstream is truncated at the point relating to tag quality, the receiver includes: means for receiving at least one bit stream, means for extracting quality information from the at least one bit stream, received by the means for receiving; means for decoding at least one bit stream that is received by a means for receiving, for receiving the decoded media object; means for post-processing the media object to reduce artifacts depending on the quality information extracted by the tool to retrieve.

In the present application is disclosed a signal representing the bit stream representing the encoded media object, and the bit stream added information about the quality, and the information about the quality indicates the quality of the object in relation to a given position or a given part in the bit stream, and the information about the quality of the provided tags in quality, tag quality contains the tag value of the qualities mentioned the tag value quality characterizes the distortion in the reproduced encoded media object, when the bitstream is truncated at the point relating to tag.

The above and other aspects of the invention will be clear from the following description of embodiments of the invention.

In the drawings:

figure 1 represents the system under option the image is the shadow, and

figure 2 - preferred embodiments of the invention.

In the drawings, showing only those elements necessary for understanding the invention.

Figure 1 shows a system according to the variant of the invention, containing the transmitter 11 having an input block 110 forming object and the encoder 12. The encoder 12 includes scaling the encoder 120 and the block 121 of the formation of information about quality. Scaling the encoder 120 encodes the objects received from the input unit 110 for receiving one or more scalable bit streams. Block 121 of the formation of information about the quality retrieves the object from signals received from the input unit 110, and signals and/or parameters provided by the encoder 120. Information about the quality of the block 121 forming served in the block encoder 120, which generates the tag information on the quality and inserts them into the scalable bit stream. The system also contains device 3 truncate to truncate one or more scalable bit streams and unit 4 of the control device truncation. Unit 4 management device truncation highlights information about the quality of the scalable bit stream provided by the encoder 12, and controls the device 3 truncation depending on the received tag or quality information. If only a single scalable bit sweat the ka this thread is truncated when achieved the required quality. Device 3 truncation and the control unit 4 together can be part of the multiplexer, a control unit of the bit rate, network node, etc. and can be in the channel, and receiver. Unit 5 alternative may be a unit of playback and/or decoder, for example, it may be located with the device 3 truncation and the control unit 4 in the receiver according to a variant of the invention.

In preferred applications, as shown in figure 2, many scalable bit streams comes from the transmitters 21, 31, 41, where at least some of this set of scalable bit streams have included in them the tag of quality. The transmitters 21, 31, 41 and their elements is similar to the transmitter 11 shown in figure 1. Depending on the available bandwidth or memory capacity in the channel or medium 15 for storing information of a scalable bit streams to a greater or lesser extent, are truncated from tags or quality information, which are scalable bit streams. Such multiple truncation can be performed using the principle of flexible memory described in the Application for the European patent 00200890.2 filed 13.03.2000 (our Ref. PHNL000110), the application is not published prior to the issuance of a patent, the application is incorporated into this description by ssy is key. The multiplexer 16 connects the threads of the transmitters. The device 13 truncation and the control unit 14 together form part of the multiplexer (e.g., 16), the control unit of the bit rate, network node, etc. and can be in the channel, and receiver. Unit 15 may alternatively be a block playback and/or decoder, for example, he is with the device 13 truncation and the control unit 14 in the receiver according to a variant of the invention.

Below are some examples of the application of MPEG-4 FGS, which requires information about the quality. Although the following description refers specifically to MPEG-4 FGS, specialists in the art will understand that the invention can be successfully applied to any scheme for scalable coding. From the point of view of application of the distortion is a significant parameter for the scheme of MPEG-4 FGS. If the distortion is not available, the applicability of the FGS (fine-grained scaling - MM) is limited, as is demonstrated below, which shows various applications that require this information. According to a variant of the invention syntax MM bit stream is improved by adding information about the quality (distortion). This can be accomplished with no increase or a slight increase in the transmission speed in bits and this expansion is yet range of applications for MM.

The first application of the invention is the encoding for the output signal of constant quality (hence, variable bit rate). This can be used, for example, to record video with consistent quality on the data carrier that allows for variable bit rate. Through the use of quality information it is not necessary to obtain a final bit stream during the initial coding, it can be retrieved later using the processing of the encoded bit stream.

Selling the same content with different quality can be carried out using the method scalable (fine-grained or multi-speed) compression, such as described above, followed by the encryption of one or more levels: a characteristic feature of many methods scalable compression is that when the lowest zoom level is not available, then a higher zoom levels useless, i.e. cannot be used to improve quality. When compressed by scaling the content is encrypted, it is still possible to use it for flexible memory, that is, to reduce the amount of stored volume at the expense of discarding some levels(I) extensions. Relatively flexible memory reference is given neuromante above PHNL000110. In order to decide how large the amount of data that should be removed, should be available some information related to the corresponding loss of quality, because this information could not be more removed from a compressed bitstream without decoding it. In this embodiment of the invention the information as sent as unencrypted information such as additional information. The business model of selling the same content at different levels of quality is closely linked with flexible memory, since it is the same content is stored at many different levels of quality, using compression with scaling. The levels of quality that are offered for purchase to the consumer, preferably directly correspond to the quality levels used in the system with a flexible memory. This implies that when the device is flexible memory wants to lower quality item with a certain content, then you can remove the highest encrypted level of quality, without the need for its decoding. Because the device thus does not decrypt any data, there is no danger of a breach of information security or it is stolen. To ensure security throughout the epoct from the owner of the information or service provider to the consumer this information is preferably compacted (using scalable compression) and is encrypted by the owner of the information required levels quality, and then distributed in encrypted form in the flexible device, the memory (either directly transmitted or uploaded or transmitted through, for example, the intermediate storage on an optical disc).

When using a flexible memory user (or device based on what it knows about the preferences of the user may optionally select a required minimum level of quality. I.e. the information that is currently with a higher level of quality than the minimum required by the user can be "downgraded" as to create space for other information, until it reaches the lowest level acceptable to the user. However, when there is still information with high quality, the user will still have the option to buy more high quality. Of course, the user (or device) can also be pre-set different required minimum levels of quality for different types of information (sports, talk shows or movies).

Alternatively, you can also allow the service provider to control memory and to determine what levels of quality should be deleted (in this case, the service provider retains the conductor quality and can perform the functions of killed the second memory). This can be useful, for example, when the information provider is placed in a box with memory function (e.g., hard disk). First, the user may be prompted information with high quality. When a user within a specific time on looks or not buying this information, then the quality level stored in the box can be lowered to create space for other information.

Quality data also allow for the encoding of the source of information and channel coding at different times and in different places. This is useful because during encoding channel characteristics may not yet be known. In addition, the same encoded bit stream may be served by different channels with different parameters errors. And finally, do not waste memory on unproductive losses associated with storing data for error correction. They can be created, if necessary, because the field of quality give the required information to add to the channel coding (using unequal protection against errors).

Then, there are applications where multiple encoded MM frames should be processed together. This may occur, for example, in a congested network node, which has temporarily width p of the band smaller, than required. Then this network node may use information about quality to optimal truncation of the bitstream with minimal loss of quality. Since this is a set of objects, possibly with very different curves are the rate-distortion, without information about the quality cannot be performed satisfactorily this truncation.

In addition, for applications with streaming data information about quality can provide to the server a good tool to perform the function of the control transmission rate during transmission, and also for establishing a compromise between time improvements and enhancements SNR (MM depending on the MM-IN, see definitions N3315).

In MPEG-4 FGS information about the quality that is required actually is a curve of the rate-distortion for scalable extension level. Because speed is obvious, it should be added only information distortion. There are two decisions to add this information without modification or with minimal modifications of the syntax of the current FGS bit stream.

Solution A. the Beginning of a bit plane is a good point to add fields of quality or information quality, because it makes it easy to display this information, and also provides sufficient sampling to technologiesare curve rate-distortion. "Code quality" would be similar to the current fgs bp start-code, in which the last 5 bits indicate the ID (identifier) bit plane. Instead of ID, the bit plane in these 5 bits can enter information about quality. In a preferred embodiment, the syntax is added some bits to indicate whether fgs bp start-code ID bit plane or information about quality.

Solution C. alternatively, for information about quality could be defined in the new code and inserted after fgs bp start-code. In this case, the tag may have 8 bits.

The information that we propose to store in the field of quality is distorted after complete decoding of bit plane that follows this field of quality. In this case, when the stream is truncated inside of a bit plane, the approximate quality can be obtained by interpolation. It's easier than extrapolation, which would be required if field quality included information about the distortion before decoding the current bit plane. For example, let Q1 quality before decoding of a bit plane, a Q2 - quality after decoding (as suggested above, Q2 is already known to the decoder at the beginning of the bit plane). If the stream is truncated inside of a bit plane, therefore, it is known that the true quality Q lies inside the interval < Q1, Q2 >. So it might be the ROSTO approximated as Q≅ (Q1+Q2)/2. A more accurate approximation can be obtained if you also consider the number of decoded DCT blocks. For example, if the extended information for the current bit plane is adopted for the n of N the total number of blocks for any frame, then the true quality can be approximated as Q≅Q1+(Q2-Q1)·n/N.

In the preferred embodiment, to Solve And the first field quality for enhanced VOP, i.e. the field to the most significant (MSB) bit plane includes absolute quality (distortion), while the additional fields include quality improvement (decrease distortion) relative to the previous quality. Absolute quality can be used to compare different objects. Accommodation in additional data fields to improve the quality allows these improvements were presented with higher accuracy than in the case of using the data on absolute quality. This is especially important when each field has only 5 bits.

To quantify the quality is preferably used PSNR value. As already described above, the first field quality includes data on absolute quality. We expect to use 5 bits in order to give the PSNR after decoding the first (MSB) bit plane in the range 18-49 dB with 1 dB steps. This range covers all the practice is automatic with the appropriate values PSNR: PSNR when above 49 dB, the basic level includes the representation of an object with almost no losses. The value of PSNR is less than 18 dB would mean that the basic level provides extremely low quality, which is not very likely. When the values are outside this range, they will be limited or 18 dB or 48 dB depending on whether they are above or below the allowed range.

Then the following field quality will include improving the quality for full decoding of the subsequent bit plane with respect to quality after decoding the previous bit plane. It is preferable to set these quality improvement in the range 0-6,2 dB in steps of 0.2 dB, uses 5 bits. Because you are adding a single bit, this improvement may not exceed of 6.02 dB, so this range is sufficient.

For Decisions, when to tag quality used 8 bit quality values (both absolute and differential) could be represented even with a smaller module. However, the Solution would also allow us to simply use only the absolute (i.e. non-differential) value. Then the range of quality would be 18-60,5 dB increments of 1/6≈0,167 dB (or 18-49,875 dB increments with 0.125 dB).

The above-described various applications of MPEG-4 FGS, which requires information about the quality. Because this information is the have a unique fully available, when the source coding, it is added to the bit stream to make it available for future use. This can be done with a slight increase in the transmission speed in bits or without increasing the speed with minimal modifications to the current syntax of the bitstream. Two detailed solutions were presented to add the values of PSNR quality. Preferred is a Solution that uses absolute values.

The invention is applicable to all cases, when together should be stored or transmitted many scalable compressed media objects and some of these objects are compressed using MPEG-4FGS included in the invention. Specific application - this application with flexible memory, as well as the transmission channels and networks dealing with multiple objects or users. When the memory or the channel, or the network must be shared between a few objects or users, then they receive a high quality. The quality of automatically getting worse, to accommodate a greater number of objects or users. This can be done effectively, i.e. with low costs due to the presence of tag quality according to variants of this invention.

The invention can also advantageously be used for applications in which what's the encoding of the source of information and channel coding is not carried out at the same time or in the same place. Then the tag quality give the required information to add when channel coding (unequal error protection, for example, increased protection for parts of the bitstream that are of higher quality or greater protection for parts of the bitstream with high quality for a range of bit rates).

The invention can also advantageously be applied in the context of processing schemes scalable images, such as JPEG2000, see document ISO/IEC JTC 1/SC 29/WG 1 N1646 from March 16, 2000, which is incorporated by reference. Information as may be conveniently included in JPEG2000, because the Label review and extensions (CME) already defined (see p.51 document N1646)that allows unstructured data in the header. Information about the quality it is advantageous to include in a given CME. For example, can be included binary data (Rcme=0). Moreover, according to the variant of the invention for tag quality is defined by a separate type of Rcme.

It should be noted that the above options only illustrate, but not limit the invention, and specialists in the art will be able to create many alternative variants, without leaving the scope of the appended claims. Any signs that are enclosed in parentheses in the claims, should not be interpreted as limiting formula. The word "containing" don't drop the et presence of other elements or steps, in addition to those listed in paragraph formula. The invention can be implemented with technical support, containing some special items and using a properly programmed computer. In the claims, related to devices, are listed some tools, some of these tools can be implemented using the same component technical support. Just the fact that certain characteristics are listed in mutually different dependent claims does not indicate that the combination of these features can be used to obtain benefits.

1. The encoding of the media object, comprising stages: encode the media object to obtain a bit stream and add the information as to the bit stream, and the information about the quality specifies the quality of the media object in relation to a given position or in relation to specified parts of the bitstream, information about the quality of the provided tags qualities mentioned tag quality provides the tag value quality and value tag quality characterizes the distortion in the reproduced encoded media object, when the bitstream is truncated at the point relating to tag.

2. The method according to claim 1, in which the phase encoding is Kodirov the use of scaling to obtain a scalable bit stream.

3. The method according to claim 1, in which information about the quality based on the value of the ratio of signal to noise.

4. The method according to any of the preceding paragraphs in which the tags add quality to the specified location in the bit stream, and the value of the tag quality is distortion in the reproduced media object, when the bitstream is truncated immediately after, or alternatively, directly in front of the specified location in the bit stream.

5. The method according to claim 1, wherein the media object is encoded in the standard encoding, scaling, and referred to the coding standard zoom contains a pre-defined tag, and include as mentioned in a predefined tag.

6. The method according to claim 2, in which a scalable bitstream includes more than one level of quality, and to each of these levels of quality bitstream add information about quality.

7. The method according to claim 1, in which the bit stream is encrypted, and information about quality is not encrypted.

8. A method of controlling at least one bit stream representing the media object, in which the bit stream added information quality, and information quality indicates the quality of the media object in relation to a given position or a given part in the bit stream, if this is the information as provided in the tag quality, tag quality contains the tag value quality, and the aforementioned tag value quality characterizes the distortion in the reproduced encoded media object, when the bitstream is truncated at the point relating to tag quality, the method includes the steps of: receiving at least one bit stream, the allocation of the bit stream of the information about the quality, TRANS-coding or truncation mentioned at least one bit stream if the current combination of bit rate and quality mentioned, at least one of the received bit stream reaches the desired combination of bit rate and quality mentioned at least one bit stream, providing at least one bit stream having the desired combination of bit rate and quality.

9. A method of transferring at least one media object, comprising stages: encoding of the media object to obtain a bit stream, adding a bit stream quality information, and the information about the quality specifies the quality of the media object in relation to a given position or a given part in the bit stream, and the information about the quality of the provided tags in quality, tag quality contains the tag value of the qualities mentioned the tag value quality which describes the distortion in the reproduced encoded media object, when the bitstream is truncated at the point relating to tag quality, and transmitting the bit stream to which we have added information about quality.

10. The method of receiving at least one bit stream representing the media object, in which the bit stream added information quality, and information quality indicates the quality of the media object in relation to a given position or a given part in the bit stream, the information as provided in the tag quality, tag quality contains the tag value of the qualities mentioned the tag value quality characterizes the distortion in the reproduced encoded media object, when the bitstream is truncated at the point relating to tag quality, the method includes the steps of: receiving at least one bit stream, the selection of the bit stream of the information about the quality, TRANS-coding or truncation mentioned at least one bit stream if the current combination of bit rate and quality that at least one of the received bit stream reaches the desired combination of bit rate and quality mentioned at least one bit stream, providing at least one bit stream having the desired combination of bit rate and quality, on the encoding, at least one bit stream having the desired combination of bit rate and quality.

11. The method of receiving at least one bit stream representing the media object, in which the bit stream added information quality, and information quality indicates the quality of the object in relation to a given position or a given part in the bit stream, and the information about the quality of the provided tags in quality, tag quality contains the tag value of the qualities mentioned the tag value quality characterizes the distortion in the reproduced encoded media object, when the bitstream is truncated at the point relating to tag quality, the method includes the steps of: accepting at least one bit stream, extract from the bitstream, information about the quality; decode the bitstream to obtain a decoded media object; perform post-processing of the decoded media object to reduce artifacts in the extracted quality.

12. Device for encoding the media object that contains the means for encoding object for receiving the bit stream and means for adding quality information to the bit stream, and information about quality indicates the quality of the media object in relation to a given the th position or a specified part in the bit stream, when this information as provided in the tag quality, tag quality contains the tag value of the qualities mentioned the tag value quality characterizes the distortion in the reproduced encoded media object, when the bitstream is truncated at the point relating to tag.

13. The transmitter containing the device 12 and the means for transmitting the bit stream is appended with information about quality.

14. A controller to control at least one bit stream representing the media object, in which the bit stream added information quality, and information quality indicates the quality of the object in relation to a given position or a given part in the bit stream, and the information about the quality of the provided tags in quality, tag quality contains the tag value of the qualities mentioned the tag value quality characterizes the distortion in the reproduced encoded media object, when the bitstream is truncated at the point relating to tag quality, and the controller includes means for receiving at least one bit flow, means for extracting quality information from the at least one bit stream that is received by a means for receiving, means for trimming at least one of the received bit stream in case, if the current combination of bit rate and quality, specified information about the quality of extracted means for extracting at least one of the received bit stream reaches the desired combination of bit rate and quality mentioned, at least one of the received bit stream, means for providing at least one bit stream, a truncated means for truncating and having the desired combination of bit rate and quality.

15. The receiver containing the controller 14.

16. A receiver for receiving at least one bit stream representing the media object, in which the bit stream added information quality, and information quality specifies the quality of the object in relation to a given position or a given part in the bit stream, and the information about the quality of the provided tags in quality, tag quality contains the tag value of the qualities mentioned the tag value quality characterizes the distortion in the reproduced encoded media object, when the bitstream is truncated at the point relating to tag quality, and the receiver includes means for receiving at least one bit stream, means for retrieve information about the quality of at least one bit stream, received by the means for receiving; means for decoding at least, one of the CSOs bit stream, adopted by the means for receiving, for receiving the decoded media object; means for post-processing the media object to reduce artifacts depending on the quality information extracted by the tool to retrieve.

17. The multiplexer containing the controller 14.

18. The network node containing the controller 14.

19. The signal representing the bit stream representing the encoded media object, and the bit stream added information about the quality, and the information about the quality indicates the quality of the object in relation to a given position or a given part in the bit stream, and the information about the quality of the provided tags in quality, tag quality contains the tag value of the qualities mentioned the tag value quality characterizes the distortion in the reproduced encoded media object, when the bitstream is truncated at the point relating to tag.

Priority items:

11.10.2000 - p.2;

12.10.2000 - claims 1, 3-7, 9-19;

04.10.2001 - item 8.



 

Same patents:

FIELD: data filtration technologies, in particular, signaling adaptive filtration for lower blocking effect and contour noise.

SUBSTANCE: during forming of frame of blocks of given size, following operations are performed: production of blocking information for decreasing blocking noise and production of contouring information for decreasing contour noise of coefficients of previously given pixels of upper and left threshold areas of data block, when frame, received by decomposition of image data in the stream of binary digits for inverse quantizing is an internal frame, and adaptive filtration of image data passing through inverse quantizing and inverse discontinuous cosine transformation, in accordance to produced blocking information and contouring information. Thus, blocking effect and contouring noise can be removed from image, restored from image on basis of blocks, improving the image restored from compression.

EFFECT: decreased blocking effect and contouring noise of encoding with high compression level.

2 cl, 7 dwg

FIELD: data filtration technologies, in particular, signaling adaptive filtration for lower blocking effect and contour noise.

SUBSTANCE: during forming of frame, following operations are performed: production of blocking information for decreasing blocking noise and production of contouring information for decreasing contouring noise of coefficients of previously given pixels of upper and left threshold areas of data block, when frame, received by decomposition of image data in the stream of binary digits for inverse quantizing is an internal frame, and adaptive filtration of image data passing through inverse quantizing and inverse discontinuous cosine transformation, in accordance to produced blocking information and contouring information. Thus, blocking effect and contouring noise can be removed from image, restored from image on basis of blocks, improving the image restored from compression.

EFFECT: decreased blocking effect and contouring noise of encoding with high compression level.

2 cl, 7 dwg

FIELD: technologies for data filtering.

SUBSTANCE: when a frame is formed of blocks of preset size, following operations are performed: generation of blocking information for decrease of blocking effect and contouring information for decrease of contouring noise from coefficients of preset pixels of upper and left limiting areas of data block, when a frame, received by decomposition of image data in a stream of binary bits for inverse quantizing, is an inner frame, and adaptive filtering of image data, passing through inverse quantizing and inverse discontinuous cosine transformation, in accordance to generated information of blocking and information of contouring. That is why blocking effect and contouring noise can be removed from an image, restored from image on basis of blocks, to improve the image, restored from compression.

EFFECT: decreased blocking effect and contouring noise.

2 cl, 7 dwg

FIELD: electrical communications; data processing including reduction of data redundancy.

SUBSTANCE: proposed process includes similar way of generation of random quadrature matrix measuring m x m items and k random key matrices measuring N x m and m x N items on sending and receiving ends. Then k matrices of quantum readings of motionless gray-level video picture measuring M x M items are formed from k motionless gray-level video pictures which are then converted into product of three following matrices: random rectangular matrix measuring N x m items, random square matrix measuring m x m items, and random rectangular matrix measuring m x N items; in the process items of rectangular matrix measuring N x m items are transferred to communication channel. On receiving end k matrices of recovered quantum readings of motionless gray-level video pictures measuring M x M items are formed around random matrix measuring N x m items received from communication channel, as well as around random quadrature matrix measuring m x m items, and random rectangular matrix measuring m x N items, and motionless gray-level video pictures are produced from mentioned k matrices of recovered quantum readings.

EFFECT: enhanced data transfer speed at desired quality of recovered messages.

4 cl, 24 dwg

The invention relates to a digital camcorder, and more particularly to a device is a digital camcorder that uses methods of video compression that is compatible with the decoding device MPEG-2

The invention relates to filter the data, in particular to the way the signal adaptive filtering for reducing blocking effect and noise trim, signal adaptive filter and a machine-readable medium for storing program

FIELD: technology for encoding and decoding content, in particular, extracting data from buffer and loading them into buffer.

SUBSTANCE: method includes picking data from buffer in response to execution of data access command, while buffer contains multiple data storage devices, forming additional unified address space with bit level addressing. If picked data are contained in source data storage device and in next data storage device, fragment of picked data from source data storage device is concatenated with remaining fragment of picked data from next data storage device to form picked data as continuous block, picked data are stored in assignment device for storing data. Method for loading data into buffer includes storing data into buffer, while if data size exceeds capacity of device for storing data, data are split onto fragments and stored in source storage device and next device. After saving of aforementioned data, data from storage device are moved to memorizing device.

EFFECT: higher speed of loading and data extraction.

5 cl, 26 dwg

FIELD: electric communication, namely systems for data transmitting by means of digital communication lines.

SUBSTANCE: method comprises steps of preliminarily, at reception and transmission forming R matrices of allowed vectors, each matrix has dimension m2 x m1 of unit and zero elements; then from unidimensional analog speech signal forming initial matrix of N x N elements; converting received matrix to digital one; forming rectangular matrices with dimensions N x m and m x N being digital representation of initial matrix from elements of lines of permitted vectors; transmitting elements of those rectangular matrices through digital communication circuit; correcting errors at transmission side on base of testing matching of element groups of received rectangular matrices to line elements of preliminarily formed matrices of permitted vectors; then performing inverse operations for decompacting speech messages. Method is especially suitable for telephone calls by means of digital communication systems at rate 6 - 16 k bit/s.

EFFECT: possibility for correcting errors occurred in transmitted digital trains by action of unstable parameters of communication systems and realizing telephone calls by means of low-speed digital communication lines.

5 cl, 20 dwg

The invention relates to animate three-dimensional graphical models

The invention relates to radio engineering and is intended for the discharge of compressed clock signal to a device for separating signal and the clock signal, essentially agreed with a synchronizing signal encoding device

The invention relates to the field of radio engineering, in particular to the coding information to enhance the format of the encoded signals

The invention relates to techniques for transmitting data in automated information-measuring systems, control and detection of radiation situation

The invention relates to encoding and decoding of speech

The invention relates to encoding and decoding of audio signals, particularly to a scalable encoding and decoding of audio signals for multi-streams of bits by representing data of different levels of expansion based on the basic level within the stream of bits

The invention relates to telecommunications, and in particular to the field associated with the reduction of the redundancy of transmitted information

The invention relates to methods compressor, and in particular to methods compressor for digital camcorders or other digital tape recording and reproducing device, when required editing

FIELD: electric communication, namely systems for data transmitting by means of digital communication lines.

SUBSTANCE: method comprises steps of preliminarily, at reception and transmission forming R matrices of allowed vectors, each matrix has dimension m2 x m1 of unit and zero elements; then from unidimensional analog speech signal forming initial matrix of N x N elements; converting received matrix to digital one; forming rectangular matrices with dimensions N x m and m x N being digital representation of initial matrix from elements of lines of permitted vectors; transmitting elements of those rectangular matrices through digital communication circuit; correcting errors at transmission side on base of testing matching of element groups of received rectangular matrices to line elements of preliminarily formed matrices of permitted vectors; then performing inverse operations for decompacting speech messages. Method is especially suitable for telephone calls by means of digital communication systems at rate 6 - 16 k bit/s.

EFFECT: possibility for correcting errors occurred in transmitted digital trains by action of unstable parameters of communication systems and realizing telephone calls by means of low-speed digital communication lines.

5 cl, 20 dwg

Up!