The method of use of auxiliary data arrays in the conversion process and/or verification of computer codes in the form of symbols, and the corresponding portions of the image

 

(57) Abstract:

The invention relates to the field of electronics and is designed, for example, to use auxiliary data arrays in the conversion process and/or verification of computer codes in the form of symbols, and the corresponding portions of the image. The technical result is to reduce the error of conversion and/or verification. How is that make the production of semantic units recognizable image fragments containing n components of their elements, where n is chosen within 1n103. In selected samples of produce subject to verification of the totality of the portions of the image containing the n1elements, where n1choose within 1(n1+n)/n2. Search in the auxiliary dataset semantic units that differ from the selected sets of image fragments, with the error selected in the range of 0(n1- 1)/n1where is the experimental factor within 0,61,2 selected depending on the frequency fioccurrence of any of the semantic of the i-th unit in the admissible set of semantic units, which is defined as the number of n2repetition is noreste semantic units. Identify in the recognized semantic units elements that do not match the equivalent at the locations of the symbols in the semantic units found in the search process, and produce their replacement corresponding to location of the symbols of the identified semantic units. Form array for dynamic bitmap standards computer codes of elements comprising the recognized semantic units and with regard to the previous operations transform the auxiliary dataset to reduce the total error3method, which is chosen in relation to the intermediate error1within 1 (1+3)/12.

The invention relates to the field of electronics and can be applied, for example, to use auxiliary data arrays in the conversion process and/or verification of computer codes in the form of symbols, and the corresponding portions of the image.

There is a method of use of auxiliary data in the conversion process computer codes and corresponding portions of the image, including man-made and/or equivalent device, and/or computer program is atent USA N 5153927: Character reading system and method., IPC Oct. 6, 1992.].

There is also known a method of using auxiliary data arrays in the conversion process and/or verification of computer codes and their corresponding originals consisting implemented in a computer program using secondary datasets employed for recognition of their respective originals [user Manual Fine Reader 4.0ABBYY Software House, M. , 1998. Kazan factory software. Order F-377].

A disadvantage of the known methods are relatively low and their functional and technical characteristics, including high values achieved errors of the conversion.

Solved by the invention objective is the improvement of methods of use of auxiliary data arrays in the conversion process and/or verification of computer codes in the form of symbols, and the corresponding portions of the image with the achievement of the technical result in the decrease of the error of conversion and/or verification.

For convenience and unambiguous understanding would be appropriate interpretation and definitions hereinafter thereof, is a set of computer codes, the corresponding source object, for example raspoznavaniya picture.

Computer code (e.g., symbol) - computer representation of some piece of information (in particular, character).

The recognition process is the processing of the recognition system introduced in the computer graphics image of a certain character, resulting in the recognition system assigns the image of a computer code for that character.

The verification process is made by the person and/or equivalent device, and/or computer program comparison (adequacy) computer codes (symbols) with a graphical image that is entered into the computer.

An admissible set of semantic units includes the totality likely to recognize sets of semantic units.

A semantic unit is a set of computer codes, appropriate based on any practical use of the image, for example a letter, word, symbol, graphic, logical operations, together etc.

Auxiliary dataset is randomly generated set e">

Error of conformity between the original semantic units and their corresponding semantic units of volume n1additional data is determined as valid, the number of n1unmatched in their elements, corresponding to the n1: = n1/n1.

Frequency fioccurrence of any of the semantic of the i-th unit in the admissible set of semantic units defined as the number of n2repetitions of a particular semantic unit, correlated with the total number of semantic units in the admissible set of semantic units: f1= n2i/n3.

Error1auxiliary data array in relation to a valid set of semantic units is defined as the probability of not finding in the data array element njcorrelated with the total number of semantic units n4in the auxiliary data array.

Error2conversion is defined as the number of n5mistakenly converted items correlated with the total number of n6elements in the transformed set of semantic elements from the allowable set:2= n5/n6.

Error31,2.

As a quick information disclosing the invention, it should be noted that the technical result achieved provide using the proposed method using auxiliary data arrays in the conversion process and/or verification of computer codes in the form of symbols, and the corresponding portions of the image, namely, that produce a sample semantic units recognizable image fragments containing the n1their constituent elements, where n1- choose between 1 n 103. In selected samples of produce subject to verification of the totality of the portions of the image containing the n1elements, where n1choose within 1 (n1+n)/n 2. Search in the auxiliary dataset semantic units that differ from the selected sets of image fragments, with the error selected in the range of 0 (n1-1)/n1. Here is an experimental coefficient in the range of 0.6 to 1.2 selected depending on the frequency fioccurrence of any of the semantic of the i-th unit artisan units, correlated with the total number of n3semantic units in the admissible set of semantic units.

Identify in the recognized semantic units elements that do not match the equivalent at the locations of the symbols in the semantic units found in the search process, and produce their replacement corresponding to location of the symbols of the identified semantic units. Form array for dynamic bitmap standards computer codes of elements comprising the recognized semantic units by the number of n7, the value of which is chosen in the range 1 (n2+ n5+ n6+ n7+ n3)/n36,3. Here is an experimental coefficient in the range of 0.4 to 1.3 selected depending on specified error1auxiliary data array in relation to a valid set of semantic units, defined as the probability of not finding in the data array element njin the total number of semantic units n4in the auxiliary array data, and errors2conversion, defined as the number of n5mistakenly converted items correlated with the total number of n6elements in preobbect auxiliary data array to reduce the total error3method, which is chosen in relation to the error1within 1 (1+3)/12.

When presenting information, confirming the possibility of carrying out the invention it is expedient in more detail to describe the proposed method of use of auxiliary data arrays in the conversion process and/or verification of computer codes and the corresponding portions of the image. In detail it is appropriate to focus only on the essential features of the operations of the proposed method lies in the fact that produce a sample semantic units recognizable image fragments containing n components of their elements, where n is chosen in the range 1 n 103. Semantic units can be in an arbitrary case letters, mathematical and other symbols, words, sentence, graphic elements, and any combination of them. In selected samples of produce subject to verification of the totality of the portions of the image containing the n1elements, where n1choose within 1 (n1+n)/n 2. Search in the auxiliary dataset semantic units that differ from the selected sets of image fragments is 0,6 1,2, choose depending on frequency fioccurrence of any of the semantic of the i-th unit in the admissible set of semantic units, which is defined as the number of n2repetitions of a particular semantic unit, correlated with the total number of n3semantic units in the admissible set of semantic units. Fragments can be as a semantic unit as a whole and parts thereof, oriented, for example, on a stand-alone application. Error in conversion of mainly related to the quality of the original graphics, which is determined, in particular, those that have to recognize, for example, is made for Photocopying apparatus of the image, faxogram, typewritten or handwritten text.

Identify in the recognized semantic units elements that do not match the equivalent at the locations of the symbols in the semantic units found in the search process, and produce their replacement corresponding to location of the symbols of the identified semantic units. Form array for dynamic bitmap standards computer codes of elements comprising the recognized semantic units by the number of n7which size oefficient within 0,4 1,3, choose depending on specified error1auxiliary data array in relation to a valid set of semantic units, defined as the probability of not finding in the data array element njin the total number of semantic units n4in the auxiliary array data, and errors2conversion, defined as the number of n5mistakenly converted items correlated with the total number of n6elements in the transformed set of semantic elements from the valid set.

The process of building a dynamic bitmap standards can be identified as man-made and/or equivalent device, and/or computer program build additional data used to speed up the recognition process. Dynamic raster pattern is an optional array of data generated and used to speed up the recognition process. The concept of "dynamic" reflects the changing nature of the established standards, that is, means that in the process of using the proposed method constantly changing set of built templates by replenishing it with new standards, vidoe in the form of a set of elements, forming, for example, a periodic structure. To create a standard means for each found in the text symbol to store the raster subsystem couple: bitmap image of the symbol and its name (i.e., what letter is this image represents).

Then with regard to the previous operations transform the auxiliary dataset to reduce the total error3method, which is chosen in relation to the error1within 1 (1+3)/12. In practice, it is possible to use a separate logically complete sets of operations of the proposed method. If the allocation in accordance with the analytical ratios required values receive fractional, negative values and any other values that are incorrect based on the conditions of possibility of their future use, they are excluded from consideration and/or automatically removed.

As an example of practical implementation of the claimed method using auxiliary data arrays in the conversion process and/or verification of computer codes and the corresponding portions of the image, it is advisable to give the following, the person produced a sample semantic units recognized by the originals, containing n components of their elements, where n is chosen in the range 1 n 20. In selected samples of produce subject to verification of the totality of their fragments containing the n1elements, where n1choose from a condition of 1.8 (n1+n)/n 2. Search in the auxiliary dataset semantic units with an error different from the selected sets of fragments selected in the range of 0.1 at = 0,9 fi= 0.01 and 0.1. Identify in the recognized semantic units elements that do not match the equivalent at the locations of the symbols in the semantic units found in the search process, and produce their replacement corresponding to location of the symbols of the identified semantic units. Form array for dynamic bitmap standards computer codes of elements comprising the recognized semantic units by the number of n7, the value of which in relation to the total number of n3semantic units in the admissible set of semantic units chosen from the condition of n7/n3= 0.9 at = 1,1,1= 0.05 and 2= 0,05, neglecting in a particular case, the effect of n2n5and n6the value of n7. Convert the result of the subsidiary is B>)/1= 1,2.

Computer code in the claimed object, as already mentioned, is converted by a computer a set of electromagnetic signals, adequately relevant recognizable original characters, or any other recognizable fragments of the original information. Each of the standards of population dynamic bitmap patterns, forming a periodic structure that represents, for example, an ordered set of electromagnetic signals or appropriate relief magnetized portions of your hard disk. The dynamic properties of the raster standards determine the timing of their transformation.

In respect of technical means necessary for implementing the inventive method, it is advisable in addition to the above, be noted that they can be as specialized functional units and functional units of computer, managed a system-defined commands. In particular, some operations are carried out by the math coprocessor CPU system unit of a computer running specialized for operations with arrays of data and statistical computing functional software blocks (Biblioteca or in random access memory (RAM), or on a disk of the computer and are controlled by the system libraries of commands of the operating environment. Under the replacement person device refers to any device that may be necessary for implementing the method level to perform operations that were previously performed or that a person can do. In practice, the technical means implementing the method of constructing dynamic bitmap standards of computer codes in the recognition process corresponding originals can be, in particular, the system consisting of a scanner, computer, loaded into memory by the program scan, the program Fine Reader, subsystem synchronization of computer codes, as well as monitor or a printing device and a manipulator to control and process control. The criterion of industrial applicability of the proposed method is also proved by the absence of the stated claims of any practically difficult-to-implement features and known means for their implementation.

Specified in the claims of the differences, as already mentioned, give reason to conclude that the novelty of the proposed technical solution, but a set of requirements which can be found by the method. A practical method of achieving the above technical result interrelated set of essential features and characteristics, as reflected in the claims. Features use of the method and other objects that were not reflected in the description, well-known and are not the subject of the invention.

In addition to the above technical result, the practical realization of the declared object can significantly extend its use in, for example, to various documents, fillable handwritten characters, or documents of poor quality.

The method of use of auxiliary data arrays in the conversion process and/or verification of computer codes in the form of symbols, and the corresponding portions of the image, namely, that produce a sample semantic units recognizable image fragments containing n components of their elements, where n is chosen in the range 1 n 103in selected samples of produce subject to verification of the totality of the portions of the image containing the n1elements, where n1choose within 1 (n1+ n)/n 2 implementation of the image fragments with the uncertainty vybiraem within 0 (n1-1)/n1where is the experimental ratio in the range of 0.6 to 1.2 selected depending on the part of fi occurrence of any of the semantic of the i-th unit in the admissible set of semantic units, which is defined as the number of n2repetitions of a particular semantic unit, correlated with the total number of n3semantic units in the admissible set of semantic units, identified in the recognized semantic units elements that do not match the equivalent at the locations of the symbols in the semantic units found in the search process, and produce their replacement corresponding to location of the symbols of the identified semantic units, create additional dynamic bitmap array of standards computer codes of elements comprising the recognized semantic units by the number of n7, the value of which is chosen in the range 1(n2+ n5+ n6+ n7+ n3)/ n36,3, where experimental ratio in the range 0.4 to 1.3 selected depending on specified error1auxiliary data array in relation to a valid set of semantic units, defined as the probability of the tive is assive data and errors 2conversion, defined as the number of n5mistakenly converted items correlated with the total number of n6elements in the transformed set of semantic elements from the valid set and convert auxiliary array data to reduce error3method, which is chosen in relation to the error1within 1 (1+3)/12.

 

Same patents:
The invention relates to the field of data communications, and in particular to devices for translating information from one language to another, and may be applicable in various sectors of the national economy, in particular in the manufacture of products of the printing industry dictionaries

The invention relates to a system for translating phrases from a first language to a second language, in particular, but not exclusively, to such a system, which generates speech output in the second language of the speech input in the first language

The invention relates to computing

The invention relates to a computer system of creation and translation of documents, to prepare the text in the language limitations and translation into a foreign language
The invention relates to the field of electronics and can be used, for example, in the way of interrelated activation computer code in the form of symbols and corresponding portions of the image
The invention relates to the field of electronics and can be used, for example, in the way of interrelated activation computer code in the form of symbols and corresponding portions of the image
The invention relates to the field of electronics and can be used, for example, in the way of interrelated activation computer code in the form of symbols and corresponding portions of the image

The invention relates to the field of computer engineering and can be used to control the addressing e-mail messages when a subscriber in an open computer network with the ability to control on a formal or natural language

How ponasterone // 2140103
The invention relates to methods of reception of voice messages and can be used when fontanarrosa

The invention relates to means for Informatics and computer engineering and can be used to solve problems in symbolic processing using production systems programming

FIELD: the invention refers to the system of remote training.

SUBSTANCE: the system has an arrangement for providing training in rendering training services through a net; an arrangement for transmitting texts connected with training aids, an arrangement for evaluation of reception of the answer through a net; an arrangement for transmitting of evaluation of transmitting the result of evaluation to a user; a database about members supporting training; an arrangement for selection of supporting members for reception of inquiry about support from the user through a net and for selection of a member for training in required field of specialization; an intermediary arrangement for connection for fulfillment of the role of the mediator at connecting the contact address of the selected member supporting training and the user through a net.

EFFECT: allows to provide services in training with dynamically changing training changes depending from the evaluation of the degree of perception in remote system with corresponding support.

6 cl, 9 dwg

FIELD: computer science, in particular, engineering of automated system for distributed processing of text documents.

SUBSTANCE: system contains block for receiving text documents, blocks for identification of base address of text documents, block for selection of structure of text document, block for modifying record address for text document, block for selecting sections of text documents, block for addressing sections of text documents, block for modifying record address of text document, block for selecting sections of text documents, block fro addressing sections of text documents, block for modifying reading address of text document section, block for receiving text documents of executives, block for identification of base address of documenting of sections of text documents, block for recording number of completed tasks, block for modification of address of record of completed tasks, block for commutation of channels for dispensing text documents and block for dispensing data and control signals.

EFFECT: increased speed of operation of system by means of localization of addresses of text documents in system database by identifiers of the very text documents.

13 dwg

FIELD: physics, computer facilities.

SUBSTANCE: invention concerns computer facilities. The system of transformation of the files, having at least one file, associated with one or more non-structured properties is given. The output agent of properties of a file manipulates with non-structured properties according to one or several structured properties, associated with medium of storehouse of the structured objects. If not structured file be used in a context of medium of storehouse of the structured objects, unfolding operation is carried out for updating of not structured properties in a file in the structured properties approaching for operation in the environment of storehouse of structured objects. If concerning the developed device the manipulation in the environment of storehouse of the structured objects be executed, operation of compression or an inverse transformation is carried out for updating of properties in the file.

EFFECT: interaction and compatibility possibility between non-compatible data systems.

26 cl, 9 dwg

FIELD: physics, computer engineering.

SUBSTANCE: invention is related to processing of electronic ink. Method of the first data structure matching with the second data structure consists in the following: for every unit of the second data structure it is defined whether this unit received change from appropriate unit in the first data structure; for every unit in the second data structure, for which it has been defined that it received change from appropriate unit in the first data structure, attempt of access is realised to this unit in the first data structure; if mentioned unit in the first data structure is unachievable, realisation of mentioned change is prevented in the second data structure; if it is achievable - it is defined, when mentioned change in relation to the second data structure creates optional collision, and sometimes obligatory collision; if change creates optional collision, it is defined whether it is prohibited by collision criteria; if optional collision is not prohibited, mentioned change is performed; if it is prohibited - realisation of mentioned change is prevented, at that mentioned collision criteria prohibit removal of ink strokes from end unit under fixed unit.

EFFECT: expansion of method functional resources.

12 cl, 49 dwg

FIELD: physics; computer facilities.

SUBSTANCE: offered invention concerns ways and systems for transformation of object of one type in object of other type. Transformation can be carried out in an augmented agent of serialisation which carries out serialisation, deserialisation and transformation of objects of various types. Changes during performance are imported to operation of an agent of serialisation by means of one or more procedures of expansion which realise required configuring for specific needs or expansion, thus not demanding replacements of other available procedures. On the basis of the information on the type, identified for initial object, object will converse to the intermediate representation which supposes change during performance, including change of names of object, types of object and object data. The intermediate representation of initial object change according to procedures of expansion which make changes to operation of a resort of serialisation during performance, and the intermediate representation will converse to target object or type.

EFFECT: possibility of change or configuring for specific needs of operation of transformation process to performance time.

35 cl, 7 dwg

FIELD: information technologies.

SUBSTANCE: invention is related to facilities of training and research automation and may be used in interactive systems of research and development works automation in process of software (SW) verification in distributed computer complexes (DCC). Suggested method and device for its realisation provide complete manageability and observability of the main processes of SW initial code verification. At the same time processes of SW initial code input and processing are combined along dependent or independent interface channels, on the basis of application of sensor or mechanical manipulators of computer operator workplace, user interfaces of local or global network. Sections or points of SW initial code vulnerability are defined on the basis of SW initial code transformation into internal representation, which is stored in the form of databases and knowledge bases, and sections or points of SW initial code vulnerability are defined on the basis of automatic making and solving of according equation systems.

EFFECT: expansion of functional resources of DCC SW verification processes.

9 cl, 39 dwg, 26 tbl

FIELD: information technologies.

SUBSTANCE: invention is related to facilities of training and science research automation and may be used in interactive systems of research and development works automation in process of software (SW) verification in distributed computer complexes (DCC). Suggested method and system for its realisation provide complete manageability and observability of the main processes of SW initial code verification. At the same time at each level of DCC processes of SW initial code input and processing are combined along dependent or independent interface channels, on the basis of application of sensor or mechanical manipulators of computer operator workplace, network interfaces of local or global network Sections or points of SW initial code vulnerability are defined on the basis of SW initial code transformation into internal representation, which is stored in the form of databases and knowledge bases, and sections or points of SW initial code vulnerability are defined on the basis of automatic making and solving of according equation systems.

EFFECT: expansion of functional resources of DCC SW verification processes

9 cl, 40 dwg, 26 tbl

FIELD: information technologies.

SUBSTANCE: method includes receiving information entered in natural language and analysis of information entered in natural language to identify contained in it semantic information. For part of information entered in natural language, correspondence with "command" objects and "object" objects of scheme based on semantic information and entered in natural language information. The method also contains representation of data from data source in a table of columns and rows on the basis of scheme and corresponding parts of information which has been entered in natural language.

EFFECT: providing more effective interface for creation and representation of table with information from data source.

35 cl, 5 dwg

FIELD: information technologies.

SUBSTANCE: in invention it is automatically detected, which is the category of printed document, and unauthorised printing is prevented. In method printed document is analysed for availability of confidential information, system comprises user device, printing device, server of printing control service, converter unit, server of databases, file storage, unit of recognition, server of context analysis and alarm service.

EFFECT: provision of information safety, detection of document flows containing confidential information and requiring high extent of control.

2 cl

FIELD: information technology.

SUBSTANCE: method provides a preliminary presentation which automatically shows the intended outcome of applying one or another control to data. This is preferred when analysing electronic worksheet data by formatting certain data based on the control condition. The method involves identification of one or more data parametres subject to formatting based on the condition on display, selection of a predefined condition and automatic temporary application of that predefined condition to parametre(s), display of the temporary preliminary presentation on the display of the said predefined condition applied to data which correspond to the said predefined condition. The method also enables preliminary change of conditions and parametres applied to data, and automatically provide corresponding preliminary presentation of the effect of such application of the altered conditions with respect to displayed data.

EFFECT: faster formatting of displayed data.

27 cl, 28 dwg

Up!