System for identification of preparedness of text documents in network for distributed data processing

FIELD: computer science, in particular, system for identification of preparedness of text documents in network for distributed processing of data.

SUBSTANCE: system contains block for receiving sections of text documents, block for selection of base addresses of text documents, block for selecting structure of text documents, block for forming signals for recording and reading database, block for gating sections of text documents, block for addressing of text documents, block for receiving sections of text documents from database of server, block for commutator of channels for dispensing sections of text documents, block for counting number of finished sections of text documents, comparator, counter.

EFFECT: increased speed of operation of system.

8 dwg

 

The invention relates to the field of computer technology, in particular to the identification system readiness text documents in a network of distributed data processing.

The feature of a technical problem lies in the fact that in a distributed processing network sections of the text document various performers need to fix the time of the readiness of all its sections for further processing.

Similar problems have to be solved, for example, editions of Newspapers, where different sections of the same issue are distributed between different specialists, and then, all together, must be presented to the General editorial changes to the automated workplace of the chief editor of release.

Known systems that could be used to solve the task (1, 2).

The first of the known systems contains blocks receiving and storing data, connected to the control blocks and data blocks search and selection, connected to the blocks of the data storage and display, the clock inputs are connected to outputs of the control unit (1).

A significant disadvantage of this system is that you cannot solve the problem of updating data stored in memory in the form of relevant documents, simultaneously with the task of issuing the soda is Jania these documents to users in real time.

Known and other system containing blocks of data, information inputs which are connected with the blocks receive data and control outputs connected to the first group of blocks of memory, Central processor, the inputs of which are connected to the outputs of the memory blocks of the first group and block data, and outputs connected to inputs of memory blocks of the second group and block data display (2).

The last of the above technical solutions closest to being described.

Its disadvantage is the low system performance, due to the fact that the procedure of determining the readiness of sections of text documents by various artists is implemented through the search condition data sections of text documents in the entire database.

This organization of data processing for large volumes of text documents inevitably leads to unreasonably high costs time.

The purpose of the invention is the improvement of the system performance by identifying the readiness of text documents at the time of writing the latest ready section of the text document into the database system.

This objective is achieved in that in the known system containing block receiving sections of text documents, information and clock inputs of which are first the information and clock inputs of the system, block addressing text documents, information and clock inputs of which are the second information and the clock inputs of the system, the unit receiving sections of text documents from the database servers, data and clock inputs of which are the third information and the clock inputs of the system, the block switching channels issuance of sections of text documents, one data input connected to the output unit receiving sections of text documents from the database server, the other information input connected to the output of block addressing text documents, word clock input connected to the third clock input system and information outputs are outputs of the system group, the signal recording and read the database, the address which is the address of the system output, and first and second clock outputs are first and second clock outputs of the system, put the block selection base address of text documents, an information input connected to the first output unit receiving sections of text documents, word clock input connected to the first clock input of the system, and information and clock outputs are connected to some information the district and the clock input signal of the read and write database the block structure of text documents, one data input connected to the second information output unit receiving sections of text documents, and data and clock outputs connected to other information and clock inputs of the signal writing and reading of the database, while the third information output unit receiving sections of text documents is an information output system, the power Gating sections of text documents, word clock input connected to the second clock output signal of the read and write databases, and control inputs of the group connected to the relevant control outputs of the group of block selection base address of text documents, the block count the finished sections of text documents, the control inputs of which are connected to the relevant control outputs of the group of block Gating sections of text documents, and the counting input of which is connected to the corresponding counting the outputs of the block Gating sections of text documents, unit assignments patterns of text documents, the control inputs of which are connected to the relevant control outputs of the group of block Gating sections of text to the instruments, the first comparator, one data input connected to the output of the block count of the number of ready-made sections of text documents, other information input connected to the output of the job structure of text documents, and the clock input is connected to the synchronizing output of the block Gating sections of text documents, with one output of the first comparator is connected to the first clock input unit for selecting the structure of text documents, and the other output is connected to mounting blocks inputs receiving sections of text documents, selection of the base address of text documents, Gating sections of text documents, the signal writing and reading the database, the count, a counting input connected to the first a synchronizing unit output switching channels for issuing sections of text documents, and the output is connected to another information input unit for selecting the structure of text documents, and the second comparator, one data input connected to the output of the job structure of text documents, other information input connected to the output of the counter and the clock input is connected to the second output of the switching unit channels issuance of sections of text documents, with one output of the second comparator is connected to a second sin is toniziruushii input unit for selecting the structure of text documents, and the other output connected to the set inputs of blocks addressing text documents, receiving sections of text documents from the database server, selecting the structure of a text document, a signal recording and reading of the database and the counter.

The invention is illustrated by drawings, where figure 1 shows the block diagram of the device in figure 2 - block diagram of the block selection base address of text documents, figure 3 - block diagram of the structure of text documents, figure 4 - block diagram of the signal writing and reading database figure 5 is a structural block circuit diagram of the Gating sections of text documents, figure 6 - block diagram of the block count of the number of ready-made sections of text documents, figure 7 is a structural block circuit diagram of the job structure of text documents, Fig - structural block circuit switching channels issuance of sections of text documents.

The system (figure 1) contains the unit 1 receiving sections of text documents, unit 2 selection of the base address of text documents, unit 3 selection patterns in text documents, unit 4 generating signals of read and write database, block 5 Gating sections of text documents, unit 6, counting the number of ready-made sections of text documents, unit 7 task patterns texto who's documents the first 8 and second 9 Comparators, block 10 addressing text documents, unit 11 of the receiving sections of text documents from the database server, block 12 switching channels issuance of sections of text documents and the counter 13.

Figure 1 shows the first 20, second 21 and 22 third information input system, the first 23 and second 24 and third 25 clock inputs of the system, as well as information 26 address 27 the system outputs, the first 28 and second 29 synchronizing the system outputs and groups 30-32 information system outputs.

Unit 1 receiving sections of text documents in the form of register with the information input 20, the clock 23 and installation 161 inputs, as well as the first 162, 163 second and third 164 information outputs.

Block 2 (figure 2) selection of the base address of text documents contains the register 33, the elements 34-36 And the decoder 37, block 38 memory, made in the form of permanent storage devices, elements 39-1, 39-2, 39-3 delay. Figure 2 shows information 41, the clock 42 and installation 43 inputs, as well as information 44, 45 clock outputs and control 46-48 outputs of the group.

Block 3 (figure 3) selection of the structure of text documents contains the trigger 51, groups 52, 53 elements And group elements 54 OR element 55 OR element 56 delay. Figure 3 shows information 58, 59, the clock 60, 62 ostanovochnyy 61 inputs, as well as information 63 clock and 64 outputs.

Block 4 (figure 4) is a signal recording and reading database contains the adder 70, the elements 71, 72 OR elements 73, 74 delay. Figure 4 shows information 76, 77 inputs, clock 78, 79 and installation 80, 81 inputs, and address 27 output, the first 28 and second 29 clock outputs.

Block 5 (figure 5) Gating sections of text documents contains triggers 83-85, elements 86-88 And element 89 delay. Figure 5 shows the counting 90, installation 91 and control 92-94 inputs and clock 95 exit control 96-98 outputs of the group and the accounts 99-101 outputs of the group.

Section 6 (6) counting the number of ready-made sections of text documents contains counters 105-107, group 108-110 elements And group 111 elements OR. Figure 6 shows the control 112-114 inputs group accounts 115-117 inputs of the group, as well as information 118 output.

Block 7 (7) specify the structure of text documents contains registers 121-123, group 124-126 elements And group 127 items OR. 7 shows control 128-129 inputs of the group, as well as information 131 output.

Unit 10 addressing text documents is made in the form of the register, with information 21, the clock 24 and mounting input and information output.

Unit 11 receiving sections of text documents from the database who's server is made in the form of the register, having information 22 and the clock 25 inputs, and information output.

Block 12 (Fig) switching channels issuance of sections of text documents contains the decoder 150, group 151-153 elements And the elements 154-155 delay. On Fig shows information 156-157 inputs, clock 158 entrance, as well as the first 159 160 and the second clock and outputs the group 30-32 information outputs.

All nodes and elements of the system are made on the standard of potentially switching elements.

The system works as follows.

During execution of the job processing sections of text documents in a distributed network, each of the artists working at a remote workstation, having finished the job, forms codogram, which specifies the ID of the text of the document, above the relevant section of which has been executed the specified work, the partition itself and its content.

The structure of this codogram has the following form:

CODECODECODE
The ID of the text of the documentRoomThe contents of section

This codogram input system 20 is supplied to the information input of the register block 1, where and for what OSISA synchronizing pulse from the input 23 of the system.

Output 161 unit 1 ID text document is supplied to the information input 41 of unit 2, where it is issued to the input of the decoder 37.

The decoder 37 decodes the ID of the text document and prepares a circuit to the signal input 42, opening one of the elements 34-36 And. For definiteness, suppose that a high potential is received at one input element 36 I.

In parallel, the clock pulse from the input 23 of the system is fed to the input 42 of the block 2, delay element 39-1 at the time of entry of codogram in block 1 and the operation of the decoder 37 and then he arrives at the inputs of elements 34-36 I.

Given the fact that open on one input will be only element 36 And, after this item, the sync pulse is fed to the input of the read fixed memory permanent storage device 38, where the code is stored in the base address of the database server corresponding to the ID of the text document.

The code base address of the database server reads the information on the input of the register 33, which is inserted in the same clock pulse, the delayed element 39-2 on the time code read from the memory unit 38.

The base address from the output 44 of the block 2 is fed to the input 77 of the block 4, and the clock pulse from the output element 39-2 delay delayed again what lementa 39-3 at the time of entry code base address register 33 and through the outlet 45 of the block 2 is supplied to the clock input 78 of the block 4.

In parallel with this, the code is ready section of the text document output 162 unit 1 is fed to the input 58 of the block 3. Given that in the initial state, the trigger 51 outputs to the inverted output a high potential, you will open the elements 53 And groups. In the code section number text document passes through the elements 53 and groups, And then through the elements 54 OR group, and then from the output 63 of the block 3 is given to the input 76 of the block 4, and thence to the input of the adder 70, the other input of which is through the entrance 77 unit 4 is fed code base address of the text document from the output 44 of the block 2.

The clock signal input 78 unit 4 through the element 71 is supplied to the clock input of the adder 70, which adds the base address of the text document section number performed by the contractor, and generates at the output 27 of the system address of the database server on which shall be recorded the finished section of the text document.

In addition, the same clock pulse is delayed by the delay element 73 at the time of operation of the adder and available at the output 29 of the system as a clock pulse to record the received section of the text document. From the output 29 of the system, this pulse is fed to the input of the first interrupt server database.

On this signal, the server goes to the routine for the ISI of the content of the text document, which output 163 of the block 1 is issued at the output 26 of the system, the address generated at the output 27 of the system.

In addition, in parallel with the recording process section of the text document in the server database synchronization pulse from the output 29 of the block 4 is fed to the input 90 of the block 5. By this time the clock pulse from the output element 36 And the block 2 is supplied through the output 48 of the block 2 to the input 94 of the block 5 and then to a single input trigger 85, setting it in one state in which to direct the output of the trigger 85 is set to a high potential, the opening element 88 I.

The clock pulse input 90 unit 5 through the element 88 And passes to the output 101 of the block 5 and then fed to the counting input 117 unit 6, whence it is fed to the counting input of the counter 107, which captures the fact account of one of the finished section of the text document to the database server.

In addition, given that the high potential of the trigger 85 to exit 98 is also supplied through the inlet 114 of the block 6 to one input elements 110 and groups, And opens them, the code counter 107 through the elements 110 OR groups and elements 111 OR group available at the output 118 of block 6, where this code is issued to the input 133 of the comparator 8.

High potential to trigger 85 through the output 98 of the block 5 is also issued to the input 130 of the block 7 and opens one input elements 126 And groups, connected is UCA thereby output register 123 through the elements 126 And groups and elements 127 OR group to the output 131 of the block 7 and further to the input 134 of the comparator 8.

Registers 121-123 in advance brought the number of partitions that contain processed in a distributed system text documents. Selecting the appropriate register ID text document unit 2, which allows control of the appropriate trigger in the block 5.

With the arrival of the clock pulse to the input 135 of the comparator 8 compares the input codes and, given that the counter 107 is less than the number of sections input 134, the output 137 of the comparator 8 is formed impulse, which comes on the installation inputs of blocks 1, 2, 4, 5, returning to their original state waiting ready in the next section of the text document.

In addition, in parallel with the process of recording the content of the first section of the text document in the server database synchronization pulse from the output of element 93 delay unit 4 is also fed to the counting input of counter 92 and increases his testimony one unit, and the output 98 of the block 4, he again fed to the input 66 of block 2, where the element 61 OR delayed element 63 at the time of recording the contents of the first section of the text document in memory database, then exit 69 unit 2 he again issued to the input 95 of the block 4.

Describes the process of recording sections of the text document in the database server will continue until the ex long while all sections of the text document will not be stored in the memory of the database server. This time will be recorded by the comparator 91 unit 4, which signal is sent to the output 99 and on the installation of the meter inlet 92, returning it to its original state.

In addition, exit 99 unit 4 signal the end of the writing sections of the text document into the database server is supplied on the installation entrance 67 of the block 2, which returns the register 57 in the initial state and on the installation log 45 block 1, where, first, sets the initial state of the register 41 and the second, returns to the initial state of the trigger 42, shooting, thereby blocking signal from the output 52 of the block 1.

Thus, each text document is entered into the database server, where it will be located as follows:

The cell address of the database serverCell contents
The cell with the base address of the text of the documentAll codogram text document
The cell base address + 1The first section of the text document
The cell base address + 2The second section of the text document
................................................................
The cell base and what RES + N The N-th section in the text of the document

The head of the distributed processing of text documents, such as the chief editor, at its automated workplace (Armagh) looking at the sections of text documents and appoints executors of processing of each of them. To this end, it consistently generates codogram follows:

CodeCodeCode
The ID of the text of the documentRoom section text documentThe identity of the perpetrator (ArmA)

Each of these patterns with the release of ArmA chief editor is fed to the input 24 of the system and the clock signal output ArmA is fed to the input 29 of the system.

The clock signal input 29 through the inlet 83 unit 3 is fed to the input element 74, which will be opened only after the end of the entry input-text documents in the database server, high potential with inverted output 52 of the trigger 42. After the element 74 And the clock pulse from the output 85 of the block 3 is supplied to the clock input 87 of the block 7 and adds it to the input codogram.

ID code text document output 88 unit 7 is supplied through the inlet 81 unit 3 to the input desif atora 72. The decoder 72 decodes the ID of the text document and prepares the circuit of the signal from the output of the delay element 78, opening one of the elements 75-77 And. For definiteness, suppose that a high potential is received at one input element 75 I.

In parallel, the clock pulse from the output of the element 74 And delayed element 78 at the time of entry of codogram in unit 7 and the operation of the decoder 72 and then queries the state of the elements 75-77 I.

Given the fact that open on one input will be only element 75 And, after this item, the sync pulse is fed to the input of the read fixed memory permanent storage device 71 that stores code base address of the database server corresponding to the ID of the text document.

The code base address of the database server reads the information on the input of the register 73, which is inserted in the same clock pulse, the delayed element 79 on the time code read from the memory unit 71.

Base address output 84 unit 3 is fed to the input 117 of the block 8, and the clock pulse from the output of element 79 delay again delayed element 80 at the time of entry code base address register 73 and through the outlet 86 of the block 3 is supplied to the clock input 119 unit 8, on the other the information is hydrated input 118 which is fed code section number text document from the output 89 of the block 7.

Box 8 summarizes the base address with the partition number of the text document and displays the summary address from the output 120 to the input 186 of the block 15, where the final code read address passes through the elements 180 OR group to address 31 the system output.

In addition, the clock pulse from the output 86 unit 3 is fed to the input 192 of the block 15, which is delayed by the time of the read address in the block 8, and through the exit 34 of the system is fed to the input of the second channel interrupt the database server.

On this signal, the server goes to the subroutine read the contents of that section of the text document, the address of which is formed at the output 31 of the system.

In parallel with this process ID code of the contractor from the output 90 of the block 7 is fed to the input 176 of the block 14 and further to the input of the decoder 171, which decrypts the ID code of the contractor and opens the corresponding group 172-174 elements And, for example, a group of elements 172 And, thereby connecting the input 177 of the block 14 to the outlet 35.

As a result of reading the specified section of the text of the document, its contents are transferred to the input 22 of the block 9, which is inserted synchronizing pulse server at the input 27 of the system.

At the same time, the pulse from the input 27 of the system immediately goes to the clock input unit 14, where the delayed element 175 N. the time of entering the code in the block 9, and then enters the third input elements 172 And groups, overwriting the contents of the section of the text document through the output 35 of the system for the automated workplace of the respective artist, who, having received the appropriate section, proceeds to processing, such as translation or editing.

Similarly, the chief editor distributes the content sections of relevant documents among all artists working in this system.

In the process of fulfillment of tasks each of the performers on their automated workplace forms codogram having the following structure:

CodeCode
ID artistThe contents of the processed section of the text document

This codogram is fed to the input 23 of the system and further to the input unit 10, where it is written on a clock signal received at the input 28 of the system with ArmA artist.

ID code of the contractor from the output 196 of the unit 10 is fed to the input 136 of the block 11 and further to the input of the decoder 126. The decoder 126 decodes code combination and prepares a circuit to the signal input 137, opening one of the elements 128-130 And. For definiteness, suppose that high the th potential received at one input element 130 I.

In parallel, the clock pulse input 28 of the system is fed to the input 137 of the block 11, which is delayed by element 131 at the time of entry of codogram in the block 10 and the operation of the decoder 126, and then queries the state of the elements 128-130 I.

Given the fact that open on one input will be only element 130 And, after this item, the sync pulse is fed to the input of the read fixed memory permanent storage device 125 that stores code reference address field of the database allowed for the recording of the results of work performed by the relevant contractor.

Code reference address field of the database of the memory unit 125 reads the information on the input of the register 127. In parallel with the described process, the same pulse is read from the output element 131 delay, delay impulse time read the contents of a fixed cell ROM 125, is fed to the clock input of the register 127, the locking read code. Address code from the output of the register 127 is issued to the input 166 of the block 13, the other input of which is connected to the output 164 of the block 12.

At the same time, the pulse from the output element 130 And through the exit 141 unit 12 is fed to the inlet 161 of the block 12 and then enters the direct input of the trigger 148, setting it in one state, in which the high potential direct is ihoda trigger 148 opens one input element 151 And and also group elements 154 And, thereby connecting the output of the counter 145 through the elements 154 And, as well as elements 157 OR group to the output 164 of the block 12 and further to the input 167 of the block 13.

In parallel, the pulse from the output element 132 delay again delayed element 133 at the time of the trigger 148 and is supplied to the clock input 168 of the block 13, which summarizes address code output 138 of the block 11 with the counter 145 unit 12, and generates the address database to record the results of the work performed.

This address is output 169 unit 13 through the inlet 187 unit 15 passes the elements 180 And groups to the output 31 of the system and the clock pulse from the output of the delay element 133 is again delayed at the time of the address recording unit 13 and the output 140 of the block 11, first, is fed to the input 191 of the block 15, where the element 182 OR, and through the exit 33 of the system it is fed to the input of the first interrupt server.

On the first clock signal, the interrupt goes to the subroutine write the contents of the work performed by the contractor from the output 32 of the system in the database server according to the address generated at the output 31 of the system.

To this end the contents of the work performed from the output 197 of the block 10 is issued to the input 189 unit 15 and then through the elements 181 OR group information to the output 32 of the system.

In addition, in parallel with% som content recording completed work in the database server, the clock pulse from the output 140 of the block 11 is fed to the input 160 of the block 12, where the delayed element 158 at the time of recording the content of work performed in the database server and then through the open element 151 And supplied to the counting input of the counter 145, increasing its readings per unit.

The following performed by the same contractor, the work will be recorded at the following for the previous address as the counter 145 will be added to the base address of the contractor.

The counters 145-147 also displayed on the automated workplace of the chief editor (not shown), so their testimony editor determines the degree of readiness of the sections of text documents that are processed on workstations performers.

Up the relevant sections editor starts the program format the new contents of the text document as a whole.

Thus, the introduction of new units will significantly improve system performance by identifying the readiness of text documents at the time of writing the latest ready section of the text document into the database system.

Sources of information

1. U.S. patent No. 5136708, MCL G 06 F 15/16, 1992.

2. U.S. patent No. 5129083, MCL G 06 F 12/00, 15/40, 1992 (prototype).

The system ID is tificatio ready text documents in a network of distributed data processing, containing block receiving sections of text documents from the database servers, data and clock inputs of which are the first information and the clock inputs of the system, block addressing text documents, information and clock inputs of which are the second information and the clock inputs of the system, the unit receiving sections of text documents from the database servers, data and clock inputs of which are the third information and the clock inputs of the system, the block switching channels issuance of text documents, one data input connected to the output unit receiving text documents from the database server, the other information input connected to the output of block addressing text documents, word clock input connected to the third clock input system and information outputs are outputs of the system group, the signal writing and reading the database, the address which is the address of the system output, and first and second clock outputs are first and second clock outputs of the system, characterized in that the system includes a block selection base address of text documents, information whose input is connected to the first output unit receiving sections of text documents, the clock input is connected to the first clock input of the system, and information and clock outputs are connected with some information and a synchronizing input signal of the read and write database, the block structure of text documents, one data input connected with the second information output unit receiving sections of text documents, and data and clock outputs connected to other information and clock inputs of the signal writing and reading of the database, while the third information output unit receiving sections of text documents is an information output system, the power Gating sections of text documents, word clock input connected with the second clock the output signal of the read and write databases, and control inputs of the group connected to the relevant control outputs of the group of block selection base address of text documents, the block count of the number of ready-made sections of text documents, the control inputs of which are connected to the relevant control outputs of the group of block Gating sections of text documents, and the counting input of the group connected to the corresponding counting the output of the m block Gating sections of text documents, unit assignments patterns of text documents, the control inputs of which are connected to the relevant control outputs of the group of block Gating sections of text documents, the first comparator, one data input connected to the output of the block count of the number of ready-made sections of text documents, other information input connected to the output of the job structure of text documents, and the clock input is connected to the synchronizing output of the block Gating sections of text documents, with one output of the first comparator is connected to the first clock input unit for selecting the structure of text documents, and the other output is connected to mounting blocks inputs receiving sections of text documents, selection of the base address of text documents, Gating sections of text documents, the signal writing and reading the database, the count, a counting input connected to the first clock output unit switching channels issuance of text documents, and the output is connected to another information input unit for selecting the structure of text documents, and the second comparator, one data input connected to the output of the job structure of text documents, other information input connected to the output of the MF is tcheka, and the clock input is connected to the second output of the switching unit channels issuance of text documents, with one output of the second comparator is connected to the second clock input unit for selecting the structure of text documents, and the other output connected to the set inputs of blocks addressing text documents, receiving sections of text documents from the database server, select the structure of text documents, the signal writing and reading of the database and the counter.



 

Same patents:

FIELD: technology for automated synthesis of text documents.

SUBSTANCE: method includes, in data variable, selecting variable unified information (common word combinations), variable inputted data (details), and variable non-unified information (free word combinations), while variable unified information is separated as a plurality of support words, constituting lexicological document skeleton, and is recorded in machine-readable database, lexicological document tree is formed and data document control contour is formed, and during generation of document, all branches of formed lexicological document tree are passed to select necessary support words for inserting matching word combinations into generated document.

EFFECT: lower probability of errors, lower laboriousness.

3 cl, 7 dwg

FIELD: computer science.

SUBSTANCE: method includes text messages from data channel, linguistic words processing is performed, thesaurus of each text message is formed, statistical processing of words in thesaurus is performed, text message and thesaurus are stored in storage. Membership of text message in one of categories from the list is determined, starting data value of text message is determined, stored in storage with text message, data value values are periodically updated with consideration of time passed since their appearance and text messages with data value below preset threshold are erased, during processing of each message values of categories classification signs are updated.

EFFECT: higher efficiency.

1 dwg

The invention relates to information processing natural language text materials
The invention relates to computing, and in particular to work on the Internet
The invention relates to the field of electronics and is designed, for example, to use auxiliary data arrays in the conversion process and/or verification of computer codes in the form of symbols, and the corresponding portions of the image

The invention relates to computing

The invention relates to a computer system of creation and translation of documents, to prepare the text in the language limitations and translation into a foreign language

FIELD: computer science.

SUBSTANCE: method includes text messages from data channel, linguistic words processing is performed, thesaurus of each text message is formed, statistical processing of words in thesaurus is performed, text message and thesaurus are stored in storage. Membership of text message in one of categories from the list is determined, starting data value of text message is determined, stored in storage with text message, data value values are periodically updated with consideration of time passed since their appearance and text messages with data value below preset threshold are erased, during processing of each message values of categories classification signs are updated.

EFFECT: higher efficiency.

1 dwg

FIELD: technology for automated synthesis of text documents.

SUBSTANCE: method includes, in data variable, selecting variable unified information (common word combinations), variable inputted data (details), and variable non-unified information (free word combinations), while variable unified information is separated as a plurality of support words, constituting lexicological document skeleton, and is recorded in machine-readable database, lexicological document tree is formed and data document control contour is formed, and during generation of document, all branches of formed lexicological document tree are passed to select necessary support words for inserting matching word combinations into generated document.

EFFECT: lower probability of errors, lower laboriousness.

3 cl, 7 dwg

FIELD: computer science, in particular, system for identification of preparedness of text documents in network for distributed processing of data.

SUBSTANCE: system contains block for receiving sections of text documents, block for selection of base addresses of text documents, block for selecting structure of text documents, block for forming signals for recording and reading database, block for gating sections of text documents, block for addressing of text documents, block for receiving sections of text documents from database of server, block for commutator of channels for dispensing sections of text documents, block for counting number of finished sections of text documents, comparator, counter.

EFFECT: increased speed of operation of system.

8 dwg

FIELD: technology for recognizing text information from graphic file.

SUBSTANCE: in accordance to method, set in advance is order of access to additional information, assigned also is estimate of quality for each type of additional information, different variants of division of image of selected rows on fragments are constructed, for each fragment of row linear division graph is built, images of graphic elements are recognized, using a classifier, and an estimate is assigned to each recognition variant, transition from variants of recognition of graphic elements to variants of alphabet symbols is performed, for each chain, connecting starting and ending vertexes, chains are built, appropriate for all variants of recognition of graphical elements and variants of transitions from recognized graphical elements to alphabet symbols, produced variants are ranked in order of decrease of recognition quality estimate, produced variants are processed with usage of information about position of uppercase and lowercase letters, if more than one variant of symbol is available based on results of recognition of graphic element, variants are processed with successive usage of additional information, and/or when necessary simultaneous usage of all types of additional information, quality estimate is assigned to each produced variant, variants of symbols with estimate below predetermined value are discarded, produced variants are sorted using pair-wise comparison, and additional correction of recognition of spaces, erroneously recognized at previous stages, is performed.

EFFECT: increased precision of recognition of text and increased interference resistance of text recognition.

9 cl, 2 dwg

FIELD: devices for recognition of written symbols.

SUBSTANCE: method contains stage of receipt of written symbols, which are written on sensor screen, where sensor screen contains at least at area for writing symbols and area for writing punctuation. Then a stage for determining ratio of written symbols is performed for symbols which are written in symbol area of punctuation writing, relatively to symbol writing area, and stage of recognition of punctuation marks is performed. Recognition stage is conducted for written symbols, when ratio exceeds threshold value, where conduction of recognition of punctuation symbols determines at least one possible punctuation mark from, similar to written symbols, from a set of punctuation marks.

EFFECT: automatic recognition of punctuation marks with increased precision.

8 cl, 8 dwg

FIELD: information technologies.

SUBSTANCE: invention relates checking methods of documents accuracy of extensible markup language (XML) and message delivery about the real-time scheme violation. Parallel tree is supported with portions corresponding the elements of another XML document XML. When irregularities take place in XML document, elements of another XML of document are pointed out which comply the irregularities. Portions which correspond the pointed out elements of another XML document are verified according to the XML scheme, which in its turn corresponds another XML document positioning. This elements and portions which comply the errors in another XML document positioning are reported to the user according to image indicators in XML document and parallel tree.

EFFECT: XML document accuracy check provision and messaging about scheme irregularities in real-time mode while document correcting by the user.

20 cl, 8 dwg

FIELD: physics, computer equipment.

SUBSTANCE: present invention is related to components of trees ordering in system of sentences realisation. Component accepts disordered syntactical tree and generates ranged list of alternatively ordered syntactical trees from disordered syntactical tree. Component also includes statistic models of components structure that are used by component of trees ordering for estimation of alternatively ordered trees.

EFFECT: provision of proper order of words in treelike structure.

24 cl, 11 dwg

FIELD: computer engineering.

SUBSTANCE: application program interface (API) for import can be implemented to import content from hierarchically structured document, such as XML-file. Import API works in conjunction with syntax analyser to preview document and extract content from selected elements, units, attributes and text. Import API also uses callback component to process extracted content. Export API also can be realised to export data with the aim of creation of hierarchically structured document , such as XML-file. Export API works in conjunction with editor to receive data and export data in the form of elements, units, attributes and text in hierarchically structured document.

EFFECT: providing selective data import and export in electronic document.

20 cl, 5 dwg

FIELD: physics, computer technology.

SUBSTANCE: invention concerns methods and systems of text segmentation. Method involves addressing symbol line (204), long lexeme determination (206), recording adjoining symbols in long lexeme (208), determination of lexemes from symbol line by holding together the adjoining symbols, and determination of multiple lexeme combinations (210), with number of lexeme combinations reduced by means of recorded adjoining symbols.

EFFECT: enhanced speed of text fragmentation.

22 cl, 3 dwg

FIELD: physics; computer engineering.

SUBSTANCE: present invention pertains to computer technology. The elements of each schema can be arbitrarily embedded in the elements of another schema and each set of elements remains correct within the limits of its intrinsic schema. Elements of the second schema are "transparent" to the elements of the first schema, when the text processor checks correctness of elements of the first schema. Elements of the second schema are verified separately so that, elements of the first schema are "transparent" for verification of elements, corresponding to the second schema.

EFFECT: provision for validity checking of an extensible mark-up language (XML) document, with elements, linked to two or more schemata, where elements of each schema can be arbitrarily embedded in the elements of another schema and each set of elements remains correct within the limits of its intrinsic schema.

16 cl, 6 dwg

Up!