Text to speech device and method

FIELD: information technology.

SUBSTANCE: navigation device has apparatus for digital processing of sounds and audible transmission thereof, memory which stores multiple data in form of text pointers and pre-recorded sounds, apparatus for transmitting data between the processor of the device and memory, an operating system for controlling processing and flow of data between the processor and memory, and determining whether said sounds are reproduced in an audible manner through repeated determination of physical conditions comparable with reference values built into the memory, so that satisfaction of the condition causes the device to generate a sound through the pre-recorded sounds stored on the device, or a sound which is digitally presented by a text to speech (TTS) program component by transmitting a text point to it, which corresponds to an event, a combination of the above said, wherein when determining the event which requires reproduction of sound by the TTS program component, the operating system invokes a set of options selected or marked by the device user during its configuration in order to determine the extent to which this event can be audibly indicated.

EFFECT: possibility of audible indication during enroute navigation of user-predefined information.

14 cl, 6 dwg

 

This invention relates to a device and method for performing speech playback text (TTS).

The LEVEL of TECHNOLOGY

Software TTS is well known. Typically the mechanism TTS capable of decoding or interpretation of plain text or created in a word processor document (for example, with the extension ".txt", ".doc", etc) and convert what is essentially a binary representation of the text in the alternate binary representation in the form of commands to the sound processor, which ultimately, generates corresponding electrical signals to a traditional loudspeaker. Interpretation of the source text document regardless of whether it is separate for the reason that it contains just short phrase, or name, or more extensive and contains one or more paragraphs of text, typically may entail analysis at a granular level, for example, consonants, vowels and syllables, and may also include grammar and punctuation analysis, from the condition that the resulting synthetic speech was formed with the correct flexeri and intonations and, thus, sounded more realistic.

In General, there are two ways of speech synthesis with the use of electronic hardware and software. When concaten the efficient synthesis of the synthesized speech is created by concatenating pieces of pre-recorded speech stored in the database. Systems differ in the size of the stored speech units; a system that stores only smaller backgrounds or difani, will provide the greatest output range, but may suffer from the lack of intelligibility, whereas storage of full words or sentences provides high-quality output signal. Alternatively, when formant synthesis, the synthesizer comprises a model of the vocal tract and other characteristics of the human voice to create a fully synthetic voice output signal. Parameters such as fundamental frequency, pronunciation and noise levels change over time, to generate an oscillating signal of artificial speech. This method is sometimes called a synthesis on the basis of the rules; however, many concatenative systems also have components on the basis of the rules.

One of the most common applications of speech synthesis from the time of his appearance was to provide blind or partially sighted people the opportunity to understand the written word. More precisely, the abundance of modern devices, in fact, any device with relatively modest computing power and memory, such as personal digital assistants (PDAs), more advanced mobile phones, such as the so-called smartphones, game consoles the car's satellite navigation system (SNS), provide some means for playing a pre-recorded fragments of human voices or run software TTS for interpretation of any text or processed word processor document stored on the device in real time.

This invention has particular application in the automotive devices SNS, and although the following description is almost entirely focused on them, a specialist in the art will be easily taken into account that the use of the invention can have a much broader scope and should not be considered limited partial description.

Device car SNS became widespread in the previous 5 or so years, and most of the devices include one or more databases cards for specific countries, and the ability to store a certain amount of pre-recorded phrases, possibly in the variety of different voices, such as male, female, and different basic colours or with different levels of severity or gaiety. Moreover, many devices also allow users to record phrases his or her own voice, and can be a simple procedure in the system software of the device, in order to educate the user in series is to record each and every phrase, required for correct operation of the device. For example, the user can request to record a variety of different phrases or fragments of spoken words, such as "Turn left", "Turn right", "400 meters" and so on, and as soon as the recording has been completed, the system device software ensures that the fragments of the voice of the user selected to play in the proper time, as opposed to the default or selected in advance of the pre-recorded fragments. Such technology was available based on mobile phones for some time, albeit on a simpler basis, the user can record their own voice and to use this account instead of the default call call device when a specific individual or, indeed, any person makes a call on a mobile phone.

The above system with a prior record is usually more than sufficient for most operations, route navigation, but is limited for the reason that they do not offer the opportunity for audible indicate unusual or country-specific information.

Therefore, the aim of this invention is to overcome this disadvantage and to propose a more comprehensive solution on the I sound support for, among other devices, automotive SNS.

A BRIEF summary of the INVENTION

According to the present invention proposed is equipped with a processor device for generating sounds from the data, the said device contains:

means for processing sounds in digital form, and means for audible transmission,

a memory in which is stored a database of data sets, at least some of which are in the form of text pointers, and one or more pre-recorded sounds,

the means of transmission of data, whereby data is transferred between the processor device and said memory, and

software operating system, which manages the processing and data flow between the processor and the memory and reproduced mentioned whether the sounds are audible,

the said device is also able to re-identify one or more physical conditions, which are compared with one or more reference values provided in the memory, from the condition that the positive result of the comparison was caused by an event that requires sound was formed by the device

characterized in that

the device additionally includes a TTS software component that interacts with the operating system or programs is th, running it referred to the operating system or the program decides in accordance with the user input, whether the event be denoted audible, by

one or more pre-recorded sounds stored on the device

the sound, presented in digital form component of TTS from a text pointer retrieved from the database and corresponding to an event, or

the combination of the above.

In a preferred embodiment, the operating system or the program running on it, provided additional, more accurate user input, to allow selection of the type of events that need to be visible, audible user. In particular, the operating system software running on it, preferably represents a set of options of different types of events that can be selected or subjected deselected, depending on user preferences.

More preferably, the device is provided with a means of global positioning (GPS), which includes the extraction tool time signals, the above mentioned device, thus, is capable of determining its global physical location, velocity, and acceleration (by performing averaging RA the free period of time), and events in accordance with the ideal of audible indicated by the user, are the teams direction, while the device (and, thus, the user carrying the said device or moving in a vehicle in which you installed the mentioned device) is moved at a predetermined or pre-programmed route.

Most preferably the data are representative of one or more cards of the road transportation network, such as a road network of a particular country or region.

Preferably the data is supplied by a variety of different additional data from card(s), which during the trip, the user may wish, or may not wish to be informed audible, such as street names, road numbers, numbers, buildings, POIs (POI), the signs of the road. In the case of street names such may be audible to visible to the user only through the TTS component.

In a preferred embodiment, the device is further provided with means for determining the environmental conditions such as temperature and pressure (which may be present in the GPS signal), and the device can additionally be provided with a secondary means of long-distance radio communication, which provided yet the device the ability to determine traffic conditions along specific segments of the network of highways, presents data and receive messages and other information on pre-existing networks, such as mobile telecommunication networks or other networks.

In a desirable embodiment, the invention also allows the user to select whether audible warning extracted in such networks, such as incoming SMS (short message service) messages or other messages, such as information on weather and traffic.

In another more preferred embodiment, the device also enables user selection indicated whether based on device operational events audible, for example tips about the device and text information commands manuals of the device. Most preferably the device includes a user interface, preferably a graphical user interface and the operating system or the program running on it, causes the display of one or more pages of options, whereby the device may be informed whether to represent the sound in digital form via TTS component on one or more different types of events that require the audible message to the user whether to perform the selection of one or the more pre-recorded sounds, to notify about such events, or a combination of these existing options for the implementation.

In the second aspect of the invention, a method for determining the course of action which has a processor device must generate the sounds from the data, the said device contains:

means for synthesizing sounds in digital form and pre-recorded sounds together with means for audible transmission,

a memory in which is stored a database of data sets, at least some of which are in the form of text pointers, and one or more pre-recorded sounds,

the means of transmission of data, whereby data is transferred between the processor device and said memory, and

software operating system, which manages the processing and data flow between the processor and the memory and reproduced mentioned whether the sounds are audible,

the said device is also able to re-identify one or more physical conditions, which are compared with one or more reference values provided in the memory, from the condition that the positive result of the comparison was caused by an event that requires sound was formed by the device

characterized in that

the manual includes the steps which

offer the user the choice of the type of sound required to transfer audible and dependent on user selection

give TTS software component able to communicate with the operating system or a program running on it, from the condition to at least one event sounds were synthesized in digital form from one or more text pointers retrieved from the database or

one or more pre-recorded sounds stored on the device, played for the mentioned at least one event, or

the combination of the above.

Preferably in the case where the user makes a selection that the sounds produced by the device in response to various events, must consist solely of pre-recorded sounds stored on the device, the method includes a stage on which warn the user that certain events will not be audible to referred to referred the user, for example the designation of street names.

Preferably, in an alternative embodiment, in the case where the user makes a selection that one or more events have audible reported to the user by means of synthesized sounds, the method includes an additional step, which cover the t a lot of additional options event which depending on the choice the user can make visible audible way. Examples include street names, buildings, road numbers, the incoming message long-distance radio communication, such as traffic, weather or mobile networks, warnings for POIs, tips about using your device, text information manuals, device, notice the road signs.

The invention also covers a computer program, operating system or software designed to run on a pre-existing operating system, to provide the above functionality is endowed with a processor device.

Separate embodiments of the invention will be described below by way of example, with reference to the accompanying drawings, on which:

BRIEF DESCRIPTION of DRAWINGS

1 to 6 show exemplary screen image with the car SNS, adapted according to the present invention, and

DETAILED DESCRIPTION

Device car SNS, typically, will include a graphical user interface (GUI) touch screen, which in normal operation will display the route that is taken in real time by the user. During configuration of such devices, however, the user will be presented not what AutoRAE number of different options screens, some of which are shown in Fig.1-6.

Figure 1 shows the screen to Select the voice"on which various graphics and text, generally shown at 2, to indicate to the user that it is possible to select a pre-recorded voice (indicated as "Kate") for the audible event notification. The screen also displays a graphical keys" 4, 6, 8, 10, 12, which, when the user touches the screen in the area of these keys provide the user the ability to check voice, to delete voice (along with the previously stored pre-recorded sounds)to select shows the voice or to scroll backward and forward through another pre-recorded voice profiles. Once the user selects a specific pre-recorded human voice, a screen is displayed as shown in figure 2, warning the user that the street names retrieved from the database of the card and message telecommunication taken by the device may not be read aloud by the device. Of course, it would be impractical, not to say impossible, to pre-record a human voice expressing words each and every street name in a particular country.

Alternatively, if the user selects a machine voice (which practically means issuing commands to the device with naziroute speech in digital form from text pointers, extracted from the database of the card, in the case of street names and the like, or from simple text, such as included in messages received by the device, the screen, such as shown in Figure 3, may sound device.

In this particular case, additional options screens, such as shown in Figure 4 and 5 presented to the user (entitled screens "Voice preferences"), and various options can be selected or be deselected by the user, so that the audible device refers to a specific event selected by the user, but not others.

Figure 6 advanced options screen shows that the user can also optionally specify not only the type of event about which the device should tell him audible, but also the length and the type of notification, which must issue the device. For example, can be given full or abbreviated verbal warning, you may miss simple sound effect, or the user may wish to type in some text that the device converts into synthesized speech.

In the General operation of the route, i.e. commands that the device gives the user a, to facilitate navigation from one position to the predetermined destination, put into the mouth of austo user will be audible marked in accordance with the user selection of the human voice or machine (synthesized) voice. Native voice will read aloud at least some of the proposals in the quality of the human voice. Depending on the settings of the street names can join the indication of route and road numbers, road signs and other signs can be read aloud to the user.

The following pages give examples of specific phrases that may be uttered by the device, and how they can be constructed:

A single command: "After 500 metres turn left - Rembrandplein"; "Turn left - A4"; "Turn left - A4, Utrecht, 31 mile;

Combine commands: "After 500 metres turn left - Rembrandplein - turn right - Amstelstraat"; "Take the exit - 6 - then take the motorway - A9";

A single command: "Turn left - A4.

In the case where you want the device to audible indicated distance, in this case, if the preferred metric system (miles instead of kilometers), and the current voice is the voice of the American TTS can be read out loud the following distances:

[150 feet, 250 feet, a mile, a quarter of a mile].

Example:

"After half a mile turn left".

During the occupation of the highway where the user is l will move within a certain time, TTS voice will be audible to inform the user about how long it will stay on the same path, by reading aloud the following: "Follow <road> for <distance>", for example:

"Follow A10 for 3.6 km";

"Follow A10 for 37 miles."

Conditions for issuing this command are:

- Must be set to "read aloud road numbers" speech preferences.

The last (and final) team was at least 15 seconds ago.

- You're on the road, which relates to the team (not on the motorway and not on the driveway).

- The distance from this point until the next command is double the distance to the point, where you will be issued the following command, or at least 1500 meters to the next warning. These constraints can simply be applied in the software.

Variable <distance> is the distance to the next command if the next command is not one of the following: "go right", "keep right", "keep left".

Example:

The user will keep the car on the road, called A2:

Within 10 miles there is a reminder at a fork in the road (keep left), and 20 miles later the planned route n is inevitable entails reaching out.

In this case <distance>, which will be issued at the entrance to A2, is 30 kilometers, even if the next command ("keep left - A2") already issued after 10 kilometers.

Such an operation, the newly implemented computer control program is designed to reduce the information transmitted to the user. When street name/road numbers/trips/road signs are read aloud, they can tell too often. To reduce the amount of text that will be read aloud TTS voice, we have implemented the following rules:

- Same <name> will not be mentioned again in 30 seconds, where <name> = index of the road/road/exit/street name.

For roads that have early warning, street name/road numbers/trips/road signs are already expressed verbally in early warning. Moreover, they are also not expressed verbally in the following warning, but it may be canceled above rule 30 seconds. If so, it is expressed verbally again at the end of the warning (which may also be superseded by rule 30 seconds).

Example:

- "In front of the exit 6 of Wuten" (early warning).

- "After 800 yards, follow the exit" (not pronounced due to rules 30 seconds).

- "Reach exit 6 n is Wuten".

Or

- "Stay ahead of the right side, A2, Utrecht (early warning)

- "After 800 meters and keep to the right, A2, Utrecht (more than 30 seconds between them).

- "Keep right".

For other types of roads ("Central city") it is often desirable to hear the name of the street at the intersection. In this case, the rule is to speak the warning only on the intersection. However, this warning sometimes do not read, if the previous warning was taking too long. In some embodiments, implementation, therefore, it is also possible to read out loud the information, if the distance to the final warning is less than or equal to 200 meters. In this case, either the command is not in the intersection, or it is, but without additional information.

Examples:

- "After 300 meters, keep to the right" (the street name is not pronounced).

- "Keep to the right, Mr. Visserplein" (street name, spoken at the crossroads).

- "After 100 meters turn left Amstelstraat" (speaks street names, if the intersection is not commands).

- "After 100 meters turn left Amstelstraat" (no warning is not issued at the crossroads, as the previous warning took too long).

- The after 200 meters turn left Amstelstraat" (as the warning is <= 200 meters).

- "Turn left" (no street name, usually 30 seconds).

For compound commands like "through <distance> turn left on <street>, then turn right at the <street>", second street name is not spoken, if the combined command is issued not in the intersection. This is because, if the combined command is issued at the intersection, it is likely that the command is not spoken again, and no one will ever hear the street name.

Examples:

"After 100 meters turn left, Amstel, then turn right (second street didn't slip);

"Turn right, Herengracht";

"Turn left, Spiegelgracht, then turn right, Lijnbaansgracht"

"Turn left, Spiegelgracht, then turn right, Lijnbaansgracht" (no command is not issued, because we were supposed to turn right immediately).

For warnings for POIs, the following applies.

When this setting is enabled (selected button with independent locking) and the currently selected voice is native voice, the dialog buttons with dependent fixing, with the following options alert types:

(*) Warning ("gas Station: 800 meters. The next possibility is: 20 kilometers")

(A) a brief warning ("gas Station, 800 meters")

(Sound effect

() Type in your own warning

1. Full warning

When the full warning is selected for a particular type of POI, read aloud the following sentence (in the same points at the same distance, which given the current sounds warning POI):

<a Type of POI> - POI type for which the user selects the alert;

<distance Units> one of the units of the distances in yards/miles/yards/miles;

<X> is the distance from the current location to the POI;

<Y> the distance from the current position to the next POI type.

Example:

"Gas station: 800 meters. The next opportunity: 20 kilometres."

2. A brief warning

When a brief warning is selected for a particular type of POI, read aloud the following sentence (in the same points at the same distance that given the current sounds warning POI):

<a type of POI> <X> <distance units>.

Example:

"Restaurant: 100 meters."

3. Sound effect

When you select a sound effect, the user is presented the classic switch, and the selected sound will be issued at the same time/the same distances that are issued warning sounds on POI.

4. () Type in your own the first warning

When you select this setting for a particular type of POI, the user can type in text that will be read when a warning is issued about POI.

Urgent message: if this setting is enabled (selected with independent fixing)will be read aloud some flashing notification (urgent message), for example:

No reliable GPS signal!

- Not able to install a traffic checkpoint

Your GPS location is not on the route, it is impossible to determine which roads should be followed. Not developed any itinerary!

- You have already passed this paragraph

- You cannot escape your destination

The route has changed!

Route found ...

The traffic info is not available for this area

It is impossible to join the service ...

- "Running the demo"

No destinations are allowed to visit

- The following road location is not on this map

Is there a phone number for the POI

Your mailbox outgoing message is empty

- Unable to call, no dial tone readiness

- Download <xxx> was canceled

- Download <xxx> was successful

Other messages are within the scope of this application.

Textbooks: When this option is enabled (using the Ana pushbutton with independent locking), page text tutorials (excursions) will be read aloud, when displayed on the screen. When you go to the next page, or when she had read the entire page, TTS voice stops reading the current page.

Tips: When this option is selected the button with independent locking), the tooltip text will be read aloud, when displayed on the screen. When removed, specifying the user, or when it was read aloud all the hint, machine voice stops reading the tooltip.

SMS: SMS Read can be performed at two different levels: i. As soon as the SMS arrives (automatically inserted), and ii. when you press the "read aloud" in your Inbox/mailbox outgoing message.

1. Automatic reading: When this option is selected the button with independent locking), and the incoming message will be automatically read as soon as it arrives, for example:

If the sender is known from the address book, read aloud the following sentence:

"Incoming message from johnny: Today I will sooner!"

If the sender is unknown, read aloud the following sentence:

"Incoming message to <room>:<message>".

Example:

"Incoming message from a number 06557 40775: Today I will sooner!"

When a message is deleted, indicating the floor is the user or when it was read aloud to the entire message, native voice stops reading messages.

2. Read aloud (Inbox/mailbox outgoing message): When the user selects a message via mobile phone => read/write message => read the message Inbox, read aloud the following sentence:

"Message from <name or number of the sender>: <message>".

3. Special messages: it is Possible that between the devices send messages that contain a certain location of the sender; in such cases, it may read aloud the following sentence:

"This message contains a location.

Weather: When this setting is enabled (selected button with independent locking), the weather information will be read aloud when it is displayed on the screen.

Complete the sentence: "report on the weather for the day: <description> <X> C <C/F>".

Example:

"Report on the weather for the day: Sun, 19 degrees Celsius";

"Report on the weather for the day: Sunny and 67 degrees Fahrenheit.

To improve reporting on the weather can read aloud the following proposals:

"Report on the weather for the day: <description>. The minimum temperature <X> degrees Celsius or Fahrenheit, maximum temperature: <Y> degrees Celsius or Fahrenheit".

When the page, showing the weather, is removed by specifying the user or when it was read aloud the entire text, machine voice stops reading the weather.

Traffic Information: traffic can be read aloud at two different levels: as soon As a new traffic info is available after the upgrade (automatic reading), and when the actuation functions "Read out loud the information on road traffic", embodied in the software:

1. Automatic reading: When this option is selected the button with independent locking) and the selected button with independent fixing "to Give sound a tone when you change information on the traffic route, the traffic will be read aloud, when a new traffic info is available for the current route.

If you are configuring, such as automatic optimization after each update, turned off, you may read aloud the following information: "the traffic Situation on your route has changed."

And if known delay:

"the total delay due to traffic at the moment <delay>", where delay is the time, for example, 11 minutes.

When setting such as "automatic optimization after each update" switch is on, can read aloud the following information: "the traffic Situation on your route has changed", then "route Recalculation", then when the allocation is completed, can be read aloud as follows: "Your route has been calculated again. New arrival - 11.45", if the route has been calculated again, and Your route has been calculated again. It was not modified." otherwise.

2. Can be displayed softkey temporary screen, entitled "Read-aloud traffic info": When used to configure "traffic" and mentioned the button is pressed, a submenu is displayed. Can be the key you selected "read aloud on road traffic", which has the effect of loading details about incidents that will be on your route, and message:

"Extraction of traffic information. One moment, please"

read it aloud.

As soon as all the details about all the incidents are loaded, for each incident on the route can be read out loud the following:

<description> (<road number>, <A> and <B>;

where

- (<road number>) might not be available;

"Between <A> and <B>" could be replaced by "A" (if there is only one location).

Example:

"Removing information about Doro the EBM movement, one moment, please....";

Slow traffic on the A1 between Harwich and reading. The accident on the A1 motorway between London and Hemel-Hempstead ...".

First sentence "Retrieving traffic information. One moment, please") could be dropped if all the details are loaded before was expressed in words in a complete sentence. For example, when you press the second time, it will be dropped immediately because all the details will already be loaded.

Priorities: the Signs are assigned priorities in the following order, meaning that the directions will be given on the warning POI and so on

1. The route

2. Warning POI

3. Accrual fare

4. Traffic

5. SMS

6. Tutorial tips/weather

7. Key validation

8. Urgent messages

Of course, can be applied to any other order of priority.

If the user cancels the selection of native voices and begins again using the "human voice", some symptoms may not be affected for the reason that the text for those just not expressed in the words:

Tips

- Urgent messages

- Tutorial

- Weather

- SMS (Inbox/mailbox outgoing message)

Other signs may be returned in their state is the default, being audible inform the user through the "sound signal".

- SMS (automatic reading)

- Traffic

- Select the alert POI (BOING in this case)

In conclusion, other signs of a return to the audible message to a user via a pre-recorded file ".wav" or ".mp3":

- Specify the route.

1. The navigation device equipped with a processor capable of calculating a route between a starting point and destination and form the sounds of the data, the said device comprises:
means for digital processing of sounds and a means for audible transmission,
a memory in which is stored a lot of data, at least some of which are in the form of text pointers, and one or more pre-recorded sounds,
the means of transmission of data, whereby data is transferred between the processor device and said memory,
software operating system or the program running on it, which provides control of the processing and data flow between the processor and memory, and whether the reproduced sounds are audible, through repeated determine one or more physical conditions, which are compared with one or more reference values, OEM home button Flex cable is provided in the memory, thus, to the satisfaction of the conditions causing event that requires sound was formed by the device via the
one or more pre-recorded sounds stored on the device,
sound, digital image presents a software component of the speech playback text (TTS), which is supplied with the device and interacts with the operating system or the program running on it, by sending it a text pointer extracted from the data and corresponding to an event, or
the combination of the above,
characterized in that when determining the existence of an event that requires the sound to play through the mentioned TTS software component, referred to the operating system or the program running on it, refers to the set of one or more options that were selected or unselected by the user is referred to the device during configuration, to determine the extent to which it should be audible to mark this event.

2. The device according to claim 1, the device is equipped with a means of global positioning (GPS), which includes the extraction tool time signals and events that are perfectly audible indicated by the user, are the teams direction, what about the time as the device is moved at a predetermined or pre-programmed route.

3. The device according to claim 1, wherein the data includes data representing one or more cards of the network of highways.

4. The device according to claim 3, in which the map data is supplied by a variety of different additional data during scheduled trips may be included in or excluded from audible denoted teams destinations such additional data is selected from the group consisting of: names of streets, buildings, rooms roads, POIs (POI)information road signs.

5. Device according to any one of claims 1 to 4, wherein the device is further provided with means for determining the environmental conditions.

6. The device according to claim 1, the device is equipped with means of long-distance radio communication, which gives the device the ability to determine traffic conditions on specific segments of the road transportation network, presents data to receive messages and other information through the above mentioned means of distant radio.

7. The device according to claim 6, where the device also gives the user the ability to select, audible whether the warnings extracted through the above mentioned means of distant radio.

8. The device according to claim 1, where the device also gives the user the choice indicated whether audible about the time based on the device operational events.

9. The device according to claim 1, where the device includes a graphical user interface and the operating system or the program running on it, causes the display of one or more pages of options through which the device can be informed whether the digital image to represent the sound through TTS component on one or more different types of events that require the audible message to the user whether to perform the selection of one or more pre-recorded sounds to notify about such events, or whether the combination of these existing options for the implementation.

10. The method of determining the course of action which the navigation device equipped with a processor capable of calculating a route between a source location and destination, should form the sounds of the data, the said device comprises:
means for digital sound synthesis and playback of pre-recorded sounds together with means for audible transmission,
a memory in which is stored a lot of data, at least some of which are in the form of text pointers, and one or more pre-recorded sounds,
the means of transmission of data, whereby data is transferred between the processor devices and the KJV is anotai memory, and
software operating system or the program running on it, which provides control of the processing and data flow between the processor and memory, and whether the reproduced sounds are audible, through repeated determine one or more physical conditions, which are compared with one or more reference values provided in the memory, from the condition to the satisfaction of the conditions causing event that requires sound was formed by the device via the
one or more pre-recorded sounds stored on the device,
sound, digital image presents a software component of the speech playback text (TTS), which is supplied with the device and interacts with the operating system or the program running on it, by sending it a text pointer extracted from the data and corresponding to an event, or
the combination of the above,
characterized in that when determining the existence of an event that requires the sound to play through the mentioned TTS software component, referred to the operating system or the program running on it, refers to the set of one or more options that were selected or unselected by the user of these devices at the time of its configuration, to determine the extent to which it should be audible to mark this event.

11. The method according to claim 10, the method includes the steps, which warn the user that certain events may not be visible audible to the person you mentioned, when the choice for the exclusive use of pre-recorded sounds.

12. The method according to claim 10, the method includes an additional step, which provides many more options, event notifications, when deciding TTS, such additional options events are selected from the group consisting of: names of streets, buildings, roads, incoming messages distant radio communications, such as traffic, weather and mobile networks, warnings for POIs, tips about using your device, text information, manuals, device, notification of road signs.

13. The computer-readable medium containing codes when handling to which the computer performs any of the steps of the method according to p-12.

14. A computer containing a computer-readable medium of clause 13 and running codes on item 13.



 

Same patents:

FIELD: physics.

SUBSTANCE: at least one part is selected in text. Intonation of each part is determined. Target speech sounds are associated with each part. Physical parameters of the target speech sounds are determined. Speech sounds which are closest on physical parameters to the target speech sounds are found in the speech base. Speech is synthesised in form of a sequence of the found speech sounds, wherein physical parameters of said target speech sounds are determined in accordance with the determined intonation.

EFFECT: improved quality of synthesised speech owing to accurate transmission of intonation.

12 cl, 1 dwg

FIELD: radio engineering.

SUBSTANCE: device comprises source (S) of input sound signal, memory device (MD), analyzing device (AD), producing device (PD) and synthesising device (SD). PD is arranged on the basis of characteristics analyser (CA) and correcting processor (CP). Switch (S) of training/operation mode is introduced, as well as input signal analyser (ISA). S is connected to input of S. MD is equipped with unit of audio records (U). Input/output of S is connected to input/output of ISA, and its output - to input of U. The first output of U data is connected to input of ISA, and the second output of U data - to AD input. ISA is arranged as providing for decay of input voice signal into sinusoid components of signal (Sig), noise components of signal (N) and residual components of signal (R) and is arranged with the possibility to generate sets of characteristic vectors and functions of conversion for each specified components and their transmission into MD. AD is arranged as providing for decay of input voice signal from U into Sig, N and R. CA and CP are arranged with the possibility to process specified components.

EFFECT: invention makes it possible to perform a song by user's voice, but in manner and with quality level of professional singer performance with minimisation of performance errors and improvement of its quality.

6 cl, 5 dwg

FIELD: information technologies.

SUBSTANCE: invention makes it possible to produce all possible versions of initial text transcriptions, not resorting to analysis of text sounding. Rules of transcription modeling are applied to ideal transcriptions produced on the text basis, additional versions of transcriptions are obtained, to which rules of transcription modeling are also applied. Identical transcriptions are excluded from produced list of transcriptions, and transcriptions remained in the list are saved for further use.

EFFECT: improvement of preliminary text processing.

4 cl, 8 dwg, 1 tbl, 3 ex

FIELD: communications.

SUBSTANCE: on a VoIP server, the encoded voice signal of each subscriber, received from a data transfer network, is decoded. The volume level of the voice signal of each subscriber is measured. Signals whose volume level exceeds a preset level are summed up, and the obtained sum is encoded and transferred to each subscriber. When transferring the obtained sum to a subscriber, the volume level of the voice signal of which exceeds the preset level, the current subscriber signal is subtracted from the sum.

EFFECT: prevention of amplification of acoustic echo; reduced expenditure.

11 cl, 3 dwg

FIELD: technology for synthesizing speech from text.

SUBSTANCE: in accordance to the method, the word, extracted from received text string, is divided onto sub-words, which constitute a series of sub-words, wherein at least one sub-word contains at least two letters, and each one of the possible sub-words has predetermined weight, where to create a series of sub-words, sub-words are selected with maximal combined weights; phonemes are determined for sub-words by means of table of phonemic identifiers; phonemes are combined in a series of phonemes; and speech is synthesized on basis of a series of phonemes.

EFFECT: accenting of consonants depending on other neighboring letters and position in text fragment being synthesized.

4 cl, 6 dwg

FIELD: technology for generating sounds similar to sounds of voice of healthy people using noise-like sounds of esophageal voice of people without a larynx.

SUBSTANCE: in accordance to the method, original signal is demodulated and resulting envelope curve is corrected and then multiplied by oscillation with transformed instant frequency, from input signal, signals of consonant sounds of speech are extracted as well as low frequency part of frequency range, which is demodulated, as well as spectral components adjacent to frequency of main voice tone, which are amplified, enhanced with signal, conjugated according to Gilbert, conjugated signals are multiplied, the signal, conjugated according to Gilbert and limited in amplitude is added to amplitude-limited signal resulting from multiplication, and multiplied with amplitude-limited signal which contains spectral components close to frequency of main tone of voice, signals of consonant sounds of speech and transformed signals of consonant sounds of speech are added and speech signal output is composed.

EFFECT: transformation of noise-like "vowel" sounds of esophageal voice to vowel sounds with discrete spectrum of harmonics, similar to the voice of a healthy person.

2 cl, 1 dwg

FIELD: technology for synthesizing speech from text.

SUBSTANCE: in accordance to the invention, analysis of at least one word from text string is performed to determine, whether natural speech pause, positioned adjacently to aforementioned word, is present, where the analysis if based on at least one predetermined threshold value for that word, where aforementioned predetermined threshold value is connected to number of syllables between the word and one of two ends of the text string.

EFFECT: increased precision of realized identification of natural speech pauses for various speech patterns at input.

6 cl, 5 dwg

FIELD: speech-related computer engineering and tool making industry for synthesizing speech messages from text in systems for acoustic communication of man and machine.

SUBSTANCE: compilation of acoustic database element composition includes consonant-vowel syllables, vowel-consonant syllables, separate vowel and consonants. Methods for connecting these: direct connection or mixing for phoneme combinations of type consonant-vowel-consonant-consonant and consonant-vowel-end consonant. Device for compilation phoneme synthesis of Russian speech contains text processor, connected to acoustic database and sound signal generation block, which is connected to reproduction block, block for generation of consonant-vowel-consonant, input of which is connected to appropriate outputs of acoustic database and txt processor, while output is connected to input of sound signal generation block.

EFFECT: increased naturalness of speech and speed of text-based synthesis due to improved structure of compilation elements and usage of method for their connection with consideration of phonetic particularities of Russian language.

2 cl, 2 dwg

FIELD: method and system for adaptation of speech synthesizer using data received in real time scale.

SUBSTANCE: during realization of method and system for dynamically modifying synthesized speech on basis of inputted text and a set of values of dynamic control parameters, synthesized speech is generated. Then on basis of input signal, characterizing legibility of speech by listener perceiving it, data received in real time scale are generated, on basis of which one or several values of dynamic control parameters are modified.

EFFECT: increased legibility of synthesized speech.

3 cl, 6 dwg

The invention relates to techniques for digital processing of speech signals transmitted over the communication line method PCM

FIELD: instrument making.

SUBSTANCE: there introduced are adaptive modules and connections between them, which allow combining current data on road traffic, weather and time with information on driving habits of particular driver. This information is used during profile formation of particular driver. This driver profile is used for adaptation of navigation instructions. Submission of adaptive instructions to a particular driver can contribute to safer road traffic.

EFFECT: enlarging functional capabilities.

19 cl, 6 dwg

FIELD: physics.

SUBSTANCE: destinations of a trip are based on at least one of a prior and a likelihood based at least in part on the received input data. The destination estimator component can use one or more of a personal destinations prior, time of day and day of week, a ground cover prior, driving efficiency associated with possible locations, and a trip time likelihood to probabilistically predict the destination. In addition, data gathered from a population about the likelihood of visiting previously unvisited locations and the spatial configuration of such locations may be used to enhance the predictions of destinations and routes. The group of inventions make easier probabilistic prediction of destinations.

EFFECT: output of distributions of probabilities on destinations and routes of a user from observations on content and partial trajectories.

FIELD: physics.

SUBSTANCE: route guidance system includes: a unit for detecting current location; processing apparatus for compiling a list of strips which a list of strips (Ls1) taking into account connection between strips for groups of strips (from Lk1 to Lk3) in road junctions in the road list displaying area; processing apparatus for determining the visualisation region which determines whether the number of strips in the list of strips (Ls1) is greater than the number of strips in the display unit; and apparatus for processing and controlling the display region, which selects predetermined strips in a list of strips (Ls1) and displays selected strips only. Strips which may not be displayed can be deleted.

EFFECT: possibility of displaying a guide map on strips which takes into account connections between the strips, thereby preventing deterioration of visibility of the guide map.

4 cl, 21 dwg

FIELD: physics, navigation.

SUBSTANCE: invention relates to a vehicle navigation system. The navigation system includes a vehicle, an information display (40) fitted in the vehicle, a portable GPS unit (10) and an interface (30) for transmitting data between the portable GPS unit and the information display fitted in the vehicle. The information display (40) is mounted on the vehicle and is visible to the driver. The portable GPS unit (10) includes a GPS sensor for determining location of the GPS unit and a portable information display (20). The portable GPS unit (10) is fitted in a positioning unit in the vehicle such that the portable information display is visible to the driver. Data from the portable GPS unit (10) can be displayed on the information display fitted in the vehicle. In the first version, the portable GPS unit (10) includes a central processing unit (15) for storing several locations. The information display (40) fitted in the vehicle and the portable information display (20) display different information on location of the GPS unit relative the stored locations. An input device (50) is designed for transmitting a signal from the portable GPS unit (10) through the data transmission interface (30). The input device (50) is fitted as an alternative solution on the information display (40) fitted in the vehicle or is fitted such that the driver can operate it without taking hands off vehicle control elements. The input device (50) is designed for transmitting a signal to the portable GPS unit (10) for storing the location of the GPS unit in the central processing unit (15). In the second version, the information display (40) fitted in the vehicle displays data from the portable GPS unit (10) when receiving data from the data transmission interface (30) and displays data from a sensor fitted in the vehicle when the data transmission interface (30) and the portable GPS unit (10) are interrupted.

EFFECT: easy vehicle control.

31 cl, 6 dwg

FIELD: physics; navigation.

SUBSTANCE: invention relates to navigation equipment of vehicles. The proposed navigation device can display directions on a display, receive a video signal from a camera and display a combination of the image from the camera and directions on the display. The device, which is a portable navigation device, includes a built-in camera. The device can provide an option from the menu which enables the user to regulate relative position of the displayed image from the camera with respect to the directions.

EFFECT: using the proposed device, instructions which can be quickly and easily interpreted are displayed for the user.

15 cl, 12 dwg

FIELD: physics; measurement.

SUBSTANCE: invention relates to portable navigation systems particularly for installation in an automobile. The portable personal navigation device is programmed with possibility of linking any function, related to a basic set of functions, with a non-overlapping input sensory area, which is sufficiently large for reliable activation by touching with a finger. The invention is based on understanding that, a set of basic functions can be identified, and can then be reliably selected/activated by touching the input sensory area with a finger, where the input sensory area is sufficiently large for reliable activation. This is especially preferable for a navigation device installed in an automobile, in which the basic functions are those functions which are likely to be activated by the driver when driving the automobile.

EFFECT: design of a portable navigation device with a non-overlapping input sensory area, which is sufficiently large for reliable activation by touching with a finger.

18 cl, 4 dwg

FIELD: physics, measurement.

SUBSTANCE: device of information provision enables relevant confirmation of information content which facilitates movement of moving object and is represented by image display unit, even in conditions of vibration affecting image display unit at a level not lower than given value. Equipment includes image display unit mounted in vehicle and allowing display of information facilitating movement of vehicle, vibration sensor detecting vibration equal or exceeding specified level applied to image display unit, and transmitting detection output signal, and operation control unit modifying display mode for information presenting image display by image display unit into information including data content which can be recognised if detection output signal of vibration sensor indicates than image display unit is affected by vibration equal or exceeding specified level for time period longer or equal to specified period.

EFFECT: device of information provision enabling relevant confirmation of information content, facilitating movement of moving object.

8 cl, 6 dwg

FIELD: physics, measurement.

SUBSTANCE: device of information provision enables relevant confirmation of information content which facilitates movement of moving object and is represented by image display unit, even in conditions of vibration affecting image display unit at a level not lower than given value. Equipment includes image display unit mounted in vehicle and allowing display of information facilitating movement of vehicle, vibration sensor detecting vibration equal or exceeding specified level applied to image display unit, and transmitting detection output signal, and operation control unit modifying display mode for information presenting image display by image display unit into information including data content which can be recognised if detection output signal of vibration sensor indicates than image display unit is affected by vibration equal or exceeding specified level for time period longer or equal to specified period.

EFFECT: device of information provision enabling relevant confirmation of information content, facilitating movement of moving object.

8 cl, 6 dwg

FIELD: physics; measurement.

SUBSTANCE: invention relates to portable navigation systems particularly for installation in an automobile. The portable personal navigation device is programmed with possibility of linking any function, related to a basic set of functions, with a non-overlapping input sensory area, which is sufficiently large for reliable activation by touching with a finger. The invention is based on understanding that, a set of basic functions can be identified, and can then be reliably selected/activated by touching the input sensory area with a finger, where the input sensory area is sufficiently large for reliable activation. This is especially preferable for a navigation device installed in an automobile, in which the basic functions are those functions which are likely to be activated by the driver when driving the automobile.

EFFECT: design of a portable navigation device with a non-overlapping input sensory area, which is sufficiently large for reliable activation by touching with a finger.

18 cl, 4 dwg

FIELD: physics; navigation.

SUBSTANCE: invention relates to navigation equipment of vehicles. The proposed navigation device can display directions on a display, receive a video signal from a camera and display a combination of the image from the camera and directions on the display. The device, which is a portable navigation device, includes a built-in camera. The device can provide an option from the menu which enables the user to regulate relative position of the displayed image from the camera with respect to the directions.

EFFECT: using the proposed device, instructions which can be quickly and easily interpreted are displayed for the user.

15 cl, 12 dwg

FIELD: physics, navigation.

SUBSTANCE: invention relates to a vehicle navigation system. The navigation system includes a vehicle, an information display (40) fitted in the vehicle, a portable GPS unit (10) and an interface (30) for transmitting data between the portable GPS unit and the information display fitted in the vehicle. The information display (40) is mounted on the vehicle and is visible to the driver. The portable GPS unit (10) includes a GPS sensor for determining location of the GPS unit and a portable information display (20). The portable GPS unit (10) is fitted in a positioning unit in the vehicle such that the portable information display is visible to the driver. Data from the portable GPS unit (10) can be displayed on the information display fitted in the vehicle. In the first version, the portable GPS unit (10) includes a central processing unit (15) for storing several locations. The information display (40) fitted in the vehicle and the portable information display (20) display different information on location of the GPS unit relative the stored locations. An input device (50) is designed for transmitting a signal from the portable GPS unit (10) through the data transmission interface (30). The input device (50) is fitted as an alternative solution on the information display (40) fitted in the vehicle or is fitted such that the driver can operate it without taking hands off vehicle control elements. The input device (50) is designed for transmitting a signal to the portable GPS unit (10) for storing the location of the GPS unit in the central processing unit (15). In the second version, the information display (40) fitted in the vehicle displays data from the portable GPS unit (10) when receiving data from the data transmission interface (30) and displays data from a sensor fitted in the vehicle when the data transmission interface (30) and the portable GPS unit (10) are interrupted.

EFFECT: easy vehicle control.

31 cl, 6 dwg

Up!