Text To Speech Using Labview
Text To Speech Using Labview
1. Binarization is the process of converting a gray scale image (0 to 255 pixel values)
into binary image (0 to1 pixel values) by using a threshold value. The pixels lighter
than the threshold are turned to white and the remainder to black pixels.
Line segmentation
Line segmentation is the first step of the segmentation process. It takes the array of the image as
an input and scans the image horizontally to find first ON pixel and remember that coordinate as
y1. The system continues to scan the image horizontally and found lots of ON pixel since the
characters would have started. When finally first OFF pixel has been detected the system
remembers that coordinate as y2 and check the surrounding of the pixel to find out required
number of OFF pixels.
Word Segmentation
In the word segmentation process the line segmented images have been vertically
scanned to find first ON pixel. When this happen the system remember the coordinate
of this point as x1. This is the starting coordinate for the word. The system continues
the scanning process until fifteen (this is assumed word distance) successive OFF pixels
have been obtained.
Matching and Recognition
In this process, correlation between stored templates and segmented character has been
obtained by using correlation VI. The correlation VI determines the correlation between
segmented character and stored templates of each character. The value of the highest
correlation recognizes a particular character. In this way in order to recognize the character
every segmented character has been compared with the pre defined data stored in the
system.
Text to speech synthesis
In text to speech module text recognised by OCR system will be the inputs of
speech synthesis system which is to be converted into speech in .wav file format
and creates a wave file named output wav, which can be listen by using wave file
player. Two steps are involved in text to speech synthesis
i) Text to speech conversion
ii) Play speech in .wav file format
Text to speech conversion
In the text speech conversion input text is converted speech (in LabVIEW) by
using automation open, invoke node and property node.
Results and Discussion
Experiments have been performed to test the proposed system developed using
LabVIEW 7.1 version. The developed OCR based speech synthesis system has two
steps:
a. Optical Character Recognition
b. Speech Synthesis
Speech Synthesis
A wave file output.wav is created containing text converted into speech which can
listen using wave file player.
Conclusion