Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
0% found this document useful (0 votes)
112 views

Text To Speech Using Labview

The document discusses the development of a text-to-speech system using LabVIEW. The system uses optical character recognition to convert scanned text images into machine-readable text, then performs text-to-speech conversion to create an audio .wav file of the synthesized speech. The OCR process involves image acquisition, preprocessing, segmentation, and character recognition. Text is then converted to speech using automation and property nodes in LabVIEW and played back as an audio file. The system was tested and able to successfully convert scanned text into synthesized speech.

Uploaded by

SHUBHAM YADAV
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
112 views

Text To Speech Using Labview

The document discusses the development of a text-to-speech system using LabVIEW. The system uses optical character recognition to convert scanned text images into machine-readable text, then performs text-to-speech conversion to create an audio .wav file of the synthesized speech. The OCR process involves image acquisition, preprocessing, segmentation, and character recognition. Text is then converted to speech using automation and property nodes in LabVIEW and played back as an audio file. The system was tested and able to successfully convert scanned text into synthesized speech.

Uploaded by

SHUBHAM YADAV
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 12

Text to speech using Labview

AIM – TO CONVERT TEXT SIGNAL TO


SPEECH SIGNAL
Optical Character Recognition
Optical character recognition (OCR) is the
mechanical or electronic translation of images of
hand-written or printed text into machine-
editable
text [12]. The OCR based system consists of
following process steps:
a) Image Acquisition
b) Image Pre-processing (Binarization)
c) Image Segmentation
d) Matching and Recognition
Image Acquisition
1. The configuration of the Image has been done with the help of Imaq
create subvi function of LabVIEW.
Image Pre-processing (Binarization)

1. Binarization is the process of converting a gray scale image (0 to 255 pixel values)
into binary image (0 to1 pixel values) by using a threshold value. The pixels lighter
than the threshold are turned to white and the remainder to black pixels.
Line segmentation
Line segmentation is the first step of the segmentation process. It takes the array of the image as
an input and scans the image horizontally to find first ON pixel and remember that coordinate as
y1. The system continues to scan the image horizontally and found lots of ON pixel since the
characters would have started. When finally first OFF pixel has been detected the system
remembers that coordinate as y2 and check the surrounding of the pixel to find out required
number of OFF pixels.
Word Segmentation
In the word segmentation process the line segmented images have been vertically
scanned to find first ON pixel. When this happen the system remember the coordinate
of this point as x1. This is the starting coordinate for the word. The system continues
the scanning process until fifteen (this is assumed word distance) successive OFF pixels
have been obtained.
Matching and Recognition

In this process, correlation between stored templates and segmented character has been
obtained by using correlation VI. The correlation VI determines the correlation between
segmented character and stored templates of each character. The value of the highest
correlation recognizes a particular character. In this way in order to recognize the character
every segmented character has been compared with the pre defined data stored in the
system.
Text to speech synthesis

In text to speech module text recognised by OCR system will be the inputs of
speech synthesis system which is to be converted into speech in .wav file format
and creates a wave file named output wav, which can be listen by using wave file
player. Two steps are involved in text to speech synthesis
i) Text to speech conversion
ii) Play speech in .wav file format
Text to speech conversion

In the text speech conversion input text is converted speech (in LabVIEW) by
using automation open, invoke node and property node.
Results and Discussion

Experiments have been performed to test the proposed system developed using
LabVIEW 7.1 version. The developed OCR based speech synthesis system has two
steps:
a. Optical Character Recognition
b. Speech Synthesis
Speech Synthesis

A wave file output.wav is created containing text converted into speech which can
listen using wave file player.
Conclusion

In this paper, an OCR based speech synthesis system (which can


be used as a good mode of communication between people)
has been discussed.

You might also like