In this paper we describe the development of Mandi Information System, a Telugu spoken dialogue system for obtaining price information of agricultural commodities like vegetables, fruits, pulses, spices, etc.. The target users of MIS are... more
In this paper we describe the development of Mandi Information System, a Telugu spoken dialogue system for obtaining price information of agricultural commodities like vegetables, fruits, pulses, spices, etc.. The target users of MIS are primarily the farmers in rural and semi-urban areas. Speech recognition is error prone and it is necessary for the dialogue system to make minimum number of errors while acquiring information from a user and also to detect errors (if not correctable) and adopt appropriate strategies. In this paper we suggest an approach to improve the performance and usability of the system by using multiple decoders and contextual information.
This workshop paper describes the experiments conducted for spoken web search at MediaEval 2011 evaluations. The task consists of searching for audio segments within audio content using an audio query. The current approach uses a broad... more
This workshop paper describes the experiments conducted for spoken web search at MediaEval 2011 evaluations. The task consists of searching for audio segments within audio content using an audio query. The current approach uses a broad articulatory phonetic units for indexing the audio files. Once the appropriate audio segments are obtained for the query, time instants of the audio segment is determined using a sliding DTW search.
This paper describes the experiments conducted for spoken web search (SWS) at MediaEval 2013 evaluations. A conventional approach is to train a multi-layer perceptron using high resource languages and then use it in the low resource... more
This paper describes the experiments conducted for spoken web search (SWS) at MediaEval 2013 evaluations. A conventional approach is to train a multi-layer perceptron using high resource languages and then use it in the low resource scenario. However, phone posteriorgrams have been found to under-perform when the language they were trained on differs from the target language. In this paper, we use bottle-neck features derived from MLP to generate Gaussian posteriorgrams. We also use a variant of dynamic time warping (DTW) based technique which exploits the redundancy in speech signal and thus averages the successive Gaussian posteriorgrams to reduce the length of the spoken query and spoken reference.
ABSTRACT User authentication is necessary to secure the data and process on Internet and in digital devices. Static text based authentication are most widely employed authentication systems for being inexpensive and highly scalable. But... more
ABSTRACT User authentication is necessary to secure the data and process on Internet and in digital devices. Static text based authentication are most widely employed authentication systems for being inexpensive and highly scalable. But they are prone to various types of active and passive attacks. The constant need of extending them to increase security is making them less usable. One promising alternative is Graphical authentication systems, which if implemented properly are more secure but have their own drawbacks.