Design Considerations of an Interactive Robotic Agent for Public Libraries

Potirakis, Stelios M. and Ganchev, Todor and Tuna, Gürkan and Tatlas, Nicolas-Alexander and Zogo, Recep Design Considerations of an Interactive Robotic Agent for Public Libraries. Journal of Balkan Libraries Union, 2013, vol. 1, n. 1, pp. 1-6. [Journal article (Paginated)]

BLUJ_v1_n1_paper1.pdf - Published version

Download (358kB) | Preview

English abstract

The role of public libraries has long been recognized, rendering them a timeless offered form of public service. Users of a public library should have easy access to catalogs and full text of printed and electronic versions of books, magazines, and periodicals, as well as to multimedia databases. Every day, most public libraries are in service for several hours, thus computer-based library service applications provide a valuable service supplement. In this study, a robotic agent which guides users in libraries is proposed. The robotic agent is equipped with sound acquisition and reproduction chains and is capable of understanding some specific commands and guiding the users. The agent is currently able to understand commands and respond in English. Therefore, it may be useful for public libraries visited or remotely used by foreign, English speaking, users. Future work consists of the implementation of language packages for Turkish and the evaluation of field tests that will be held at the library and documentation center of Trakya University, Edirne, Turkey.

Item type: Journal article (Paginated)
Keywords: Interactive robotic agent, public libraries, automatic speech recognition, speech-to-text.
Subjects: L. Information technology and library technology > LD. Computers.
L. Information technology and library technology > LL. Automated language processing.
L. Information technology and library technology > LP. Intelligent agents.
L. Information technology and library technology > LQ. Library automation systems.
Depositing user: Dr. Gurkan Tuna
Date deposited: 07 Aug 2014 11:31
Last modified: 02 Oct 2014 12:30


"SEEK" links will first look for possible matches inside E-LIS and query Google Scholar if no results are found.

Rubin, R. E. (2010) Foundations of Library and Information Science (3rd ed), New York, Neal-Schuman Publishers.

Hsieh, P.–N., Chang, P.–L. and Lu, K.–H. (2000) ‘Quality Management Approaches in Libraries and Information Sciences’, Libri, 50, 191-201.

Breitbach, W. and Prieto, A. G. (2012) ‘Text reference via Google Voice: a pilot study’, Library Review, 61(3), 188-198, DOI: 10.1108/00242531211259319.

Cho, H.-Y., Kim, B.-I., and Cha, S.-J. (2012) ‘A Study on the Improvement in Statistical Indicators of Libraries for the Disabled’, Journal of the Korean Society for Library and Information Science, 46(1), 141-162, DOI: 10.4275/KSLIS.2012.46.1.141.

Evans, D. A. and Reichenbach, J. (2012) ‘Need for Automatically Generated Narration’, Proc. of CIKM 2012 (Conference on Information and Knowledge Management), 21-24, DOI: 10.1145/2390116.2390130.

Fassbender, E. and Mamtora, J. (2013) ‘A Workflow for Managing Information for Research Using the iPad, Sente and Dragon Dictate: A Collaboration Between an Academic and a Research Librarian’, The Australian Library Journal, 62(1), 53-60, DOI: 10.1080/00049670.2013.768520.

Hill, H. (2013) ‘Disability and Accessibility in the Library and Information Science Literature: A Content Analysis’, Library & Information Science Research, 35(2), 137-142, DOI: 10.1016/j.lisr.2012.11.002.

Jonnalagadda, S. (2012) Android Application for Library Resource Access, Master’s Thesis, San Diego State University.

Mairn, C. (2012) ‘Three Things You Can Do Today to Get Your Library Ready for the Mobile Experience’, The Reference Librarian, 53, 263-269, DOI: 10.1080/02763877.2012.678245.

Mallon, M. (2012) ‘The New Distance Learners: Providing Customized Online Research Assistance to Urban Students on the Go’, Urban Library Journal, 18(1), 4.

Mikawa, M., Morimoto, Y. and Tanaka, K. (2010) ‘Guidance method using laser pointer and gestures for librarian robot’, Proc. of IEEE RO-MAN 2010, 373-378, DOI: 10.1109/ROMAN.2010.5598714.

Singh, K.P. and Moirangthem, E. (2010) ‘Are Indian Libraries VIP-Friendly? Information Use and Information Seeking Behaviour of Visually Impaired People in Delhi Libraries’, Library Philosophy and Practice, 2010, 374.

Kiesler, S. and Hinds, P. (2004) ‘Introduction to this Special Issue on Human-Robot Interaction’, Human Computer Interaction, 19(1), 1-8.

Yanco, H. and Drury, J. (2004) ‘Classifying Human-Robot Interaction: An Updated Taxonomy’, Proc. of the IEEE SMC 2004 International Conference on Systems, Man and Cybernetics, 2841-2846.

Werner, K., Oberzaucher, J. and Werner, F. (2012) ‘Evaluation of Human Robot Interaction Factors of a Socially Assistive Robot Together with Older People’, Proc. of the 2012 Sixth International Conference on Complex, Intelligent, and Software Intensive Systems (CISIS), 455-460.

Sharma, F.R. and Wasson, S.G. (2012) ‘Speech Recognition and Synthesis Tool: Assistive Technology for Physically Disabled Persons’, Int. J. Comp. Sc. Telecom., 3(4), 86-91.

MSDN Microsoft (2013), Microsoft Speech API (SAPI) 5.3, Available from: [Accessed 11 June 2013].

Holmes, J. and Holmes, W. (2001) Speech Synthesis and Recognition (2nd ed.), CRC Press.

Van Santen, J.P.H., Sproat, R., W., Olive, J.P. and Hirschberg, J. (1997) Progress in Speech Synthesis, New York, Springer.

Reddy, R. and Rao, K.S. (2013) ‘Two-stage intonation modeling using feedforward neural networks for syllable based text-to-speech synthesis’, Computer Speech and Language, 27, 1105–1126.

Rubin, P.E. (1982) ‘Sinewave synthesis’, Internal memorandum, New Haven, Haskins Laboratories.

Remez, R., Rubin, P., Pisoni, D. and Carrell, T. (1981) ‘Speech perception without traditional speech cues’, Science, 212(4497), 947–949. DOI:10.1126/science.7233191.

Olive, J.P. (1997) ‘Concatenative Syllables’, in Progress in Speech Synthesis, 261-262, New York, Springer.

Dutoit, T. (1997) An Introduction to Text-to-Speech Synthesis, Dordrecht, The Netherlands, Kluwer.

Ling, Z.-H. Microsoft Research (2012) ‘HMM-based Speech Synthesis: Fundamentals and Its Recent Advances’, Available from: [Accessed 21 July 2013]

Tokuda, K., Nankaku, Y., Toda, T., Zen, H., Yamagishi, J. and Oura, K. (2013) ‘Speech Synthesis Based on Hidden Markov Models’, Proc. IEEE, 101(5), 1234-1252.

Chungurski, S., Arsenovski, S. and Gjorgjevikj, D. (2012) ‘Development overview of TTS-MK speech synthesizer for Macedonian language, and its application’, Proc. of ICT Innovations 2012, 599-604.

Microsoft Research (2013), SAPI: Speech Application Programming Interface Development Toolkit, Available from: [Accessed 11 June 2013]

MSDN Microsoft (2013), System.Speech.Synthesis Namespace, Available from: [Accessed 11 June 2013]

Gonzalez, S. and Brookes, M. (2011) ‘A pitch estimation filter robust to high levels of noise (PEFAC)’, Proc EUSIPCO.

Bozkurt B. and Dutoit, T. (2001) ‘Implementation of Two Diphone-Based Synthesizers for Turkish’, Proc. Quatriemes Rencontres Jeunes Chercheurs en Parole, 38-41.

Bicil, Y. (2010) Turkish text-to-speech synthesis, Master’s Thesis, Sakarya University.

Tekindal, B. and Arik, G. (2012) ‘Görme Engelliler için Türkçe Metinden Konuşma Sentezleme Yazılımı Geliştirilmesi (“Development of Speech Synthesis Software From Turkish Text for the Visually Impaired”)’, BİLİŞİM TEKNOLOJİLERİ DERGİSİ, 5, 9-18.

Uslu, I.B., Ilk, H.G. and Yilmaz, A.E. (2013) ‘A Rule Based Prosody Model for Turkish Text-To-Speech Synthesis’, Tehnički vjesnik, 20(2), 217-223.

Uslu, B., Demir, N., Ilk, H.G. and Yılmaz, A.E. (2013) ‘Bilgisayar Bir Metni Vurgulu Okuyabilir mi? (“Can Computers Read a Text with Stress?”)’, Bilig, 65,165-176.

Yurtay, N., Çelebi, S., Gunduz, B.A. and Bicil, Y. (2013) ‘A Mobile Product Recognition System for Visually Impaired People with IPhone 4’, AWERProcedia Information Technology & Computer Science, 3, 204-211.


Downloads per month over past year

Actions (login required)

View Item View Item