Speech to Text Processing for Interactive Agent of Virtual Tour Navigation

  • Universitas Merdeka Pasuruan
  • Universitas Merdeka Pasuruan
Abstract views: 338 , PDF downloads: 240
Keywords: Sound, Linear Predictive Coding (LPC), Backpropagation

Abstract

The development of science and technology is one way to replace the method of human interaction with computers, one of which is to provide voice input. Conversion of sound into text form with the Backpropagation method can be understood and realized through feature extraction, including the use of Linear Predictive Coding (LPC). Linear Predictive Coding is one way to represent the signal in obtaining the features of each sound pattern. In brief, the way this speech recognition system worked was by inputting human voice through a microphone (analog signal) which then sampled with a sampling speed of 8000 Hz so that it became a digital signal with the assistance of sound card on the computer. The digital signal from the sample then entered the initial process using LPC, so that several LPC coefficients were obtained. The LPC outputs were then trained using the Backpropagation learning method. The results of the learning were classified with a word and stored in a database afterwards. The results of the test were in the form of an introduction program that able display the voice plots. the results of speech recognition with voice recognition percentage of respondents in the database iss 80% of the 100 data in the test in Real Time

Downloads

Download data is not yet available.

References

1] F. A. AHDA, “ANALISIS SUARA ALPHABET MENGGUNAKAN JARINGAN SYARAF TIRUAN PROPAGASI BALIK,” Anal. SUARA Alph. MENGGUNAKAN Jar. SYARAF TIRUAN PROPAGASI BALIK, vol. 60, no. 4, pp. 982–992, 2010.
[2] M. B. Gunawan, “KONVERSI SUARA KE TEKS MENGGUNAKAN METODE HIDDEN MARKOV MODEL,” KONVERSI SUARA KE TEKS MENGGUNAKAN Metod. HIDDEN MARKOV Model, p. 45, 2010.
[3] K. Anam, “Pengenalan suara manusia menggunakan metode,” 2013.
[4] F. AN, “Pengenalan Pengucap Tak Bergantung Teks dengan Metode Vector Quantization ( VQ ) Melalui Ektraksi Linear Predictive Coding ( LPC ),” pp. 1–8, 2004.
[5] R. A. SRI MELATI SAGITA, SITI KHOTIJAH, “PENGKONVERSIAN DATA ANALOG MENJADI DATA DIGITAL DAN DATA DIGITAL MENJADI DATA ANALOG MENGGUNAKAN INTERFACE PPI 8255 DENGAN BAHASA PEMROGRAMAN BORLAND DELPHI 5 . 0,” ISSN 1979-276X, vol. 6, no. 2, pp. 168–179, 2013.
[6] M. Irfandy, “Aplikasi Pengenalan Ucapan Dengan Jaringan Syaraf Tiruan Propagasi Balik Untuk Pengendalian Robot Bergerak,” Apl. Pengenalan Ucapan Dengan Jar. Syaraf Tiruan Propagasi Balik Untuk Pengendali. Robot Berger., pp. 1–7, 2004.
[7] and A. A. Z. Sigit Nur Rohman, Achmad Hidayatno, “APLIKASI PENCIRIAN DENGAN LINEAR PREDICTIVE CODING UNTUK BALIK Landasan Teori,” Apl. PENCIRIAN DENGAN LINEAR Predict. CODING UNTUK PEMBELAJARAN PENGUCAPAN NAMA HEWAN DALAM Bhs. Ingg. MENGGUNAKAN Jar. SARAF TIRUAN PROPAGASI BALIK, pp. 151–158, 2012.
Published
2019-11-30
How to Cite
, & . (2019). Speech to Text Processing for Interactive Agent of Virtual Tour Navigation. International Journal of Artificial Intelligence & Robotics (IJAIR), 1(1), 33-38. https://doi.org/10.25139/ijair.v1i1.2030