琉球大学教務システム

科目番号

開講年度	期間	曜日時限	開講学部等
2016	後学期	火4	理工学研究科総合知能工学専攻
講義コード	科目名[英文名]			単位数
R0015500	音響信号処理特論			2
担当教員[ローマ字表記]
アシャリフモハマッド

授業内容と方法

音響信号処理特論
Advanced Acoustic Signal Processing：In this lecture we are study about Acoustic Speech Signal Processing. First, the mechanism of human speech production and acoustic phonetics such as vowels, the vowel triangle, front, mid, back, diphthongs, semivowels, liquids, glides, consonants, nasals, stops (Voiced, Unvoiced), fricatives (voiced, unvoiced), whisper, afficates are explained. Then, the acoustic theory of speech production, sound propagation Portnoff-Sondhi differential equations, uniform lossless tube and formants, effects of losses in the vocal tract, the effect of nasal coupling, excitation of sound in the vocal tract, lossless tube models, wave propagation in concatenated lossless tubes, relationship to digital filter and digital model for speech signals are explained. Next, time-domain methods for speech processing, short-time energy, zero-crossing rate, speech vs. silence discrimination, pitch period estimation, short-time autocorrelation function, short-time average magnitude difference function (AMDF), pitch period estimation using the autocorrelation function, median smoothing are explained. Then, digital representation of the speech waveform, PCM, MPCM, Adaptive quantization, Delta modulation DPCM, ADPCM will be discussed. Next, Homomorphic speech processing, Cepstrum, pitch detection, formal estimation, Homomorphic vocoder are aimed for studies. Also, Linear Predictive Coding (LPC) of speech, LPC analysis, the autocorrelation method, the covariance method, computation of gain for LPC model, Cholesky decomposition solution of LPC equations, Durbin’s recursive solution, lattice formulations, the prediction error signal, relation between the various speech parameters, synthesis of speech from LP parameters, pitch and formant analysis using LPC, LPC vocoder are presented. At last, digital speech processing for Man-Machine Communication by voice and special projects in Acoustic Echo and Noise Cancellation , etc. will be studied.

達成目標

Digital Processing of Speech Signal：
To know about the mechanism of human speech production and acoustic phonetics.
To know about the acoustic theory of speech production and relationship to digital filter
To know about time-domain methods for speech processing, pitch period estimation.
To know about digital representation of the speech waveform.
To know about Homomorphic speech processing & Linear Predictive Coding, synthesis of speech from LP parameters
To know about Man-Machine Communication by voice, Acoustic Echo and Noise Cancellation by Adaptive Digital Filter Algorithms etc

評価基準と評価方法

Project & Presentation & Report

履修条件

Some Digital Signal Processing and Communications knowledge are necessary.

授業計画

1Introduction to speech and signal processing
2The mechanism of human speech production.
3Phonems in American English, Vowel triangle, Diphthongs, Liquids, Glides.
4Consonants, Nasals, Stops, Fricatives, Whisper, Afficates.
5Sound propagation Portnoff-Sondhi equations, Lossless tube, Formants, Losses.
6Nasal coupling, Excitation, Digital model for speech signals.
7Energy, Zero Crossing, Silence, Pitch.
8Autocorrelation, AMDF.
9Pitch period estimation, Median smoothing.
10PCM, Adaptive quantization, Delta modulation.
11Homomorphic Speech Processing, Cepstrum, Pitch detection, Formal estimation, The Homomorphic Vocoder
12LPC of speech, Autocorrelation method
13Covariance method, Computation of gain for LPC model.
14Cholesky, Durbin, Lattice formulations
15The Prediction Error Signal, Synthesis of speech from LP parameters, Pitch and formant by LPC, LPC vocoder.
16Man-Machine Communication by voice, Special Projects in Acoustic Echo and Noise Cancellation, etc.

事前学習

Students should have knowledge about Digital Signal Processing and digital filtering. So they should study the basics theory of DSP.
Then get some knowledge for some real applications.

事後学習

After each class students should study that part from text book. Later they should make some real application of
Man-Machine Interface such as recognition and speech parameters estimation or making Echo & Noise Cancellation by Adaptive Filtering.

教科書にかかわる情報

教科書	書名	Digital Processing of Speech Signal, L.R. Rabiner/R.W. Schafer, Prentice Hall		ISBN	0132136031	備考
	著者名
	出版社		出版年	NCID

教科書全体備考

Digital Processing of Speech Signal, L.R. Rabiner/R.W. Schafer, Pre

参考書にかかわる情報

参考書全体備考

使用言語

英語

メッセージ

This course will give you basic knowledge about Acoustic and Speech Processing. Mostly it is theoretical but gives know-how to do researches in this area.

オフィスアワー

Tue 3:00-5:00 P.M., Fri 3:00-5:00 P.M.

メールアドレス

asharif@ie.u-ryukyu.ac.jp

URL

https://ie.u-ryukyu.ac.jp/~asharif/pukiwiki/

ページの先頭へ