Last update August, 2011
Carlos Toshinori Ishi, PhD in
Engineering
Speech
Science and Technology Researcher
Office
Address
ATR – IRC (Intelligent Robotics
and Communication) Laboratories
Phone: +81-774-95-2457 Fax: +81-774-95-1408
E-mail: carlos at
atr dot jp
Academic
background
Doctor
course Oct./1998
~ Sep./2001 (
University
of Tokyo (Graduate School of Engineering –
Dept. of Information and Communication Engineering)
PhD dissertation: "Japanese
Prosody Analysis and its Application for Computer-Aided Language Learning
(CALL) Systems". With the aim
of constructing a CALL system that can detect pronunciation errors reliably,
acoustic-prosodic features on linguistic features of Japanese, such as tokushuhaku
(special morae), mora
rhythm, accent and intonation, were investigated in the production and
perception viewpoints.
Master
course Jan./1997
~ Feb./1998 (
"Instituto Tecnológico de Aeronáutica" (Electronic
Engineering – Dept. of Telecommunications)
Master thesis: "Analysis of
Brazilian Portuguese Phonemes for Speech Recognition". Acoustic properties of Brazilian
Portuguese phonemes were analyzed for automatically segmentation purposes. Neural networks were also implemented
for discriminating some devoiced vowels frequent in phrase finals in Brazilian
Portuguese.
College Jan./1992
~ Dec./1996 (
"Instituto Tecnológico de Aeronáutica" (Electronic Engineering)
BA thesis: "DSP-Implementation
of an Isolated Word Speech Recognition System". A DTW-based algorithm using mel-cepstral coefficients as features was implemented in
DSP Assembly language for recognition of isolated words.
High-school Jan./1987
~ Dec./1990 (
"Colégio Industrial Liceu de Artes e Ofícios de
São Paulo" (Electronic Technical school)
Vocational
background
ATR/IRC
Labs.
Jan.
2005 ~
Research
in Speech Science and Technology for Verbal and Non-Verbal Communication in
Communication Robots
-
Acoustic analysis of vocal fry in pressed voice (“rikimi”): Proposal of an algorithm for automatic
detection of vocal fry.
-
Use of prosodic and voice quality parameters for automatic extraction of
paralinguistic information (speech acts, attitudes, emotions).
-
Use of prosodic and linguistic cues for automatic detection of turn-taking and
dialog acts.
-
Evaluation of robust speech recognition system for communication robots in real
environments (Joint work with ATR/SLC Labs.)
-
Acoustic and electro-glottographic analysis of
pressed voice and other voice qualities.
-
Analysis of head motions and linguistic and paralinguistic information carried
by speech.
-
Head motion control in humanoid robots (androids)
-
Sound source localization and utterance interval detection using microphone
arrays; sound environment.
-
Audio-visual speech interval detection.
-
Integration of speech recognition and paralinguistic information extraction
(speech act recognition).
-
Robust F0 extraction.
-
Speech-driven lip motion generation for teleoperation
of humanoid robots.
JST/CREST
at ATR/HIS Labs.
Feb.
2002 ~ Dec. 2004
-
Research in Speech Science and Technology for Expressive Speech Processing
-
Acoustic-prosodic analysis of expressive speech: Principal Component Analysis
on global acoustic features and impressions about emotional states, attitudes,
and speaking styles.
-
Analysis focused on pitch movements of phrase finals à
Automatic identification of phrase final tones.
-
Acoustical analysis of creaky voice à Automatic detection of creaky
segments
-
Acoustical analysis of breathy/whispery voices à
Automatic detection of aspiration noise segments
-
Development of algorithms for automatic speech utterance detection, algorithms
for pitch extraction, software for prosodic labeling tools.
ITA-LASD
Jan. 1997 ~
Feb. 1998
Implementation
of software in assembly for Digital Signal Processors
- ADPCM
algorithms for audio compression, using ADSP-21XX processors
- FFT-based
algorithms for telephone tone detection, using Motorola ESM processors
Matec
Jan. 1991 ~
Jan. 1992
Repair of
broken Telephone exchange plaques, power supplier modules, and telephone
devices.
Grants
- 科学研究費補助金
若手研究A(Apr.
2011 ~ Mar. 2014)“韻律・声質の動的特徴および形態素・品詞を考慮した発話意図認識システムの構築”
- 科学研究費補助金
若手研究A(Apr.
2008 ~ Mar. 2011)“発話音声に伴う頭部動作および表情と言語・パラ言語情報との関連構造の構築”
- 科学研究費補助金
若手研究A(Apr.
2006 ~ Mar. 2008)“韻律と声質を考慮した発話スタイルの検出機構の構築と実環境への適用”
- 文部省国費留学生奨学金
(Apr. 1998 ~ Sep. 2001)
Lecture
2008 ~
Visitant Professor(非常勤講師)at
Osaka Prefecture University(大阪府立大学)“Advanced
Intelligent Media Processing” (“知能メディア処理特論”)
Language
skills
- Native language: Brazilian
Portuguese
- Second languages: Japanese,
English.
Programming
skills
- C++, Basic, Pascal
- Visual C++, Visual Basic,
JAVA
- Matlab
- Assembly (Analog Devices ADSP-21XX, Motorola ESM, 386)
Research
interests
Current research topics:
- Analysis of head motions and
speech in spoken dialogue:
automatic generation of head motions from speech.
- Analysis of laryngeal voice
qualities: automatic detection of
creaky voice; automatic detection of aspiration noise in breathy and whispery
voices.
- Mapping between prosodic +
voice quality features and linguistic and paralinguistic functions (intentions,
emotions, and attitudes) in Japanese.
- Transcription of prosodic
events: automatic extraction of perceptually meaningful prosodic events for
automatic prosody labeling: focus on phrase final prosody and voice quality.
- Pitch perception: Correspondence
between acoustically observed F0 and perceived pitch movements.
- Robust F0 extraction.
- Multi-modal dialogue
processing.
- Lip motion generation/synchronization
for humanoid robots (including androids) based on speech acoustics.
- Head motion generation from
speech acoustics and linguistic information.
Topics related to Robot Audition
- Microphone array for audio
source localization and separation.
- Improvement of speech recognition
and understanding in noisy environments.
- Utterance interval detection
based on sound directivity.
- Utterance interval detection
based on audio-visual information.
Other topics of interest:
Topics related to Speech
perception and recognition
- Auditory representation of
speech signals: acoustic parameters related to auditory perception; masking
functions.
- Prosodic modeling applied to
recognition of linguistic and paralinguistic information
Topics related to Speech
Production and Synthesis
- Mapping between physiological
and acoustic features for laryngeal voice quality control
- Prosodic control and voice
quality control for Speech Synthesis
Ishi, C.T., Ishiguro, H., Hagita, N.
(2010). Analysis of the roles and the dynamics of breathy and whispery voice
qualities in dialogue speech. EURASIP
Journal on Audio, Speech, and Music Processing 2010, ID 528193, 1-12 Jan.
2010.
Ishi, C.T., Ishiguro, H., Hagita, N.
(2008). Automatic extraction of paralinguistic information using prosodic
features related to F0, duration and voice quality. Speech
Communication 50(6), 531-543, June 2008.
Ishi, C.T., Matsuda, S., Kanda, T., Jitsuhiro, T., Ishiguro, H., Nakamura, S., Hagita, N. (2008). A robust speech recognition system for
communication robots in noisy environments. IEEE
Transactions on Robotics, Vol. 24, No. 3, 759-763, June 2008.
Ishi, C.T., Sakakibara,
K-I., Ishiguro, H., Hagita, N. (2008). A method for
automatic detection of vocal fry. IEEE
Transactions on Audio, Speech and Language Processing, Vol. 16, No. 1,
47-56, Jan. 2008.
Ishi, C.T. (2006), The functions of
phrase final tones in Japanese: Focus on turn-taking. Journal of Phonetic Society of
石井カルロス寿憲,榊原健一,石黒浩,萩田紀博 (2006) Vocal Fry発声の自動検出法. 電子情報通信学会論文誌DVol. J89-D, No. 12, 2679-2687, Dec. 2006.
石井カルロス寿憲,石黒浩,萩田紀博 (2006) 韻律および声質を表現した音響特徴と対話音声におけるパラ言語情報の知覚との関連. 情報処理学会論文誌Vol. 47, No. 6, 1782-1793, June
2006.
Ishi, C.T. (2005) Perceptually-related
F0 parameters for automatic classification of phrase final tones. IEICE Trans. Inf. & Syst., Vol. E88-D, No. 3, 481-488
Ishi,
C.T., Hirose, K. & Minematsu, N. (2003). Mora F0
representation for accent type identification in continuous speech and
considerations on its relation with perceived pitch values. Speech
Communication, Vol. 41, Nos. 2-3, 441-453
Invited
article
石井カルロス寿憲 (2010) ATRのコミュニケーションロボットにおける聴覚および音声理解に関する研究課題, 日本ロボット学会誌, Vol. 28, No. 1, pp. 27-30, Jan.2010.
PhD
dissertation
Ishi,
C.T. (2001). “Japanese prosody analysis and its applications to Computer-Aided
Language Learning systems,” PhD dissertation,
Ishi, C.T. (2004). “Analysis of autocorrelation-based
parameters in creaky voice,” Acoustical Science and Technology, Vol.
25, No. 4, 299-302.
International
Conference Papers
Ishi, C., Dong, L., Ishiguro, H.,
and Hagita, N. (2011). “The effects of microphone
array processing on pitch extraction in real noisy environments,” Proc. of IEEE/RSJ International Conference on
Intelligent Robots and Systems (IROS
2011), accepted.
Ishi, C., Liu, C., Ishiguro, H.
and Hagita, N. (2011). “Speech-driven lip motion
generation for tele-operated humanoid robots,” Proc. International Conference on Auditory-Visual
Speech Processing (AVSP2011), accepted.
Ishi, C.T., Ishiguro, H., and Hagita, N. (2011). “Analysis of acoustic-prosodic features
related to paralinguistic information carried by interjections in dialogue
speech,” Proceedings of The 12th Annual
Conference of the International Speech Communication Association (Interspeech’ 2011), accepted.
Ishi, C.T., Ishiguro, H., and Hagita, N. (2011). “Improved acoustic characterization of breathy
and whispery voices,” Proceedings of The 12th
Annual Conference of the International Speech Communication Association (Interspeech’ 2011), accepted.
Ishi, C., Dong, L., Ishiguro, H.,
and Hagita, N. (2010). “Sound interval detection of
multiple sources based on sound directivity,” Proc. of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2010), 1982-1987.
Ishi, C., Sato, M., Lao, S., and Hagita, N. (2010). “Real-time audio-visual voice activity
detection for speech recognition in noisy environments,” Proc. International Conference on Auditory-Visual
Speech Processing (AVSP2010),
81-84.
Heracleous, P., Sato, M., Ishi, C., and Hagita,
N. (2010). “Investigating the role of the Lombard reflex in visual and
audiovisual speech recognition,” Proc. International
Conference on Auditory-Visual Speech Processing (AVSP2010), 69-72.
Even, J., Ishi, C., Saruwatari,
H., Hagita, N. (2010). “Close speaker cancellation
for suppression of non-stationary background noise for hands-free speech
interface” Proc. of The 11th Annual
Conference of the International Speech Communication Association (Interspeech2010).
Ishi, C., Ishiguro, H., and Hagita, N. (2010). “Acoustic, electroglottographic
and paralinguistic analyses of “rikimi” in expressive
speech,” Proceedings of Speech Prosody
2010 (SP2010), ID 100139, 1-4.
Ishi, C.T., Liu, C., Ishiguro,
H., and Hagita, N. (2010). “Head motion during dialogue
speech and nod timing control in humanoid robots,” Proceedings of IEEE/RSJ Human Robot Interaction (HRI 2010),
293-300.
Ishi, C.T., Chatot,
O., Ishiguro, H., and Hagita, N. (2009). “Evaluation
of a MUSIC-based real-time sound localization of multiple sound sources in real
noisy environments,” Proceedings of IEEE/RSJ
International Conference on Intelligent Robots and Systems (IROS 2009),
2027-2032.
Ishi, C.T., Ishiguro, H., and Hagita, N. (2008). “Analysis of inter- and intra-speaker
variability of head motions during spoken dialogue,” Proceedings of the International Conference on Auditory-Visual
Speech Processing 2008 (AVSP’ 2008),
37-42.
Ishi, C.T., Ishiguro, H., and Hagita, N. (2008). “The meanings of interjections in
spontaneous speech,” Proceedings of The
9th Annual Conference of the International Speech Communication Association (Interspeech’ 2008), 1208-1211.
Ishi, C.T., Ishiguro, H., and Hagita, N. (2008). “The roles of breathy/whispery voice
qualities in dialogue speech,” Proceedings of Speech Prosody 2008, 45-48.
Ishi, C.T., Haas, J., Wilbers, F.P., Ishiguro, H., and Hagita,
N. (2007). “Analysis of head motions and speech, and head motion control in an
android,” Proceedings of IEEE/RSJ
International Conference on Intelligent Robots and Systems (IROS 2007),
548-553.
Wilbers, F.P., Ishi, C.T., Ishiguro, H. (2007). “A blendshape
model for mapping facial motions to an android,” Proceedings of IEEE/RSJ International Conference on
Intelligent Robots and Systems (IROS 2007), 542-547.
Ishi, C.T., Ishiguro, H., and Hagita, N. (2007). “Analysis of head motions and speech in
spoken dialogue,” Proceedings of The 8th
Annual Conference of the International Speech Communication Association (Interspeech’ 2007), 670-673.
Ishi, C.T., Ishiguro, H., and Hagita, N. (2007). “Acoustic analysis of pressed
phonation,” Proceedings of International
Conference on Phonetic Sciences (ICPhS’2007),
2057-2060.
Ishi, C.T.,
Matsuda, S., Kanda, T., Jitsuhiro, T., Ishiguro, H., Nakamura, S., and Hagita,
N. (2006). “Robust speech recognition system for
communication robots in real environments,” Proceedings of 2006 IEEE-RAS International Conference on
Humanoid Robots (Humanois’06),
340-345.
Ishi, C.T., Ishiguro, H., and Hagita, N. (2006). “Evaluation
of prosodic and voice quality features on automatic extraction of
paralinguistic information,” Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems
(IROS 2006), 374-379.
Ishi, C.T., Ishiguro, H., and Hagita, N. (2006). “Analysis
of prosodic and linguistic cues of phrase finals for turn-taking and dialog
acts,” Proceedings of The Ninth
International Conference of Speech and Language Processing 2006
(Interspeech’2006 - ICSLP), 2006-2009.
Ishi, C.T., Ishiguro, H., and Hagita, N. (2006). “Using
Prosodic and Voice Quality Features for Paralinguistic Information Extraction,”
CD-ROM Proceedings of The 3rd International Conference on Speech Prosody
(SP2006).
Ishi,
C.T., Ishiguro, H., and Hagita, N. (2005). “Proposal of Acoustic Measures for
Automatic Detection of Vocal Fry,” Proceedings of The 9th European Conference
on Speech Communication and Technology (Interspeech’
2005 - Eurospeech), 481-484.
Ishi, C.T. (2004). “A New Acoustic Measure for Aspiration Noise
Detection,” Proceedings of The 8th
International Conference of Speech and Language Processing 2004 (ICSLP 2004),
Vol. II, 941-944.
Ishi, C.T. (2004). “Analysis of Autocorrelation-based parameters for
Creaky Voice Detection,” Proceedings of The 2nd International
Conference on Speech Prosody (SP2004), 643-646.
Ishi,
C.T., Mokhtari, P., and Campbell, N. (2003). “Perceptually-related acoustic-prosodic
features of phrase finals in spontaneous speech,” Proceedings of The 8th
European Conference on Speech Communication and Technology (Eurospeech'
03), 405-408.
Mokhtari, P., Pfitzinger, H. R. and Ishi,
C. T. (2003). “Principal components of glottal waveforms: towards parameterisation and manipulation of laryngeal
voice-quality,” Proceedings of the ISCA Tutorial and Research Workshop on
"Voice Quality: Functions, Analysis and Synthesis" (Voqual'03),
133-138.
Ishi,
C.T., Campbell, N. (2002). “Analysis of Acoustic-Prosodic
Features of Spontaneous Expressive Speech,” Proceedings of 1st
International Congress of Phonetics and Phonology, 19.
Ishi,
C.T., Hirose, K., Minematsu, N. (2002). “Using Perceptually-related F0- and Power-based
Parameters to Identify Accent Types of Accentual Phrases,” Proceedings of 1st
International Conference on Speech Prosody (SP2002), 407-410.
Ishi,
C.T., Minematsu, N., Hirose, K., Nishide
R. (2001). “Identification of Accent
and Intonation in sentences for CALL systems,” Proceedings of The 7th
European Conference on Speech Communication and Technology (Eurospeech'01),
2455-2458.
Ishi,
C.T., Minematsu, N., Hirose, K. (2001). “Recognition
of accent and intonation types of Japanese using F0 parameters related to human
pitch perception,” Proceedings of ISCA Tutorial and Research Workshop on
Prosody in Speech Recognition and Understanding, 71-76.
Ishi,
C.T., Minematsu, N., Hirose, K. (2001). “Investigation on perceived pitch and observed F0
features to represent Japanese pitch accent patterns,” Proceedings of International
Conference of Speech Processing, 437-442.
Ishi,
C.T., Hirose, K. & Minematsu, N. (2000). “Identification of Japanese Double-Mora Phonemes
Considering Speaking Rate for the Use in CALL Systems,” Proceedings of The 6th International Conference of Speech
and Language Processing 2000 (ICSLP 2000), Vol. I, 786-789.
Watanabe, M. & Ishi, C.T. (2000). “The distribution of fillers
in lectures in the Japanese Language,” Proceedings of The 6th International Conference of Speech and Language Processing
2000 (ICSLP 2000), vol. III, 167-170.
Ishi,
C.T. & Hirose, K. (2000). “Influence of speaking rate on segmental duration
and its formulation for the use in CALL systems,” Proceedings of Integrating Speech Technology In Language
Learning 2000 (InSTiL 2000), 106-108.
Kawai, G & Ishi, C.T. (1999). “A system for learning the
pronunciation of Japanese Pitch Accent,” Proceedings of The 6th European
Conference on Speech Communication and Technology (Eurospeech' 99), Vol.1, 177-181.
国内学会・研究会発表論文 (Non-refereed
domestic conferences and workshops)
石井カルロス寿憲、石黒浩、萩田紀博 (2011)“気息音発声の音響的表現の改善” 日本音響学会2010年春季研究発表会,
Vol. I, 269-270.
石井カルロス寿憲、梁棟、石黒浩、萩田紀博
(2010) “ロボットの実環境におけるピッチ抽出に関する考察” 人工知能学会AIチャレンジ研究会
(SIG-Challenge-10), 36-40.
石井カルロス寿憲、佐藤幹、秋本高明、萩田紀博
(2010) “コミュニケーション知能における音声認識モジュール群に関する一考察” 日本ロボット学会学術講演会、ID
RSJ2010AC3P3-6.
石井カルロス寿憲、新井潤、萩田紀博 (2010)“対話音声に出現する感動詞における発話意図認識の試み” 日本音響学会2010年秋季研究発表会,
Vol. I, 251-252.
石井カルロス寿憲,梁棟,石黒浩,萩田紀博
(2010) “音の指向性を利用した複数音源の発話区間検出の検討”
日本音響学会2010年春季研究発表会,
Vol. I, 731-734.
石井カルロス寿憲,梁棟,石黒浩,萩田紀博
(2009) “MUSIC空間スペクトログラムを用いた複数音源の発話区間検出の検討”第30回
人工知能学会 AIチャレンジ研究会
(SIG-Challenge-09), 8-13.
石井カルロス寿憲,石黒浩,萩田紀博 (2009)
“声質の変化がもたらすパラ言語情報の分析”
日本音響学会2009年秋季研究発表会,
Vol. I, 475-476.
石井カルロス寿憲,石黒浩,萩田紀博 (2009)
“声質に関連する音響パラメータの分析”
日本音響学会2009年秋季研究発表会,
Vol. I, 327-328.
石井カルロス寿憲,Olivier Chatot,石黒浩,萩田紀博
(2009) “実環境におけるMUSIC法を用いた3次元音源定位の評価”
第28回
人工知能学会 AIチャレンジ研究会
(SIG-Challenge-08).
石井カルロス寿憲,Olivier Chatot,石黒浩,萩田紀博
(2009) “3次元空間での音源方向推定の実環境における評価およびリアルタイム性の評価”
日本音響学会2009年春季研究発表会,
Vol. I, 699-702.
石井カルロス寿憲,石黒浩,萩田紀博 (2008)
“自然発話に現れる感動詞の発話スタイルと機能の分析”
日本音響学会2008年秋季研究発表会,
Vol. I, 269-270.
石井カルロス寿憲,石黒浩,萩田紀博 (2008)
“Breathy/whispery発声の音響特徴と音声コミュニケーションにおける役割”
電子情報通信学会技術研究報告, Vol. 108,
No. 116, 127-132.
石井カルロス寿憲,石黒浩,萩田紀博 (2008)
“Breathy/whispery発声の音声コミュニケーションにおける役割”
日本音響学会2008年春季研究発表会,
Vol. I, 357-358.
石井カルロス寿憲,石黒浩,萩田紀博 (2007)
“発話音声に関わる頭部動作の分析及びアンドロイドロボットの頭部制御” 第26回
人工知能学会 AIチャレンジ研究会
(SIG-Challenge-07), 46-51.
石井カルロス寿憲,石黒浩,萩田紀博 (2007)
“発話音声に伴う頭部動作の分析” 日本音響学会2007年秋季研究発表会,
Vol. I, 109-110.
石井カルロス寿憲,石黒浩,萩田紀博 (2007)
“EGGを用いた「りきみ」発声の音響分析” 日本音響学会2007年春季研究発表会,
Vol. I, 221-222.
Ishi. C.T., Ishiguro, H., Hagita, N. (2006) “Acoustic analysis of pressed voice,” Fourth Joint Meeting: ASA and ASJ, J. Acoust,. Soc,
Am., Vol. 120, No. 5, Pt. 2, pp. 3374, Nov. 2006.
石井カルロス寿憲,松田繁樹,神田崇行,實廣貴敏,石黒浩,中村哲,萩田紀博
(2006)“コミュニケーションロボットの音声認識システムの実環境における評価”第24回人口知能学会AIチャレンジ研究会
(SIG-Challenge-06), 23-28.
石井カルロス寿憲,石黒浩,萩田紀博 (2006)“りきみの自動検出のための音響分析”
電子情報通信学会技術研究報告,Vol.
106,No. 178,1-6.
石井カルロス寿憲,石黒浩,萩田紀博 (2006)“喉頭を力んだ発声の音響特徴の分析”
日本音響学会2006年春季研究発表会,Vol.
I,227-228.
石井カルロス寿憲,石黒浩,萩田紀博 (2005)“対話音声における韻律と声質の特徴を利用したパラ言語情報の抽出の検討”第22回人口知能学会AIチャレンジ研究会(SIG-Challenge-05),71-76.
石井カルロス寿憲,石黒浩,萩田紀博 (2005)“韻律と声質に関連する音響パラメータを用いたパラ言語情報の抽出の検討”日本音響学会2005年秋季研究発表会,233-234.
石井カルロス寿憲 (2004)“母音区間の息漏れに関連する音響パラメータの検討”
日本音響学会2004年秋季研究発表会,Vol.
I,295-296.
石井カルロス寿憲,ニック・キャンベル (2004)“句末の機能的役割”日本音響学会2004年春季研究発表会,Vol.
I,235-236.
Mokhtari, P., Pfitzinger, H. R., Ishi, C. T. and Campbell, N. (2004).
"Laryngeal voice quality conversion by glottal waveshape
PCA", in Proceedings of the Spring2004 Meeting of the Acoustical
Society of Japan, Atsugi, Japan, Paper 2-P-6, pp.341-342.
石井カルロス寿憲 (2003)“Creaky発声の音響的特徴の分析”
日本音響学会2003年秋季研究発表会,Vol.
I,235-236.
Ishi,
C.T., Campbell, N. (2003). “Acoustic-Prosodic Analysis of Phrase Finals in
Expressive Speech,” Proceedings of The 1st JST/CREST
International Workshop on Expressive Speech Processing, 85-88.
石井カルロス寿憲,ニック・キャンベル (2003)“日常会話における句末の音響・韻律的特徴の分析”日本音響学会2003年春季研究発表会,Vol.
I,311-312.
石井カルロス寿憲,ニック・キャンベル (2002)“表現豊かな発話様式の韻律的特徴の分析”日本音響学会2002年秋季研究発表会,Vol.
I,275-276
Ishi,
C.T., Hirose, K., Minematsu, N. (2002).
“Investigations on a quantified representation of pitch movements in syllable
units,” Proceedings of The 2002 Spring Meeting of the Acoustical Society of
Japan, Vol. I, 419-420.
石井カルロス寿憲,峯松信明,広瀬啓吉 (2001)“ピッチ知覚に対応したモーラピッチの自動抽出”日本音声学会全国大会予稿集,13-18.
石井カルロス寿憲,峯松信明,広瀬啓吉 (2001)“日本語のアクセント・イントネーションにおけるピッチ知覚と対応したモーラピッチの自動抽出”日本音響学会2001年秋季研究発表会,Vol.
I,445-446.
石井カルロス寿憲,峯松信明,広瀬啓吉 (2001)“ピッチ知覚を考慮した日本語連続音声のアクセント型判定”
電子情報通信学会技術研究報告,Vol.
101,No. 270,23-30.
Ishi,
C.T., Minematsu, N., Hirose, K. (2001). “Relationship
between acoustically observed F0 and perceived pitch for Japanese accent and
intonation,” Technical Report of
石井カルロス寿憲,広瀬啓吉,峯松信明 (2001)“発音教育システムにおけるイントネーションの自動分類”日本音響学会2001年春季研究発表会,Vol.
I,327-328.
西出隆二,石井カルロス寿憲,峯松信明,広瀬啓吉 (2001)“日本語のアクセントを対象とした発音教育システム構築に関する検討”日本音響学会研究発表会2001年春季,Vol. I,269-270.
石井カルロス寿憲,西出隆二,峯松信明,広瀬啓吉
(2001)“日本語のアクセント・イントネーションを対象とした発音教育システム構築に関する検討”
電子情報通信学会技術研究報告,Vol.
100,No. 594,33-40.
石井カルロス寿憲,広瀬啓吉,峯松信明 (2000)“等時性の観点からの日本語のモーラタイミングに関する考察”日本音響学会2000年秋季研究発表会,Vol.
I,199-200.
石井カルロス寿憲,藤本克彦,広瀬啓吉 (2000)“話速を考慮した日本語の特殊拍判別”電子情報通信学会技術研究報告,Vol.
100,No. 97,17-24.
石井カルロス寿憲,広瀬啓吉,峯松信明 (2000)“話速に伴う音の持続時間変化の分析”日本音響学会2000年春季研究発表会,Vol.
I,235-236.
石井カルロス寿憲,河合剛,広瀬啓吉 (1999)“日本語単語のピッチアクセント型の発音学習システム”日本音響学会1999年春季研究発表会,Vol.
I,245-246.