SHENG LI

SHENG LI (李勝) English version

研究員
京都大学大学院情報学研究科知能情報学専攻
メディアアーカイブ分野河原研究室

居室: 〒606-8501 京都市左京区吉田本町京都大学総合研究7号館
E-mail: lisheng[at]sap.ist.i.kyoto-u.ac.jp

学歴

2002年7月江蘇省無錫(むしゃく)市第一高等学校卒業
2006年7月南京大学（国立中央大学，内戦により中国共産党に接収され国立南京大学と改称）学士修了(理学)
2009年7月南京大学大学院修士課程修了，（中国科学院，香港中文大学，南京大学连携项目課程同期修了）
2016年3月京都大学大学院情報学研究科知能情報学専攻博士後期課程修了

学位論文: 音響モデルの準教師付き及び半教師付き学習による音声認識，指導教授: 河原達也先生，Feb. 2016.

研究歴, 職歴

2009年9月メディア処理技術を用いた外国語学習，聴覚障害者支援研究に従事（2012年4月まで，中国科学院深セン先端技術研究所[広東省深セン市]）
2012年4月音響モデルの音素誤り最小化学習（2012年8月まで，Sogouピン音入力方法[株，中国北京市]研究員）
2012年10月音声認識の活用による講演・講義の字幕付与に従事（現在に至る，京都大学）
2015年10月 ERATOプロジェクトに従事（現在に至る，京都大学）

発表文献

GoogleScholar | ResearchGate

博士論文

Sheng Li (李勝).
Speech Recognition Enhanced by Lightly-supervised and Semi-supervised Acoustic Model Training.
Ph.D. Thesis, Kyoto University, 2016.

学術論文誌掲載論文

S.Li, Y.Akita, and T.Kawahara.
Semi-supervised acoustic model training by discriminative data selection from multiple ASR systems' hypotheses.
IEEE Trans. Audio, Speech \& Language Process., Vol.24, No.9, pp.1520--1530, 2016. (text) (PDF)
S.Li, Y.Akita, and T.Kawahara.
Automatic lecture transcription based on discriminative data selection for lightly supervised acoustic model training.
IEICE Trans., Vol.E98-D, No.8, pp.1545--1552, 2015. (text) (PDF)
L. Wang, H. Chen, S. Li and H. Meng.
Phoneme-level articulatory animation in pronunciation training,
Speech Communication, Vol. 54, Issue 7, Sept. pp. 845–856, 2012. (text) (PDF)

国際会議発表論文

S.Li, X.Lu, S.Sakai, M.Mimura and T.Kawahara
SEMI-SUPERVISED ENSEMBLE DNN ACOUSTIC MODEL TRAINING
Accepted in IEEE-ICASSP, 2017.
S.Li, X.Lu, S.Mori, Y.Akita, T.Kawahara.
Confidence Estimation for Speech Recognition Systems using Conditional Random Fields Trained with Partially Annotated Data,
ISCSLP, 2016. (PDF)
S.Li, Y.Akita, and T.Kawahara.
Data selection from multiple ASR systems' hypotheses for unsupervised acoustic model training.
In Proc. IEEE-ICASSP, pp.5875--5879, 2016. (text) (PDF)
S.Li, Y.Akita, and T.Kawahara.
Discriminative data selection for lightly supervised training of acoustic model using closed caption texts.
In Proc. INTERSPEECH, pp.3526--3530, 2015.(oral) (text) (PDF)
S.Li, X.Lu, Y.Akita, and T.Kawahara.
Ensemble speaker modeling using speaker adaptive training deep neural network for speaker adaptation.
In Proc. INTERSPEECH, pp.2892--2896, 2015. (text) (PDF)
S.Li, Y.Akita, and T.Kawahara.
Corpus and transcription system of Chinese Lecture Room.
In Proc. Int'l Sympo. Chinese Spoken Language Processing (ISCSLP), pp.442--445, 2014. (text) (PDF)
S. Li and L. Wang.
Cross Linguistic Comparison of Mandarin and English EMA Articulatory Data,
In Proc. INTERSPEECH, 2012. (Travel granted by IBM research) (text) (PDF)
S. Li, L. Wang and E. Qi.
The Phoneme-level Articulator Dynamics for Pronunciation Animation,
In Proc. IALP, Nov.15-17, Pages 283-286, 2011. (text) (PDF)
J. Chen, L. Wang, C. Li, J. Hu and S. Li.
IELS: A Computer-aided Pronunciation Training System for Undergraduate Students,
ICETC, Vol.1, pp.338-342, 2010. (text) (PDF)

発表・研究会

S. Li, X. Lu, S. Sakai, T. Kawahara,
Diversity-driven Semi-supervised Ensemble DNN Acoustic Model Training,
ASJ autumn, 2016.
S. Li, X. Lu, S. Sakai, T. Kawahara,
Diversity-driven Semi-supervised Ensemble DNN Acoustic Model Training,
IPSJ IEICE-SP2016-40, 2016. (oral)
S. Li, Y. Akita, T. Kawahara,
Discriminative data selection from multiple ASR systems' hypotheses for unsupervised acoustic model training,
IPSJ SIG-SLP-109-8, 2015.(oral)
S. Li, Y. Akita, T. Kawahara,
Effective combination of multiple ASR hypotheses with CRF-based classifiers,
ASJ autumn, 2015.
S. Li, Y. Akita, T. Kawahara,
Incorporating divergences from hypotheses of multiple ASR systems to improve unsupervised acoustic model training,
ASJ spring, 2015.
S. Li, Y. Akita, T. Kawahara,
Unsupervised Training of Deep Neural Network Acoustic Models for Lecture Transcriptions,
ASJ autumn, 2014.
S. Li, Y. Akita, T. Kawahara,
Classifier-based data selection for lightly-supervised training of acoustic model for lecture transcription,
IPSJ SIG-SLP-102-4, 2014.(oral)
S. Li, Y. Akita, T. Kawahara,
Data Selection Assisted by Caption to Improve Acoustic Modeling for Lecture Transcription,
ASJ spring, 2014.(oral)
S. Li, M. Mimura, T. Kawahara,
Automatic Transcription of Chinese Spoken Lectures,
ASJ autumn, 2013.
S. Li, K. Luo and L. Wang,
The Phoneme-level Articulator Dynamics for 3D Pronunciation Animation for Chinese,
Bulletin of Advanced Technology Research, Vol.5 No.10/Otc.2011, Pages 5-7.
S. Li and C. Li,
Application of the RFID based audio service in regional navigation system,
Bulletin of Advanced Technology Research, Vol.3 No.2/Feb.2009, Pages 44-47.

奖賞

2002年中国江蘇省化学オリンピック二等賞,生物学オリンピック三等賞
2002年 Chen Yinchuan大学新入生優秀者奨学金
2004年南京大学人民奨学金
2011年中国科学院職員優秀賞
2011年香港青年起業家プログラムの創造的な企画賞
2012年日本政府（文部科学省）奨学金
2012年ポートランド，Interspeech会議へIBM 旅行補助賞金

Academic Services

[1] Reviewer for NAACL-HLT 2016.
[2] Reviewer for Doctoral Consortium at Interspeech2015.

学会所属

IEEE-SPS (Signal Processing Society),
ISCA (International Speech Communication Association)，
ASJ (日本音響学会)，
SIG-CSLP (Chinese Spoken Language Processing),
APSIPA (Asia Pacific Signal and Information Processing Association)

校友会

南京大学(元国立中央大学)日本関西校友会
京都大学情報学研究科同窓会

歴史上の人物

三国志の登場人物

SHENG LI (李 勝) English version

研究員 京都大学大学院 情報学研究科 知能情報学専攻 メディアアーカイブ分野 河原研究室

学歴