Xingyu Na

Xingyu Na

Senior Speech R&D Engineer, Apple
Mailing Address: No. 2 Kexueyuan South Street, Beijing, China
Office: Raycom Tower A
Email: asr.naxingyu -at- gmail.com

A printable version CV is here.

I received my Ph.D. degree from Beijing Institute of Technology in 2014 under supervision of Prof. Jingming Kuang and Prof. Xiang Xie. I was a visiting Ph.D. student in Dr. Philip N. Garner's group at Idiap Research Institute in 2012 and 2013.

News

07/09/2020: I'm joining Apple!
30/03/2020: Our book "Speech Recognition with Kaldi" is available on JoyBuy, Amazon and DangDang
18/09/2017: Our paper "AIShell-1: An Open-Source Mandarin Speech Corpus and A Speech Recognition Baseline" is accepted by Oriental COCOSDA 2017 as an oral presentation!
~~21/08/2017: I'm joining Microsoft China!~~
~~02/12/2016: I'm joining Alibaba Robotics Corp. as a Senior Staff Engineer!~~
05/09/2016: Attending Interspeech 2016 at San Francisco!
~~10/12/2015: I'm joining Letv as a Senior Researcher on speech recognition~~.
19/03/2015: Our paper "Incremental Syllable-Context Phonetic Vocoding" was accepted by TASLP.
14/07/2014: I gave a talk about "Real-Time Speech Synthesis" at IEEE ICME 2014.
~~07/03/2014: I'm joining Chinese Academy of Sciences, Institute of Acoustics as an Assistant Researcher~~.

Projects

Playing with Kaldi

contributions

created components for convolutional neural network in nnet2
created and tuned left-biphone setups for Chain model
modified transition model and HMM topology kernel
maintainer of aishell, fisher_swbd, hkust, gale_mandarin and thchs30 benchmarks

Playing with HTS

extentions

HTS_PDFparser: a lite parser for hts_engine model
StreamGenerator: a single stream speech parameter generator for customizable hts_engine
MGETraining: HTS training scripts supporting minimum-generation-error training

Voxforge for Chinese