About this Event
The development of speech recognition technologies in the past decade has dramatically shaped certain industries, such as customer service. In this talk, we start with the basics of speech recognition: What is speech recognition? How does it work and what are the popular toolkits? We then get down to the details of speech recognition technologies, covering state-of-the-art speech recognition algorithms and developments.
Last but not least, you will gain some insight about speech recognition-enabled cutting-edge applications, including some proof-of-concepts that we have been working on with SeaSalt.
Speaker: Guoguo Chen, Co-founder of Seasalt.ai and Vobil.com
Guoguo Chen holds a Ph.D. in Electrical and Computer Engineering from the Johns Hopkins University and a B.E. in Electronic Engineering from Tsinghua University. During his Ph.D., he spent 5 years at the Johns Hopkins Center for Language and Speech Processing, where he worked on various aspects of speech recognition and regularly contributed code to the open-source speech recognition toolkit Kaldi, as well as the open-source deep learning toolkit CNTK. He also spent two summers at Google Inc. where he developed the prototype of Android's wake word detection engine for "Okay Google". In 2016, a freshly graduated Dr. Chen co-founded KITT.AI, a CBInsights AI 100 company, funded by Amazon’s Alexa Fund, Paul Allen’s Allen Institute for Artificial Intelligence, Madrona Venture Group, Founders’ Co-op, and A Level Capital. The company released two products: a customizable wake word engine and a conversation AI toolkit. It had more than 100,000 developers and customers over 20 countries on 4 continents. In 2017, KITT.AI was acquired by Baidu, which set up its first Seattle office with the KITT.AI deal. In 2020, Dr. Chen co-founded Seasalt.ai and Vobil.com.
|