Carnegie Mellon University


Speech Processing

By Alexander Rudnickey

The Sphinx project is an umbrella for research in basic speech technologies. Current activities include systems for real-time recognition and multi-modal interaction. Research projects include multi-party conversation management, learning through spoken language, long-term user adaptation and the development of human-robot interfaces (such as avatars). Research in speech recognition includes out-of-vocabulary word detection and representation, and the use of conversational structure to enhance spoken term detection. The Sphinx recognition code-base and the Olympus dialog code-base are open-source and used by a number of projects in LTI, elsewhere in the university and by a large number of other sites.