Journal Papers and Book Chapters
M. Picheny and D. Nahamoo Towards Superhuman Speech Recognition in Springer Handbook on Speech Processing,L. Rabiner and B. Juang, eds. Springer (November 2007)
J. Pitrelli, R. Bakis, E. Eide, R. Fernandez, W. Hamza, M. Picheny, The IBM expressive text-to-speech synthesis system for American English IEEE Transactions on Speech and Audio Processing, July 2006, vol. 14, no. 4, pg. 1099
H Erdogan , R Sarikaya, SF Chen, Y Gao, M Picheny, Using semantic analysis to improve speech recognition performance Computer Speech and Language, July 2005, vol.19, no. 3, pg. 321
R Sarikaya, Y Gao, M Picheny, H Erdogan, Semantic confidence measurement for spoken dialog systems IEEE Transactions on Speech and Audio Processing, July 2005, vol. 13, no. 4, pg. 534
W. Byrne, D. Doermann, M. Franz, S. Gustman, J. Hajic, D. Oard, M. Picheny, J. Psutka, B. Ramabhadran, D. Soergel, T. Ward, and W.-J. Zhu. Automatic recognition of spontaneous speech for access to multilingual oral history archives. IEEE Transactions on Speech and Audio Processing, July 2004, vol. 12, no. 4 pg. 420
M. Padmanbhan and M. Picheny, Large Vocabulary Speech Recognition Algorithms, IEEE Computer, March 2002 Vol. 35 No. 4 pg. 42
L. Bahl, R. Bakis, S. Das and M. Picheny, Speech Recognition, to appear in Wiley Encyclopedia of Electrical and Electronics Engineering, John Webster, ed. Wiley (spring 1999).
M. Padmanabhan, L. Bahl, D. Nahamoo, and M. Picheny, Speaker Clustering and Transformation for Speaker Adaptation in Speech Recognition Systems, IEEE Transactions on Speech and Audio Processing, January 1998, Volume 6, Number 1, pg. 71
S. K. Das and M. Picheny, Issues in Practical Large Vocabulary Isolated Word Recognition Systems: The IBM Tangora System, in Automatic Speech and Speaker Recognition: Advanced Topics, Chin-Hui Lee, Frank K. Soong, and Kuldip K. Paliwal, eds. Kluwer Academic Publishers Boston, 1996, pg. 457
J. Bellegarda, P. deSouza, A. Nadas, D. Nahamoo, M. Picheny and L. Bahl, The Metamorphic Algorithm: A Speaker Mapping Approach to Data Augmentation, IEEE Transactions on Speech and Audio Processing, July 1994, Volume 2, Number 3, pg. 413
D. Rtischev, D. Nahamoo, and M. Picheny, Speaker Adaptation via VQ Prototype Modification, IEEE Transactions on Speech and Audio Processing, January 1994, Volume 2, Number 1, pg. 94
L. Bahl, P. Brown, P. deSouza, R. Mercer, and M. Picheny, A Method for the Construction of Acoustic Markov Models for Words, IEEE Transactions on Speech and Audio Processing, October 1993, Volume 1, Number 4, pg. 443
L. Bahl, J. Bellegarda, P. deSouza, P. Gopalakrishnan, D. Nahamoo, and
M. Picheny, Multonic Word Models for Large Vocabulary Continuous Speech
Recognition, IEEE Transactions on Speech and Audio Processing, July
1993, Volume 1, Number 3, pg. 334
Selected Conference Papers
U.Chaudhari, M. Picheny Improvements in Phone Based Audio Search via Constrained Match with High Order Confusion Estimates ASRU 2007 December 2007, Kyoto, Japan
D. Jiang, M. Picheny, Y. Qin, Voice-Melody Transcription Under a Speech Recognition Framework International Conference on Acoustics, Speech, and Signal Processing, April 2007, Honolulu, Hawaii
R. Fernandez, W.Zhang, E. Eide, R. Bakis, W. Hamza, Y. Liu, M.Picheny, J. Pitrelli, Y. Qin, Z. Shuang, L. Shen, Toward Multiple-Language TTS: Experiments in English and Mandarin , Interspeech 2005, September 2005, Lisbon, Portugal.
M. Franz, B.Ramabhadran, T.Ward, M.Picheny Automated Transcription and Topic Segmentation of Large Spoken Archives Eurospeech 2003, September 2003, Geneva, Switzerland
B. Kingsbury, L. Mangu, G. Saon, G. Zweig, S. Axelrod, V. Goel, K. Visweswariah, M. Picheny Toward Domain-Independent Conversational Speech Recognition Eurospeech 2003, September 2003, Geneva, Switzerland
E. Eide, A. Aaron, R. Bakis, P. Cohen, R. Donovan, W. Hamza, T. Mathes,M. Picheny, M. Polkosky, M. Smith, M. Viswanathan Recent Improvements to the IBM Trainable Speech Synthesis System International Conference on Acoustics, Speech, and Signal Processing, April 2003, Hong Kong
B. Ramabhadran, J. Huang, M. Picheny Towards Automatic Transcription of Large Spoken Archives - English ASR for the MALACH Project International Conference on Acoustics, Speech, and Signal Processing, April 2003, Hong Kong
R. Sarikaya Y. Gao H. Erdogan, M. Picheny TurnBased Language Modeling for Spoken Dialog Systems International Conference on Acoustics, Speech, and Signal Processing May 2002, Orlando, Florida
A Aaron, S Chen, P Cohen, S Dharanipragada, E Eide, M Franz, J-M Leroux, X Luo B Maison, L Mangu, T Mathes, M Novak, P Olsen, M Picheny, H Printz, B Ramabhadran A Sakrajda, G Saon, B Tydlitat, K Visweswariah, D Yuk Speech Recognition for DARPA Communicator International Conference on Acoustics, Speech, and Signal Processing May 2001, Salt Lake City, Utah
Y. Gao, B. Ramabhadran, J. Chen, H. Erdogan, M. Picheny Innovation Approaches for Large Vocabulary Name Recognition International Conference on Acoustics, Speech, and Signal Processing May 2001, Salt Lake City, Utah
M. Picheny, Heredity and Environment in Speech Recognition: The Role of A Priori Information vs. Data, ICSLP '00 - The 6th International Conference on Spoken Language Processing, Oct. 2000, Beijing, China
M. Padmanabhan and M. Picheny, Towards Super-Human Speech Recognition, ASR2000 - ISCA Tutorial and Research Workshop Sept. 2000, Paris, France
M. Picheny, Challenges in Real-time Implementations of Large Vocabulary Dictation Systems: Past, Present And Future, ASRU '99 IEEE International Workshop on Automatic Speech Recognition and Understanding, Dec. 1999, Keystone, Colorado
P. de Souza, B. Ramabhadran, Y. Gao, and M. Picheny, Enhanced Likelihood Computation Using Regression, Eurospeech '99 - 6TH European Conference on Speech Communication and Technology, Sept. 1999, Budapest, Hungary
M. Novak and M. Picheny, Speed Improvement of the Time-Asynchronous Acoustic Fast Match, Eurospeech '99 - 6TH European Conference on Speech Communication and Technology, Sept. 1999, Budapest, Hungary
F. Liu and M. Picheny, On Variable Sampling Frequencies in Speech Recognition ICSLP '98 - The 5th International Conference on Spoken Language Processing, Dec. 1998, Sydney, Australia
E. Jan, R. Bakis, F. Liu and M. Picheny Telephone Band LVCSR for Hearing-Impaired Users, ICSLP '98 - The 5th International Conference on Spoken Language Processing, Dec. 1998, Sydney, Australia
Q. Lin, S. Das, D. Lubensky, M. Picheny A New Confidence Measure Based on Rank-Ordering Subphone Scores ICSLP '98 - The 5th International Conference on Spoken Language Processing, Dec. 1998, Sydney, Australia
S. Das, D. Nix, and M. Picheny, Improvements in Children's Speech Recognition Performance, International Conference on Acoustics, Speech and Signal Processing, May, 1998 Seattle, WA
Q. Lin, D. Lubensky, M. Picheny, and Srinivasa Rao, Key-Phrase Spotting Using and Integrated Language Model of N-Grams and Finite-State Grammar Eurospeech '97 - 5th European Conference on Speech Communication and Technology, September, 1997 Vol. I pg. 255
J. Chen, R. Gopinath, M. Monkowski, M. Picheny, and K. Shen, New Methods in Continuous Mandarin Speech Recognition, Eurospeech '97 - 5th European Conference on Speech Communication and Technology, September, 1997 Vol. 3 pg. 1543
Y. Gao, M. Padmanabhan, and M. Picheny, Speaker Adaptation Based
on Pre-Clustering Training Speakers Eurospeech '97 - 5th European
Conference on Speech Communication and Technology, September, 1997
Vol. 3 pg. 2091
Selected Patents
R. Bakis, H. Chittaluru, E. Epstein, S. Friedland, A. Ittycheriah, S. Lawrence, M. Picheny, C. Rutherfoord, M. Smith Method and system for text-to-speech caching US Patent 7043432, 2006.
Y Gao, B Ramabhadran, J Chen, H Erdogan, M Picheny Methods and apparatus for conversational name dialing systems US Patent 6925154, 2005.
M. Padmanabhan, M. Picheny, D. Nahamoo, S. Roukos Telephone messaging and editing system US Patent 6219638, 2001
J. Chen, F. Liu and M. Picheny, Automatic segmentation of continuous text using statistical approaches, US Patent 5806021, 1998.
J. Chen, R. Gopinath, M. Monkowski, and M. Picheny, Statistical acoustic processing method and apparatus for speech recognition using a toned phoneme system, US Patent 5751905, 1998
P. Gopalakrishnan, D. Nahamoo, M. Padmanbhan, and M. Picheny, Method and apparatus for estimating phone class probabilities a-posteriori using a decision tree US Patent 5680509, 1997
H. Ellozy, D. Kanevsky, M. Kim, D. Nahamoo, M. Picheny, and W. Zadrozny, Automatic indexing and aligning of audio and text using speech recognition US Patent 5649060, 1997
L. Bahl, P. deSouza, P. Gopalakrishnan, and M. Picheny, Speech recognition using dynamic features US Patent 5615299, 1997]
