Giri Iyengar's Publications

IBM publications

Arnon Amir, Shih-Fu Chang, Martin Franz, Giridharan Iyengar, John R. Kender, Ching-Yung Lin, Milind R. Naphade, Apostol Natsev, John R. Smith and Jelena Tesic. IBM Research TRECVID-2004 Video Retrieval System. NIST TRECVID-2004. NIST, March 2005.

Jintao Jiang, Gerasimos Potamianos and Giridharan Iyengar. Improved Face Finding in Visually Challenging Environments. ICME-2005, Int. Conf. Multimedia and Expo.. IEEE, March 2005.

Janne 'Argillander, Harriet J. Nock and Giridharan Iyengar. Semantic Annotation of Multimedia Using Maximum Entropy Models. ICASSP 2005 - International Conference on Acoustics, Speech and Signal Processing. IEEE Signal Processing Society, March 2005.

A. Amir, G. Iyengar, Ching-Yung Lin, Milind R. Naphade, Apostol Natsev, Chalapathy Neti, Harriet J. Nock, John R. Smith and B. Tseng. Multimodal Video Search Techniques: Late Fusion of Speech-Based Retrieval and Visual Content-Based Retrieval. ICASSP - IEEE International Conference on Acoustics, Speech, and Signal Processing. May 2004.

Bhuvana Ramabhadran, Jing Huang, Upendra V. Chaudhari, Giridharan Iyengar and Harriet J. Nock. Guess Who's Speaking: Audio Segmentation for the Automated Transcription of Large Spoken Archives. Eurospeech - Eurospeech 2003 (Interspeech 2003). September 2003.

Harriet J. Nock, W. Adams and Giridharan Iyengar. User-trainable Video Annotation using Multimodal Cues. SIGIR 2003. July 2003.

Harriet J. Nock, Giridharan Iyengar and Chalapathy Neti. Speaker Localisation using Audio-Visual Synchrony: An Empirical Study. CIVR 2003 - Interntional Conference on Image and Video Retrieval. July 2003.

Harriet J. Nock, Giridharan Iyengar and Chalapathy Neti. Issues in Speech-based Retrieval of Video. MSDR 2003 - ISCA Workshop on Multilingual Spoken Document Retrieval. March 2003.

Giridharan Iyengar, Harriet J. Nock and Chalapathy Neti. Audio-Visual Synchrony for Detection of Monologues in Video Archives. ICASSP 2003 - International Conference on Acoustics, Speech and Signal Processing. IEEE, February 2003.

Harriet J. Nock, Giridharan Iyengar and Chalapathy Neti. Assessing Face and Speech Consistency for Monologue Detection in Video. ACM Multimedia MM02 . December 2002.

John R. Smith, William H. Adams, Arnon Amir, Chitra Dorai, Sugata Ghosal, Giridharan Iyengar, Alejandro Jaimes, Christian Lang, Ching-Yung Lin, Apostol Natsev, Chalapathy Neti, Harriet J. Nock, Haim Permuter, Raghav Singh, Savitha Srinivasan, Belle L. Tseng, Ashwin T Varadaraju and Dongqing Zhang. IBM Research TREC-2002 Video Retrieval System. NIST TREC-2002 - Text Retrieval Conference. NIST, November 2002.

Giridharan Iyengar, Harriet J. Nock, Chalapathy Neti and Martin Franz. Semantic Indexing of Multimedia Using Audio, Text and Visual Cues. ICME 2002 - IEEE International Conference on Multimedia and Expo. August 2002.

Arnon Amir, Sankar Basu, Giridharan Iyengar, Ching-Yung Lin, Milind R. Naphade, John R. Smith, Savitha Srinivasan and Belle L. Tseng. A Multi-Modal System for the Retrieval of Semantic Video Events. Computer Vision and Image Understanding 96(2):216-36, February 2004.

Harriet J. Nock, Giridharan Iyengar and Chalapathy Neti. Multimodal Processing by Finding a Common Cause. Communications of the ACM 47(1):51-56, January 2004.

Hugh W. Adams Jr, Giridharan Iyengar, Ching-Yung Lin, Milind R. Naphade, Chalapathy Neti, Harriet J. Nock and John R. Smith. Semantic Indexing of Multimedia Content Using Visual, Audio and Text Cues. EURASIP Journal on Applied Signal Processing 2003(2):170-185, February 2003.

Other publications

Iyengar G., Neti C. and Verma A.. “Robust detection of visual ROI for visual speechreading”. Multimedia Signal Processing, Cannes, France. January 2001. [ download ]

Potamianos G., Verma A., Basu S. and Iyengar G.. “A cascade image transform for speaker independent automatic speechreading”. IEEE Conference on Multimedia and Expo, New York. July 2000. [ download ]

Potamianos G., Net. C., Iyengar G., Senior A. W. and Verma A.. “A cascade visual front end for speaker independent automatic speechreading”. International Journal of Speech Technology, April 2001. [ download ]