Gerasimos (Makis) Potamianos

About me


Research lab: Watson Research Center (Yorktown)


Gerasimos Potamianos has joined the Institute of Informatics and Telecommunications at the National Center for Scientific Research (NCSR), "Demokritos", in Athens, Greece, as a Research Director.

He can be contacted at gpotam@ieee.org.

Updated information can be found at http://www.iit.demokritos.gr/~gpotam

This web page is no longer being maintained.



Gerasimos (Makis) Potamianos

Manager, Multimodal Conversational Solutions Department
Human Language Technologies / Multilingual Analytics and User Technologies, IBM T.J. Watson Research Center

Gerasimos (Makis) Potamianos

CONTACT INFO
RESUME
ONGOING PROJECTS AND RESEARCH
PUBLICATIONS BY TOPIC | BY TYPE

Short Biography:

Gerasimos (Makis) Potamianos was born in Athens, Greece, in 1965. He received the Diploma degree in Electrical and Computer Engineering from the National Technical University of Athens, Greece in 1988, and the M.S.E. and Ph.D. degrees in Electrical and Computer Engineering from the Johns Hopkins University, Baltimore, Maryland, in 1990 and 1994, respectively. His thesis focused on Markov random fields for image processing, and was completed under the supervision of Professor John Goutsias, faculty with the Image Analysis and Communications Laboratory (currently with the Center for Imaging Science).

From the fall of 1994 till the summer of 1996, he has been a Postdoctoral Fellow with the Center for Language and Speech Processing (CLSP), where together with Professor Frederick Jelinek he investigated decision tree based language models. From August 1996 till August 1999, he has been a Senior Member of Technical Staff with the then Speech and Image Processing Services Laboratory at AT&T Labs-Research, in Murray Hill and Florham Park, New Jersey, working on audio-visual automatic speech recognition and synthesis.

In September 1999, he joined the Human Language Technologies Department (currently Multilingual Analytics and User Technologies) at IBM Research at the IBM Thomas J. Watson Research Center, where he currently manages the Multimodal Conversational Solutions Department. At IBM, he has continued research on audio-visual speech, with recent emphasis placed on multisensory and multimodal speech processing in smart spaces and ambient intelligence environments, within the framework of European Union funded projects CHIL, DICIT, and NETCARITY. In his management role, he has also been leading development and integration efforts of natural language processing and dialog management into conversational platforms and solutions.

Several recent highlights include his participation at the Johns Hopkins CLSP Summer Workshop (WS'00) on audio-visual speech recognition, teaching at the 2001 ELSNET summer school, a tutorial at ICIP 2003, plenary talks at AVSP 2003 and VisHCI 2006, panel participation at MMSP 2006, and guest editor of special issues of the EURASIP JASP 2002 and IEEE TASLP 2008 journals. He also received the best paper award at ICME 2005 and was co-author of the best student paper at Interspeech 2007.

His research interests span the areas of multimodal speech processing with applications to human-computer interaction and ambient intelligence, with particular emphasis on audio-visual speech processing, automatic speech recognition, multimedia signal processing and fusion, as well as computer vision for human detection and tracking. He has published over 75 articles in these areas that have received over 400 citations and has five patents granted. He is a member of IEEE and a member of the Technical Chamber of Greece.





Last updated 29 Apr 2009