New York City
I am postdoctoral researcher with the IBM Thomas J. Watson Research Center, Yorktown Heights, NY. At IBM, I focus on improving IBM Statistical Machine Translation system. Prior to that, I was a postdoctoral researcher working in the Computational Linguistics and Information Processing (CLIP) lab at the University of Maryland, Institute for Advanced Computer Studies. I worked with Philip Resnik also on Statistical Machine Translation. Prior to UMD, I was a postgraduate student at the School of Computing, National University of Singapore, co-advised by two wonderful supervisors: Haizhou Li (I2R) and Min-Yen Kan. At NUS, I was affiliated to the Web Information Retrieval / Natural Language Processing (WING) Group. My broad research interest is the application of statistical methods to natural language problems. Specifically, I've been looking at practical ways to approximate linguistic knowledge (both syntactic and semantic) using easy-to-obtain and readily-available statistics, rather depending on linguistically-annotated data that is often impractical to obtain. My research motto has always been: "Beat the data until it confesses". So far, I survive -- check out my publications on approximating syntactic knowledge via models based on function words which are approximated with most frequent words in the corpus. Having said that, I now am looking at ways to utilize existing annotated-data to be applied to word alignment modeling.