About

The main goal of Andrew McCallum's research is to dramatically increase our ability to mine actionable knowledge from unstructured text. He is especially interested in information extraction from the Web, understanding the connections among people and between organizations, expert finding, social network analysis, and mining the scientific literature. To this end, he and his group develop and employ various methods in statistical machine learning, natural language processing, information retrieval, and data mining — tending toward probabilistic approaches and graphical models. He is one of the pioneers in the development of conditional random fields. As a demonstration of his research, his group has created the research paper search engine at http://www.rexa.info.

McCallum joined the faculty of the Manning College of Information and Computer Sciences (CICS) as a research associate professor in 2002 and became an associate professor in 2003 (awarded tenure in 2007). Previously, he was vice president of research and development at WhizBang Labs, and director of its 30-person research and development lab in Pittsburgh, Pa. Prior to joining WhizBang, he was a research scientist and research coordinator at Just Research (Justsystem Pittsburgh Research Center), where he spearheaded the development of technology for statistical text processing. In 1996; he was a postdoctoral fellow at Carnegie Mellon University.

McCallum has over 50 research publications spanning machine learning, natural language processing, and reinforcement learning. He is the PI on a prestigious NSF Medium ITR grant and Co-PI on another. In 2004 and 2007, he received the IBM Faculty Partnership award. In 2004, one of his papers won Honorable Mention at AAAI. McCallum was selected as a UMass Amherst Lilly Teaching Fellow for the 2005-2006 academic year, and received the College of Natural Sciences and Mathematics' Outstanding Faculty Research Award in 2007. He is an action editor on the board of the Journal of Machine Learning Research, and has served on the program committees for many technical conferences, including IJCAI, AAAI, ICML, NIPS, UAI, ACL, and HLT. He has given invited talks in academia and industry, including at MIT, Stanford, CMU, UT Austin, Xerox PARC, IBM Research, Microsoft Research, AT&T Research, and Google. In 2003, he gave a tutorial, "Information Extraction From the World Wide Web," at the Neural Information Processing Systems conference in Vancouver, Canada, and at the KDD Conference.