Yi Chen's Home Page

Yi Chen is a Professor and the Henry J. Leir Chair in Healthcare in the School of Management with a joint appointment in the College of Computing Sciences at New Jersey Institute of Technology (NJIT). Prior to joining NJIT, she was an associate professor in Arizona State University. She received her Ph.D. degree in Computer Science from the University of Pennsylvania in 2005 and B.S. from Central South University in 1999. Her research interests span many aspects of data management. Her current research focuses on information discovery on big data, social computing and social network analysis, workflow management, and information integration, with applications in business, Web and healthcare domains. She has served in the organization and program committees for various conferences, including SIGMOD, VLDB, ICDE and WWW, served as an associate editor for DAPD and a guest editor for TKDE and PVLDB. She is serving as a general co-chair for SIGMOD'2012. Yi Chen is a recipient of Outstanding Researcher in Computer Science and Engineering in ASU (2011), a Google Research Award (2011), IBM Faculty Award (2010) and an NSF CAREER Award (2009).

Towards Effective Search and Knowledge Discovery in Online Health Forums
We are investigating a patient-centered approach for information extraction, classification, and integration to support effective search and knowledge discovery in healthcare forums.
See more

XSEEK : an Intelligent Search Engine for Semi-Structured Data
We are developing a search engine for databases. We are identifying a spectrum of problem space for supporting keyword search on structured/semi-structured data, ranging from evaluation framework, generating high-quality results, to helping users analyze results, and developing techniques to address the open challenges. More Information about XSEEK

SIGMOD 2009 Tutorial:

pptx

ppt

ICDE 2011 Tutorial:

pptx

Querying Incomplete and Inconsistent Web Databases
We are developing techniques for querying web databases in the presence of the imprecise nature of user queries as well as inconsistence in the data. More Information about the Project

ExpertNet: Collaboration Network for Intelligent Social Computing
We are developing computational foundations and quantitative frameworks to model, optimize, and search collaborative social networks to expedite problem-solving and innovation. More Information about ExpertNet

SWAN: Smart Workflow Management
We are developing techniques for workflow management, including workflow modeling, provenance reasoning, workflow search, and optimization, for both scientific workflows and business processes, for regular workflows as well as ad-hoc workflows. More Information about SWAN and its sub-project SmartFlow for managing ad-hoc workflows specifically.

Information Extraction -- A Database Centric Approach
In collaboration with Prof. Chitta Baral, Graciela Gonzalez

Overview: Traditionally information extraction systems are implemented as a pipeline of special-purpose processing modules, which necessitates extraction to be re-applied from scratch to the entire text corpus whenever the data, processing modules, or extraction goals change. we propose an innovative paradigm for information extraction: the parse trees that are output by natural language processing on textual documents are stored in a database, and then extraction is expressed as queries using our proposed structured query language on databases. Such a paradigm have several advantages:

avoiding writing special-purpose extraction programs,
leveraging query optimization in databases,
allowing incremental extraction upon changes.

Furthermore, to allow ordinary users to easily perform information extraction or keyword search on corpus without learning the structured query language, we are investigating techniques that automatically generate structured queries based on the user keyword query and its pseudo-relevance feedback to obtain high-quality results.

Publications: TKDE'12, ICDE'10 (demo), ICDE'06

Completed Projects

XML Stream Processing

XML Databases

XML Constraints

Querying Linguistic Databases

A Complete List of Publications

Yi Chen

Short Bio

Research Projects

Teaching

Recent Professional Services