BIOKDD03: Workshop on Data Mining in Bioinformatics.

BIOKDD03: Workshop on Data Mining in Bioinformatics. Mohammed J. Zaki Jason T. L. Wang Hannu T. T. Toivonen Bioinformatics is the science of managing, mining, and interpreting information from biological sequences and structures. Genome sequencing projects have contributed to an exponential growth in complete and partial sequence databases. The structural genomics initiative aims to catalog the structure- function information for proteins. Advances in technology such as microarrays have launched the subfield of genomics and proteomics to study the genes, proteins, and the regulatory gene expression circuitry inside the cell. What characterizes the state of the field is the flood of data that exists today or that is anticipated in the future; data that needs to be mined to help unlock the secrets of the cell. While tremendous progress has been made over the years, many of the fundamental problems in bioinformatics, such as protein structure prediction or gene finding, are still open. Data mining will play a fundamental role in understanding gene expression, drug design and other emerging problems in genomics and proteomics. Furthermore, text mining will be fundamental in extracting knowledge from the growing literature in bioinformatics. The goal of this workshop is to encourage KDD researchers to take on the numerous challenges that Bioinformatics of- fers. The workshop features keynote talks from noted ex- perts in the field, and the latest data mining research in bioinformatics. Department of Computer Science, Rensselaer Polytechnic Institute, Troy, NY 08/27/2003 cs-03-11

BIOKDD03: Workshop on Data Mining in Bioinformatics.

Mohammed J. Zaki

Jason T. L. Wang

Hannu T. T. Toivonen

Bioinformatics is the science of managing, mining, and interpreting information from biological sequences and structures. Genome sequencing projects have contributed to an exponential growth in complete and partial sequence databases. The structural genomics initiative aims to catalog the structure- function information for proteins. Advances in technology such as microarrays have launched the subfield of genomics and proteomics to study the genes, proteins, and the regulatory gene expression circuitry inside the cell. What characterizes the state of the field is the flood of data that exists today or that is anticipated in the future; data that needs to be mined to help unlock the secrets of the cell. While tremendous progress has been made over the years, many of the fundamental problems in bioinformatics, such as protein structure prediction or gene finding, are still open. Data mining will play a fundamental role in understanding gene expression, drug design and other emerging problems in genomics and proteomics. Furthermore, text mining will be fundamental in extracting knowledge from the growing literature in bioinformatics. The goal of this workshop is to encourage KDD researchers to take on the numerous challenges that Bioinformatics of- fers. The workshop features keynote talks from noted ex- perts in the field, and the latest data mining research in bioinformatics.

Department of Computer Science, Rensselaer Polytechnic Institute, Troy, NY

08/27/2003

cs-03-11