Journal of Heredity 2003:94(1)
© 2003 The American Genetic Association 94:15-22
A Survey of Canine Expressed Sequence Tags and a Display of Their Annotations Through a Flexible Web-Based Interface
From the Cold Spring Harbor Laboratory, Genome Research Center, 500 Sunnyside Blvd., Woodbury, NY 11797 (Palmer, O'Shaughnessy, Preston, Santos, Balija, Nascimento, Zutavern, and McCombie); the Department of Clinical Studies, University of Pennsylvania, School of Veterinary Medicine, 3900 Delancey St., Philadelphia, PA 19104-6010 (Henthorn); and Cold Spring Harbor Laboratory, 1 Bungtown Rd., Cold Spring Harbor, NY 11724 (Hannon).
Address correspondence to W. R. McCombie at the address above, or e-mail: mccombie{at}cshl.org.
We have initially sequenced approximately 8,000 canine expressed sequence tags (ESTs) from several complementary DNA (cDNA) libraries: testes, whole brain, and Madin-Darby canine kidney (MDCK) cells. Analysis of these sequences shows that they provide partial sequence information for about 5%10% of the canine genes. An analysis pipeline has been created to cluster the ESTs and to map individual ESTs as well as clustered ESTs to both the human genome and the human proteome. Gene ontology (GO) terms have been assigned to the ESTs and clusters based on their top matches to the International Protein Index (IPI) set of human proteins. The data generated is stored in a MySQL relational database for analysis and display. A Web-based Perl script has been written to display the analyzed data to the scientific community.