Exploiting PubChem for virtual screening期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Exploiting PubChem for virtual screening

Abstract:	*Importance of the field:* PubChem is a public molecular information repository, a scientific showcase of the National Institutes of Health Roadmap Initiative. The PubChem database holds > 27 million records of unique chemical structures of compounds (compound ID) derived from nearly 70 million substance depositions (substance ID), and contains > 449,000 bioassay records with thousands of in vitro biochemical and cell-based screening bioassays established targeting > 7000 proteins and genes linking to > 1.8 million of substances. *Areas covered in this review:* This review builds on recent PubChem-related computational chemistry research reported by other authors while providing readers with an overview of the PubChem database, focusing on its increasing role in cheminformatics, virtual screening and toxicity prediction modeling. *What the reader will gain:* These publicly available data sets in PubChem provide great opportunities for scientists to perform cheminformatics and virtual screening research for computer-aided drug design. However, the high volume and complexity of the data sets, in particular the bioassay-associated false positives/negatives and highly imbalanced data sets in PubChem, also create major challenges. Several approaches regarding the modeling of PubChem data sets and development of virtual screening models for bioactivity and toxicity predictions are also reviewed. *Take home message:* Novel data-mining cheminformatics tools and virtual screening algorithms are being developed and used to retrieve, annotate and analyze the large-scale and highly complex PubChem biological screening data for drug design.

Keywords:	cheminformatics data-mining polypharmacology PubChem toxicity virtual screening

设为首页 | 免责声明 | 关于勤云 | 加入收藏