PubChem Database

Overview

PubChem is the world's largest freely available chemical database with 110M+ compounds and 270M+ bioactivities. Query chemical structures by name, CID, or SMILES, retrieve molecular properties, perform similarity and substructure searches, access bioactivity data using PUG-REST API and PubChemPy.

When to Use This Skill

This skill should be used when:

Searching for chemical compounds by name, structure (SMILES/InChI), or molecular formula
Retrieving molecular properties (MW, LogP, TPSA, hydrogen bonding descriptors)
Performing similarity searches to find structurally related compounds
Conducting substructure searches for specific chemical motifs
Accessing bioactivity data from screening assays
Converting between chemical identifier formats (CID, SMILES, InChI)
Batch processing multiple compounds for drug-likeness screening or property analysis

Core Capabilities

1. Chemical Structure Search

Search for compounds using multiple identifier types:

By Chemical Name:

import pubchempy as pcp
compounds = pcp.get_compounds('aspirin', 'name')
compound = compounds[0]

By CID (Compound ID):

compound = pcp.Compound.from_cid(2244)  # Aspirin

By SMILES:

compound = pcp.get_compounds('CC(=O)OC1=CC=CC=C1C(=O)O', 'smiles')[0]

Loading…

PubChem Database

Overview

When to Use This Skill

This skill should be used when:

Searching for chemical compounds by name, structure (SMILES/InChI), or molecular formula
Retrieving molecular properties (MW, LogP, TPSA, hydrogen bonding descriptors)
Performing similarity searches to find structurally related compounds
Conducting substructure searches for specific chemical motifs
Accessing bioactivity data from screening assays
Converting between chemical identifier formats (CID, SMILES, InChI)
Batch processing multiple compounds for drug-likeness screening or property analysis

Core Capabilities

1. Chemical Structure Search

Search for compounds using multiple identifier types:

By Chemical Name:

import pubchempy as pcp
compounds = pcp.get_compounds('aspirin', 'name')
compound = compounds[0]

By CID (Compound ID):

compound = pcp.Compound.from_cid(2244)  # Aspirin

By SMILES:

compound = pcp.get_compounds('CC(=O)OC1=CC=CC=C1C(=O)O', 'smiles')[0]

pubchem-database

PubChem Database

Overview

When to Use This Skill

Core Capabilities

1. Chemical Structure Search

Related Skills

flow

verify

feature-flags

flags

PubChem Database

Overview

When to Use This Skill

Core Capabilities

1. Chemical Structure Search

2. Property Retrieval

3. Similarity Search

4. Substructure Search

5. Format Conversion

6. Structure Visualization

7. Synonym Retrieval

8. Bioactivity Data Access

9. Comprehensive Compound Annotations

Installation Requirements

Helper Scripts

scripts/compound_search.py

scripts/bioactivity_query.py

API Rate Limits and Best Practices

Common Workflows

Workflow 1: Chemical Identifier Conversion Pipeline

Workflow 2: Drug-Like Property Screening

Workflow 3: Finding Similar Drug Candidates

Workflow 4: Batch Compound Property Comparison

Workflow 5: Substructure-Based Virtual Screening

Reference Documentation

Troubleshooting

Additional Resources

Related Skills

flow

verify

feature-flags

flags