scanpy.queries.biomart_annotations#
- scanpy.queries.biomart_annotations(org, attrs, *, host='www.ensembl.org', use_cache=False)[source]#
Retrieve gene annotations from ensembl biomart.
- Parameters:
- org
str
Organism to query. Must be an organism in ensembl biomart. “hsapiens”, “mmusculus”, “drerio”, etc.
- attrs
Iterable
[str
] Attributes to query biomart for.
- host
str
(default:'www.ensembl.org'
) A valid BioMart host URL. Alternative values include archive urls (like “grch37.ensembl.org”) or regional mirrors (like “useast.ensembl.org”).
- use_cache
bool
(default:False
) Whether pybiomart should use a cache for requests. Will create a
.pybiomart.sqlite
file in current directory if used.
- org
- Return type:
- Returns:
Dataframe containing annotations.
Examples
Retrieve genes coordinates and chromosomes
>>> import scanpy as sc >>> annot = sc.queries.biomart_annotations( ... "hsapiens", ... ["ensembl_gene_id", "start_position", "end_position", "chromosome_name"], ... ).set_index("ensembl_gene_id") >>> adata.var[annot.columns] = annot