Search gprofiler
database for species
that match the input text string.
Then translate to a standardised species ID.
map_species(
species = NULL,
search_cols = c("display_name", "id", "scientific_name", "taxonomy_id"),
output_format = c("scientific_name", "id", "display_name", "taxonomy_id", "version",
"scientific_name_formatted"),
method = c("homologene", "gprofiler", "babelgene"),
remove_subspecies = TRUE,
remove_subspecies_exceptions = c("Canis lupus familiaris"),
use_local = TRUE,
verbose = TRUE
)
Species query
(e.g. "human", "homo sapiens", "hsapiens", or 9606).
If given a list, will iterate queries for each item.
Set to NULL
to return all species.
Which columns to search for
species
substring in
metadata
API.
Which column to return.
R package to use for gene mapping:
"gprofiler"
: Slower but more species and genes.
"homologene"
: Faster but fewer species and genes.
"babelgene"
: Faster but fewer species and genes.
Also gives consensus scores for each gene mapping based on a
several different data sources.
Only keep the first two taxonomic levels: e.g. "Canis lupus familiaris" --> "Canis lupus"
Selected species to ignore when
remove_subspecies=TRUE
.
e.g. "Canis lupus familiaris" --> "Canis lupus familiaris"
If TRUE
default,
map_species
uses a locally stored version of the species metadata table
instead of pulling directly from the gprofiler API.
Local version may not be fully up to date,
but should suffice for most use cases.
Print messages.
Species ID of type output_format
ids <- map_species(species = c(
"human", 9606, "mus musculus",
"fly", "C elegans"
))
#> Retrieving all organisms available in homologene.
#> Mapping species name: human
#> Common name mapping found for human
#> 1 organism identified from search: Homo sapiens
#> Mapping species name: 9606
#> 1 organism identified from search: Homo sapiens
#> Mapping species name: mus musculus
#> 1 organism identified from search: Mus musculus
#> Mapping species name: fly
#> Common name mapping found for fly
#> 1 organism identified from search: Drosophila melanogaster
#> Mapping species name: C elegans
#> 1 organism identified from search: Caenorhabditis elegans