scanpy.pp.filter_genes
- scanpy.pp.filter_genes(data, min_counts=None, min_cells=None, max_counts=None, max_cells=None, inplace=True, copy=False)
Filter genes based on number of cells or counts.
Keep genes that have at least
min_countscounts or are expressed in at leastmin_cellscells or have at mostmax_countscounts or are expressed in at mostmax_cellscells.Only provide one of the optional parameters
min_counts,min_cells,max_counts,max_cellsper call.- Parameters:
- data :
AnnData An annotated data matrix of shape
n_obs×n_vars. Rows correspond to cells and columns to genes.- min_counts :
Optional[int] (default:None) Minimum number of counts required for a gene to pass filtering.
- min_cells :
Optional[int] (default:None) Minimum number of cells expressed required for a gene to pass filtering.
- max_counts :
Optional[int] (default:None) Maximum number of counts required for a gene to pass filtering.
- max_cells :
Optional[int] (default:None) Maximum number of cells expressed required for a gene to pass filtering.
- inplace :
bool(default:True) Perform computation inplace or return result.
- data :
- Return type:
- Returns:
: Depending on
inplace, returns the following arrays or directly subsets and annotates the data matrix- gene_subset
Boolean index mask that does filtering.
Truemeans that the gene is kept.Falsemeans the gene is removed.- number_per_gene
Depending on what was tresholded (
countsorcells), the array storesn_countsorn_cellsper gene.