scanpy.external.tl.phate

scanpy.external.tl.phate(adata, n_components=2, k=5, a=15, n_landmark=2000, t='auto', gamma=1.0, n_pca=100, knn_dist='euclidean', mds_dist='euclidean', mds='metric', n_jobs=None, random_state=None, verbose=None, copy=False, **kwargs)

PHATE [Moon17].

Potential of Heat-diffusion for Affinity-based Trajectory Embedding (PHATE) embeds high dimensional single-cell data into two or three dimensions for visualization of biological progressions.

For more information and access to the object-oriented interface, read the PHATE documentation. For tutorials, bug reports, and R/MATLAB implementations, visit the PHATE GitHub page. For help using PHATE, go here.

Parameters:

adata : AnnData: Annotated data matrix.
n_components : int (default: 2): number of dimensions in which the data will be embedded
k : int (default: 5): number of nearest neighbors on which to build kernel
a : int (default: 15): sets decay rate of kernel tails. If None, alpha decaying kernel is not used
n_landmark : int (default: 2000): number of landmarks to use in fast PHATE
t : Union[int, str] (default: 'auto'): power to which the diffusion operator is powered sets the level of diffusion. If ‘auto’, t is selected according to the knee point in the Von Neumann Entropy of the diffusion operator
gamma : float (default: 1.0): Informational distance constant between -1 and 1. gamma=1 gives the PHATE log potential, gamma=0 gives a square root potential.
n_pca : int (default: 100): Number of principal components to use for calculating neighborhoods. For extremely large datasets, using n_pca < 20 allows neighborhoods to be calculated in log(n_samples) time.
knn_dist : str (default: 'euclidean'): recommended values: ‘euclidean’ and ‘cosine’ Any metric from scipy.spatial.distance can be used distance metric for building kNN graph
mds_dist : str (default: 'euclidean'): recommended values: ‘euclidean’ and ‘cosine’ Any metric from scipy.spatial.distance can be used distance metric for MDS
mds : Literal['classic', 'metric', 'nonmetric'] (default: 'metric'): Selects which MDS algorithm is used for dimensionality reduction.
n_jobs : Optional[int] (default: None): The number of jobs to use for the computation. If None, sc.settings.n_jobs is used. If -1 all CPUs are used. If 1 is given, no parallel computing code is used at all, which is useful for debugging. For n_jobs below -1, (n_cpus + 1 + n_jobs) are used. Thus for n_jobs = -2, all CPUs but one are used
random_state : Union[None, int, RandomState] (default: None): Random seed. Defaults to the global numpy random number generator
verbose : Union[bool, int, None] (default: None): If True or an int/Verbosity ≥ 2/hint, print status messages. If None, sc.settings.verbosity is used.
copy : bool (default: False): Return a copy instead of writing to adata.
kwargs: Additional arguments to phate.PHATE

Return type:

Optional[AnnData]

Returns:

: Depending on copy, returns or updates adata with the following fields.

X_phatenp.ndarray, (adata.obs, shape=[n_samples, n_components], dtype float): PHATE coordinates of data.

Examples

>>> from anndata import AnnData
>>> import scanpy.external as sce
>>> import phate
>>> tree_data, tree_clusters = phate.tree.gen_dla(
...     n_dim=100,
...     n_branch=20,
...     branch_length=100,
... )
>>> tree_data.shape
(2000, 100)
>>> adata = AnnData(tree_data)
>>> sce.tl.phate(adata, k=5, a=20, t=150)
>>> adata.obsm['X_phate'].shape
(2000, 2)
>>> sce.pl.phate(adata)