scanpy.read

scanpy.read(filename, backed=None, sheet=None, ext=None, delimiter=None, first_column_names=False, backup_url=None, cache=False, cache_compression=<Empty.token: 0>, **kwargs)

Read file and return AnnData object.

To speed up reading, consider passing cache=True, which creates an hdf5 cache file.

Parameters
filename : Path | strUnion[Path, str]

If the filename has no file extension, it is interpreted as a key for generating a filename via sc.settings.writedir / (filename + sc.settings.file_format_data). This is the same behavior as in sc.read(filename, ...).

backed : {‘r’, ‘r+’} | NoneOptional[Literal[‘r’, ‘r+’]] (default: None)

If 'r', load AnnData in backed mode instead of fully loading it into memory (memory mode). If you want to modify backed attributes of the AnnData object, you need to choose 'r+'.

sheet : str | NoneOptional[str] (default: None)

Name of sheet/table in hdf5 or Excel file.

ext : str | NoneOptional[str] (default: None)

Extension that indicates the file type. If None, uses extension of filename.

delimiter : str | NoneOptional[str] (default: None)

Delimiter that separates data within text file. If None, will split at arbitrary number of white spaces, which is different from enforcing splitting at any single white space ' '.

first_column_names : boolbool (default: False)

Assume the first column stores row names. This is only necessary if these are not strings: strings in the first column are automatically assumed to be row names.

backup_url : str | NoneOptional[str] (default: None)

Retrieve the file from an URL if not present on disk.

cache : boolbool (default: False)

If False, read from source, if True, read from fast ‘h5ad’ cache.

cache_compression : {‘gzip’, ‘lzf’} | None | EmptyUnion[Literal[‘gzip’, ‘lzf’], None, Empty] (default: <Empty.token: 0>)

See the h5py Filter pipeline. (Default: settings.cache_compression)

kwargs

Parameters passed to read_loom().

Return type

AnnDataAnnData

Returns

An AnnData object