icechunk.session#

Sessions for reading and writing data. Includes ForkSession for distributed writes and SessionMode.

icechunk.session #

Classes:

Name	Description
`Session`	A session object that allows for reading and writing data from an Icechunk repository.
`SessionMode`	Enum for session access modes

Session #

A session object that allows for reading and writing data from an Icechunk repository.

Methods:

Name	Description
`all_virtual_chunk_locations`	Return the location URLs of all virtual chunks.
`all_virtual_chunk_locations_async`	Return the location URLs of all virtual chunks (async version).
`amend`	Commit the changes in the session to the repository, by amending/overwriting the previous commit.
`amend_async`	Commit the changes in the session to the repository, by amending/overwriting the previous commit.
`chunk_coordinates`	Return an async iterator to all initialized chunks for the array at array_path
`chunk_type`	Return the chunk type for the specified coordinates
`chunk_type_async`	Return the chunk type for the specified coordinates
`commit`	Commit the changes in the session to the repository.
`commit_async`	Commit the changes in the session to the repository (async version).
`discard_changes`	When the session is writable, discard any uncommitted changes.
`flush`	Save the changes in the session to a new snapshot without modifying the current branch.
`flush_async`	Save the changes in the session to a new snapshot without modifying the current branch.
`fork`	Create a child session that can be pickled to a worker job and later merged.
`get_node_id`	Return the node ID for the array or group at the given path.
`get_node_id_async`	Return the node ID for the array or group at the given path.
`merge`	Merge the changes for this session with the changes from another session.
`merge_async`	Merge the changes for this session with the changes from another session (async version).
`move`	Move or rename a node (array or group) in the hierarchy.
`move_async`	Async version of :meth:`move`.
`rebase`	Rebase the session to the latest ancestry of the branch.
`rebase_async`	Rebase the session to the latest ancestry of the branch (async version).
`reindex_array`	Reindex chunks in an array by applying a transformation function.
`shift_array`	Shift all chunks in an array by the given chunk offset.
`status`	Compute an overview of the current session changes

Attributes:

Name	Type	Description
`branch`	`str \| None`	The branch that the session is based on. This is only set if the session is writable.
`config`	`RepositoryConfig`	Get the repository configuration.
`has_uncommitted_changes`	`bool`	Whether the session has uncommitted changes. This is only possibly true if the session is writable.
`mode`	`SessionMode`	The mode of this session.
`read_only`	`bool`	Whether the session is read-only.
`snapshot_id`	`str`	The base snapshot ID of the session.
`store`	`IcechunkStore`	Get a zarr Store object for reading and writing data from the repository using zarr python.

Source code in icechunk-python/python/icechunk/session.py

class Session:
    """A session object that allows for reading and writing data from an Icechunk repository."""

    _session: PySession

    def __init__(self, session: PySession):
        self._session = session

    def __repr__(self) -> str:
        return repr(self._session)

    def __str__(self) -> str:
        return str(self._session)

    def _repr_html_(self) -> str:
        return self._session._repr_html_()

    def __eq__(self, value: object) -> bool:
        if not isinstance(value, Session):
            return False
        return self._session == value._session

    def __getstate__(self) -> object:
        if not self.read_only:
            raise ValueError(
                "You must opt-in to pickle writable sessions in a distributed context "
                "using Session.fork(). "
                "See https://icechunk.io/en/stable/parallel/#distributed-writes for more. "
                "If you are using xarray's `Dataset.to_zarr` method to write dask arrays, "
                "please use `icechunk.xarray.to_icechunk` instead. "
                "If you are using dask & distributed or multi-processing to read/write from the same repository, "
                "then pass a readonly session created using Repository.readonly_session for the read step. "
                "Alternatively, make sure to pass the ForkSession created by Session.fork() for the read step. "
            )
        state = {
            "_session": self._session.as_bytes(),
        }
        return state

    def __setstate__(self, state: object) -> None:
        if not isinstance(state, dict):
            raise ValueError("Invalid state")
        self._session = PySession.from_bytes(state["_session"])

    @property
    def read_only(self) -> bool:
        """
        Whether the session is read-only.

        Returns
        -------
        bool
            True if the session is read-only, False otherwise.
        """
        return self._session.read_only

    @property
    def mode(self) -> SessionMode:
        """
        The mode of this session.

        Returns
        -------
        SessionMode
            The session mode - one of READONLY, WRITABLE, or REARRANGE.
        """
        return self._session.mode

    @property
    def snapshot_id(self) -> str:
        """
        The base snapshot ID of the session.

        Returns
        -------
        str
            The base snapshot ID of the session.
        """
        return self._session.snapshot_id

    @property
    def branch(self) -> str | None:
        """
        The branch that the session is based on. This is only set if the session is writable.

        Returns
        -------
        str or None
            The branch that the session is based on if the session is writable, None otherwise.
        """
        return self._session.branch

    @property
    def has_uncommitted_changes(self) -> bool:
        """
        Whether the session has uncommitted changes. This is only possibly true if the session is writable.

        Returns
        -------
        bool
            True if the session has uncommitted changes, False otherwise.
        """
        return self._session.has_uncommitted_changes

    def status(self) -> Diff:
        """
        Compute an overview of the current session changes

        Returns
        -------
        Diff
            The operations executed in the current session but still not committed.
        """
        return self._session.status()

    def discard_changes(self) -> None:
        """
        When the session is writable, discard any uncommitted changes.
        """
        self._session.discard_changes()

    @property
    def store(self) -> IcechunkStore:
        """
        Get a zarr Store object for reading and writing data from the repository using zarr python.

        Returns
        -------
        IcechunkStore
            A zarr Store object for reading and writing data from the repository.
        """
        return IcechunkStore(self._session.store)

    @property
    def config(self) -> RepositoryConfig:
        """
        Get the repository configuration.

        Notice that changes to the returned object won't be impacted. To change configuration values
        use `Repository.reopen`.

        Returns
        -------
        RepositoryConfig
            The config for the repository that owns this session.
        """
        return self._session.config

    def move(self, from_path: str, to_path: str) -> None:
        """Move or rename a node (array or group) in the hierarchy.

        This is a metadata-only operation—no data is copied. Requires a rearrange session:

            session = repo.rearrange_session("main")
            session.move("/data/raw", "/data/v1")

        Parameters
        ----------
        from_path : str
            The current path of the node (e.g., "/data/raw").
        to_path : str
            The new path for the node (e.g., "/data/v1").
        """
        return self._session.move_node(from_path, to_path)

    async def move_async(self, from_path: str, to_path: str) -> None:
        """Async version of :meth:`move`."""
        return await self._session.move_node_async(from_path, to_path)

    def get_node_id(self, path: str) -> str:
        """
        Return the node ID for the array or group at the given path.

        Each node is assigned an opaque ID when it is created. This ID is
        stable across moves and renames — a node keeps the same ID for its
        entire lifetime. See the `icechunk spec <https://icechunk.io/en/stable/spec/>`_
        for details on node identity.

        Parameters
        ----------
        path : str
            Absolute path to the node (e.g., "/data/temperature").

        Returns
        -------
        str
            The node ID as an opaque string.

        Raises
        ------
        IcechunkError
            If no node exists at the given path.
        """
        return self._session.get_node_id(path)

    async def get_node_id_async(self, path: str) -> str:
        """
        Return the node ID for the array or group at the given path.

        Each node is assigned an opaque ID when it is created. This ID is
        stable across moves and renames — a node keeps the same ID for its
        entire lifetime. See the `icechunk spec <https://icechunk.io/en/stable/spec/>`_
        for details on node identity.

        Parameters
        ----------
        path : str
            Absolute path to the node (e.g., "/data/temperature").

        Returns
        -------
        str
            The node ID as an opaque string.

        Raises
        ------
        IcechunkError
            If no node exists at the given path.
        """
        return await self._session.get_node_id_async(path)

    def all_virtual_chunk_locations(self) -> list[str]:
        """
        Return the location URLs of all virtual chunks.

        Returns
        -------
        list of str
            The location URLs of all virtual chunks.
        """
        return self._session.all_virtual_chunk_locations()

    def reindex_array(
        self,
        array_path: str,
        forward: Callable[[Iterable[int]], Iterable[int] | None],
        backward: Callable[[Iterable[int]], Iterable[int] | None] | None = None,
    ) -> None:
        """Reindex chunks in an array by applying a transformation function.

        Only existing (non-empty) chunks are visited — empty positions are
        skipped. This means that if an empty chunk would have shifted into an
        occupied position, that position retains stale data unless a backward
        function is also provided.

        Parameters
        ----------
        array_path : str
            Path to the array.
        forward : Callable[[Iterable[int]], Iterable[int] | None]
            Function that maps old chunk coordinates to new coordinates. Receives
            a list of non-negative integers (the current chunk index) and must return
            either a new index (as a list/tuple of non-negative integers within the
            array's chunk grid bounds) or ``None`` to skip the chunk (leave it in place).
        backward : Callable[[Iterable[int]], Iterable[int] | None], optional
            Inverse of ``forward``: given a chunk position, returns the position
            that would have mapped there under ``forward``. Must follow the same
            return conventions as ``forward``. When provided, each existing chunk
            position is checked to determine whether it should be cleared — if
            ``backward`` returns ``None`` (out of bounds) or points to a position
            with no chunk, that position is reset to the fill value.
        """
        return self._session.reindex_array(array_path, forward, backward)

    def shift_array(
        self,
        array_path: str,
        chunk_offset: Iterable[int],
    ) -> None:
        """Shift all chunks in an array by the given chunk offset.

        Out-of-bounds chunks are discarded. To preserve them, resize the array first
        to make room. Vacated source positions are cleared (reset to fill value).

        Parameters
        ----------
        array_path : str
            The path to the array to shift.
        chunk_offset : Iterable[int]
            The number of chunks to shift by in each dimension. Positive values
            shift right/down, negative values shift left/up.

        Notes
        -----
        To shift right while preserving all data, first resize the array using zarr's
        array.resize(), then shift.
        """
        self._session.shift_array(array_path, list(chunk_offset))

    async def all_virtual_chunk_locations_async(self) -> list[str]:
        """
        Return the location URLs of all virtual chunks (async version).

        Returns
        -------
        list of str
            The location URLs of all virtual chunks.
        """
        return await self._session.all_virtual_chunk_locations_async()

    async def chunk_coordinates(
        self, array_path: str, batch_size: int = 1000
    ) -> AsyncIterator[tuple[int, ...]]:
        """
        Return an async iterator to all initialized chunks for the array at array_path

        Returns
        -------
        an async iterator to chunk coordinates as tuples
        """
        # We do unbatching here to improve speed. Switching to rust to get
        # a batch is much faster than switching for every element
        async for batch in self._session.chunk_coordinates(array_path, batch_size):
            for coord in batch:
                yield tuple(coord)

    def chunk_type(
        self,
        array_path: str,
        chunk_coordinates: Sequence[int],
    ) -> ChunkType:
        """
        Return the chunk type for the specified coordinates

        Parameters
        ----------
        array_path : str
            The path to the array inside the Zarr store. Example: "/groupA/groupB/outputs/my-array".
        chunk_coordinates: Sequence[int]
            A sequence of integers (list or tuple) used to locate the chunk. Example: [0, 1, 5].

        Returns
        -------
        ChunkType
            One of the supported chunk types.
        """
        return self._session.chunk_type(array_path, chunk_coordinates)

    async def chunk_type_async(
        self,
        array_path: str,
        chunk_coordinates: Sequence[int],
    ) -> ChunkType:
        """
        Return the chunk type for the specified coordinates

        Parameters
        ----------
        array_path : str
            The path to the array inside the Zarr store. Example: "/groupA/groupB/outputs/my-array".
        chunk_coordinates: Sequence[int]
            A sequence of integers (list or tuple) used to locate the chunk. Example: [0, 1, 5].

        Returns
        -------
        ChunkType
            One of the supported chunk types.
        """
        return await self._session.chunk_type_async(array_path, chunk_coordinates)

    def merge(self, *others: "ForkSession") -> None:
        """
        Merge the changes for this session with the changes from another session.

        Parameters
        ----------
        others : ForkSession
            The forked sessions to merge changes from.
        """
        for other in others:
            self._session.merge(other._session)

    async def merge_async(self, *others: "ForkSession") -> None:
        """
        Merge the changes for this session with the changes from another session (async version).

        Parameters
        ----------
        others : ForkSession
            The forked sessions to merge changes from.
        """
        for other in others:
            await self._session.merge_async(other._session)

    def commit(
        self,
        message: str,
        metadata: dict[str, Any] | None = None,
        *,
        rebase_with: ConflictSolver | None = None,
        rebase_tries: int = 1_000,
        allow_empty: bool = False,
    ) -> str:
        """
        Commit the changes in the session to the repository.

        When successful, the writable session is completed and the session is now read-only and based on the new commit. The snapshot ID of the new commit is returned.

        If the session is out of date, this will raise a ConflictError exception depicting the conflict that occurred. The session will need to be rebased before committing.

        Parameters
        ----------
        message : str
            The message to write with the commit.
        metadata : dict[str, Any] | None, optional
            Additional metadata to store with the commit snapshot.
        rebase_with : ConflictSolver | None, optional
            If other session committed while the current session was writing, use Session.rebase with this solver.
        rebase_tries : int, optional
            If other session committed while the current session was writing, use Session.rebase up to this many times in a loop.
        allow_empty : bool, optional
            If True, allow creating a commit even if there are no changes. Default is False.

        Returns
        -------
        str
            The snapshot ID of the new commit.

        Raises
        ------
        icechunk.ConflictError
            If the session is out of date and a conflict occurs.
        icechunk.NoChangesToCommitError
            If there are no changes to commit and allow_empty is False.
        """
        return self._session.commit(
            message,
            metadata,
            rebase_with=rebase_with,
            rebase_tries=rebase_tries,
            allow_empty=allow_empty,
        )

    async def commit_async(
        self,
        message: str,
        metadata: dict[str, Any] | None = None,
        *,
        rebase_with: ConflictSolver | None = None,
        rebase_tries: int = 1_000,
        allow_empty: bool = False,
    ) -> str:
        """
        Commit the changes in the session to the repository (async version).

        When successful, the writable session is completed and the session is now read-only and based on the new commit. The snapshot ID of the new commit is returned.

        If the session is out of date, this will raise a ConflictError exception depicting the conflict that occurred. The session will need to be rebased before committing.

        Parameters
        ----------
        message : str
            The message to write with the commit.
        metadata : dict[str, Any] | None, optional
            Additional metadata to store with the commit snapshot.
        rebase_with : ConflictSolver | None, optional
            If other session committed while the current session was writing, use Session.rebase with this solver.
        rebase_tries : int, optional
            If other session committed while the current session was writing, use Session.rebase up to this many times in a loop.
        allow_empty : bool, optional
            If True, allow creating a commit even if there are no changes. Default is False.

        Returns
        -------
        str
            The snapshot ID of the new commit.

        Raises
        ------
        icechunk.ConflictError
            If the session is out of date and a conflict occurs.
        icechunk.NoChangesToCommitError
            If there are no changes to commit and allow_empty is False.
        """
        return await self._session.commit_async(
            message,
            metadata,
            rebase_with=rebase_with,
            rebase_tries=rebase_tries,
            allow_empty=allow_empty,
        )

    def amend(
        self,
        message: str,
        *,
        metadata: dict[str, Any] | None = None,
        allow_empty: bool = False,
    ) -> str:
        """
        Commit the changes in the session to the repository, by amending/overwriting the previous commit.

        When successful, the writable session is completed and the session is now read-only and based on the new commit. The snapshot ID of the new commit is returned.

        If the session is out of date, this will raise a ConflictError exception depicting the conflict that occurred. The session will need to be rebased before committing.

        This operation doesn't create a new commit in the repo ancestry. It replaces the previous commit.

        The first commit to the repo cannot be amended.

        Parameters
        ----------
        message : str
            The message to write with the commit.
        metadata : dict[str, Any] | None, optional
            Additional metadata to store with the commit snapshot.
        allow_empty : bool, optional
            If True, allow amending even if no data changes have been made to the session.
            This is useful when you only want to update the commit message. Default is False.

        Returns
        -------
        str
            The snapshot ID of the new commit.

        Raises
        ------
        icechunk.ConflictError
            If the session is out of date and a conflict occurs.
        """
        return self._session.amend(message, metadata, allow_empty=allow_empty)

    async def amend_async(
        self,
        message: str,
        *,
        metadata: dict[str, Any] | None = None,
        allow_empty: bool = False,
    ) -> str:
        """
        Commit the changes in the session to the repository, by amending/overwriting the previous commit.

        When successful, the writable session is completed and the session is now read-only and based on the new commit. The snapshot ID of the new commit is returned.

        If the session is out of date, this will raise a ConflictError exception depicting the conflict that occurred. The session will need to be rebased before committing.

        This operation doesn't create a new commit in the repo ancestry. It replaces the previous commit.

        The first commit to the repo cannot be amended.

        Parameters
        ----------
        message : str
            The message to write with the commit.
        metadata : dict[str, Any] | None, optional
            Additional metadata to store with the commit snapshot.
        allow_empty : bool, optional
            If True, allow amending even if no data changes have been made to the session.
            This is useful when you only want to update the commit message. Default is False.

        Returns
        -------
        str
            The snapshot ID of the new commit.

        Raises
        ------
        icechunk.ConflictError
            If the session is out of date and a conflict occurs.
        """
        return await self._session.amend_async(message, metadata, allow_empty=allow_empty)

    def flush(
        self,
        message: str,
        *,
        metadata: dict[str, Any] | None = None,
    ) -> str:
        """
        Save the changes in the session to a new snapshot without modifying the current branch.

        When successful, the writable session is completed and the session is now read-only and based on the new snapshot. The ID of the new snapshot is returned.

        Parameters
        ----------
        message : str
            The message to write with the commit.
        metadata : dict[str, Any] | None, optional
            Additional metadata to store with the commit snapshot.

        Returns
        -------
        str
            The ID of the new snapshot.
        """
        return self._session.flush(message, metadata)

    async def flush_async(
        self,
        message: str,
        *,
        metadata: dict[str, Any] | None = None,
    ) -> str:
        """
        Save the changes in the session to a new snapshot without modifying the current branch.

        When successful, the writable session is completed and the session is now read-only and based on the new snapshot. The ID of the new snapshot is returned.

        Parameters
        ----------
        message : str
            The message to write with the commit.
        metadata : dict[str, Any] | None, optional
            Additional metadata to store with the commit snapshot.

        Returns
        -------
        str
            The ID of the new snapshot.
        """
        return await self._session.flush_async(message, metadata)

    def rebase(self, solver: ConflictSolver) -> None:
        """
        Rebase the session to the latest ancestry of the branch.

        This method will iteratively crawl the ancestry of the branch and apply the changes from the branch to the session. If a conflict is detected, the conflict solver will be used to optionally resolve the conflict. When complete, the session will be based on the latest commit of the branch and the session will be ready to attempt another commit.

        When a conflict is detected and a resolution is not possible with the provided solver, a RebaseFailed exception will be raised. This exception will contain the snapshot ID that the rebase failed on and a list of conflicts that occurred.

        Parameters
        ----------
        solver : ConflictSolver
            The conflict solver to use when a conflict is detected.

        Raises
        ------
        RebaseFailedError
            When a conflict is detected and the solver fails to resolve it.
        """
        self._session.rebase(solver)

    async def rebase_async(self, solver: ConflictSolver) -> None:
        """
        Rebase the session to the latest ancestry of the branch (async version).

        This method will iteratively crawl the ancestry of the branch and apply the changes from the branch to the session. If a conflict is detected, the conflict solver will be used to optionally resolve the conflict. When complete, the session will be based on the latest commit of the branch and the session will be ready to attempt another commit.

        When a conflict is detected and a resolution is not possible with the provided solver, a RebaseFailed exception will be raised. This exception will contain the snapshot ID that the rebase failed on and a list of conflicts that occurred.

        Parameters
        ----------
        solver : ConflictSolver
            The conflict solver to use when a conflict is detected.

        Raises
        ------
        RebaseFailedError
            When a conflict is detected and the solver fails to resolve it.
        """
        await self._session.rebase_async(solver)

    def fork(self) -> "ForkSession":
        """
        Create a child session that can be pickled to a worker job and later merged.

        This method supports Icechunk's distributed, collaborative jobs. A coordinator task creates a new session using
        `Repository.writable_session`. Then `Session.fork` is called repeatedly to create as many serializable sessions
        as worker jobs. Each new `ForkSession` is pickled to the worker that uses it to do all its writes.
        Finally, the `ForkSessions` are pickled back to the coordinator that uses `ForkSession.merge` to merge them
        back into the original session and `commit`.

        Learn more about collaborative writes at https://icechunk.io/en/latest/parallel/

        Raises
        ------
        ValueError
            When `self` already has uncommitted changes.
        ValueError
            When `self` is read-only.
        """
        # TODO: Do we still need ForkSession?
        return ForkSession(self._session.fork())

branch `property` #

branch

The branch that the session is based on. This is only set if the session is writable.

Returns:

Type	Description
`str or None`	The branch that the session is based on if the session is writable, None otherwise.

config `property` #

config

Get the repository configuration.

Notice that changes to the returned object won't be impacted. To change configuration values use Repository.reopen.

Returns:

Type	Description
`RepositoryConfig`	The config for the repository that owns this session.

has_uncommitted_changes `property` #

has_uncommitted_changes

Whether the session has uncommitted changes. This is only possibly true if the session is writable.

Returns:

Type	Description
`bool`	True if the session has uncommitted changes, False otherwise.

mode `property` #

mode

The mode of this session.

Returns:

Type	Description
`SessionMode`	The session mode - one of READONLY, WRITABLE, or REARRANGE.

read_only `property` #

read_only

Whether the session is read-only.

Returns:

Type	Description
`bool`	True if the session is read-only, False otherwise.

snapshot_id `property` #

snapshot_id

The base snapshot ID of the session.

Returns:

Type	Description
`str`	The base snapshot ID of the session.

store `property` #

store

Get a zarr Store object for reading and writing data from the repository using zarr python.

Returns:

Type	Description
`IcechunkStore`	A zarr Store object for reading and writing data from the repository.

all_virtual_chunk_locations #

all_virtual_chunk_locations()

Return the location URLs of all virtual chunks.

Returns:

Type	Description
`list of str`	The location URLs of all virtual chunks.

Source code in icechunk-python/python/icechunk/session.py

def all_virtual_chunk_locations(self) -> list[str]:
    """
    Return the location URLs of all virtual chunks.

    Returns
    -------
    list of str
        The location URLs of all virtual chunks.
    """
    return self._session.all_virtual_chunk_locations()

all_virtual_chunk_locations_async `async` #

all_virtual_chunk_locations_async()

Return the location URLs of all virtual chunks (async version).

Returns:

Type	Description
`list of str`	The location URLs of all virtual chunks.

Source code in icechunk-python/python/icechunk/session.py

async def all_virtual_chunk_locations_async(self) -> list[str]:
    """
    Return the location URLs of all virtual chunks (async version).

    Returns
    -------
    list of str
        The location URLs of all virtual chunks.
    """
    return await self._session.all_virtual_chunk_locations_async()

amend #

amend(message, *, metadata=None, allow_empty=False)

Commit the changes in the session to the repository, by amending/overwriting the previous commit.

When successful, the writable session is completed and the session is now read-only and based on the new commit. The snapshot ID of the new commit is returned.

If the session is out of date, this will raise a ConflictError exception depicting the conflict that occurred. The session will need to be rebased before committing.

This operation doesn't create a new commit in the repo ancestry. It replaces the previous commit.

The first commit to the repo cannot be amended.

Parameters:

Name	Type	Description	Default
`message`	`str`	The message to write with the commit.	required
`metadata`	`dict[str, Any] \| None`	Additional metadata to store with the commit snapshot.	`None`
`allow_empty`	`bool`	If True, allow amending even if no data changes have been made to the session. This is useful when you only want to update the commit message. Default is False.	`False`

Returns:

Type	Description
`str`	The snapshot ID of the new commit.

Raises:

Type	Description
`ConflictError`	If the session is out of date and a conflict occurs.

Source code in icechunk-python/python/icechunk/session.py

def amend(
    self,
    message: str,
    *,
    metadata: dict[str, Any] | None = None,
    allow_empty: bool = False,
) -> str:
    """
    Commit the changes in the session to the repository, by amending/overwriting the previous commit.

    When successful, the writable session is completed and the session is now read-only and based on the new commit. The snapshot ID of the new commit is returned.

    If the session is out of date, this will raise a ConflictError exception depicting the conflict that occurred. The session will need to be rebased before committing.

    This operation doesn't create a new commit in the repo ancestry. It replaces the previous commit.

    The first commit to the repo cannot be amended.

    Parameters
    ----------
    message : str
        The message to write with the commit.
    metadata : dict[str, Any] | None, optional
        Additional metadata to store with the commit snapshot.
    allow_empty : bool, optional
        If True, allow amending even if no data changes have been made to the session.
        This is useful when you only want to update the commit message. Default is False.

    Returns
    -------
    str
        The snapshot ID of the new commit.

    Raises
    ------
    icechunk.ConflictError
        If the session is out of date and a conflict occurs.
    """
    return self._session.amend(message, metadata, allow_empty=allow_empty)

amend_async `async` #

amend_async(message, *, metadata=None, allow_empty=False)

Commit the changes in the session to the repository, by amending/overwriting the previous commit.

When successful, the writable session is completed and the session is now read-only and based on the new commit. The snapshot ID of the new commit is returned.

If the session is out of date, this will raise a ConflictError exception depicting the conflict that occurred. The session will need to be rebased before committing.

This operation doesn't create a new commit in the repo ancestry. It replaces the previous commit.

The first commit to the repo cannot be amended.

Parameters:

Name	Type	Description	Default
`message`	`str`	The message to write with the commit.	required
`metadata`	`dict[str, Any] \| None`	Additional metadata to store with the commit snapshot.	`None`
`allow_empty`	`bool`	If True, allow amending even if no data changes have been made to the session. This is useful when you only want to update the commit message. Default is False.	`False`

Returns:

Type	Description
`str`	The snapshot ID of the new commit.

Raises:

Type	Description
`ConflictError`	If the session is out of date and a conflict occurs.

Source code in icechunk-python/python/icechunk/session.py

async def amend_async(
    self,
    message: str,
    *,
    metadata: dict[str, Any] | None = None,
    allow_empty: bool = False,
) -> str:
    """
    Commit the changes in the session to the repository, by amending/overwriting the previous commit.

    When successful, the writable session is completed and the session is now read-only and based on the new commit. The snapshot ID of the new commit is returned.

    If the session is out of date, this will raise a ConflictError exception depicting the conflict that occurred. The session will need to be rebased before committing.

    This operation doesn't create a new commit in the repo ancestry. It replaces the previous commit.

    The first commit to the repo cannot be amended.

    Parameters
    ----------
    message : str
        The message to write with the commit.
    metadata : dict[str, Any] | None, optional
        Additional metadata to store with the commit snapshot.
    allow_empty : bool, optional
        If True, allow amending even if no data changes have been made to the session.
        This is useful when you only want to update the commit message. Default is False.

    Returns
    -------
    str
        The snapshot ID of the new commit.

    Raises
    ------
    icechunk.ConflictError
        If the session is out of date and a conflict occurs.
    """
    return await self._session.amend_async(message, metadata, allow_empty=allow_empty)

chunk_coordinates `async` #

chunk_coordinates(array_path, batch_size=1000)

Return an async iterator to all initialized chunks for the array at array_path

Returns:

Type	Description
`an async iterator to chunk coordinates as tuples`

Source code in icechunk-python/python/icechunk/session.py

async def chunk_coordinates(
    self, array_path: str, batch_size: int = 1000
) -> AsyncIterator[tuple[int, ...]]:
    """
    Return an async iterator to all initialized chunks for the array at array_path

    Returns
    -------
    an async iterator to chunk coordinates as tuples
    """
    # We do unbatching here to improve speed. Switching to rust to get
    # a batch is much faster than switching for every element
    async for batch in self._session.chunk_coordinates(array_path, batch_size):
        for coord in batch:
            yield tuple(coord)

chunk_type #

chunk_type(array_path, chunk_coordinates)

Return the chunk type for the specified coordinates

Parameters:

Name	Type	Description	Default
`array_path`	`str`	The path to the array inside the Zarr store. Example: "/groupA/groupB/outputs/my-array".	required
`chunk_coordinates`	`Sequence[int]`	A sequence of integers (list or tuple) used to locate the chunk. Example: [0, 1, 5].	required

Returns:

Type	Description
`ChunkType`	One of the supported chunk types.

Source code in icechunk-python/python/icechunk/session.py

def chunk_type(
    self,
    array_path: str,
    chunk_coordinates: Sequence[int],
) -> ChunkType:
    """
    Return the chunk type for the specified coordinates

    Parameters
    ----------
    array_path : str
        The path to the array inside the Zarr store. Example: "/groupA/groupB/outputs/my-array".
    chunk_coordinates: Sequence[int]
        A sequence of integers (list or tuple) used to locate the chunk. Example: [0, 1, 5].

    Returns
    -------
    ChunkType
        One of the supported chunk types.
    """
    return self._session.chunk_type(array_path, chunk_coordinates)

chunk_type_async `async` #

chunk_type_async(array_path, chunk_coordinates)

Return the chunk type for the specified coordinates

Parameters:

Name	Type	Description	Default
`array_path`	`str`	The path to the array inside the Zarr store. Example: "/groupA/groupB/outputs/my-array".	required
`chunk_coordinates`	`Sequence[int]`	A sequence of integers (list or tuple) used to locate the chunk. Example: [0, 1, 5].	required

Returns:

Type	Description
`ChunkType`	One of the supported chunk types.

Source code in icechunk-python/python/icechunk/session.py

async def chunk_type_async(
    self,
    array_path: str,
    chunk_coordinates: Sequence[int],
) -> ChunkType:
    """
    Return the chunk type for the specified coordinates

    Parameters
    ----------
    array_path : str
        The path to the array inside the Zarr store. Example: "/groupA/groupB/outputs/my-array".
    chunk_coordinates: Sequence[int]
        A sequence of integers (list or tuple) used to locate the chunk. Example: [0, 1, 5].

    Returns
    -------
    ChunkType
        One of the supported chunk types.
    """
    return await self._session.chunk_type_async(array_path, chunk_coordinates)

commit #

commit(
    message,
    metadata=None,
    *,
    rebase_with=None,
    rebase_tries=1000,
    allow_empty=False,
)

Commit the changes in the session to the repository.

When successful, the writable session is completed and the session is now read-only and based on the new commit. The snapshot ID of the new commit is returned.

If the session is out of date, this will raise a ConflictError exception depicting the conflict that occurred. The session will need to be rebased before committing.

Parameters:

Name	Type	Description	Default
`message`	`str`	The message to write with the commit.	required
`metadata`	`dict[str, Any] \| None`	Additional metadata to store with the commit snapshot.	`None`
`rebase_with`	`ConflictSolver \| None`	If other session committed while the current session was writing, use Session.rebase with this solver.	`None`
`rebase_tries`	`int`	If other session committed while the current session was writing, use Session.rebase up to this many times in a loop.	`1000`
`allow_empty`	`bool`	If True, allow creating a commit even if there are no changes. Default is False.	`False`

Returns:

Type	Description
`str`	The snapshot ID of the new commit.

Raises:

Type	Description
`ConflictError`	If the session is out of date and a conflict occurs.
`NoChangesToCommitError`	If there are no changes to commit and allow_empty is False.

Source code in icechunk-python/python/icechunk/session.py

def commit(
    self,
    message: str,
    metadata: dict[str, Any] | None = None,
    *,
    rebase_with: ConflictSolver | None = None,
    rebase_tries: int = 1_000,
    allow_empty: bool = False,
) -> str:
    """
    Commit the changes in the session to the repository.

    When successful, the writable session is completed and the session is now read-only and based on the new commit. The snapshot ID of the new commit is returned.

    If the session is out of date, this will raise a ConflictError exception depicting the conflict that occurred. The session will need to be rebased before committing.

    Parameters
    ----------
    message : str
        The message to write with the commit.
    metadata : dict[str, Any] | None, optional
        Additional metadata to store with the commit snapshot.
    rebase_with : ConflictSolver | None, optional
        If other session committed while the current session was writing, use Session.rebase with this solver.
    rebase_tries : int, optional
        If other session committed while the current session was writing, use Session.rebase up to this many times in a loop.
    allow_empty : bool, optional
        If True, allow creating a commit even if there are no changes. Default is False.

    Returns
    -------
    str
        The snapshot ID of the new commit.

    Raises
    ------
    icechunk.ConflictError
        If the session is out of date and a conflict occurs.
    icechunk.NoChangesToCommitError
        If there are no changes to commit and allow_empty is False.
    """
    return self._session.commit(
        message,
        metadata,
        rebase_with=rebase_with,
        rebase_tries=rebase_tries,
        allow_empty=allow_empty,
    )

commit_async `async` #

commit_async(
    message,
    metadata=None,
    *,
    rebase_with=None,
    rebase_tries=1000,
    allow_empty=False,
)

Commit the changes in the session to the repository (async version).

When successful, the writable session is completed and the session is now read-only and based on the new commit. The snapshot ID of the new commit is returned.

If the session is out of date, this will raise a ConflictError exception depicting the conflict that occurred. The session will need to be rebased before committing.

Parameters:

Name	Type	Description	Default
`message`	`str`	The message to write with the commit.	required
`metadata`	`dict[str, Any] \| None`	Additional metadata to store with the commit snapshot.	`None`
`rebase_with`	`ConflictSolver \| None`	If other session committed while the current session was writing, use Session.rebase with this solver.	`None`
`rebase_tries`	`int`	If other session committed while the current session was writing, use Session.rebase up to this many times in a loop.	`1000`
`allow_empty`	`bool`	If True, allow creating a commit even if there are no changes. Default is False.	`False`

Returns:

Type	Description
`str`	The snapshot ID of the new commit.

Raises:

Type	Description
`ConflictError`	If the session is out of date and a conflict occurs.
`NoChangesToCommitError`	If there are no changes to commit and allow_empty is False.

Source code in icechunk-python/python/icechunk/session.py

async def commit_async(
    self,
    message: str,
    metadata: dict[str, Any] | None = None,
    *,
    rebase_with: ConflictSolver | None = None,
    rebase_tries: int = 1_000,
    allow_empty: bool = False,
) -> str:
    """
    Commit the changes in the session to the repository (async version).

    When successful, the writable session is completed and the session is now read-only and based on the new commit. The snapshot ID of the new commit is returned.

    If the session is out of date, this will raise a ConflictError exception depicting the conflict that occurred. The session will need to be rebased before committing.

    Parameters
    ----------
    message : str
        The message to write with the commit.
    metadata : dict[str, Any] | None, optional
        Additional metadata to store with the commit snapshot.
    rebase_with : ConflictSolver | None, optional
        If other session committed while the current session was writing, use Session.rebase with this solver.
    rebase_tries : int, optional
        If other session committed while the current session was writing, use Session.rebase up to this many times in a loop.
    allow_empty : bool, optional
        If True, allow creating a commit even if there are no changes. Default is False.

    Returns
    -------
    str
        The snapshot ID of the new commit.

    Raises
    ------
    icechunk.ConflictError
        If the session is out of date and a conflict occurs.
    icechunk.NoChangesToCommitError
        If there are no changes to commit and allow_empty is False.
    """
    return await self._session.commit_async(
        message,
        metadata,
        rebase_with=rebase_with,
        rebase_tries=rebase_tries,
        allow_empty=allow_empty,
    )

discard_changes #

discard_changes()

When the session is writable, discard any uncommitted changes.

Source code in icechunk-python/python/icechunk/session.py

def discard_changes(self) -> None:
    """
    When the session is writable, discard any uncommitted changes.
    """
    self._session.discard_changes()

flush #

flush(message, *, metadata=None)

Save the changes in the session to a new snapshot without modifying the current branch.

When successful, the writable session is completed and the session is now read-only and based on the new snapshot. The ID of the new snapshot is returned.

Parameters:

Name	Type	Description	Default
`message`	`str`	The message to write with the commit.	required
`metadata`	`dict[str, Any] \| None`	Additional metadata to store with the commit snapshot.	`None`

Returns:

Type	Description
`str`	The ID of the new snapshot.

Source code in icechunk-python/python/icechunk/session.py

def flush(
    self,
    message: str,
    *,
    metadata: dict[str, Any] | None = None,
) -> str:
    """
    Save the changes in the session to a new snapshot without modifying the current branch.

    When successful, the writable session is completed and the session is now read-only and based on the new snapshot. The ID of the new snapshot is returned.

    Parameters
    ----------
    message : str
        The message to write with the commit.
    metadata : dict[str, Any] | None, optional
        Additional metadata to store with the commit snapshot.

    Returns
    -------
    str
        The ID of the new snapshot.
    """
    return self._session.flush(message, metadata)

flush_async `async` #

flush_async(message, *, metadata=None)

Save the changes in the session to a new snapshot without modifying the current branch.

When successful, the writable session is completed and the session is now read-only and based on the new snapshot. The ID of the new snapshot is returned.

Parameters:

Name	Type	Description	Default
`message`	`str`	The message to write with the commit.	required
`metadata`	`dict[str, Any] \| None`	Additional metadata to store with the commit snapshot.	`None`

Returns:

Type	Description
`str`	The ID of the new snapshot.

Source code in icechunk-python/python/icechunk/session.py

async def flush_async(
    self,
    message: str,
    *,
    metadata: dict[str, Any] | None = None,
) -> str:
    """
    Save the changes in the session to a new snapshot without modifying the current branch.

    When successful, the writable session is completed and the session is now read-only and based on the new snapshot. The ID of the new snapshot is returned.

    Parameters
    ----------
    message : str
        The message to write with the commit.
    metadata : dict[str, Any] | None, optional
        Additional metadata to store with the commit snapshot.

    Returns
    -------
    str
        The ID of the new snapshot.
    """
    return await self._session.flush_async(message, metadata)

fork #

fork()

Create a child session that can be pickled to a worker job and later merged.

This method supports Icechunk's distributed, collaborative jobs. A coordinator task creates a new session using Repository.writable_session. Then Session.fork is called repeatedly to create as many serializable sessions as worker jobs. Each new ForkSession is pickled to the worker that uses it to do all its writes. Finally, the ForkSessions are pickled back to the coordinator that uses ForkSession.merge to merge them back into the original session and commit.

Learn more about collaborative writes at https://icechunk.io/en/latest/parallel/

Raises:

Type	Description
`ValueError`	When `self` already has uncommitted changes.
`ValueError`	When `self` is read-only.

Source code in icechunk-python/python/icechunk/session.py

def fork(self) -> "ForkSession":
    """
    Create a child session that can be pickled to a worker job and later merged.

    This method supports Icechunk's distributed, collaborative jobs. A coordinator task creates a new session using
    `Repository.writable_session`. Then `Session.fork` is called repeatedly to create as many serializable sessions
    as worker jobs. Each new `ForkSession` is pickled to the worker that uses it to do all its writes.
    Finally, the `ForkSessions` are pickled back to the coordinator that uses `ForkSession.merge` to merge them
    back into the original session and `commit`.

    Learn more about collaborative writes at https://icechunk.io/en/latest/parallel/

    Raises
    ------
    ValueError
        When `self` already has uncommitted changes.
    ValueError
        When `self` is read-only.
    """
    # TODO: Do we still need ForkSession?
    return ForkSession(self._session.fork())

get_node_id #

get_node_id(path)

Return the node ID for the array or group at the given path.

Each node is assigned an opaque ID when it is created. This ID is stable across moves and renames — a node keeps the same ID for its entire lifetime. See the icechunk spec <https://icechunk.io/en/stable/spec/>_ for details on node identity.

Parameters:

Name	Type	Description	Default
`path`	`str`	Absolute path to the node (e.g., "/data/temperature").	required

Returns:

Type	Description
`str`	The node ID as an opaque string.

Raises:

Type	Description
`IcechunkError`	If no node exists at the given path.

Source code in icechunk-python/python/icechunk/session.py

def get_node_id(self, path: str) -> str:
    """
    Return the node ID for the array or group at the given path.

    Each node is assigned an opaque ID when it is created. This ID is
    stable across moves and renames — a node keeps the same ID for its
    entire lifetime. See the `icechunk spec <https://icechunk.io/en/stable/spec/>`_
    for details on node identity.

    Parameters
    ----------
    path : str
        Absolute path to the node (e.g., "/data/temperature").

    Returns
    -------
    str
        The node ID as an opaque string.

    Raises
    ------
    IcechunkError
        If no node exists at the given path.
    """
    return self._session.get_node_id(path)

get_node_id_async `async` #

get_node_id_async(path)

Return the node ID for the array or group at the given path.

Each node is assigned an opaque ID when it is created. This ID is stable across moves and renames — a node keeps the same ID for its entire lifetime. See the icechunk spec <https://icechunk.io/en/stable/spec/>_ for details on node identity.

Parameters:

Name	Type	Description	Default
`path`	`str`	Absolute path to the node (e.g., "/data/temperature").	required

Returns:

Type	Description
`str`	The node ID as an opaque string.

Raises:

Type	Description
`IcechunkError`	If no node exists at the given path.

Source code in icechunk-python/python/icechunk/session.py

async def get_node_id_async(self, path: str) -> str:
    """
    Return the node ID for the array or group at the given path.

    Each node is assigned an opaque ID when it is created. This ID is
    stable across moves and renames — a node keeps the same ID for its
    entire lifetime. See the `icechunk spec <https://icechunk.io/en/stable/spec/>`_
    for details on node identity.

    Parameters
    ----------
    path : str
        Absolute path to the node (e.g., "/data/temperature").

    Returns
    -------
    str
        The node ID as an opaque string.

    Raises
    ------
    IcechunkError
        If no node exists at the given path.
    """
    return await self._session.get_node_id_async(path)

merge #

merge(*others)

Merge the changes for this session with the changes from another session.

Parameters:

Name	Type	Description	Default
`others`	`ForkSession`	The forked sessions to merge changes from.	`()`

Source code in icechunk-python/python/icechunk/session.py

def merge(self, *others: "ForkSession") -> None:
    """
    Merge the changes for this session with the changes from another session.

    Parameters
    ----------
    others : ForkSession
        The forked sessions to merge changes from.
    """
    for other in others:
        self._session.merge(other._session)

merge_async `async` #

merge_async(*others)

Merge the changes for this session with the changes from another session (async version).

Parameters:

Name	Type	Description	Default
`others`	`ForkSession`	The forked sessions to merge changes from.	`()`

Source code in icechunk-python/python/icechunk/session.py

async def merge_async(self, *others: "ForkSession") -> None:
    """
    Merge the changes for this session with the changes from another session (async version).

    Parameters
    ----------
    others : ForkSession
        The forked sessions to merge changes from.
    """
    for other in others:
        await self._session.merge_async(other._session)

move #

move(from_path, to_path)

Move or rename a node (array or group) in the hierarchy.

This is a metadata-only operation—no data is copied. Requires a rearrange session:

session = repo.rearrange_session("main")
session.move("/data/raw", "/data/v1")

Parameters:

Name	Type	Description	Default
`from_path`	`str`	The current path of the node (e.g., "/data/raw").	required
`to_path`	`str`	The new path for the node (e.g., "/data/v1").	required

Source code in icechunk-python/python/icechunk/session.py

def move(self, from_path: str, to_path: str) -> None:
    """Move or rename a node (array or group) in the hierarchy.

    This is a metadata-only operation—no data is copied. Requires a rearrange session:

        session = repo.rearrange_session("main")
        session.move("/data/raw", "/data/v1")

    Parameters
    ----------
    from_path : str
        The current path of the node (e.g., "/data/raw").
    to_path : str
        The new path for the node (e.g., "/data/v1").
    """
    return self._session.move_node(from_path, to_path)

move_async `async` #

move_async(from_path, to_path)

Async version of :meth:move.

Source code in icechunk-python/python/icechunk/session.py

async def move_async(self, from_path: str, to_path: str) -> None:
    """Async version of :meth:`move`."""
    return await self._session.move_node_async(from_path, to_path)

rebase #

rebase(solver)

Rebase the session to the latest ancestry of the branch.

This method will iteratively crawl the ancestry of the branch and apply the changes from the branch to the session. If a conflict is detected, the conflict solver will be used to optionally resolve the conflict. When complete, the session will be based on the latest commit of the branch and the session will be ready to attempt another commit.

When a conflict is detected and a resolution is not possible with the provided solver, a RebaseFailed exception will be raised. This exception will contain the snapshot ID that the rebase failed on and a list of conflicts that occurred.

Parameters:

Name	Type	Description	Default
`solver`	`ConflictSolver`	The conflict solver to use when a conflict is detected.	required

Raises:

Type	Description
`RebaseFailedError`	When a conflict is detected and the solver fails to resolve it.

Source code in icechunk-python/python/icechunk/session.py

def rebase(self, solver: ConflictSolver) -> None:
    """
    Rebase the session to the latest ancestry of the branch.

    This method will iteratively crawl the ancestry of the branch and apply the changes from the branch to the session. If a conflict is detected, the conflict solver will be used to optionally resolve the conflict. When complete, the session will be based on the latest commit of the branch and the session will be ready to attempt another commit.

    When a conflict is detected and a resolution is not possible with the provided solver, a RebaseFailed exception will be raised. This exception will contain the snapshot ID that the rebase failed on and a list of conflicts that occurred.

    Parameters
    ----------
    solver : ConflictSolver
        The conflict solver to use when a conflict is detected.

    Raises
    ------
    RebaseFailedError
        When a conflict is detected and the solver fails to resolve it.
    """
    self._session.rebase(solver)

rebase_async `async` #

rebase_async(solver)

Rebase the session to the latest ancestry of the branch (async version).

This method will iteratively crawl the ancestry of the branch and apply the changes from the branch to the session. If a conflict is detected, the conflict solver will be used to optionally resolve the conflict. When complete, the session will be based on the latest commit of the branch and the session will be ready to attempt another commit.

When a conflict is detected and a resolution is not possible with the provided solver, a RebaseFailed exception will be raised. This exception will contain the snapshot ID that the rebase failed on and a list of conflicts that occurred.

Parameters:

Name	Type	Description	Default
`solver`	`ConflictSolver`	The conflict solver to use when a conflict is detected.	required

Raises:

Type	Description
`RebaseFailedError`	When a conflict is detected and the solver fails to resolve it.

Source code in icechunk-python/python/icechunk/session.py

async def rebase_async(self, solver: ConflictSolver) -> None:
    """
    Rebase the session to the latest ancestry of the branch (async version).

    This method will iteratively crawl the ancestry of the branch and apply the changes from the branch to the session. If a conflict is detected, the conflict solver will be used to optionally resolve the conflict. When complete, the session will be based on the latest commit of the branch and the session will be ready to attempt another commit.

    When a conflict is detected and a resolution is not possible with the provided solver, a RebaseFailed exception will be raised. This exception will contain the snapshot ID that the rebase failed on and a list of conflicts that occurred.

    Parameters
    ----------
    solver : ConflictSolver
        The conflict solver to use when a conflict is detected.

    Raises
    ------
    RebaseFailedError
        When a conflict is detected and the solver fails to resolve it.
    """
    await self._session.rebase_async(solver)

reindex_array #

reindex_array(array_path, forward, backward=None)

Reindex chunks in an array by applying a transformation function.

Only existing (non-empty) chunks are visited — empty positions are skipped. This means that if an empty chunk would have shifted into an occupied position, that position retains stale data unless a backward function is also provided.

Parameters:

Name	Type	Description	Default
`array_path`	`str`	Path to the array.	required
`forward`	`Callable[[Iterable[int]], Iterable[int] \| None]`	Function that maps old chunk coordinates to new coordinates. Receives a list of non-negative integers (the current chunk index) and must return either a new index (as a list/tuple of non-negative integers within the array's chunk grid bounds) or `None` to skip the chunk (leave it in place).	required
`backward`	`Callable[[Iterable[int]], Iterable[int] \| None]`	Inverse of `forward`: given a chunk position, returns the position that would have mapped there under `forward`. Must follow the same return conventions as `forward`. When provided, each existing chunk position is checked to determine whether it should be cleared — if `backward` returns `None` (out of bounds) or points to a position with no chunk, that position is reset to the fill value.	`None`

Source code in icechunk-python/python/icechunk/session.py

def reindex_array(
    self,
    array_path: str,
    forward: Callable[[Iterable[int]], Iterable[int] | None],
    backward: Callable[[Iterable[int]], Iterable[int] | None] | None = None,
) -> None:
    """Reindex chunks in an array by applying a transformation function.

    Only existing (non-empty) chunks are visited — empty positions are
    skipped. This means that if an empty chunk would have shifted into an
    occupied position, that position retains stale data unless a backward
    function is also provided.

    Parameters
    ----------
    array_path : str
        Path to the array.
    forward : Callable[[Iterable[int]], Iterable[int] | None]
        Function that maps old chunk coordinates to new coordinates. Receives
        a list of non-negative integers (the current chunk index) and must return
        either a new index (as a list/tuple of non-negative integers within the
        array's chunk grid bounds) or ``None`` to skip the chunk (leave it in place).
    backward : Callable[[Iterable[int]], Iterable[int] | None], optional
        Inverse of ``forward``: given a chunk position, returns the position
        that would have mapped there under ``forward``. Must follow the same
        return conventions as ``forward``. When provided, each existing chunk
        position is checked to determine whether it should be cleared — if
        ``backward`` returns ``None`` (out of bounds) or points to a position
        with no chunk, that position is reset to the fill value.
    """
    return self._session.reindex_array(array_path, forward, backward)

shift_array #

shift_array(array_path, chunk_offset)

Shift all chunks in an array by the given chunk offset.

Out-of-bounds chunks are discarded. To preserve them, resize the array first to make room. Vacated source positions are cleared (reset to fill value).

Parameters:

Name	Type	Description	Default
`array_path`	`str`	The path to the array to shift.	required
`chunk_offset`	`Iterable[int]`	The number of chunks to shift by in each dimension. Positive values shift right/down, negative values shift left/up.	required

Notes

To shift right while preserving all data, first resize the array using zarr's array.resize(), then shift.

Source code in icechunk-python/python/icechunk/session.py

def shift_array(
    self,
    array_path: str,
    chunk_offset: Iterable[int],
) -> None:
    """Shift all chunks in an array by the given chunk offset.

    Out-of-bounds chunks are discarded. To preserve them, resize the array first
    to make room. Vacated source positions are cleared (reset to fill value).

    Parameters
    ----------
    array_path : str
        The path to the array to shift.
    chunk_offset : Iterable[int]
        The number of chunks to shift by in each dimension. Positive values
        shift right/down, negative values shift left/up.

    Notes
    -----
    To shift right while preserving all data, first resize the array using zarr's
    array.resize(), then shift.
    """
    self._session.shift_array(array_path, list(chunk_offset))

status #

status()

Compute an overview of the current session changes

Returns:

Type	Description
`Diff`	The operations executed in the current session but still not committed.

Source code in icechunk-python/python/icechunk/session.py

def status(self) -> Diff:
    """
    Compute an overview of the current session changes

    Returns
    -------
    Diff
        The operations executed in the current session but still not committed.
    """
    return self._session.status()

SessionMode #

Bases: Enum

Enum for session access modes

Attributes:

Name	Type	Description
`readonly`	`int`	Session can only read data
`writable`	`int`	Session can read and write data
`rearrange`	`int`	Session can only move nodes and reindex arrays

Source code in icechunk-python/python/icechunk/_icechunk_python.pyi

class SessionMode(Enum):
    """Enum for session access modes

    Attributes
    ----------
    readonly: int
        Session can only read data
    writable: int
        Session can read and write data
    rearrange: int
        Session can only move nodes and reindex arrays
    """

    readonly = 0
    writable = 1
    rearrange = 2

icechunk.session#