share-nodesDelta Sharing

Delta Sharing lets Amigo provide governed, read-only access to selected tables so you can consume data directly within your analytics platform or warehouse using an open sharing protocol.

circle-info

New to Delta Sharing?

Delta Sharing is an open protocol for secure data exchange across platforms and clouds. Providers expose read-only tables via a simple REST endpoint; recipients use a small profile file (credentials + endpoint) to query those tables directly from tools like pandas or Apache Spark—without copying data.

Learn more:

Key Concepts

  • Share — Container your organization is granted access to; exposes one or more schemas.

  • Schema — Logical grouping of tables within a share.

  • Table — Read-only dataset you can query from supported tools.

  • Profile file — JSON with endpoint and credentials used by clients.

  • Open protocol — REST-based access to underlying Parquet/Delta data.

Access & Provisioning

  • Ask your Amigo representative (via Slack) to provision a Delta Share for your organization and specify which tables you need.

  • Beta: Delta Sharing is in active development; schemas and operational policies may evolve.

  • You will receive either:

    • A share profile file (containing the share endpoint and credentials), or

    • An endpoint URL plus recipient credentials, along with the list of shared schemas/tables.

How You Use It

  • Most platforms support Delta Sharing natively or via an open-source connector.

  • Import the provided share profile or configure the endpoint and credentials per your platform’s instructions.

  • Browse the shared schemas/tables and query them like native read-only tables in your environment.

circle-info

Example workflows

  • Connect your warehouse or lakehouse to read shared tables for BI dashboards.

  • Consume curated datasets for ML feature engineering or offline evaluation.

  • Join Amigo-shared tables with your internal datasets without copying data.

Supported Clients

  • Python: delta-sharing library for pandas and PySpark.

  • Apache Spark: Delta Sharing Spark connector (SQL, Python, Scala, Java, R).

  • BI/ETL: Many tools integrate via the open protocol or vendor connectors.

Quick Start

circle-info

Trying it quickly?

Use the public demo profile file to explore example datasets: Open datasets profilearrow-up-right

circle-info

Table URL format: <profile_file>#<share>.<schema>.<table>. The profile file can reside locally or in cloud storage (e.g., s3a://...). See connector docs for supported path schemes.

Governance & Operations

  • Scope: Shares are read-only and limited to the schemas/tables you request.

  • Security: Credentials or recipient tokens can be rotated; IP allowlisting is available upon request.

  • Observability: Access and query activity are logged by Amigo for security and auditing.

Last updated

Was this helpful?