How to use dataset resources

Use the resources below to wrap data from any source, allowing you to log your dataset's columnar data and artifacts in Vectice.

For instructions on using these resources, refer to the Vectice API Reference guide's Resources section.

Vectice stores the artifacts of your datasets, not your actual datasets.

ResourcesDescription

Resource()

Wrap your dataset's columnar data and artifacts from your storage location. It can be extended for any data source. (example: Redshift, RDS, etc.)

FileResource(...)

Wrap your dataset's columnar data and artifacts from a local file.

GCSResource(...)

Wrap your dataset's columnar data and artifacts from your Google Cloud Storage (GCS) source.

S3Resource(...)

Wrap your dataset's columnar data and its artifacts from your AWS S3 source.

BigQueryResource(...)

Wrap your dataset's columnar data and artifacts from your BigQuery source.

DatabricksTableResource(...)

Wrap your dataset's columnar data and artifacts from your Databricks source.

Resource usage examples

Below we highlight how you can use the available Resources to wrap your dataset's columnar and artifacts to later log your dataset to Vectice.

A Custom Data Source

To wrap data from a custom data source, create a custom resource inherited from the base Resource class and implement your own _build_metadata() and _fetch_data() methods.

View our guide How to add a custom data source for more information and examples.

Last updated