External Stages (S3, Azure, GCS) β Real Company Setup Example
Welcome back to RetailCo, our fictional retail company.
Alice, the data engineer, now wants to load large amounts of historical sales data stored in AWS S3 into Snowflake.
βInternal stages are great for small loads,β she explains, βbut for large datasets or multi-team projects, we use external stages.β
Letβs explore Snowflake external stages and see how RetailCo sets them up.
ποΈ What Are External Stages?β
An external stage is a reference to a cloud storage location outside Snowflake, such as:
- AWS S3
- Azure Blob Storage
- Google Cloud Storage (GCS)
Benefits:
- Handle large datasets efficiently
- Share data between multiple Snowflake accounts
- Integrate with data lakes or pipelines
- Secure access with cloud credentials
RetailCo example: Historical sales CSVs and JSON files sit in an S3 bucket. Alice wants to stage them in Snowflake without copying them internally.