Data Pipelines Made Simple

A declarative, configuration-driven Snowpark framework with built-in geospatial capabilities. Focus on what needs to happen, not how everything is wired together.

Why Choose SnowFlow?

🧩

Declarative Pipelines

Define pipelines using YAML instead of scattered Python files. Easy to read, review, and reason about with minimal code required.

πŸ“

Schema Management

Version-controlled schemas in YAML/JSON. Schema evolution is explicit, reviewable, and never hidden in code.

🧠

Clean SQL

Keep SQL clean and readable with parameter substitution. Business logic stays close to data logic without string chaos.

πŸ—οΈ

Automated Infrastructure

Databases, schemas, and tables auto-created when needed. Remove repetitive setup work with safer deployments.

πŸ–₯️

Simple CLI

Run pipelines via a simple CLI interface designed for both local development and automation.

⚑

Meta-Driven

Configuration over code. Clear separation of logic, schema, and infrastructure concerns for maintainable pipelines.

🌍

Geospatial Ready

Built-in Apache Sedona integration for spatial data processing. Work with geometries, perform spatial joins, and analyze location data natively.

See It In Action

# Define your pipeline in YAML - that's it!

pipeline:
  name: "customer_analytics"
  source: "northwind_api"
  destination:
    database: "analytics"
    schema: "gold"

transformations:
  - name: "aggregate_sales"
    sql_file: "sql/customer_sales.sql"
    output_table: "customer_sales_summary"

# No boilerplate. No hidden logic. Just clear, declarative configuration.

Built for Real Teams

1

Faster Onboarding

New team members can understand and contribute to pipelines in hours, not weeks. Clear structure means less time untangling code.

2

Consistency Across Projects

Opinionated structure encourages best practices. Every pipeline follows the same patterns, making collaboration seamless.

3

Easy to Review & Debug

Configuration-based pipelines are easier to review in pull requests and simpler to debug when issues arise.

4

Scale Without Pain

Add new pipelines without reinventing the wheel. The framework handles infrastructure, you handle business logic.

Powered by Apache Sedona

Snowdeck seamlessly integrates Apache Sedona, bringing enterprise-grade geospatial capabilities to your Snowflake data pipelines.

πŸ“ Spatial Functions

Access 100+ spatial SQL functions for geometry creation, transformation, and analysis.

πŸ—ΊοΈ Spatial Joins

Perform efficient spatial joins on large datasets with optimized query execution.

🎯 Location Analytics

Analyze point-in-polygon, distance calculations, and proximity queries at scale.

πŸ”§ Auto-Setup

Framework handles Sedona installation and configuration automatically.

Example: Spatial Query with Sedona

-- Find all stores within 5km of customer locations
SELECT s.store_name, c.customer_name,
       SEDONA.ST_Distance(s.location, c.location) AS distance_km
FROM stores s
CROSS JOIN customers c
WHERE SEDONA.ST_DWithin(s.location, c.location, 5000)
ORDER BY distance_km;

Ready to Build Better Pipelines?

Join teams who are making data engineering predictable, maintainable, and dare we say... boring (in the best way).