A comprehensive guide to Snowpark Python, the developer framework that brings the power of Python directly to the Snowflake platform. This project covers the DataFrame API, User-Defined Functions (UDFs), and Stored Procedure deployment.
- DataFrame API: Familiar, pandas-like syntax for pushdown data processing.
- UDFs: Extending Snowflake with custom Python-based logic.
- Stored Procedures: Encapsulating complex ELT workflows in Python.
- Server-Side Execution: Running code securely inside Snowflake's governed environment.
Snowflake_Snowpark_Python/
├── README.md # You are here
├── Tutorial/
│ └── SNOWPARK_TUTORIAL.md # Step-by-step walkthrough
├── scripts/ # Extracted Python and SQL scripts
│ ├── 01_session_connection.py
│ ├── 02_active_session.py
│ ├── 03_dataframe_basics.py
│ ├── 04_udf_definition.py
│ ├── 05_udf_sql_invocation.sql
│ ├── 06_stored_procedure.py
│ └── 07_sproc_call.sql
└── data/ # Initialised as empty
Environment Setup: Before beginning, execute
scripts/00_setup_environment.sqlto provision the necessary databases, schemas, and sample data for this project.
- Review the Snowpark Python Guide.
- Setup your environment following the connection parameters in
scripts/01_session_connection.py.
Elliott Fairhall Data Engineering Educational Series