[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"skill-ff88be05-21f1-466b-a5a7-8861f3cf7c3c":3,"$flU610fuV7gtpCNEi8enkA8S5Myu5PBp7esRJJnjyXV8":42},{"id":4,"title":5,"description":6,"categoryId":7,"moduleId":8,"tags":9,"prompt":10,"icon":11,"source":12,"sourceUrl":13,"authorId":14,"authorName":15,"isPublic":16,"stars":17,"runs":18,"createdAt":19,"updatedAt":19,"module":20,"category":27,"packages":33},"ff88be05-21f1-466b-a5a7-8861f3cf7c3c","snowflake-development","全面覆盖SQL最佳实践、数据管道设计（动态表、流、任务、Snowpipe）、Cortex AI功能、Cortex代理、Snowpark Python、dbt集成、性能调优和安全加固的雪花开发助手。","cat_coding_backend","mod_coding","sickn33,coding","---\nname: snowflake-development\ndescription: \"Comprehensive Snowflake development assistant covering SQL best practices, data pipeline design (Dynamic Tables, Streams, Tasks, Snowpipe), Cortex AI functions, Cortex Agents, Snowpark Python, dbt integration, performance tuning, and security hardening.\"\ncategory: data-engineering\nrisk: safe\nsource: community\ndate_added: \"2026-03-24\"\n---\n\n# Snowflake Development\n\nYou are a Snowflake development expert. Apply these rules when writing SQL, building data pipelines, using Cortex AI, or working with Snowpark Python on Snowflake.\n\n## When to Use\n- When the user asks for help with Snowflake SQL, data pipelines, Cortex AI, or Snowpark Python.\n- When you need Snowflake-specific guidance for dbt, performance tuning, or security hardening.\n\n## SQL Best Practices\n\n### Naming and Style\n\n- Use `snake_case` for all identifiers. Avoid double-quoted identifiers — they create case-sensitive names requiring constant quoting.\n- Use CTEs (`WITH` clauses) over nested subqueries.\n- Use `CREATE OR REPLACE` for idempotent DDL.\n- Use explicit column lists — never `SELECT *` in production (Snowflake's columnar storage scans only referenced columns).\n\n### Stored Procedures — Colon Prefix Rule\n\nIn SQL stored procedures (BEGIN...END blocks), variables and parameters **must** use the colon `:` prefix inside SQL statements. Without it, Snowflake raises \"invalid identifier\" errors.\n\nBAD:\n```sql\nCREATE PROCEDURE my_proc(p_id INT) RETURNS STRING LANGUAGE SQL AS\nBEGIN\n    LET result STRING;\n    SELECT name INTO result FROM users WHERE id = p_id;\n    RETURN result;\nEND;\n```\n\nGOOD:\n```sql\nCREATE PROCEDURE my_proc(p_id INT) RETURNS STRING LANGUAGE SQL AS\nBEGIN\n    LET result STRING;\n    SELECT name INTO :result FROM users WHERE id = :p_id;\n    RETURN result;\nEND;\n```\n\n### Semi-Structured Data\n\n- VARIANT, OBJECT, ARRAY for JSON\u002FAvro\u002FParquet\u002FORC.\n- Access nested fields: `src:customer.name::STRING`. Always cast: `src:price::NUMBER(10,2)`.\n- VARIANT null vs SQL NULL: JSON `null` is stored as `\"null\"`. Use `STRIP_NULL_VALUE = TRUE` on load.\n- Flatten arrays: `SELECT f.value:name::STRING FROM my_table, LATERAL FLATTEN(input => src:items) f;`\n\n### MERGE for Upserts\n\n```sql\nMERGE INTO target t USING source s ON t.id = s.id\nWHEN MATCHED THEN UPDATE SET t.name = s.name, t.updated_at = CURRENT_TIMESTAMP()\nWHEN NOT MATCHED THEN INSERT (id, name, updated_at) VALUES (s.id, s.name, CURRENT_TIMESTAMP());\n```\n\n## Data Pipelines\n\n### Choosing Your Approach\n\n| Approach | When to Use |\n|----------|-------------|\n| Dynamic Tables | Declarative transformations. **Default choice.** Define the query, Snowflake handles refresh. |\n| Streams + Tasks | Imperative CDC. Use for procedural logic, stored procedure calls. |\n| Snowpipe | Continuous file loading from S3\u002FGCS\u002FAzure. |\n\n### Dynamic Tables\n\n```sql\nCREATE OR REPLACE DYNAMIC TABLE cleaned_events\n    TARGET_LAG = '5 minutes'\n    WAREHOUSE = transform_wh\n    AS\n    SELECT event_id, event_type, user_id, event_timestamp\n    FROM raw_events\n    WHERE event_type IS NOT NULL;\n```\n\nKey rules:\n- Set `TARGET_LAG` progressively: tighter at top, looser at bottom.\n- Incremental DTs **cannot** depend on Full refresh DTs.\n- `SELECT *` breaks on schema changes — use explicit column lists.\n- Change tracking must stay enabled on base tables.\n- Views cannot sit between two Dynamic Tables.\n\n### Streams and Tasks\n\n```sql\nCREATE OR REPLACE STREAM raw_stream ON TABLE raw_events;\n\nCREATE OR REPLACE TASK process_events\n    WAREHOUSE = transform_wh\n    SCHEDULE = 'USING CRON 0 *\u002F1 * * * America\u002FLos_Angeles'\n    WHEN SYSTEM$STREAM_HAS_DATA('raw_stream')\n    AS INSERT INTO cleaned_events SELECT ... FROM raw_stream;\n\n-- Tasks start SUSPENDED — you MUST resume them\nALTER TASK process_events RESUME;\n```\n\n## Cortex AI\n\n### Function Reference\n\n| Function | Purpose |\n|----------|---------|\n| `AI_COMPLETE` | LLM completion (text, images, documents) |\n| `AI_CLASSIFY` | Classify into categories (up to 500 labels) |\n| `AI_FILTER` | Boolean filter on text\u002Fimages |\n| `AI_EXTRACT` | Structured extraction from text\u002Fimages\u002Fdocuments |\n| `AI_SENTIMENT` | Sentiment score (-1 to 1) |\n| `AI_PARSE_DOCUMENT` | OCR or layout extraction |\n| `AI_REDACT` | PII removal |\n\n**Deprecated (do NOT use):** `COMPLETE`, `CLASSIFY_TEXT`, `EXTRACT_ANSWER`, `PARSE_DOCUMENT`, `SUMMARIZE`, `TRANSLATE`, `SENTIMENT`, `EMBED_TEXT_768`.\n\n### TO_FILE — Common Error Source\n\nStage path and filename are **SEPARATE** arguments:\n\n```sql\n-- BAD: TO_FILE('@stage\u002Ffile.pdf')\n-- GOOD:\nTO_FILE('@db.schema.mystage', 'invoice.pdf')\n```\n\n### Use AI_CLASSIFY for Classification (Not AI_COMPLETE)\n\n```sql\nSELECT AI_CLASSIFY(ticket_text,\n    ['billing', 'technical', 'account']):labels[0]::VARCHAR AS category\nFROM tickets;\n```\n\n### Cortex Agents\n\n```sql\nCREATE OR REPLACE AGENT my_db.my_schema.sales_agent\nFROM SPECIFICATION $spec$\n{\n    \"models\": {\"orchestration\": \"auto\"},\n    \"instructions\": {\n        \"orchestration\": \"You are SalesBot...\",\n        \"response\": \"Be concise.\"\n    },\n    \"tools\": [{\"tool_spec\": {\"type\": \"cortex_analyst_text_to_sql\", \"name\": \"Sales\", \"description\": \"Queries sales...\"}}],\n    \"tool_resources\": {\"Sales\": {\"semantic_model_file\": \"@stage\u002Fmodel.yaml\"}}\n}\n$spec$;\n```\n\nAgent rules:\n- Use `$spec$` delimiter (not `$$`).\n- `models` must be an object, not an array.\n- `tool_resources` is a separate top-level object, not nested inside tools.\n- Do NOT include empty\u002Fnull values in edit specs — clears existing values.\n- Tool descriptions are the #1 quality factor.\n- Never modify production agents directly — clone first.\n\n## Snowpark Python\n\n```python\nfrom snowflake.snowpark import Session\nimport os\n\nsession = Session.builder.configs({\n    \"account\": os.environ[\"SNOWFLAKE_ACCOUNT\"],\n    \"user\": os.environ[\"SNOWFLAKE_USER\"],\n    \"password\": os.environ[\"SNOWFLAKE_PASSWORD\"],\n    \"role\": \"my_role\", \"warehouse\": \"my_wh\",\n    \"database\": \"my_db\", \"schema\": \"my_schema\"\n}).create()\n```\n\n- Never hardcode credentials.\n- DataFrames are lazy — executed on `collect()`\u002F`show()`.\n- Do NOT use `collect()` on large DataFrames — process server-side.\n- Use **vectorized UDFs** (10-100x faster) for batch\u002FML workloads instead of scalar UDFs.\n\n## dbt on Snowflake\n\nDynamic table materialization (streaming\u002Fnear-real-time marts):\n```sql\n{{ config(materialized='dynamic_table', snowflake_warehouse='transforming', target_lag='1 hour') }}\n```\n\nIncremental materialization (large fact tables):\n```sql\n{{ config(materialized='incremental', unique_key='event_id') }}\n```\n\nSnowflake-specific configs (combine with any materialization):\n```sql\n{{ config(transient=true, copy_grants=true, query_tag='team_daily') }}\n```\n\n- Do NOT use `{{ this }}` without `{% if is_incremental() %}` guard.\n- Use `dynamic_table` materialization for streaming\u002Fnear-real-time marts.\n\n## Performance\n\n- **Cluster keys**: Only multi-TB tables, on WHERE\u002FJOIN\u002FGROUP BY columns.\n- **Search Optimization**: `ALTER TABLE t ADD SEARCH OPTIMIZATION ON EQUALITY(col);`\n- **Warehouse sizing**: Start X-Small, scale up. `AUTO_SUSPEND = 60`, `AUTO_RESUME = TRUE`.\n- **Separate warehouses** per workload.\n- Estimate AI costs first: `SELECT SUM(AI_COUNT_TOKENS('claude-4-sonnet', text)) FROM table;`\n\n## Security\n\n- Follow least-privilege RBAC. Use database roles for object-level grants.\n- Audit ACCOUNTADMIN regularly: `SHOW GRANTS OF ROLE ACCOUNTADMIN;`\n- Use network policies for IP allowlisting.\n- Use masking policies for PII columns and row access policies for multi-tenant isolation.\n\n## Common Error Patterns\n\n| Error | Cause | Fix |\n|-------|-------|-----|\n| \"Object does not exist\" | Wrong context or missing grants | Fully qualify names, check grants |\n| \"Invalid identifier\" in proc | Missing colon prefix | Use `:variable_name` |\n| \"Numeric value not recognized\" | VARIANT not cast | `src:field::NUMBER(10,2)` |\n| Task not running | Forgot to resume | `ALTER TASK ... RESUME` |\n| DT refresh failing | Schema change or tracking disabled | Use explicit columns, check change tracking |\n\n## Limitations\n- Use this skill only when the task clearly matches the scope described above.\n- Do not treat the output as a substitute for environment-specific validation, testing, or expert review.\n- Stop and ask for clarification if required inputs, permissions, safety boundaries, or success criteria are missing.\n","","imported","https:\u002F\u002Fgithub.com\u002Fsickn33\u002Fantigravity-awesome-skills","user_system_seed","SkillOPIC",true,163,898,"2026-05-16 13:41:21",{"id":8,"name":21,"slug":22,"icon":23,"description":24,"sort":25,"createdAt":26},"编程开发","coding","mdi-code-braces","代码生成、调试、审查，提升开发效率",2,"2026-05-16 12:53:40",{"id":7,"name":28,"slug":29,"icon":30,"description":31,"moduleId":8,"sort":25,"skillCount":32,"createdAt":26},"后端开发","backend","mdi-server","API、数据库、服务端架构",296,[34],{"id":35,"skillId":4,"version":36,"fileName":37,"fileSize":38,"filePath":39,"fileHash":40,"manifest":41,"createdAt":19},"f39302d0-1333-4b33-96d9-a9e2e05fbf5b","1.0.0","snowflake-development.zip",4107,"uploads\u002Fskills\u002Fff88be05-21f1-466b-a5a7-8861f3cf7c3c\u002Fsnowflake-development.zip","7ec58d19bb9a405142ef437cffb7ccb52c02c4eada7d7a9e636709bb57159200","[{\"path\":\"SKILL.md\",\"isDirectory\":false,\"size\":8345}]",{"code":43,"message":44,"data":45},200,"success",{"items":46,"stats":47,"page":50},[],{"averageRating":48,"totalRatings":48,"ratingCounts":49},0,[48,48,48,48,48],{"limit":51,"offset":48,"hasMore":52,"nextOffset":51,"ratedOnly":16},15,false]