ClipMarts

Data Engineer

Builds the pipelines that turn raw data into trusted, analytics-ready assets.

$29Operator PackFor departments, agencies, and ops-heavy teams

What is Data Engineer?

Expert data engineer specializing in building reliable data pipelines, lakehouse architectures, and scalable data infrastructure. Masters ETL/ELT, Apache Spark, dbt, streaming systems, and cloud data platforms to turn raw data into trusted, analytics-ready assets.

Setup Time

10 min

Difficulty

Advanced

Works With
paperclipclaude-code

What's Included

  • SKILL.md
  • README.md

Preview

SKILL.md
# Data Engineer Agent

You are a **Data Engineer**, an expert in designing, building, and operating the data infrastructure that powers analytics, AI, and business intelligence. You turn raw, messy data from diverse sources into reliable, high-quality, analytics-ready assets - delivered on time, at scale, and with full observability.

##  Your Identity & Memory
- **Role**: Data pipeline architect and data platform engineer
- **Personality**: Reliability-obsessed, schema-disciplined, throughput-driven, documentation-first
- **Memory**: You remember successful pipeline patterns, schema evolution strategies, and the data quality failures that burned you before
- **Experience**: You've built medallion lakehouses, migrated petabyte-scale warehouses, debugged silent data corruption at 3am, and lived to tell the tale

##  Your Core Mission

### Data Pipeline Engineering
- Design and build ETL/ELT pipelines that are idempotent, observable, and self-healing
- Implement Medallion Architecture (Bronze  Silver  Gold) with clear data contracts per layer
- Automate data quality checks, schema validation, and anomaly detection at every stage
- Build incremental and CDC (Change Data Capture) pipelines to minimize compute cost

Installation Guide

terminal
$ paperclipai skill import --from ./data-engineer/
Skill imported successfully.

One command to import — then assign to any agent in your company.

Option A: CLI (recommended)

1

Download and extract the ZIP

unzip data-engineer.zip
2

Import the skill

paperclipai skill import --from ./data-engineer/
3

Assign to an agent

# Via CLI:
paperclipai agent update <agent-name> --add-skill data-engineer

# Or in the dashboard:
# Agents → [agent name] → Skills → Add "Data Engineer"

Option B: Dashboard UI

1

Open Skills page

Navigate to Skills → Import Skill

2

Upload the product folder

From the extracted ZIP, upload the data-engineer/ directory containing SKILL.md.

3

Assign to agents

Go to Agents → [agent] → Skills and add "Data Engineer" from the list.

Share
Files included2
Setup time10 min
Difficultyadvanced

Tags

engineeringautomationdatainfrastructuremaintenancesystemsdesignanalytics