Data Engineer building scalable data infrastructure for US tech companies. 8+ years architecting ETL pipelines, data platforms, and real-time processing systems — from fintech to gaming to social media at scale.
flowchart LR
S["🗄️ Sources<br/>APIs · DBs · streams"] --> P["⚙️ Ingest → Transform<br/>Airflow · dbt · Go/Python"]
P --> W["📦 Warehouse<br/>Databricks · BigQuery"]
W --> A["📊 Analytics"]
W --> M["🤖 ML · datatrax"]
A --> C(("Clients"))
M --> C
click M "https://github.com/rbmuller/datatrax" _blank
classDef stage fill:#161b22,stroke:#30363d,color:#c9d1d9
classDef proc fill:#0d1117,stroke:#00B341,stroke-width:2px,color:#00B341,font-weight:bold
classDef clients fill:#00B341,stroke:#00B341,color:#ffffff,font-weight:bold
class S,A,M stage
class P,W proc
class C clients
- Building Bonavia — crowdsourced road quality mapping platform for Brazilian roads (React + Flask + React Native)
- Applying to Georgia Tech OMSCS — MS in Computer Science (AI + Distributed Systems + Robotics)
— Data engineering & ML toolkit for Go. Zero dependencies, 7 ML algorithms, generics-first.
— Zero-config data quality monitoring. Connect, profile, detect anomalies — 3 commands, no YAML.
Golang Python Databricks Airflow BigQuery dbt PySpark Cloud Composer Airbyte Superset PostgreSQL React TypeScript AWS GCP Azure
Climber, trekker, triathlete. Endurance sports taught me that the boring, incremental approach is slower to start — but it actually finishes.
German citizen | Based in Brazil | EST-aligned | 5 languages (EN, PT, ES, IT, DE)



