Staff / Lead Data Engineer

MacroXStudio, Inc

San Francisco, CA (Hybrid)

Salary: $200,907 per year

About MacroX

MacroX is building the next generation of AI-powered macroeconomic intelligence. We combine alternative data, machine learning, and large language models to help investors, businesses, and policymakers understand economic trends in real time.

The Opportunity

We are looking for a Staff / Lead Data Engineer to architect and scale our end-to-end data platform. You will lead the development of the infrastructure powering our AI, machine learning, and macroeconomic forecasting systems.

This role is ideal for an experienced engineer who enjoys building large-scale data systems, working with cutting-edge AI technologies, and shaping the technical direction of a rapidly growing company.

What You'll Do

Data Platform & Infrastructure

Design and manage scalable data ingestion, transformation, and consumption pipelines.
Build infrastructure supporting high-frequency economic and financial datasets.
Ensure compatibility with downstream AI, ML, and LLM applications.

Machine Learning & MLOps

Define and implement MLOps standards and best practices.
Develop automated feature stores and model input pipelines.
Build CI/CD workflows for model retraining and deployment.
Collaborate with Data Scientists and ML Engineers to productionize AI models.

AI & Data Quality

Implement AI-powered data validation and anomaly detection systems.
Develop monitoring solutions for schema drift and data quality issues.
Create observability and lineage frameworks across the data stack.
Build intelligent systems that automatically detect and report pipeline issues.

LLM & Retrieval Systems

Design and maintain vector database infrastructure.
Develop Retrieval-Augmented Generation (RAG) architectures.
Support internal research workflows and customer-facing AI products.

Leadership

Recruit, mentor, and develop Data Engineering and ML Infrastructure teams.
Partner with Product, Legal, and Go-to-Market teams on AI and data strategy.
Drive technical excellence and responsible AI practices across the organization.

Minimum Qualifications

Master's degree (or foreign equivalent) in Data Science, Computer Science, Engineering, or a related field.
Two (2) years of experience as a Senior Data Engineer or related occupation.

Required Technical Skills

Programming Languages

Python
SQL
R

Data Engineering

Pandas
NumPy
Spark (PySpark)
Airflow
dbt

Machine Learning

scikit-learn
XGBoost
LightGBM
TensorFlow
PyTorch

Model Management

MLflow
Weights & Biases

Databases

PostgreSQL
MySQL
BigQuery
Redshift
MongoDB
Cassandra

Cloud Platforms

Google Cloud Platform (GCP)
Amazon Web Services (AWS)

Development & Deployment

Git
GitHub
GitLab
Docker
REST APIs

Development Tools

Jupyter Notebook
JupyterLab

Work Location

Hybrid Schedule:

Four (4) days per week in our San Francisco office
One (1) day per week remote

Apply Now

Interested candidates should submit their resume to:

communications@macroxstudio.com