Hi, my name is Ray!
Web Developer | Educator |
Data Enthusiast.
About me

Hello, I'm Ray!
Data Engineer | Machine Learning Enthusiast | Remote-Ready | Skilled in Python, Airflow, dbt, SQL, & Cloud | Bridging Engineering & Data to Deliver Scalable Solutions
I'm a Data Engineer with a strong foundation in engineering and applied mathematics, now focused on building scalable data pipelines, analytics workflows, and AI/ML applications. With experience across remote teams and global startups, I bring both technical expertise and adaptability to fast-paced environments.
I specialize in Python, SQL, dbt, Airflow, and cloud platforms (GCP, AWS, Snowflake, BigQuery), leveraging these tools to design pipelines, automate workflows, and drive data-informed decision-making. My background as a Mechanical Engineering graduate and former university instructor adds a structured, problem-solving mindset to my tech career.
Currently, I'm passionate about transforming raw data into actionable insights and exploring how machine learning can enhance business workflows. My career goal is to grow into leading data teams and mentoring future consultants—while contributing to startups and organizations that value innovation, efficiency, and collaboration.
Data Engineering and Data Science/Machine Learning Projects

Stack Overflow End-to-End Data Pipeline
This project analyzes 14 years of Stack Overflow Developer Survey data to uncover valuable insights into technology trends, developer experiences, and industry shifts. The analysis covers a wide range of topics, including programming languages, salary distribution, education demographics, job roles, and predictions for future tech trends.
Stack:
- ✅ Python
- ✅ Docker
- ✅ Apache Airflow
- ✅ dbt (Data Build Tool)
- ✅ Terraform
- ✅ Google Cloud Storage (GCS)
- ✅ Google BigQuery
- ✅ PySpark
- ✅ Pandas

Amazon Sales Data Analysis
This project processes and analyzes Amazon sales data to generate insightful metrics and visualizations, including sales performance, return rates, profit margins, and fee analysis.
Features:
- ✅ Data Processing: Extracts detailed fee information from raw Amazon sales data.
- ✅ Sales Metrics: Calculates total sales, net proceeds, return rates, and profit margins.
- ✅ Visualizations: Generates bar charts for sales, return rates, profit margins, and fee analysis.

Weather Data Pipeline
This project collects, processes, and analyzes weather data to enable insightful visualizations and trend analysis using modern data tools.
Features:
- ✅ Data Collection: Fetches real-time and historical weather data from a public API.
- ✅ Data Storage and Processing: Cleans and stores data in PostgreSQL using Python ETL scripts within Docker containers.
- ✅ Visualizations: Uses Metabase to create dashboards showing temperature trends, humidity levels, and weather anomalies.

Machine Learning Zoomcamp Coursework
This project is a collection of coursework from the Machine Learning Zoomcamp, covering various topics in machine learning and data science.
Features:
- ✅ Module 1: Introduction to Machine Learning
- ✅ Module 2: Machine Learning for Regression
- ✅ Module 3: Machine Learning for Classification
- ✅ Module 4: Evaluation Metrics
- ✅ Module 5: Deploying ML Models
- ✅ Module 6: Decision Trees & Ensemble Learning
- ✅ Module 7: Neural Networks & Deep Learning
- ✅ Module 8: Serverless Deep Learning
- ✅ Module 9: Kubernetes & TensorFlow Serving
Certificates & Achievements

