AIMAN MAZLAN
PORTFOLIO

Data Professional | Python, SQL, Tableau, Power BI | @aimmazlan

Real-Time ETL with Kafka

This project identify faulty IoT sensor data in real-time using Kafka Stream in Confluent Cloud. Analyze the device telemetry data with ease and load the results for in-depth insights.

Pipeline testing with Python

In this project, we used Pytest to test ETL pipelines in Python. Ensure accurate data delivery, detect errors in transformation logic while automatically runs tests and provides results and debugging information.

Automate ETL Pipeline with Airflow

In this project, we will use Apache Airflow to automate our Python ETL pipeline. Airflow is a popular open-source workflow management system that provides data engineers with an intuitive platform to create, schedule, monitor, and maintain complex data pipelines.

Clustering with Python

In this project, we compare 2 different method of clustering method (K-means & DBSCAN) and determine which method is best suited for the dataset.