Data Exploration using SQL
In this project, we explore COVID-19 data using SQL.
In this project, we explore COVID-19 data using SQL.
In this project, we explore and analyse the distribution of monthly income and their correlation at Company A
Tableau Dashboards for projects on COVID-19, Airbnb and Customers Analysis.
This project identify faulty IoT sensor data in real-time using Kafka Stream in Confluent Cloud. Analyze the device telemetry data with ease and load the results for in-depth insights.
In this project, we build ETL pipeline to upload data from an on-premise database to an AWS S3 bucket using Python. We will use the AWS S3 API to achieve this by connecting to the database.
In this project, we build two Machine Learning models to assess how does alcohol consumption affect student performance in Mathematics.
In this project, we used Pytest to test ETL pipelines in Python. Ensure accurate data delivery, detect errors in transformation logic while automatically runs tests and provides results and debugging information.
In this project, we used text data and build 3 different ML models (Naive Bayes, Random Forrest, XGBoost) and determine which model is most accurate.
In this project, we will use Apache Airflow to automate our Python ETL pipeline. Airflow is a popular open-source workflow management system that provides data engineers with an intuitive platform to create, schedule, monitor, and maintain complex data pipelines.
In this project, we build a telco churn model and find their correlation.
In this project, we use SVD Algorithm to build a RS based on multiple users' movie rating.
In this project, we compare 2 different method of clustering method (K-means & DBSCAN) and determine which method is best suited for the dataset.
In this project we take raw housing data and transform it in MySQL to make it more usable for analysis.