Welcome to My Projects! 🔮
Hello! My projects right now are divided into sections of:
- Data Engineering to Data Visualizations
- Exploratory Data Analysis
- Machine Learning / Gen AI
Machine Learning & AI
Archaeology News Insight Engine 🛠️🏺📰
Click here to view on Github or visit the app: https://archaeology-news-insight-engine

An automated news aggregation and analysis system for archaeological discoveries and research, featuring real-time updates, location mapping, and AI-powered insights.
Tech Stack/Architecture Doodle

- Real-time News Collection: Automated collection of archaeology news from multiple sources
- Content Enrichment: Full-text extraction and location detection
- Interactive Map: Geographical visualization of archaeological discoveries
- AI-Powered Q&A thru LLM: Natural language queries about recent archaeological findings
- Automated Orchestration: Scheduled data collection and processing pipelines
Google Advanced Capstone: An HR Solution using Predictive Attrition Model and Analytics 📊

- Applied the Principles of EDA to discover insights on Salifort Motors employees leaving
- Created and evaluated Logistic Regression Model, Tuned Random Forest (Gridsearch) and Tuned XGBoost for a classification task in employee’s attrition rate
- Used test data as means to deciding Champion Model based on F1 score
- Created recommendations for Salifort Motor’s HR about employee satisfaction Levels
UFO Sightings Predictor 🛸👽👾
Click here to view on Github or visit the app: https://ufo-predictor.streamlit.app/

-
Used label encoding on countries.
-
Applied Logistic Regression modeling on the following features: seconds, latitude and longitude.
-
Used streamlit to show prediction results as a simple webapp that loaded the pickled model.
Data Visualizations
Google Capstone: Cyclistic Bikeshare Dashboard and Interactive Map 🗾🚲
Click here to view in Tableau Public

King County Washington Sales Dashboard 🏡
Click here to view in Tableau Public

Data Analysis
Google Capstone: Cyclistic Bikeshare Exploratory Data Analysis🚵♀️

- Used pandas & missingno (for easy visuals of null values)
- Used datetime for conversting objects to correct date format
- Seaborn & matplotlib for Visuals
- Used data to understand difference between annual vs casual riders to determine Marketing Strategy
Google Activity: Waze User Churn Model🚙

- EDA on Waze users
- Classification of Waze data: Built decision tree, random forest, and XGBoost to predict Waze user churn
Cereals! Exploratory Data Analysis 🥣

-
Leveraged six principles of EDA from the Google Analytics Course.
-
Explored data distributions across variables.
-
Applied Pearson’s Correlation Matrix and Feature Importance to understand what factors relate to higher ratings for cereals
Data Cleaning and Transformation
Data Exploration on Mayan Sites 🗺️

-
Explored the data in Google Sheets and pre cleaned the csv file
-
Used pandas to tidy up left dirty data, and visualized in Jupyter Notebook, used bar charts and an interactive map of Archaeological sites using folium
What should I Major? Data Exploration on the College Dataset 🏫

-
Cleaned (duplicates/nulls) and transformed data through merges and concatenations, addressing datatype mismatches.
-
Created simple insightful visualizations with seaborn & matplotlib
Data Exploration on Original Netflix Shows🍿

- Cleaning nulls
- Converting objects to pd.datetime
- Restructuring Columns
- Gained insights on Netflix Show genres, and produced seasons over time through visuals from matplotlib & seaborn
Simple App Guide

The idea behind the Guidebook for Computer/Mobile related tasks is to provide users with a comprehensive, easy-to-follow resource for navigating and efficiently completing common activities or troubleshooting issues on computers and mobile devices.
The site utilizes the Jekyll Theme: JustTheDocs to provide an easy navigation menu to list all the tutorials.