Atlanta, GA (Open to Relocate)

Entry-Level Data Analyst

Python, SQL, & Power BI

Turning Raw Data into Clear Business Insights

Data AnalysisSQLPythonMicrosoft Power BITableau
3+
Years Experience
95%
Reporting Accuracy
100K+
Records Processed
Scroll to explore

About Me

Data Analyst with 3+ years of experience working across analytics, data pipelines, and reporting workflows in finance, IT, and academic environments. Skilled in Python, SQL, Power BI, and Excel to support data-driven decision-making and operational insights.

Work Experience

ABM Industries
Data Analyst Intern
June 2025 – August 2025
United States
  • Migrated and optimized Oracle stored procedures from Oracle Fusion to Azure SQL, improving performance by 50% from 1 hour to 30 minutes and reducing FP&A close-cycle delays through PL/SQL-to-T-SQL conversion, query rewrites, and indexing strategies.
  • Validated 200K financial records during Oracle-to-Azure SQL migration, reducing data discrepancies from 35% to under 5% and enabling production deployment for FP&A reporting through Python validation scripts and systematic SQL mismatch resolution
  • Developed automated pipeline for ABM operational documentation, generating 735 Q&A pairs from 30 unstructured PDFs into structured training data and enabling future Azure OpenAI chatbot training through PDF text extraction, document chunking, and GPT-4o-mini prompt engineering.
Habib University
Research Assistant
July 2022 – July 2024
Pakistan
  • Designed 34 Python and SQL lab assignments and exercises, reducing TA workload from 8 to 4 hours per week and lowering instructional support costs across 100+ students through structured documentation, modular code templates, and iterative design refinements.
  • Facilitated weekly 3-hour Python and SQL labs for 30+ students, achieving 95%+ lab assignment completion rates and preparing students for advanced coursework and technical projects through hands-on coding guidance, troubleshooting, and personalized feedback.
  • Mentored 10 student research projects in Deep Learning as Research Assistant across two course offerings, resulting in two first-time IEEE conference paper acceptances and establishing a reusable research pipeline for future student cohorts through weekly checkpoint meetings, Python code reviews, experimental methodology guidance, technical debugging, and iterative documentation feedback.
Parents Voice Association – UJALA Centre
Web Developer (Part-time & Remote)
June 2021 – May 2022
Pakistan
  • Built full-stack management system for special education NGO, digitizing student/teacher records, fee tracking, sponsorships, and certificate workflows into centralized database and replacing manual paper-based administrative processes with web-based CRUD interface through MERN stack with MongoDB, role-based authentication, and RESTful API design.
Ismail Industries Limited
IT Intern
July 2021 – August 2021
Pakistan
  • Parsed 100K+ daily unstructured Trend Micro syslog records into six CSV files by security event category, replacing vendor-limited reports with structured datasets and enabling custom analysis of firewall, malware, and monitoring events through Python-based signature classification.
  • Built automated Python ETL pipeline loading parsed security event CSV files into an MS SQL database daily, replacing manual file searches with instant SQL queries and enabling cross-category trend analysis through relational data modeling and scheduled batch processing.
  • Developed Power BI dashboard connected to SQL security database, automating daily security reporting for IT leadership to monitor threats, investigate incidents, and inform antivirus security policy decisions through interactive visualizations and event filtering.

Technical Skills

💻

Programming Languages

Python, SQL, R, JavaScript, TypeScript, C++

📈

Data Analysis Tools

Excel, Power BI, Tableau

🗄️

Databases

PostgreSQL, MS SQL Server, Azure SQL Database, MongoDB, SQLite

⚙️

Data Processing

Data Cleaning, Data Validation, Data Ingestion, Data Migrations

🔎

Statistical Analysis

Descriptive Statistics, Hypothesis Testing, Regression Analysis

📊

Data Visualization

Matplotlib, D3.js

🤖

Machine Learning & AI

TensorFlow, PyTorch, Scikit-learn

🛠️

Software Engineering & Tools

Node.js, Express.js, REST APIs, JWT, React, Git, GitHub, VS Code

Education

Emory University
Masters, Computer Science
GPA: 4.0/4.0 | August 2024 - December 2025
Habib University
Bachelors, Computer Science
GPA: 3.85/4.0 | August 2018 - June 2022

My Projects

A showcase of projects demonstrating machine learning, visualization, and end-to-end pipeline development.

🚕

NYC Yellow Taxi Analytics Platform

August 2025 – December 2025

Built full-stack analytics platform ingesting 20M+ NYC taxi trip records from January to August 2025 into PostgreSQL, implementing five materialized views and eleven strategic indexes to enable interactive analysis of fare trends, peak demand patterns, and vendor performance.

PostgreSQLPython
📊

Gender Wage Gap Scrollytelling

Jan 2025 – May 2025

Built wage visualization platform processing 344K records, creating multi-dimensional aggregations across year, age, occupation, race, and education, and delivering comparative gender and racial pay-gap insights through an interactive scrollytelling interface using Python and D3.js.

D3.js
🔗

Graph-Based Fraud Detection in Cryptocurrency Networks

Jan 2025 – Apr 2025

Conducted a comparative study of traditional and graph-based machine learning models for detecting illicit cryptocurrency transactions, achieving up to 0.97 accuracy and 0.88 macro F1 on the Elliptic dataset and 0.9857 accuracy with 0.9312 macro F1 on the Ethereum dataset.

PythonScikit-learnPyTorch GeometricGCNGATGraphSAGEGraph Transformers
📰

Fake News Detection Using NLP Models

August 2024 – December 2024

Trained and evaluated NLP models on the ISOT Fake News Dataset (44,898 articles), achieving up to 100% accuracy, precision, recall, and F1-score using RoBERTa for real versus fake news classification.

PythonScikit-learnLSTMBERTRoBERTa
🔐

Membership Inference Attacks on Personalized Differential Privacy Models

August 2024 – December 2024

Evaluated empirical privacy risks of Personalized Differential Privacy using membership inference attacks, achieving up to 98.59% test accuracy on MNIST while observing AUC values closely aligned with individualized privacy budgets across MNIST, CIFAR-10, and SVHN.

PythonPyTorchDifferential PrivacyIDP-SGDCNNMembership Inference Attacks
🦎

Camouflaged Animal Detection Using YOLOv5

January 2022 – May 2022

Built a data-centric object detection pipeline for camouflaged animals, achieving up to 69.78% mAP@0.5 on real-world wildlife data despite extreme foreground–background similarity.

PythonPyTorchYOLOv5Computer VisionOpenCV
🧠

Compression-Based Perceiver for Image Classification

August 2021 – May 2022

Evaluated a Perceiver model trained on compressed image embeddings instead of raw pixels, achieving up to 94.4% classification accuracy while reducing the computational cost of vision transformers.

PythonPyTorchPerceiverAutoencodersSupervised Contrastive LearningResNet
🧵

Textile Design Generation Using Generative Adversarial Networks

August 2021 - December 2021

Trained and compared DCGAN, StyleGAN, and VAE models on approximately 15,000 textile images across six pattern categories, generating designs at 64×64 and 256×256 resolutions for automated pattern synthesis.

PythonPyTorchDCGANStyleGANVariational Autoencoder
🏥

Pharmacy Management System

January 2021 – August 2021

Built a full-stack pharmacy management system implementing 4 role-based workflows, 8 REST API modules, and 5 MongoDB data models using dual Angular frontends and a Node.js backend.

Node.jsExpress.jsMongoDBAngular 9TypeScriptRestful APIs
❤️

Cardiac Arrest Risk Prediction Using Decision Trees

August 2020 – December 2020

Built a decision tree classifier from scratch to predict cardiac arrest risk, achieving 81.11% accuracy on real clinical data with an interactive diagnostic interface.

PythonNumPyPandasDecision TreesTkinterMatplotlibSeaborn

Resume

Download or view my complete professional resume with detailed experience, skills, and achievements.

Resume Preview

PDF viewer may not work on mobile

Open Resume in Browser

Quick Access

Scan these QR codes with your phone to quickly access my resume and LinkedIn profile

Resume PDF

LinkedIn Profile

Get In Touch

Interested in collaborating on data analytics projects or discussing opportunities? I'd love to hear from you. Let's connect and explore how we can work together.

Contact Information

Phone

(943) 241-3640

Location

Atlanta, GA (Open to Relocate)

Quick Response

I typically respond within 24 hours.

Best times to reach me:

Best times: Mon–Fri, 9 AM – 6 PM.

Response time: Usually within 24 hours