Google-certified Data Analyst • ML Engineer • Building real-world AI solutions
Toronto, Canada 🍁 • Open to internships and full-time roles
I’m a Data Science student and Google Certified Data Analyst focused on applying Data analysis and ML to real-world problems—especially in healthcare. I combine hands-on ML engineering with analytics rigor (SQL, experimentation, visualization) and have shipped end-to-end projects from data prep to deployment (MLOps).
- Google Data Analytics Professional Certificate
- Complete Machine Learning & NLP Bootcamp with MLOps & Deployment (Udemy)
- Strengths: Computer Vision, Predictive Modeling, NLP, SQL Analytics, MLOps
- Currently exploring: transformers, model monitoring, cloud-first ML
- Machine Learning: supervised/unsupervised, deep learning, CNNs, transformers, feature engineering, model tuning, evaluation
- Computer Vision: image classification, medical imaging, OpenCV
- NLP: text classification, sentiment, spam/phishing detection, embeddings, BERT
- Analytics: SQL, A/B testing, EDA, dashboards, storytelling
- Tools: Python, scikit-learn, TensorFlow, PyTorch, Pandas, NumPy, Matplotlib/Seaborn, Plotly
- Platforms: Git, Docker, CI/CD, Jupyter/Colab, BigQuery, Tableau, Power BI, Excel
Tech I use frequently: Python SQL TensorFlow PyTorch scikit-learn Pandas
Docker Git BigQuery Tableau Power BI
- Built a CNN to classify MRI brain scans for early-stage detection
- Emphasis on robust evaluation and class-imbalance handling
- Stack: Python · TensorFlow/Keras · OpenCV · Medical Imaging
Repo: https://github.com/Naeem1144/alzheimer-disease-detection
- Deep learning models to predict telecom churn with feature engineering and tuning
- Focus on actionable insights and retention levers
- Stack: Python · DL · Pandas · Feature Engineering
Repo: https://github.com/Naeem1144/customer-churn-prediction
- Regression models for Toronto housing prices with geo-feature enrichment
- Comprehensive EDA and interpretability
- Stack: Python · Regression · GeoPandas · Visualization
Repo: https://github.com/Naeem1144/greater-toronto-area-house-price-prediction
- NLP-based phishing detection with preprocessing and pipeline design
- Compared classical ML vs. modern embeddings
- Stack: Python · NLP · NLTK · Text Classification
Repo: https://github.com/Naeem1144/spam-email-detection-system
- Unsupervised clustering for market segmentation and persona discovery
- Stack: Python · K-Means · DBSCAN · PCA
Repo: https://github.com/Naeem1144/segmentation-project
- Complex SQL for train booking and retail analytics; schema design + optimization
- Stack: SQL · Data Modeling · Query Optimization
Repo: https://github.com/Naeem1144/sql-analysis
-
Google Data Analytics Professional Certificate
Skills: spreadsheets, SQL, Tableau, R, business cases, capstones -
Complete Machine Learning & NLP Bootcamp with MLOps & Deployment (Udemy)
End-to-end pipelines, NLP, Docker, deployment, monitoring
- Deep Learning: transformers, GANs, advanced CV
- Cloud + MLOps: model deployment, CI/CD, Docker
- Big Data: Spark fundamentals
- Math: linear algebra, stats, optimization
- Industry: AI/ML or Data oriented systems for measurable business impact
- Research: LLM pre/post-training, reinforcement learning, computer vision and Classical ML
- Startups: 0→1 product experimentation with rapid iteration
- Open Source: Commitment to contribute into ML/AI/DL tools that elevate community practice
- Read: papers, blogs, and docs daily
- Code: ship something small every day
- Build: focus on projects that solve real problems
- Share: document learnings to help others
- Collaborations on AI/ML/CV/NLP projects
- Paper discussions and idea jams
- startup ideas
- Hackathons and community learning
Reach out on LinkedIn or visit my
Portfolio.
Ask me about: machine learning, deep learning, computer vision, or my learning journey!
“In AI, we’re all students—the field evolves faster than any one person can master it.”
Currently coding from Toronto, Canada 🍁 • Open to opportunities worldwide 🌍



