About
I'm a passionate Machine Learning Engineer and Data Engineer with expertise in building intelligent AI systems and scalable data pipelines. My work focuses on transforming complex data into actionable insights and deploying production-ready ML solutions.
Currently, I specialize in Large Language Models (LLMs), Natural Language Processing, and MLOps. I have extensive experience with cloud platforms (AWS, Azure), building data engineering solutions with technologies like Apache Spark, Databricks, and Informatica.
My expertise spans the full ML lifecycle: from data engineering and model development to deployment and monitoring. I've worked on projects involving predictive modeling, computer vision, recommendation systems, and generative AI applications.
When I'm not coding, you can find me exploring the latest advancements in AI research, contributing to open-source projects, or sharing knowledge through technical writing and mentoring.
Experience
2023 — Present Lead the development and deployment of machine learning models for production systems. Build scalable ML pipelines using MLOps best practices. Mentor junior engineers and drive technical decisions for AI initiatives.
- Python
- TensorFlow
- PyTorch
- AWS SageMaker
- MLflow
- Kubernetes
- Docker
2021 — 2023 Designed and implemented large-scale data pipelines processing terabytes of data daily. Built ETL workflows using Apache Spark and Databricks. Optimized data warehouse performance and reduced query latency by 60%.
- Apache Spark
- Databricks
- AWS
- Python
- SQL
- Airflow
- Informatica
2020 — 2021 Developed computer vision models for object detection and image classification. Implemented NLP solutions for text analysis and sentiment classification. Collaborated with product team to integrate ML features into production applications.
- Python
- TensorFlow
- OpenCV
- NLTK
- FastAPI
- PostgreSQL
2019 — 2020 Performed exploratory data analysis and created interactive dashboards for business intelligence. Built predictive models for customer churn and revenue forecasting. Automated reporting processes using Python and SQL.
- Python
- SQL
- Tableau
- Power BI
- Pandas
- Scikit-learn
Projects
AI-Powered Chatbot with RAG
Built an intelligent conversational AI system using Retrieval-Augmented Generation (RAG) with LangChain and OpenAI. Implemented vector databases for efficient document retrieval and context-aware responses. Deployed on AWS with scalable infrastructure handling 1000+ daily queries.
- Python
- LangChain
- OpenAI
- Pinecone
- FastAPI
- Docker
- AWS
Featured ProjectReal-time Data Pipeline
Designed and implemented a real-time data processing pipeline using Apache Kafka and Spark Streaming. Processed millions of events per day with sub-second latency. Built monitoring dashboards and alerting systems for pipeline health.
- Apache Kafka
- Spark Streaming
- AWS
- Python
- Scala
- Grafana
Featured ProjectComputer Vision for Quality Control
Developed a deep learning model for automated defect detection in manufacturing. Achieved 98% accuracy using custom CNN architecture. Reduced quality inspection time by 75% and deployed to edge devices for real-time processing.
- Python
- TensorFlow
- OpenCV
- ONNX
- Flask
- Raspberry Pi
Featured ProjectRecommendation Engine
Built a hybrid recommendation system combining collaborative filtering and content-based approaches. Implemented using matrix factorization and neural networks. Increased user engagement by 40% and improved click-through rates.
- Python
- TensorFlow
- Scikit-learn
- Redis
- PostgreSQL
- Docker
Featured ProjectNLP Sentiment Analysis Platform
Created a multi-language sentiment analysis system for social media monitoring. Fine-tuned BERT models for domain-specific classification. Processed 100K+ posts daily with real-time sentiment tracking and trend analysis.
- Python
- Transformers
- BERT
- FastAPI
- MongoDB
- Kubernetes
Featured ProjectMLOps Platform
Established end-to-end MLOps infrastructure for model training, deployment, and monitoring. Implemented CI/CD pipelines for ML models with automated testing and validation. Reduced model deployment time from weeks to hours.
- MLflow
- Kubeflow
- Jenkins
- Docker
- Kubernetes
- AWS
- Python
Featured Project
Certifications & Awards
Certifications
- 2024
AWS Certified AI Practitioner
Amazon Web Services
- 2024
Informatica CDI R41 Professional Certification
Informatica
- 2024
IBM Data Science Professional Certificate
IBM
- 2024
Applied Data Science Capstone Specialization
Coursera
- 2024
DataBricks Fundamentals
Databricks
- 2024
Intel AI Specialization
Intel
Awards & Recognition
- 2024
MVP Award
Informatica
- 2024
PC Admin Certification
Informatica
- 2024
Volvo Recognition Award
Volvo
- 2024
Texpo Certificate
Tech Expo