I am an Active Learner seeking to expand my knowledge base in Natural Language Programming, Machine Learning and Artificial Intelligence.
- Phone: 3439871466
- Email: manoj2375@gmail.com
- LinkedIn: 😄
- GitHub: 🤩
- Hugging Face: 🤗
Education
- Master’s of Computer Science
- University Of Ottawa, Canada Grade - A+ September 2022 - August 2024
- Courses: Machine Learning, Natural Language Processing, Virtual Assistants, Data Management, Distributed Database & Transaction Processing, Knowledge Representation, E-Commerce.
- Bachelor of Engineering (B.E)
- Visvesvaraya Technological University, India GPA: 8.23 / 10.0 August 2016 - August 2020
- Courses: Machine Learning, Natural Language Processing, Operating Systems, Networks.
Work Experience
Machine Learning Associate
Vector Institute, Toronto, Canada
- Duration: October 2023 - January 2024
- Curated high-quality datasets and enabled data cleaning to facilitate downstream Summarization, Q&A and Information Retrieval Tasks with 60% Domain-specific and 40% open-source data.
- Created an initial LLM pipeline to experiment with parameters and model tuning process (SFT) to enhance the performance of Mistral-7B, Zephyr-7B and LLaMA-2 with the QLoRA, LoRA and DPO techniques.
- Evaluated results on benchmark datasets such as CNN/DM, XSum, SQuAlity, SQuAD, and NEWTS also employed model distillation using GPT-4 for training and evaluation.
Machine Learning Engineer Intern
BlueGuardian, London, Canada
- Duration: February 2023 - May 2023
- Preprocessed 100k large data sets of emotions using Airflow and Pyspark and used the Pegasus text augmentation technique to do data cleaning which improved the data quality and sample size by 30%.
- Proficient in analyzing and elucidating deep learning models and NLP solutions in functional and process design, prototyping, testing, and collaboration with advanced engineering teams and executive leadership.
- Optimized XGBoost, and SVM models for high-performance emotion classification in the model architecture which could detect Sarcasm emotion with 85% accuracy.
Assistant System Engineer
Tata Consultancy Services, Bengaluru, India
- Duration: November 2020 - August 2022
- Product Design Interactions with business stakeholders to build new features using Java & Azure DB.
- Programmed the workflow for creating Rest-APIs for the web services in XML using Postman & Swagger.
- Followed Agile Methodology which improved the code quality and decreased the software bugs by 15%.
Skills and Abilities
Programming Languages
- Python, SQL (MySQL, OracleDB), NoSQL (MongoDB), C++, Java, JavaScript
Machine Learning Frameworks
- PyTorch, TensorFlow, Scikit-Learn, MLOps, LLMOps
Deep Learning
- Neural Networks, Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Transfer Learning
Natural Language Processing (NLP)
- BERT, GPT-3.5, GPT-4, Llama2, Mistral, Zephyr, Phi2, Mixtral 8x7B
- Named Entity Recognition (NER), Sentiment Analysis, Text Classification, Summarization, Code Synthesis
Data Science Libraries
- NumPy, Pandas, Matplotlib, Seaborn, Plotly
Feature Engineering
- Feature Extraction, Handling Missing Data, Outlier Detection
Machine Learning Algorithms
- Support Vector Machines (SVM), Decision Trees, Random Forests, Clustering (K-means), XG Boost, Ensemble Methods
Model Deployment
- Docker, AWS (S3, EC2, Lambda, Fargate, Bedrock, Runner, DynamoDB, RedShift), FastAPI, Flask, Kubernetes
Cloud Services
- AWS Cloud Services (S3, EC2, Lambda, SageMaker)
- Azure, Google Cloud (optional)
MLOps
- MLFlow
- Weights & Biases
- DagsHub
REST API Development
- FAST API
- Postman & Swagger for API documentation
Big Data Technologies
Version Control
Projects
Summarization of Customer Feedback
- Web Scraped data from shopping sites, preprocessed and cleaned 80k samples.
- Applied K-means Clustering and fine-tuned FLAN-T5 model for abstractive summaries.
- Finetuned FLAN-T5 model to generate abstractive summaries of feedback and evaluated with ROUGE & BLEU.
- Developed an ETL pipeline using Apache Airflow, Pyspark, and AWS Cloud to extract tweets from Twitter.
StackOverflow Tag Prediction
- Web Scraped data from StackOverflow, preprocessed and cleaned 80k samples.
- Fine-tuned BERT Model, improving F1-Score efficiency by 5%.
Certifications
- Supervised Machine Learning: Regression and Classification, DeepLearning.AI, Stanford University, Coursera.
- Unsupervised Learning, Recommenders, Reinforcement Learning, DeepLearning.AI, Stanford University, Coursera.
- Generative AI with Large Language Models, DeepLearning.AI, Stanford University, Coursera.
Feel free to explore my projects, and don’t hesitate to reach out for any questions or collaboration opportunities.
Thank you for visiting!