Files
LLM_Engineering_OLD/week5/community-contributions/Personal Knowledge Worker/sameer-db/profile/Profile.md
2025-06-09 13:36:47 +05:30

7.2 KiB
Raw Blame History

Sameer Raju Khadatkar

Quant AI/ML @ Wells Fargo | M.Tech. (CDS) @ IISc, Bangalore | B.Tech. (Mechanical) @ GCOE, Amravati
📍 Hyderabad, Telangana, India
📧 sameer123khadatkar@gmail.com
🔗 LinkedIn


Summary

I currently serve as a Quantitative Analytics Specialist within Wells Fargo's Model Risk Management (MRM) team at India and Philippines. My primary responsibility involves validating AI/ML models, with a focus on fraud detection, as well as models used in marketing, credit scoring, and natural language processing (NLP). In this role, I ensure the conceptual soundness of models, conduct performance testing, conduct explainability analysis and rigorously challenge models by developing challenger models to detect weaknesses.

Additionally, I ensure compliance with regulatory standards set by Wells Fargo, in alignment with guidelines from the Federal Reserve and the OCC. I work closely with model development and risk management teams, providing validation feedback and recommending improvements. I also contribute to documentation and reporting, preparing validation reports, and ensuring the ongoing monitoring of model performance.

With a strong foundation in Machine Learning, Deep Learning, and High-Performance Computing gained during my graduate studies at the Indian Institute of Science, Bangalore, and a Bachelor's degree in Mechanical Engineering, I bring a unique blend of skills at the intersection of advanced technology and engineering. My expertise allows me to tackle complex challenges, drive innovation, and contribute to cutting-edge solutions in diverse industries.


Professional Experience

Wells Fargo International Solutions Private Ltd

Quantitative Analytics Specialist AVP
📍 Hyderabad, Telangana, India
📅 August 2022 September 2023

  • Collaborating with a team overseeing an inventory of 300 models focused on Fraud Detection, primarily utilizing Logistic Regression, Extreme Gradient Boosting (XGBoost), and Neural Network models.
  • Conduct validation of AI/ML models by ensuring conceptual soundness, performing performance testing, carrying out explainability analysis, and developing surrogate, challenger, and offset models to uncover potential weaknesses.
  • Joined the team during its expansion in India, playing a key role in building trust with US stakeholders. Recognized with the Managers Spotlight Award for outstanding dedication and contributions.
  • Developing a module to assist Validators in benchmarking anomaly detection models (Isolation Forest, Extended Isolation Forest, Autoencoders, Histogram-Based Outlier Score (HBOS), etc.) and assessing them using clustering performance metrics.
  • Created a validation playbook for fraud detection vendor models and developed an Excel-based policy library to facilitate quick reference for team members.

Highlighted Projects at Wells Fargo

Check Authorization Model | Validation

  • Validated a high-impact machine learning model for check authorization, ensuring compliance with regulatory and bank's MRM standards.
  • Reviewed model objectives, assumptions, architecture, and data pipeline.
  • Assessed performance using AUC, recall, KS statistic, and PSI across time.
  • Performed explainability analysis using multicollinearity checks, surrogate models (overall and segment level), SHAP, PDP, H-Statistic, 2D-PDPs, and sensitivity analysis.
  • Identified local weaknesses through segmentation and built offset models to detect missed signals.
  • Developed challenger models using YOLOv5, SigNet, TrOCR (Transformer-based OCR), XGBoost model, and pixel-based feature engineering.

🧠 Word Embedding Explainability Research

  • Collaborated with the Banks Chief Model Risk Officer on a research project focused on the explainability of word embeddings using clustering techniques such as Spectral Clustering, HDBSCAN, and analysis of ReLU neural network activation patterns.
  • Utilized Sentence Transformer embeddings (SBERT) and applied dimensionality reduction methods including PCA, UMAP, and t-SNE for cluster interpretation and visualization.
  • Extended the research by developing a Mixture of Experts model leveraging XGBoost.

Education

Indian Institute of Science (IISc), Bangalore
📅 2020 2022
🎓 Master of Technology (M.Tech.), Computational and Data Sciences
📍 Bengaluru, Karnataka
CGPA: 9.1 / 10.0

Government College of Engineering, Amravati (GCoEA)
📅 2015 2019
🎓 Bachelor of Technology (B.Tech.), Mechanical Engineering
📍 Amravati, Maharashtra
CGPA: 8.29 / 10.0


Certifications

  • Advanced Data Science with IBM (Coursera)
  • HYPERMESH (SHELL MESH AND SOLID MESH)
  • Introduction to Big Data (Coursera)
  • MASTERCAM (Design, Turning and Milling)
  • CREO PARAMETRIC

Research Publication

Subspace Recursive Fermi-Operator Expansion Strategies for Large-Scale DFT Eigenvalue Problems on HPC Architectures
📝 Sameer Khadatkar, Phani Motamarri (MATRIX Lab) 📅 July 20, 2023
📚 Journal of Chemical Physics, 159, 031102 (2023)
🔗 Publication Link

  • Implemented recursive Fermi-operator expansion methods on multi-node CPU (PARAM Pravega) and GPU (ORNL Summit) systems for large-scale DFT problems.
  • Applied mixed-precision strategies achieving 2× to 4× speedup over diagonalization.
  • Benchmarked using MPI and SLATE for distributed dense linear algebra.

Academic, Independent and Other Projects

  • LLM-Powered Multimodal Airline Chatbot: Built a chatbot with GPT-4o-mini, supporting both text and voice, generating pop-art city images. Stack: Python, Gradio, custom tools.
  • Future Stock Price Prediction for MAANG: Used yfinance, Stateful LSTM vs XGBoost. LSTM outperformed with ~0.02 MAE.
  • Duplicate Question Detection: LSTM Siamese Network with Word2Vec and GloVe. GloVe performed better.
  • Music Genre Classification: Used MFCCs and spectral features. Best result: 76% ± 3% accuracy with SVM.
  • Algorithm Implementation from Scratch: PCA, LDA, GMM, TF-IDF, and backpropagation for DNNs.

Skills

Knowledge Areas:
Model Risk Management, Machine Learning, Deep Learning, High-Performance Computing

Programming Languages:
Python, C, C++ (OpenMP, MPI, CUDA), SQL

Python Libraries & Tools:
Numpy, Pandas, Scikit-Learn, PyTorch, TensorFlow (Keras), PySpark, Matplotlib


Relevant Courses

  • Machine Learning for Signal Processing (IISc)
  • Advanced Data Science with IBM (Coursera)
  • Deep Learning (NPTEL)
  • Pattern Recognition and Neural Networks (NPTEL)
  • Numerical Linear Algebra (IISc)
  • Data Analysis and Visualization (IISc)
  • Numerical Solution of Differential Equations (IISc)
  • Parallel Programming (IISc)
  • Introduction to Big Data (Coursera)
  • LLM Engineering: Master AI, Large Language Models & Agents (Udemy)

Extracurricular Activities

  • Project Associate at MATRIX Lab, CDS Department, IISc.
  • Teaching Assistant for “DS284: Numerical Linear Algebra” at IISc.
  • Led suspension operations for SAE BAJA Team at GCoE Amravati.
  • Organized Annual Social Gathering as Joint Secretary at GCoE Amravati.

Top Skills

  • Data Reporting
  • SQL
  • Microsoft Excel