# Sameer Raju Khadatkar **Quant AI/ML @ Wells Fargo | M.Tech. (CDS) @ IISc, Bangalore | B.Tech. (Mechanical) @ GCOE, Amravati** πŸ“ Hyderabad, Telangana, India πŸ“§ sameer123khadatkar@gmail.com πŸ”— [LinkedIn](https://www.linkedin.com/in/sameer-khadatkar/) --- ## Summary I currently serve as a Quantitative Analytics Specialist within Wells Fargo's Model Risk Management (MRM) team at India and Philippines. My primary responsibility involves validating AI/ML models, with a focus on fraud detection, as well as models used in marketing, credit scoring, and natural language processing (NLP). In this role, I ensure the conceptual soundness of models, conduct performance testing, conduct explainability analysis and rigorously challenge models by developing challenger models to detect weaknesses. Additionally, I ensure compliance with regulatory standards set by Wells Fargo, in alignment with guidelines from the Federal Reserve and the OCC. I work closely with model development and risk management teams, providing validation feedback and recommending improvements. I also contribute to documentation and reporting, preparing validation reports, and ensuring the ongoing monitoring of model performance. With a strong foundation in Machine Learning, Deep Learning, and High-Performance Computing gained during my graduate studies at the Indian Institute of Science, Bangalore, and a Bachelor's degree in Mechanical Engineering, I bring a unique blend of skills at the intersection of advanced technology and engineering. My expertise allows me to tackle complex challenges, drive innovation, and contribute to cutting-edge solutions in diverse industries. --- ## Professional Experience ### Wells Fargo International Solutions Private Ltd **Quantitative Analytics Specialist – AVP** πŸ“ Hyderabad, Telangana, India πŸ“… August 2022 – September 2023 - Collaborating with a team overseeing an inventory of ∼300 models focused on Fraud Detection, primarily utilizing Logistic Regression, Extreme Gradient Boosting (XGBoost), and Neural Network models. - Conduct validation of AI/ML models by ensuring conceptual soundness, performing performance testing, carrying out explainability analysis, and developing surrogate, challenger, and offset models to uncover potential weaknesses. - Joined the team during its expansion in India, playing a key role in building trust with US stakeholders. Recognized with the **Manager’s Spotlight Award** for outstanding dedication and contributions. - Developing a module to assist Validators in benchmarking anomaly detection models (Isolation Forest, Extended Isolation Forest, Autoencoders, Histogram-Based Outlier Score (HBOS), etc.) and assessing them using clustering performance metrics. - Created a validation playbook for fraud detection vendor models and developed an Excel-based policy library to facilitate quick reference for team members. --- ## Highlighted Projects at Wells Fargo ### βœ… Check Authorization Model | Validation - Validated a high-impact machine learning model for check authorization, ensuring compliance with regulatory and bank's MRM standards. - Reviewed model objectives, assumptions, architecture, and data pipeline. - Assessed performance using AUC, recall, KS statistic, and PSI across time. - Performed explainability analysis using multicollinearity checks, surrogate models (overall and segment level), SHAP, PDP, H-Statistic, 2D-PDPs, and sensitivity analysis. - Identified local weaknesses through segmentation and built offset models to detect missed signals. - Developed challenger models using YOLOv5, SigNet, TrOCR (Transformer-based OCR), XGBoost model, and pixel-based feature engineering. ### 🧠 Word Embedding Explainability Research - Collaborated with the Bank’s Chief Model Risk Officer on a research project focused on the explainability of word embeddings using clustering techniques such as Spectral Clustering, HDBSCAN, and analysis of ReLU neural network activation patterns. - Utilized Sentence Transformer embeddings (SBERT) and applied dimensionality reduction methods including PCA, UMAP, and t-SNE for cluster interpretation and visualization. - Extended the research by developing a Mixture of Experts model leveraging XGBoost. --- ## Education **Indian Institute of Science (IISc), Bangalore** πŸ“… 2020 – 2022 πŸŽ“ Master of Technology (M.Tech.), Computational and Data Sciences πŸ“ Bengaluru, Karnataka **CGPA:** 9.1 / 10.0 **Government College of Engineering, Amravati (GCoEA)** πŸ“… 2015 – 2019 πŸŽ“ Bachelor of Technology (B.Tech.), Mechanical Engineering πŸ“ Amravati, Maharashtra **CGPA:** 8.29 / 10.0 --- ## Certifications - Advanced Data Science with IBM (Coursera) - HYPERMESH (SHELL MESH AND SOLID MESH) - Introduction to Big Data (Coursera) - MASTERCAM (Design, Turning and Milling) - CREO PARAMETRIC --- ## Research Publication **Subspace Recursive Fermi-Operator Expansion Strategies for Large-Scale DFT Eigenvalue Problems on HPC Architectures** πŸ“ Sameer Khadatkar, Phani Motamarri (MATRIX Lab) πŸ“… July 20, 2023 πŸ“š *Journal of Chemical Physics, 159, 031102 (2023)* πŸ”— [Publication Link](https://pubs.aip.org/aip/jcp/article/159/3/031102/2903241/Subspace-recursive-Fermi-operator-expansion) - Implemented recursive Fermi-operator expansion methods on multi-node CPU (PARAM Pravega) and GPU (ORNL Summit) systems for large-scale DFT problems. - Applied mixed-precision strategies achieving 2Γ— to 4Γ— speedup over diagonalization. - Benchmarked using MPI and SLATE for distributed dense linear algebra. --- ## Academic, Independent and Other Projects - **LLM-Powered Multimodal Airline Chatbot**: Built a chatbot with GPT-4o-mini, supporting both text and voice, generating pop-art city images. Stack: Python, Gradio, custom tools. - **Future Stock Price Prediction for MAANG**: Used yfinance, Stateful LSTM vs XGBoost. LSTM outperformed with ~0.02 MAE. - **Duplicate Question Detection**: LSTM Siamese Network with Word2Vec and GloVe. GloVe performed better. - **Music Genre Classification**: Used MFCCs and spectral features. Best result: 76% Β± 3% accuracy with SVM. - **Algorithm Implementation from Scratch**: PCA, LDA, GMM, TF-IDF, and backpropagation for DNNs. --- ## Skills **Knowledge Areas:** Model Risk Management, Machine Learning, Deep Learning, High-Performance Computing **Programming Languages:** Python, C, C++ (OpenMP, MPI, CUDA), SQL **Python Libraries & Tools:** Numpy, Pandas, Scikit-Learn, PyTorch, TensorFlow (Keras), PySpark, Matplotlib --- ## Relevant Courses - Machine Learning for Signal Processing (IISc) - Advanced Data Science with IBM (Coursera) - Deep Learning (NPTEL) - Pattern Recognition and Neural Networks (NPTEL) - Numerical Linear Algebra (IISc) - Data Analysis and Visualization (IISc) - Numerical Solution of Differential Equations (IISc) - Parallel Programming (IISc) - Introduction to Big Data (Coursera) - LLM Engineering: Master AI, Large Language Models & Agents (Udemy) --- ## Extracurricular Activities - **Project Associate** at MATRIX Lab, CDS Department, IISc. - **Teaching Assistant** for β€œDS284: Numerical Linear Algebra” at IISc. - Led suspension operations for SAE BAJA Team at GCoE Amravati. - Organized Annual Social Gathering as Joint Secretary at GCoE Amravati. --- ## Top Skills - Data Reporting - SQL - Microsoft Excel