About Me
Full-stack Data Scientist with 2 years + of professional experience in ML, AI, Deep Learning and Big Data techniques to develop solutions for NLP, Computer Vision, Risk Modelling and Data Mining. Experienced in end-to-end development from conceptualization, data wrangling and analysis, building models, developing APIs and hosting the service on cloud platforms. Highly organized, motivated, diligent and a quick learner with keen interest in exploring & solving real-world problems using tech.
E-mail: anish.sanka@gmail.com
Mobile: +919561430389
Linkedin: https://www.linkedin.com/in/anish-sanka-12510a65
Github: https://github.com/anish-sanka/
Twitter: https://twitter.com/anishsanka
Work Experience
Senior Data Scientist @ HDFC Life Insurance Ltd (July 2020 - Present)
Data Scientist @ HDFC Life Insurance Ltd (July 2019 - June 2020)
- Built an OCR engine for KYC documents like Voter ID, Passport, PAN, Aadhaar using OpenCV, Tesseract, YOLO, Flask, Docker, Jenkins etc to transform digital journey for customer on- boarding. The API is built around deep learning based models for Image ID Classification, Text Extraction, Orientation& Skew correction and Image Quality enhancement.
- Developed a multi-lingual conversational NLP chatbot for customer queries using RoBERTa, Siamese Neural Network, Universal Sentence Encoder, Spacy, FastText, Google Translate API, FastApi for intent detection and Named Entity Recognition(NER) of user queries. Successful in reducing the calls to helpline by a whooping 70%. Also, implemented a sentiment analysis using gensim, tf-idf vectorize and Random Forest classifier to classify and monitor customer satisfaction of chatbot interactions.
- Developed an Age, Gender and BMI Predicting model from a selfie using Transfer Learning on VGG-19 CNN model. The idea is to spur millennials to engage with Life Insurance, successfully resulting in 20% rise in policies sold to the targeted demographics.
- Created a heart Arrhythmia detection model from ECG to reduce the time taken in Insurance underwriting. Trained using a 34 layer CNN on arbitrary length ECG time series.
Data Science Intern @ GEP (June 2018 - December 2018)
Global Leader in providing consulting and tech solutioons for procurement and supply chain management.
- Supplier De-duplication & Classification Engine for spend analytics for both real-time and batch based processes.
- Technologies used: Neo4j Graph DB, Cypher query language, Elastic Search, Pandas, Multi-class Text Classification, LibShortText.
- Enrich & Automate Master Client Data using BeautifulSoup scraper, NLP and Apache Spark.
Education
Bachelor of Engineering (Hons) in Computer Science
Birla Institute of Technology & Science, Pilani - Pilani, India (2019)
- Machine Learning based Prediction on vitamin interacting residues using PSI-BLAST to compute Position-specific scoring matrix and VitaPrad, an SVM based vitamin interacting prediction model under Prof. Sukanta Mondal.
- Creating a TIC TAC TOE Al, that should never lose to a human, using MIN-MAX algorithm and Alpha-Beta pruning in Python.
- Crime Classification prediction in San Francisco given the Time and Geo-location and historic data using Ensemble learning on Random Forest, Light GBM , Neural Nets models.
- Mining data to predict success of new deposits from customers based on C4.5 Algorithm, K-MAP Algorithm on Hadoop forming a multi-node cluster.
PROGRAMMING SKILLS
Python 3.0+, Deep Learning, NLP, Computer Vision, AI, Risk Modelling, Rule Engines, Tensorflow, Keras, PyTorch, CNN, LSTM, Flask, Django, Fastapi, SpaCy, BERT, huggingface,SQL, NoSQL DBs like Mongo, Elastic Search, Neo4j, Docker, Jnekins, AWS, Sagemaker, Hadoop.
CERTIFICATES & TRAININGS
Deep Learning Specialization from DeepLearning.ai on Coursera, Link to the Certificate (October 2020)
NLP Specialization from DeepLearning.ai on Coursera, Link to the Certificate (November 2020)
Architecting on AWS by Amazon AWS Training & Certification (November 2010)
Certification in Network Management by Nettech, Licence ID: 9140152 (October 2014)
INTERESTS
Badminton, Chess, Swimming, Travelling, Photography