Shakleen Ishfar

Rochester, New York, United States ยท shakleenishfar@gmail.com
I am
  1. An ex-Samsung Software Engineer with 2+ years of experience.
  2. A Data Science Masters student at University of Rochester, graduating in December 2024.
  3. An active Kaggler, competing in Natural Language Processing focused data science competitions. (Got a Bronze Medal on my first competition!)

Here's a sneak peak at my GitHub activity though out the year as a heatmap

Loading the data just for you.

Feel free to contact me using any of my socials below! Let's dive into data patterns and shape the future together!

Work Experience

University of Rochester

Masters Capstone

๐Ÿง  Recommended and ranked misconceptions associated with incorrect answers to mathematical multiple choice questions.

  • Finetuned several large language models (QWEN-2.5, LLaMa-3.1, Phi-3.5) using LoRA and BitsAndBytes to recommend relevant misconceptions for incorrect answer to multiple choice math questions.
  • Engineered a 2-stage pipeline to first generate candidate misconceptions using small LLMs and then use large LLMs to rank them, resulting in 0.64 mAP (mean average precision).
September 2024 - December 2024

University of Rochester Medical Center

Data Scientist Intern

๐Ÿง  Explored the impact of pharmacological agents on drug-induced migraines by analyzing pulsatility within the glymphatic system.

  • Published technical paper detailing an automated computational pipeline for analyzing two-photon micro- scope recordings of mice brain to study drug-induced migraines.
  • Leveraged computer vision (ResNET) to mask brain vessels from which diameter was calculated.
  • Visualized diameter changes over time using Plotly to identify drug-induced migraine patterns.
May 2024 - August 2024

University of Rochester

Graduate Research Assistant

๐Ÿ“ฐ Presented research work which contrasted news from local and international outlets to detect press freedom.

  • Finetuned large language model (LLaMa2 7B) to score news articles on the basis of sentiment and stance.
  • Utilized BERTopic to cluster news articles into relevant topics for topic wise comparison.
  • Designed robust statistical tests to find significant differences between local and international news.
January 2024 - April 2024

Samsung Research and Development Institute

Software Engineer

โŒš Designed smartwatch features to enhance end user experience.

  • Implemented three new notification features for the Galaxy Watch 6 series for notification module.
  • Improved reliability by increasing test coverage from 30% to 95% and resolving 30+ user-reported bugs.
August 2022 - March 2023

Intelligent Machines Limited

AI/ML Engineer

๐Ÿ‹๏ธ Trained, tested, depolyed, and maintained machine learning models for AI-powered products.

  • Trained and deployed object detection model (YOLOv3) to detect store banners for a retail client.
  • Engineered social media post analyzer using BERT to gather consumer feedback on marketing campaign.
AI/ML Engineer Intern

๐Ÿงน Performed data cleaning, processing, and ETL workloads to supply data scientists with clean data.

  • Developed image processing pipeline for boosting optical character recognition using OpenCV, Pillow, and NumPy.
  • Collected social media data using Scrappy and Selenium.
February 2020 - July 2022

Education

University of Rochester

Master of Science, Data Science
August 2023 - December 2024
  • CGPA: 4.00 / 4.00
  • Dean's Scholarship for tuition waiver
  • Research Experience
    • Computer Science RA: Developing foundational model for DNA and genomic data.
    • Neuro-science RA: Identifying distinctions in vessel pulsatility to detect drug induced migranes.
  • Teaching Experience
    • Data Structures and Algorithms (Summer 2024)
    • Introduction to Computational Statistics (Fall 2024)
  • Course work
    • Natural Language Processing
    • Data Mining
    • Data Science at Scale
    • Statistical Machine Learning
  • Extra-curricular
    • Data Science Graduate Study Group Leader (Fall 2024)

Islamic University of Technology

Bachelor of Science, Computer Science
January 2017 - March 2021
  • CGPA: 3.92 / 4.00
  • OIC Scholarship for tuition waiver
  • Research Experience: Abstractive text summarization using Transformers for Bangla language.
  • Competitions
    • 2nd among 150 teams: Intra IUT Programming Contest
    • 2nd among 50 teams: National Robotech Festival Start Expo
    • 29th among 200 teams: Dhaka ACM ICPC Regional Programming Contest
  • Course work
    • Data Structures and Algorithms
    • Object Oriented Programming in C++
    • Advanced Algorithms
    • Pattern Recognition
  • Extra-curricular
    • Workshop for Game Development in Unreal Engine 4 (2018)
    • Workshop for app development in Android Studio (2019)

Skills

Programming Languages

Adept in Python, SQL, and C++ programming languages, as evidenced by the collection of HackerRank badges.

Data Science Toolkit
Certifications

Projects

Kaggle Automated Essay Scoring 2.0

Developed a system of ensemble regression models coupled with carefully engineered features to score essays written by students.

April 2024 - Present

Kaggle PII Detection

Developed a system using ensemble of DeBERTa models to detect personally identifiable information (PII) tokens from long text sequences. Ranked 173rd out of 2048 participants achieving bronze medal.

January 2024 - April 2024

End-to-end Tweet Sentiment Analysis

Developed an end-to-end system to ingest and process tweet data and store them in delta lake storage system. Then used MLFLow to predict sentiment using a HuggingFace BERT model.

April 2024 - May 2024

News Outlet Freedom Detection

Used LLaMa-2 to analyze sentiment and stance of news articles. Afterwards, I statistically showed significant distinctions between local and international media coverage of the same news topic.

October 2023 - January 2024

Asteroid Mining Feasibility

Utilized statistical models and hypothesis testing to analyze the feasibility of asteroid mining on NASA's JPL asteroid dataset.

November 2023 - January 2024