Skip to content
View thtjs0076's full-sized avatar

Block or report thtjs0076

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
thtjs0076/README.md

Hi, I'm Sosun Yim๐Ÿ‘‹

๐ŸŽ“ Data Science @ UC Berkeley (expected May 2027) ๐Ÿ”ฌ Interested in Machine learning,Data analysis,and building data-driven projects ๐ŸŒฑ Currently studying SQL, statistics, and machine learning ๐Ÿ“ซ Open to Data Science / Data Analyst internships


๐Ÿ”ญ Featured Project

Spam Classifier โ€” Text classification on the Enron email dataset (~33K emails)

  • Compared Baseline LR, TF-IDF + LR, and fine-tuned DistilBERT
  • Best F1: 0.9927 (DistilBERT), with a latency vs. accuracy trade-off analysis
  • Investigated 9.6% duplicate data as a potential leakage source and re-validated results

๐Ÿ› ๏ธ Tech Stack

Python PyTorch scikit-learn Hugging Face Pandas NumPy SQL Java


๐Ÿ“Š GitHub Stats

Primula's GitHub stats

Top Languages

Popular repositories Loading

  1. spam-classifier spam-classifier Public

    Email spam classification: baseline to transformer

    Jupyter Notebook

  2. multilingual-tts-router multilingual-tts-router Public

    A language ID model from a class project, rebuilt into a real TTS routing pipeline for 5 languages

    Python

  3. ab-test-analysis ab-test-analysis Public

    An honest end-to-end look at one A/B test โ€” is the lift real, and should we ship it?

    Jupyter Notebook

  4. rl-gridworld-qlearning rl-gridworld-qlearning Public

    Value iteration and Q-learning built from scratch on my own gridworld โ€” and an honest look at where function approximation stops working

    Python

  5. thtjs0076 thtjs0076 Public