← Back to projects

Reddit Confessions Through a Temporal Lens

Temporal NLP analysis of 50k Reddit posts with interactive visualizations.

A personal research project analyzing 50,000 Reddit posts through time-of-day and weekday/weekend lenses. The work focuses on rigorous timestamp normalization, feature engineering, and interpretable visualization.

Highlights

  • DST-aware UTC to US/Eastern conversion and time feature engineering.
  • NLP pipeline with TF-IDF 1-3 grams, Word2Vec bias axes, and t-SNE.
  • Trend analysis and interactive exploration using Bokeh.

Tech Stack

  • Python (pandas, NumPy, scikit-learn, gensim)
  • matplotlib
  • Bokeh