Reddit Confessions Through a Temporal Lens
Temporal NLP analysis of 50k Reddit posts with interactive visualizations.
A personal research project analyzing 50,000 Reddit posts through time-of-day and weekday/weekend lenses. The work focuses on rigorous timestamp normalization, feature engineering, and interpretable visualization.
Highlights
- DST-aware UTC to US/Eastern conversion and time feature engineering.
- NLP pipeline with TF-IDF 1-3 grams, Word2Vec bias axes, and t-SNE.
- Trend analysis and interactive exploration using Bokeh.
Tech Stack
- Python (pandas, NumPy, scikit-learn, gensim)
- matplotlib
- Bokeh