Overview

This report summarizes patterns found in the Date Me Directory dataset, combining quantitative summaries with visualizations and short interpretive writeups.

For all curious what a DateMe Doc is: imagine any dating profile, but instead of being on a dating app, it's a public document on the internet. On Google Docs, Notions, websites, etc, people have unlimited words and space to discuss their identity, relationships goals, interests, and values. The DateMe Directory dataset is the collection of those documents, and by analyzing them, we can uncover interesting trends about how people present themselves in the dating world.

I started this analysis as a project for my internship at Valyria Studios. As a data science major, I hoped to understand more about the usage of data science on fields I never considered, such as DateMe documents. This project helped me learn about data scraping, cleaning, and visualization. I hope you learn more about how our language is shaped by more factors than we may have considered.

Featured graphs

Graph directory

Loading graphs…

If this never changes, your manifest isn’t loading.

Appendix / Notes

  • First, this is a static analysis of a sample of profiles. DateMe Docs are constantly being added and deleted. The data for this project is the 323 accessible profiles that were available in December 2025.
    Second, it does not represent all users in the dataset. This project analyzes a non-random sample of public DateMeDirectory profiles, so findings shouldn't be generalized for the entire dating landscape.
    Third, while this data is public, the users were unaware a college freshman would be analyzing them.