NYC Newborn Analysis 2011-2019

This article is a preprint and has not been peer-reviewed.

For citation:
Show BibTeX format

Liu, H. et al. "NYC Newborn Analysis 2011-2019." GitData Archive, vol. 2024, no. 09, Sep. 2024, https://archive.gd.edu.kg/20240914190729/

Abstract:

The population of baby names has names come and go, and some have evolved. We wanted to see if we could predict sex based on ethnicity and name characteristics to see how these influences shift a name’s sex. The dataset we used was publicly available data from the city of New York containing the top 75 most popular names for each sex and ethnic category for each year from 2011-2019. Through our analysis on the dataset in regard to sex, we saw that there was a high correlation between name endings with a vowel and sex, where female names were 10 times more likely to end with a vowel. Other factors, such as ethnicity, the amount of syllables in the name, the length of the name, and whether the name started with a vowel had statistically significant but small correlation with sex.

License:

This work is licensed under CC BY 4.0.