The dplyr verbs youve learned are useful for exploring data.
R baby names dataset. 2015 US Baby Names For each year of birth YYYY after 1879 the Social Security Administration created a dataset which has the format namesexnumber where name is 2 to 15 characters sex is M male or F female and number is the number of occurrences of the name. Filter for only the year 1990. R-Data is a small web-based statistical application framework based on Drupal 9 and ℝ.
If you want to recreate it yourself run the files 1-downloadr 2-parserb and 3-cleanr in order. US Social Security applications are a great way to track trends in how babies born in the US are named. Contribute to hadleydata-baby-names development by creating an account on GitHub.
Datagov releases two datasets that are helplful for this. Using a dataset provided by the Social Security Administration I created functions with R to visualize and compare the popularity of names over time. You will need both R and ruby.
The comparison of two names over time and the comparison of a name against a birth year over time. This data was aggregated from the data made available from the social security administration. Note that only names with at least 5 babies born in the same year state are included in this dataset for privacy.
There are two functions. US baby names provided by the SSA. Diameter Height and Volume for Black Cherry Trees.
The package babynames contains a dataset babynames with a row for each year baby name and the assigned gender at birth which is incorrectly labeled sex. For each year from 1880 to 2017 the number of children of each sex given each name. Each dataset is sorted first on sex and then on number of occurrences in descending order.