From 7c6c2e09e3ad1d41f26869cb7b9f9882175c8a6e Mon Sep 17 00:00:00 2001 From: Gertjan van den Burg Date: Tue, 10 Mar 2020 12:27:53 +0000 Subject: Initial commit --- datasets/us_population/README.md | 23 +++++++++++++++++++++++ 1 file changed, 23 insertions(+) create mode 100644 datasets/us_population/README.md (limited to 'datasets/us_population/README.md') diff --git a/datasets/us_population/README.md b/datasets/us_population/README.md new file mode 100644 index 0000000..106f4e0 --- /dev/null +++ b/datasets/us_population/README.md @@ -0,0 +1,23 @@ +# US Population + +This time series are the population numbers in the US. A potential change +point occurs around index 459 (1990s). + +Data obtained from +[Kaggle](https://www.kaggle.com/census/population-time-series-data#POP.csv). + +The original source of the data is the US Census Bureau. According to [this +page](https://web.archive.org/web/20191120160410/https://ask.census.gov/prweb/PRServletCustom/YACFBFye-rFIz_FoGtyvDRUGg1Uzu5Mn*/!STANDARD?pyActivity=pyMobileSnapStart&ArticleID=KCP-4726) +on the US Census website, we are allowed to redistribute the data as part of +this repository. + +Source: United States Census Bureau, URL: https://www.census.gov, Retrieved: +2019-08-28. + +To obtain ``./us_population.json`` from ``POP.csv``, simply run: + +``` +$ python convert.py POP.csv us_population.json +``` + +![Plot of us_population dataset](./us_population.png) -- cgit v1.2.3