"a relational database of over 400,000 public and private bills introduced in the U.S. House and Senate since 1947...organized in a format that facilitates quantitative studies, and also includes detailed information about each bill's sponsor."
"The Correlates of State Policy Project includes more than seven-hundred variables, with observations across the U.S. 50 states and time (1900 – 2016). These variables represent policy outputs or political, social, or economic factors that may influence policy differences across the states."
"Data collections include sociological surveys, election studies, longitudinal studies, opinion polls, and census data. Among the materials are international and European data such as the European Social Survey, the Eurobarometers, and the International Social Survey Programme."
links to several hundred "Internet sites of numeric Social Science statistical data, data catalogs, data libraries, social science gateways, addresses and more"
Most of the data sets listed are free. If an R! appears after a source it means that the data are already in R format or there exist R commands for directly importing the data from R.
Categories of Data: Data Science -- Economics -- Finance -- Government -- Health Care -- Machine Learning -- Networks -- Public Domain Collections -- Science -- Social Sciences -- Time Series -- Universities -- Microsoft R Server .XDF Datasets
"an open source application to publish, share, reference, extract and analyze research data. It facilitates making data available to others, and allows to replicate others work."
"contains a standard 'core' of demographic, behavioral, and attitudinal questions, plus topics of special interest. Many of the core questions have remained unchanged since 1972 to facilitate time-trend studies as well as replication of earlier findings."
"include surveys of health care professionals and organizations, investigations of access to medical care, surveys on substance abuse, and evaluations of innovative programs for the delivery of health care"
"provides interested researchers with access to the original data sets generated by PD&R-sponsored data collection efforts, including the American Housing Survey, HUD median family income limits, as well as microdata from research initiatives on topics such as housing discrimination, the HUD-insured multifamily housing stock, and the public housing population."
"an integrated set of data from 50 years (1962-2011) of the March Current Population Survey (CPS). The CPS is a monthly U.S. household survey conducted jointly by the U.S. Census Bureau and the Bureau of Labor Statistics."
"consists of more than fifty high-precision samples of the American population drawn from fifteen federal censuses and from the American Community Surveys"
"Inside Kaggle you’ll find all the code & data you need to do your data science work. Use over 19,000 public datasets and 200,000 public notebooks to conquer any analysis in no time."
"a static research collection of district level election results for approximately 350 national legislative elections in 26 countries that was maintained through 2003."
Dataset of the winners and judges of prizes for prose, poetry, or unspecified genre between 1918 and 2020 with a purse of $10,000 and over. The dataset includes details about the winners of fifty-two unique prizes awarded by twenty-two institutions. For a subset of thirty-nine prizes, it includes details about judges; not every prize has complete judge data. It does not include prizes awarded specifically for children’s literature, nonfiction, drama, or translation.
"acquires and preserves data relevant to gerontological research, processing as needed to promote effective research use, disseminates them to researchers, and facilitates their use"
"Publications: The Bibliography of Data-related Literature is a continuously-updated database of thousands of citations of works using data held in the ICPSR and NACDA archive. The works include journal articles, books, book chapters, government and agency reports, working papers, dissertations, conference papers, meeting presentations, unpublished manuscripts, magazine and newspaper articles, and audiovisual materials."
"The mission of the National Archive of Criminal Justice Data (NACJD) is to facilitate research in criminal justice and criminology, through the preservation, enhancement, and sharing of computerized data resources; through the production of original research based on archived data; and through specialized training workshops in quantitative analysis of crime and justice data."
Publications - Online Bibliography of Data-related Literature: "contains over 37,000 citations of published and unpublished works resulting from analyses of data held in the ICPSR archive. Publications include journal articles, books, book chapters, government and agency reports, working papers, dissertations, conference papers, meeting presentations, and unpublished manuscripts."
"collects and organizes data from various archived sources to trace changes in the national policy agenda and public policy outcomes since the Second World War."
"promotes the access and use of SAMHSA's substance abuse and mental health data by providing public-use data files and documentation for download and online analysis tools to support a better understanding of this critical area of public health"
"the most comprehensive compilation of information on the status of women in the world. The Project facilitates understanding the linkage between the situation of women and the security of nation-states. We comb the extant literature and conduct expert interviews to find qualitative and quantitative information on over 310 indicators of women's status in 174 countries. Our Database expands daily, and access to it is free of charge."