Licensed Data Sources
This is an alphabetical list of data sources licensed by the U.Va. Library. Most are available online as databases but some are hosted on a server and can be downloaded and installed on a Windows computer. These datasets are also installed on a computer in Brown Library, Room i-043. If you have questions about how to access any of these resources, please contact our data librarian at firstname.lastname@example.org.
– A –
Alliance for Audited Media
The Media Intelligence Center contains circulation statistics for over 3,000 U.S. newspapers and magazines. Audited Reports and Publisher’s Statements provide information on total paid circulation for annual subscriptions and single copy sales, average prices, and circulation by issue and geographic region. Compare print and digital readership rates and find data on website usage, mobile app downloads, and social media interactions.
– B –
Burney Collection Newspapers: 17th and 18th Century
The newspapers, pamphlets, and books gathered by the Reverend Charles Burney (1757-1817) represent the largest and most comprehensive collection of early English news media. The present digital collection, that helps chart the development of the concept of ‘news’ and ‘newspapers’ and the “free press”, totals almost 1 million pages and contains approximately 1,270 titles.
– C –
CAA Database of Battles 1990
The U.S. Army Concepts Analysis Agency (CAA) Database of Battles contains information on more than 600 historical land combat battles and engagements that took place between 1600 AD and 1973 AD. Each record contains descriptive data, such as battle name, date, and location, the strengths and losses on each side, identification of the victor, temporal duration of the battle, and selected environmental and tactical environment descriptors.
China Data Online
This database provides access to various statistical yearbooks published by the National Bureau of Statistics of China.
China 2000 & 2010 County Population Census Data with GIS Maps
This collection GIS shapefiles from the China Data Center contains a county boundary map with more than 1000 comparable variables from the 2000 and 2010 population censuses and map layers for highways, national roads, provincial truck roads, railways, rivers, and coast lines.
CPS Utilities is an integrated set of tools to help researchers find variables of interests and extract their values from the Current Population Survey data files.
Cross-National Time-Series Data Archive
The Cross-National Time-Series Data Archive provides more than 200 years of annual data from 1815 onward for over 200 countries. Consists of 196 data variables used by academia, government, finance and media.
– D –
dataZoa gives you instant access to over 3 billion data series on a wide array of topics: economics, banking, finance, demographics, health, child well-being, environment, agriculture, energy and more. With one click, users can store reports in a personal sandbox, and the results update automatically from the primary source. Users can also upload their own data to their dataZoa account to combine it in displays or dashboards with data accessed from other sources. Create a dataZoa account using your university email address.
Data-Planet Statistical Datasets
Provides easy access to an extensive repository of standardized and structured statistical data. The repository contains more than 18.9 billion data points from over 70 source organizations, including the U.S. Census Bureau, Statistics Canada, and Zillow Real Estate Metrics. Search for data by keyword or browse 16 subject categories. Users can create charts, graphs, and maps and download data in a variety of file formats (e.g., Excel, SAS, XML, Shapefile) for use with statistical or GIS software.
Dave Leip Election Data
We have election data spreadsheets from Dave Leip. These highly-detailed spreadsheets include the total electoral vote, total popular vote, party share, and candidate vote by state, county, and town. Each file contains state, county, and town FIPS codes. We currently have: President (1996-2012), U.S. Senate (2008-2012), U.S. House (2000-2012), and Governor (2008-2012).
This website contains an extensive collection of economic times series data from U.S. government sources. These datasets can be manipulated to produce forecasts, regression analyses and charts or downloaded to Excel spreadsheets.
ECRI Lending to Households in Europe
The ECRI Statistical Package on Lending to Households in Europe is a collection of data on lending to non-financial corporations and households, including consumer credit, housing and other loans, in Europe, covering 40 countries: the 28 EU member states, two EU candidate countries (Turkey and the Former Yugoslav Republic of Macedonia), the EFTA countries (Iceland, Liechtenstein, Norway and Switzerland), four additional key global economies (the United States, Australia, Canada and Japan) and, for the first time, two emerging economies (India and Russia).
Esri Data for Education Programs
Esri Data for Education Programs contains demographic, lifestyle segmentation, and consumer spending data for a variety of geographic levels in the United States. They are in File Geodatabase Format for use with Esri ArcGIS software.
Euratlas Georeferenced Historical Vector Data
This is a collection of GIS data layers from the Euratlas Historical Atlas of Europe that together can be composed to make maps depicting the detailed political situation of Europe at the first year of each century. Each map is composed of two kinds of layers: physical features layers, such as seas and rivers, and political features layers, such as states and cities. We currently have the centuries: 1000, 1200, 1400, 1600, and 1800.
This database contains data and reports on more than 350 consumer product markets, including market size and market share. It also includes data on consumer trends and lifestyles, income and expenditures, and population for many countries.
Fairfax Countywide Property & Topography GIS Data
This data set includes over 40 layers of attributed vector data for Fairfax County in Virginia.
Gallup Analytics offers economic, well-being, and political polling data collected daily in the US since 2008, and the data can be broken down by state or MSA. Gallup also provides the World Poll, which captures economic, social, and well-being data for 160+ countries. Gallup Brain presents historical public opinion polling going back to 1935.
GeoLytics Time Series Research Package
Comparing data from different census years can be a difficult task due to changing geographic boundaries. This product assists researchers with comparisons of data across time by adjusting and weighting the census data to account for changes in geographies. It contains 1980, 1990, 2000, and 2010 census data in 2010 boundaries.
Historical Statistics of the United States
This is the standard source for quantitative indicators of American history. HSUS contains time series tables that cover the economic, social, political, demographic, and institutional history of the U.S. The introductory essays provide guidance on sources and reliability and interpretation. The five volume print set is located in the reference room.
Hoover’s contains profiles and key financial data for more than 80 million public and private companies in North America. Build a list of companies based on location, industry, number of employees, or annual sales.
Inter-university Consortium for Political and Social Research (ICPSR) is one of the world’s oldest and largest social science data repositories and contains many large-scale studies that are of interest to researchers. ICPSR also manages access to more than 1,150 restricted-use datasets.
This database contains time series data on lending, exchange rates, trade, and other economic and financial indicators for more than 200 countries. Our subscription includes access to all four datasets—International Financial Statistics (IFS), Direction of Trade (DOT), Balance of Payments (BOP), and Government Finance Statistics (GFS).
Infogroup U.S. Historical Business
Infogroup U.S. Historical Business collection contains establishment-level data for more than 24 million businesses, including the business name, location, industry classification code, number of employees, and sales volume.
Institutional Shareholder Services (ISS) Data
Shareholding voting results for all Russell 3000 companies from 2003-2015 and data for institutional votes for 2003-2014.
International Historical Statistics
IHS is a collection of statistics covering a wide range of socio-economic topics. It is a collection of datasets taken from hundreds of disparate primary sources, including both official national and international abstracts, back to 1750. Content is divided into three geographical areas: (1) Africa, Asia, Oceania, (2) Americas, and (3) Europe. Tables can be downloaded in Excel format.
Searchable database of US private and public businesses, Canadian private businesses, global businesses, industry segments, employers and jobs, as well as residents and demographics. Find company information, financial information, corporate “family tree” details, executive and people information, and more. Search results can be exported in CSV and XLS files.
Searchable database of over 35,000 public companies (active and inactive) worldwide. It also contains the D&B 20M private company database. Find company information, financial statements, annual reports, EDGAR filings, stock prices, and more. Users can compare and export company details and financial data to Excel.
Music Industry Data
Music Industry Data is a growing repository of historical and current data from Billboard, Official Chart Company, GfK Entertainment and many more reporting agencies from over 30 countries around the world. The arc of sales is presented in Relative Pitch Graphs™ which tell the story of the impact of music on society and cultures.
Nielsen Marketing Data
The Consumer Panel data includes information about product purchases made by a panel of consumer households across all retail outlets in all U.S. markets. The Retail Scanner Data consists of weekly purchase and pricing data generated from participating retail store point-of-sale systems in all U.S. markets. Access is limited to tenure-line faculty and the Ph.D. students whom they advise. Please contact email@example.com for a registration code.
Online access to statistical content produced by the Organization for Economic Cooperation and Development. OECD.stat enables users to search for and extract data from across many databases under different themes such as agriculture and fisheries, development, economic projections, education and training, energy, environment, finance, globalisation, health, national accounts, productivity, regional statistics, social and welfare, and more.
PolicyMap is an online data and mapping tool that enables researchers to access data about communities and markets across the United States. It contains over 15,000 continuously updated datasets related to demographics, mortgages and home sales, health, education, jobs and employment, and more. The 3-Layer Map tool enables users to find locations that match one or up to three criteria of data on a map. Users can also upload their own point-level data, such as a list of addresses.
Polling the Nations
Polling the Nations is an online database of public opinion polls containing the full text of 600,000+ questions and responses, from 18,000+ surveys and 1,700+ polling organizations, conducted from 1986 through the present in the United States and more than 100 other countries around the world.
ReferenceUSA is a database that contains directory information on more than 24 million businesses in the United States. It can be used to create lists of establishments, such as department stores or Starbucks in Charlottesville. Users can search and refine results based on company name, industry (NAICS or SIC), location, and a variety of business characteristics. Each record includes the company name, address, and phone number as well as information on number of employees, annual sales, facility size, and more.
Roper Center for Public Opinion Research
A vast archive of public opinion data from leading survey organizations like Gallup, Pew Research, and the Associated Press. The iPOLL databank contains over 20,000 datasets on topics such as congressional approval, healthcare, and gun control dating back to 1935.
This online resource provides quick and easy access to current and historical US census data. Users can create custom data reports at all geographic levels and download data in a variety of file formats for use with statistical software. It also provides US data of other topics such as crime, election, health and religion. Data of some other countries are also available, e.g., World Development Indicators, European statistics, census of Canada and the UK, and Ireland population and religion data.
Statistical Abstract of the United States
Convenient one-volume compendium of economic, social, political, and demographic data. The source notes can be used to find more detailed tables and machine-readable datasets.
Times Literary Supplement Archive
The Times Literary Supplement (TLS) is a weekly literary review published in London. This Archive contains downloadable image files of the TLS copies published in 1902 to 2011.
Thomas Rex Beverly Sound Files
Sound files include audio and metadata for “High Desert Ambiences” and “High Desert Chainsaw.”
UNIDO Industrial Statistics
Statistics on major indicators of industrial performance by country. We have the 2013 edition of all three databases–INDSTAT4, INDSTAT2, and IDSB.
USA Trade Online
The official source of current and cumulative U.S. export and import data for over 18,000 export commodities and 24,000 import commodities. UTO is now free to all users.
The Wisconsin Advertising Project (“WiscAds”) is a research project that analyzes how political candidates, political parties, and special interest groups communicate with voters via advertising.
Hosted Data Sources
These are data sources which are hosted by the U.Va. Library. Individual faculty, departments, and centers sometimes purchase access to resources for their own use, and are able to share that access with the rest of the University.
Access to the CMIE resources is courtesy of Associate Professor Sonal S. Pandya of the Department of Politics, Associate Professor Sheetal Sekhri of the Department of Economics, and the UVa Quantitative Collaborative.
You will need to create an account at CMIE to access these resources. There is no fee to do so.
CMIE CapExdx CapEx is a database of the progress in implementation of new capacity building projects in India. This unique database tracks projects from their announcement through their implementation and final closure. The closure could be completion or the abandonment of the projects. The CapEx database contains the history of implementation of projects since 1995. About 46,000 projects have been tracked.
CMIE Prowessdx Prowess is a database of the financial performance of companies. Annual Reports of companies, stock exchanges and regulators are the principal sources of the data. It delivers data for about 35,000 Indian companies. This includes listed companies, unlisted public companies and private companies of all sizes and ownership groups. It contains time-series data since 1990.