Skip to main content

Biological Sciences: Data Sources

Data Sources, selected list

NCBI: The National Center for Biotechnology Information
access biomedical and genomic information

Atmospheric Radiation Monitoring (ARM) Data Archive
preserves data collected through operations and scientific field experiments of the ARM Climate Research Facility.

Carbon Dioxide Information Analysis Center (CDIAC)
Primary climate-change data and information analysis center of the U.S. Department of Energy (DOE). CDIAC's data include records of the concentrations of carbon dioxide and other radiatively active gases in the atmosphere; the role of the terrestrial biosphere and the oceans in the biogeochemical cycles of greenhouse gases; emissions of carbon dioxide to the atmosphere; long-term climate trends; the effects of elevated carbon dioxide on vegetation; and the vulnerability of coastal areas to rising sea level.

Chesapeake Bay Environmental Observatory (CBEO)
Available for registering datasets of different types, and searching for CBEO-registered data or for data registered in all projects within the GEON family of federated portals.

Computational and Information Systems Laboratory (CISL) Research Data Archive
contains meteorological and oceanographic observations, operational and reanalysis model outputs, and remote sensing datasets to support atmospheric and geosciences research, along with topography/bathymetry, vegetation, and land use.

Dryad
International repository of data underlying peer-reviewed articles in the basic and applied biosciences. Dryad is governed by a consortium of journals that collaboratively promote data archiving and ensure the sustainability of the repository.

Geo.Data.gov
Geographic information system (GIS) portal, also known as the Geospatial One-Stop (GOS), contains thousands of geospatial metadata records, links to live maps, features, catalog services, downloadable data sets, images, clearinghouses, map files, and more.

Global Biodiversity Information Facility (GBIF)
Allows researchers to publish and discover biodiversity data—taxon primary occurrence data, taxonomic checklists and resource metadata—as part of a distributed global network.

Global Biotic Interactions:  http://www.globalbioticinteractions.org/about.html.
GloBI provides an infrastructure and data service that aggregates or combines existing biotic interaction datasets to provide easy access to biotic interaction data.

Ecological Society of America's Ecological Archives
Publishes materials supplemental to articles that appear in the ESA print journals (Ecology, Ecological Applications, and Ecological Monographs), and peer-reviewed Data Papers.

Knowledge Network for Biocomplexity (KNB)
National network designed to facilitate the discovery and analysis of distributed ecological and environmental datasets.

National Ecological Observatory Network (NEON)
Continental-scale research platform for discovering and understanding the impacts of climate change, land-use change, and invasive species on ecology. It will consist of distributed sensor networks and experiments to record and archive ecological data for at least 30 years using standardized protocols and an open data policy.

Oak Ridge National Laboratory Distributed Active Archive Center (ORNL DAAC)
Seeks to assemble, distribute, and archive data for research, education, and policy formulation in terrestrial biogeochemistry and the ecosystem dynamics of global environmental change. The ORNL DAAC archives data generated by NASA's Terrestrial Ecology Program.

Ocean Biogeographic Information System (OBIS)
Established by the Census of Marine Life (CoML). It is an evolving strategic alliance of people and organizations sharing a vision to make marine biogeographic data, from all over the world, freely available over the World Wide Web.

Paleobiology Database
Provides global, collection-based occurrence and taxonomic data for marine and terrestrial animals and plants of any geological age, as well as web-based software for statistical analysis of the data.

PANGAEA® (Publishing Network for Geoscientific and Environmental Data)
Open Access library aimed at archiving, publishing and distributing data from earth system research. The system guarantees reference and long-term availability of its content through data set citations using international standard formats and persistent identifiers (DOI).

Smithsonian Tropical Research Institute's (STRI) Center for Tropical Forest Science (CTFS)
Comprises a global network of large-scale and long-term studies that together monitor more than three million individual tropical trees, representing more than 6,000 tree species — nearly 10% of the world’s entire tropical tree flora.

TreeBASE
Relational database of phylogenetic information hosted by the Yale Peabody Museum. TreeBASE stores phylogenetic trees and the data matrices used to generate them from published research papers. TreeBASE accepts all types of phylogenetic data (e.g., trees of species, trees of populations, trees of genes) representing all biotic taxa.

USA National Phenology Network (USA-NPN)
Developing a list of registered phenology data sets to make available to the research community and the general public.

VegBank
Vegetation plot database of the Ecological Society of America's Panel on Vegetation Classification. Vegetation records, community types and plant taxa may be submitted to VegBank and may be subsequently searched, viewed, annotated, revised, interpreted, downloaded, and cited.

VertNet
Global museum database of vertebrate natural history collections. Over 84.3 million vertebrate records are shared online through four distributed database networks organized by biological discipline: MaNIS (mammalogy), HerpNET (herpetology), ORNIS (ornithology) and FishNet (ichthyology).

World Data Center for Human Interactions in the Environment
Archives and distributes global data sets related to population, sustainability, poverty, health, hazards, conservation, governance and climate. It is hosted by Columbia University Earth Institute's Center for International Earth Science Information Network (CIESIN).

Data repositories and more

To find additional data repositories:
Open Data: Open Access Directory of Data Repositories
[Simmons University]. List by subject, and current.
DataBib [Institute of Museum and Library Services]
Repositories [DataCite]

USEFUL sites:
Data Conservancy Organization - (NSF) collect, organize, & preserve data.
Many Eyes: data visualization tools from IBM.
Data Management & Publishing Checklist, MIT
NSF Division of Institution & Award Support

Others:
US Naval Observatory (USNO)  - Oceanography Portal: includes a range of astronomical data and products, & serves as the official source of time for the U.S. Department of Defense & a standard of time for the entire United States.

Copyright © 2014-2016 The Regents of the University of California. All rights reserved. Except where otherwise noted, this work is subject to a Creative Commons Attribution-Noncommercial 4.0 License.