Antarctic Penguin Biogeography Project: Database of abundance and distribution for the Adélie, chinstrap, gentoo, emperor, macaroni, and king penguin south of 60 S

Données d'échantillonnage
Dernière version Publié par SCAR - AntOBIS le avr. 17, 2023 SCAR - AntOBIS
Date de publication:
17 avril 2023
Publié par:
CC-BY 4.0

Téléchargez la dernière version de la ressource en tant qu'Archive Darwin Core (DwC-A), ou les métadonnées de la ressource au format EML ou RTF :

Données sous forme de fichier DwC-A (zip) télécharger 4 055 enregistrements dans Anglais (390 KB) - Fréquence de mise à jour: annuel
Métadonnées sous forme de fichier EML télécharger dans Anglais (29 KB)
Métadonnées sous forme de fichier RTF télécharger dans Anglais (22 KB)


The Antarctic Penguin Biogeography Project is an effort to collate all known information about the distribution and abundance of Antarctic penguins through time and to make such data available to the scientific and management community. The core data product involves a series of structured tables with information on known breeding sites and surveys conducted at those sites from the earliest days of Antarctic exploration through to the present. This database, which is continuously updated as new information becomes available, provides a unified and comprehensive repository of information on Antarctic penguin biogeography that contributes to a growing suite of applications of value to the Antarctic community. One such application is the Mapping Application for Antarctic Penguins and Projected Dynamics (MAPPPD; - a browser-based search and visualization tool designed primarily for policymakers and other non-specialists (Humphries et al., 2017), and ‘mapppdr’, an R package developed to assist the Antarctic science community. The Antarctic Penguin Biogeography Project has been funded by the National Aeronautics and Space Administration (NASA), the Pew Fellowship for Marine Conservation, and the Institute for Advanced Computational Sciences at Stony Brook University.

Antarctic Penguin Biogeography Project: Database of abundance and distribution for the Adélie, chinstrap, gentoo, emperor, macaroni, and king penguin south of 60 S is an occurrence and sampling event type dataset published by SCAR-AntBIOS.

This dataset contains records of Pygoscelis adeliae, Pygoscelis antarctica, Pygoscelis papua, Eudyptes chrysolophus, Aptenodytes patagonicus, and Aptenodytes forsteri annual nest, adult, and/or chick counts conducted during field expeditions or collected using remote sensing imagery, that were subsequently gathered by the Antarctic Penguin Biogeography Project from published and unpublished sources, at all known Antarctic penguin breeding colonies south of 60 S from 1892-11-01 to 2022-02-12.

The data is published as a standardized Darwin Core Archive and includes an event core and occurrence and eMoF extensions. This dataset is published by SCAR-AntOBIS under the license CC-BY 4.0. Please follow the guidelines from the SCAR Data Policy (SCAR, 2023) when using the data. If you have any questions regarding this dataset, please contact us via the contact information provided in the metadata or via Issues with dataset can be reported at

This dataset is part of the Antarctic Penguin Biogeography Project project funded by National Aeronautics and Space Administration (NASA), the Pew Fellowship for Marine Conservation, and the Institute for Advanced Computational Sciences at Stony Brook University.

Enregistrements de données

Les données de cette ressource données d'échantillonnage ont été publiées sous forme d'une Archive Darwin Core (Darwin Core Archive ou DwC-A), le format standard pour partager des données de biodiversité en tant qu'ensemble d'un ou plusieurs tableurs de données. Le tableur de données du cœur de standard (core) contient 4 055 enregistrements.

2 tableurs de données d'extension existent également. Un enregistrement d'extension fournit des informations supplémentaires sur un enregistrement du cœur de standard (core). Le nombre d'enregistrements dans chaque tableur de données d'extension est illustré ci-dessous.

Event (noyau)

Cet IPT archive les données et sert donc de dépôt de données. Les données et métadonnées de la ressource sont disponibles pour téléchargement dans la section téléchargements. Le tableau des versions liste les autres versions de chaque ressource rendues disponibles de façon publique et permet de tracer les modifications apportées à la ressource au fil du temps.


Le tableau ci-dessous n'affiche que les versions publiées de la ressource accessibles publiquement.

Comment citer

Les chercheurs doivent citer cette ressource comme suit:

Che-Castaldo C, Humphries G, Lynch H, Van de Putte A (2023). Antarctic Penguin Biogeography Project: Database of abundance and distribution for the Adélie, chinstrap, gentoo, emperor, macaroni, and king penguin south of 60 S. Version 2.3. SCAR - AntOBIS. Samplingevent dataset.


Les chercheurs doivent respecter la déclaration de droits suivante:

L’éditeur et détenteur des droits de cette ressource est SCAR - AntOBIS. Ce travail est sous licence Creative Commons Attribution (CC-BY) 4.0.

Enregistrement GBIF

Cette ressource a été enregistrée sur le portail GBIF, et possède l'UUID GBIF suivante : f7c30fac-cf80-471f-8343-4ec5d8594661.  SCAR - AntOBIS publie cette ressource, et est enregistré dans le GBIF comme éditeur de données avec l'approbation du Ocean Biodiversity Information System.




Christian Che-Castaldo
  • Fournisseur Des Métadonnées
  • Créateur
  • Personne De Contact
Stony Brook University
Grant Humphries
Heather Lynch
  • Fournisseur Des Métadonnées
  • Créateur
Stony Brook University
Anton Van de Putte
  • Fournisseur Des Métadonnées
  • Personne De Contact
Royal Belgian Institute of Natural Sciences

Couverture géographique

Penguin breeding colonies located at or very near sea level distributed around the Antarctic continent and along the Antarctic Peninsula as well as on outlying islands in the Southern Ocean.

Enveloppe géographique Sud Ouest [-77,71, -157,7], Nord Est [-60,55, 171,17]

Couverture taxonomique

Pas de description disponible

Kingdom Animalia
Phylum Chordata
Class Aves
Order Sphenisciformes
Family Spheniscidae
Genus Pygoscelis, Eudyptes, Aptenodytes
Species Pygoscelis adeliae, Pygoscelis antarctica, Pygoscelis papua, Eudyptes chrysolophus, Aptenodytes patagonicus, Aptenodytes forsteri

Couverture temporelle

Date de début / Date de fin 1892-11-01 / 2022-02-12

Données sur le projet

The objective of this project is to compile all available data on the distribution and abundance of Antarctic penguins, and to make that data available for the scientific and management communities.

Titre Antarctic Penguin Biogeography Project
Identifiant NASA Awards: NNX14AC32G and 80NSSC21K1027
Financement This project was initially funded by NASA Award NNX14AC32G under the NASA Ecosystem Forecasting program. Continued funding for database expansion and updating provided by NASA Award 80NSSC21K1027 under the Biodiversity program, a 2022 Pew Fellowship for Marine Conservation, and the Institute for Advanced Computational Science at Stony Brook University.

Les personnes impliquées dans le projet:

Christian Che-Castaldo
Heather Lynch
Grant Humphries

Méthodes d'échantillonnage

This database includes all known records of penguin breeding abundance and distribution south of 60 S. Data sources include peer reviewed scientific manuscripts, expedition reports and other public datasets outside the scientific literature, management and policy documents, and private communications. Abundance estimates are derived primarily from direct ground counting, imagery collected by remotely-piloted aircraft systems, and satellite imagery. Additional data types include counts from aerial, ground, or vessel-based photographs.

Etendue de l'étude This dataset describes the abundance and distribution of six species of Antarctic breeding penguins (Adélie, gentoo, chinstrap, emperor, macaroni, king) at all sites south of 60 S. Data include all known historical data starting in 1979 and additional records as available prior to 1979.
Contrôle qualité - All records were validated. - Coordinates were verified and plotted on map to verify the actual geographical location corresponds to its locality. - All scientific names were checked for typo and matched to the species information backbone of Worlds Register of Marine Species ( and LSID were assigned to each taxa as scientificNameID. - Event date and time are validated to be in ISO 8601 format.

Description des étapes de la méthode:

  1. Data contained within the core database include information collated from peer reviewed scientific manuscripts, expedition reports and other public data outside the scientific literature, management and policy documents, and private communications. These data contain information on breeding pairs of penguins only, and does not contain information on non-breeding distributions or sightings-at-sea. The fundamental unit of this database is the breeding “site”, which represents a population breeding on a single island or, in some cases, a discrete area of a larger landmass. In some cases, a geographically distributed population may be divided into multiple “sites” depending on the logistics of ground surveys, where each “site” can be accessed from a single landing location along the coastline. Rarely, several smaller nesting areas will be aggregated into a single “site” following historical precedent. Each site is associated with a unique four digit alphanumeric code, a name, and a specific geographic location. Note that several species may inhabit a single “site”. Data are ingested into the database manually and extensively checked against existing maps and records to ensure consistency. Because of the complexity of the geography and the different naming systems, site names may be changed between a published record and the database.
  2. The database includes data on the number of nests (equivalently, breeding pairs), the number of chicks, or the number of total adults. If multiple measures are available (a count of nests and also a count of chicks), the database will include all data points as separate entries. Each data point is associated with information on the survey date, the survey method, an estimate of the accuracy of each data point, and a reference. In this database, we follow the five-point scale initially used by Croxall and Kirkwood (1979) and described in detail in Supplementary Materials 1 to Che-Castaldo et al. (2017). If the original data source followed the Croxall and Kirkwood scale or reported an uncertainty that could be translated into that scale, our database includes that precision value. Where no information on uncertainty is provided, we estimate the precision based on information provided in the original account.
  3. Some data are not eligible to be included in this database. Data that are collected at a spatial scale other than the “site” cannot be included. All data must be “site-wide” census estimates. In keeping with our commitment to open-source data, data that are not in the public domain or remain in private collections unavailable to the public are also not included in this database. Data that cannot be verified and/or cannot be unambiguously assigned to a “site” are not included. Data on the presence of breeding pairs absent a population estimate are included only when no survey data exist.
  4. While traditional methods of surveying penguin colonies rely on the direct enumeration of penguin nests by ground-based researchers, more recent surveys have relied on various remote sensing methods. Remotely-piloted aircraft systems (RPAS; also known as drones, quadcopters, or unmanned aerial vehicles) take photographs from a low-altitude flight over the colony and yield photographs sufficient for the individual enumeration of nests, chicks, or adults. While the precision of such counts depends on the timing of the survey and the quality of the imagery, RPAS are capable of providing exceptionally precise counts and are often in the high accuracy category of N1. Satellite imagery can identify the areal extent of breeding colonies through the spectral properties of penguin guano and these estimates can be used to estimate the number of breeding pairs at a “site”. Because the precision of such estimates remains an active research question, all such estimates are assigned the lowest accuracy category of N5 (broadly defined as “order-of-magnitude”).
  5. In some cases, data from a single location in a given year is published in multiple venues. However, it is difficult to know whether these multiple outlets represent one actual sampling event because the same event may be associated with different occurrence or abundance records between sources, either because earlier mistakes were corrected or because data were re-analyzed. It also can happen that a single site is surveyed multiple times by separate parties on the same day, arriving at different counts due to observation error. Our approach is to classify an event as a survey conducted at a specific site and time, using a specific sampling protocol, and reported in a specific publication (peer-reviewed article, policy document, thesis, or other type of report), dataset, or personal communication. From each event, there can be one or more occurrence or measurement records, depending on the number of species or life stages reported. In cases where we have information that data from a unique sampling event is published in multiple venues, we retain all records reported in publications and datasets, and drop duplicate records if the duplicate event is based on personal communication. We allow this potential duplication because 1) it is not clear which source is ultimately correct if counts differ between sources and 2) withholding records creates confusion with our end-users as to whether we have overlooked sources of data.

Citations bibliographiques

  1. Che-Castaldo, C., S. Jenouvrier, C. Youngflesh, K.T. Shoemaker, G. Humphries, P. McDowall, L. Landrum, M.M. Holland, Y. Li, R. Ji, and H.J. Lynch, “Pan-Antarctic analysis aggregating spatial estimates of Adélie penguin abundance reveals robust dynamics despite stochastic noise,” Nature Communications 8:832 (2017).
  2. Croxall, J.P., and E. D. Kirkwood. The distribution of penguins on the Antarctic Peninsula and islands of the Scotia Sea. Cambridge: British Antarctic Survey, 1979.
  3. Humphries, G., R. Naveen, M. Schwaller, C. Che-Castaldo, P. McDowall, M. Schrimpf, and H.J. Lynch, “Mapping Application for Penguin Populations and Projected Dynamics (MAPPPD): Data and tools for dynamic management and decision support,” Polar Record 53, 2 (2017): 160–166.
  4. The Scientific Committee on Antarctic Research. (2023). SCAR Report 42 - September 2022 - SCAR Data Policy (2022). Zenodo.

Métadonnées additionnelles

Objet To provide open access penguin population census data to the general public.
Description de la fréquence de mise à jour Website will be updated annually or semi-annually.
Identifiants alternatifs 10.48361/zftxkr