History of the Constituent Databases#
Global Pollen Database#
In an early effort, the Cooperative Holocene Mapping Project 12 assembled pollen data in the 1970s and 1980s to test climate models. Although data-model comparison was the principal objective of the COHMAP project, the synoptic analyses of the pollen data, particularly maps showing the constantly shifting ranges of species in response to climate change, were revelatory and led to much ecological insight 345.
The COHMAP pollen "database" was a set of flat files with a fixed file format for data and for chronologies. FORTRAN programs were written to read these files and to assemble data for particular analyses. Thompson Webb III managed the COHMAP pollen database at Brown University, but as the quantity of data increased, data management became increasingly cumbersome; the data needed to be migrated to a relational database management system. Discussions with E. C. Grimm led to the initiation of the North American Pollen Database (NAPD) in 1990.
At the same time in , the International Geological Correlation Project IGCP 158 6 was conducting a major collaborative synthesis of paleoecological data, primarily of pollen, making the need for a pollen database painfully obvious. In the forward to the book resulting from this project 7, J.L. de Beaulieu describes the role that the IGCP 158 project had in launching the European Pollen Database 8. A workshop to develop a European Pollen Database (EPD) was held in in 1. North American representatives also attended, and the organizers of NAPD and EPD commenced a long-standing collaboration to develop compatible databases. NAPD and EPD held several joint workshops and developed the same data structure. Nevertheless, the two databases were independently established, partly because Internet capabilities were not yet sufficient to easily manage a merged database. The pollen databases were developed in Paradox, which at the time was the most powerful RDBMS readily available for the PC platform. NAPD and EPD established two important protocols:
- the databases were relational and queryable
- they were publicly available.
As the success the NAPD-EPD partnership escalated, working groups initiated pollen databases for other regions, including the Latin American Pollen Database (LAPD) in 1994 9, the Pollen Database for and the Russian Far East (PDSRFE) in 1995, and the African Pollen Database (APD) in 1996 10. At its initial organizational workshop, LAPD opted to merge with NAPD, rather than develop a standalone database, and the Global Pollen Database was born. PDSRFE also followed this model. APD developed independently, but uses the table structure of GPD and EPD. Pollen database projects have also been initiated in other regions, and the GPD contains some of these data, including the Indo-Pacific Pollen Database 11 and the Japanese Pollen Database.
SELECT ROUND(age::bigint, -2) AS age, COUNT(*) AS n
FROM ndb.sampleages
WHERE age IS NOT NULL
GROUP BY ROUND(age::bigint, -2);
ggplot(sampleroundcount, aes(x = age, y = n)) +
geom_histogram(stat="identity", width=500) +
scale_x_continuous() +
theme_bw() +
xlab('Years Before Present') +
ylab('Age Count')
The pollen databases contain data from the Holocene, Pleistocene, and Pliocene, although most data are from the last 20,000 years (Fig. \@ref(fig:agedistributions)). Included are fossil data, mainly from cores and sections, and modern surface samples, which are essential for calibrating fossil data. NAPD data are not separate from the GPD, but rather the NAPD is the North American subset of GPD. EPD has both public and restricted data -- a concession that had to be made early on to assuage some contributors.
North American Plant Macrofossil Database#
Plant macrofossils include plant organs generally visible to the naked eye, including seeds, fruits, leaves, needles, wood, bud scales, and megaspores. Synoptic-scale mapping of plant macrofossils from modern assemblages 12 and fossil assemblages 121314 have shown the utility of plant macrofossils in providing spatially and taxonomically precise reconstructions of past species ranges. Although plant macrofossil records are spatially precise, synoptic networks of high-quality sites can scale up to yield aggregate views of past distributions 12. In addition, macrofossils, with their greater taxonomic resolution, augment the pollen data by providing information on which species might have been present, and can resolve issues of long-distance transport 15.
The North American Plant Macrofossil Database (NAPMD) has been directed by S.T. Jackson at the . Highest priority has been placed on data from the last 30,000 years, although some earlier Pleistocene and late Pliocene data are included. The database originated as a research database for selected taxa from Late Quaternary sediments of eastern North America 12. In 1994, an effort was initiated with NOAA funding to build on this foundation to develop a cooperative, relational database comprising a longer time span, and all plant taxa.
The structure of NAPMD was adapted from the pollen database and was also stored in a Paradox file format. Although the plant macrofossil database was well served by the data model structure, modifications were made to accommodate different organs from the same species and to deal with the various quantitative measures of abundance. The Plant Macrofossil database also included surface samples, which were not part of the pollen databases at the time, but are useful for the interpretation of fossil data.
FAUNMAP#
R.W. Graham, E.L. Lundelius, Jr., and a group of Regional Collaborators organized a project to develop a database for late Quaternary faunal data from the , which the U.S. NSF funded in 1990. This project had a research agenda, and its seminal paper focused on the individualistic behavior displayed by animal species 16.
Two FAUNMAP databases exist, FAUNMAP I and FAUNMAP II. Both databases were coordinated by R. W. Graham and E. L. Lundelius, Jr. and funded by NSF. Both are relational databases for fossil mammal sites. The data were extracted from peer-reviewed literature, selected theses and dissertations, and selected contract reports for both paleontology and archaeology (all data is currently contained within the Neotoma publications tables). Unpublished collections were not included. Data were originally captured in Paradox but were later migrated to Microsoft Access.
FAUNMAP I contains data from sites in the continental United States (the lower 48 States) that date between 500 BP and \~40,000 BP. Funding for FAUNMAP I ended in 1994, with the production of two major publications by the FAUNMAP Working Group 1617, along with publications from individual members and many others who accessed the database on-line. Graham and Lundelius continued the FAUNMAP project, developing FAUNMAP II with funding from NSF beginning in 1998. FAUNMAP II 18 shares the same structure as FAUNMAP I but expands the spatial coverage to include and and extends the temporal coverage to the Pliocene (5 Ma). In addition, sites published since 1994, when FAUNMAP I was completed, have been added for the contiguous 48 States. In all, FAUNMAP I and II contain more than 5000 fossil-mammal sites with more than 600 mammal species for all of North America north of Mexico that range in age from 0.5 ka to 5 Ma.
The detailed structure of the FAUNMAP database is described in FAUNMAP Working Group 16. Sites identified by name and location were subdivided into Analysis Units (AU's), which varied from site to site depending upon the definitions used in the original publications (e.g., stratigraphic horizons, cultural horizons, excavation levels, biostratigraphic zones). All data (i.e., taxa identified, and counts of individual specimens) and metadata (sediment types, depositional environments, facies, radiometric and other geochronological dates, modifications of bone) were associated with the individual AUs. This structure -- analysis units within sites -- allows for information to be extracts at the site level, or at the smallest sample subdivision. The analysis unit permits fine-scale temporal resolution and analysis. Similar to the GPD and NAPMD, FAUNMAP contains archival and research tables. Similar to the plant macrofossil database, FAUNMAP contains a variety of quantitative measures of abundance (e.g., MNI, NISP), and presence data are more commonly used for analysis.
BEETLE#
Many beetles have highly specific ecological and climatic requirements and are valuable indicators of past environments 192021. Coleoptera is one of the most diverse groups of organisms on Earth, and of the insects, perhaps the most commonly preserved as fossils. Allan Ashworth has assembled a database of fossil beetles from . The data, which were recorded in Excel, contain 5523 individual records of 2567 taxa from 199 sites and 165 publications. Metadata include site name, latitude and longitude, lithology of sediment, absolute age, and geological age. The basic data are similar to plant and mammal databases -- lists of taxa from sites. The metadata have not been recorded to the extent of the other databases, especially chronological data, but Ashworth has resolved the taxonomic issues and has assembled the publications, so that the additional metadata can be easily pulled together.
-
COHMAP Members. Climatic changes of the last 18,000 years: observations and model simulations. Science, 24:1043–1052, 1988. doi:10.1126/science.241.4869.1043. ↩
-
Jr. Wright, H. E., J. E. Kutzbach, III Webb, T., W. F. Ruddiman, F. A. Street-Perrott, and P. J Bartlein. Global Climates since the Last Glacial Maximum. University of Minnesota Press, Minneapolis, 1993. doi:10.5860/choice.32-0947. ↩
-
III Webb, T. The past 11,000 years of vegetational change in eastern north america. Bioscience, 31:501–506, 1981. doi:10.2307/1308492. ↩
-
III Webb, T. The appearance and disappearance of major vegetational assemblages: long-term vegetational dynamics in eastern north america. Vegetatio, 69:177–187, 1987. doi:10.1007/978-94-009-4061-1\_18. ↩
-
III Webb, T. Glacial and Holocene vegetation history: Eastern North America, pages 385–414. Kluwer Academic, Dordrecht, 1988. doi:10.1111/j.1467-8470.1991.tb00724.x. ↩
-
Michael Church. Igcp project 158: paleohydrology of the temperate zone during the last 15 000 years: introduction. Canadian Journal of Earth Sciences, 26:1809–1810, 1989. URL: https://api.semanticscholar.org/CorpusID:130751808, doi:10.1139/E89-153. ↩
-
H.J.B. Birks and \Herbert E.\ Wright. Palaeoecological events during the last 15 000 years: regional synthesis of palaeoecological studies of lakes and mires in Europe. Wiley, 1996. ISBN 0471958409. Copyright: Copyright 2004 Elsevier Science B.V., Amsterdam. All rights reserved. ↩
-
B. Huntley and I. C. Prentice. Holocene vegetation and climates of Europe, pages 136–168. University of Minnesota Press, Minneapolis, 1993. doi:10.1007/978-1-4020-4411-3\_106. ↩
-
Suzette G A Flantua, Henry Hooghiemstra, Eric C Grimm, Hermann Behling, Mark B Bush, Catalina González-Arango, William D Gosling, Marie-Pierre Ledru, Socorro Lozano-García, Antonio Maldonado, Aldo R Prieto, Valentí Rull, and John H Van Boxel. Updated site compilation of the latin american pollen database. Rev. Palaeobot. Palynol., 223:104–115, December 2015. doi:10.1016/j.revpalbo.2015.09.008. ↩
-
Anne-Marie Lézine, Sarah J Ivory, William D Gosling, and Louis Scott. The african pollen database (APD) and tracing environmental change: state of the art. In Quaternary Vegetation Dynamics – The African Pollen Database, pages 5–12. CRC Press, London, November 2021. doi:10.1201/9781003162766-2. ↩
-
Annika V Herbert, Simon G Haberle, Suzette G A Flantua, Ondrej Mottl, Jessica L Blois, John W Williams, Adrian George, and Geoff S Hope. The Indo–Pacific pollen database – a neotoma constituent database. Clim. Past, 20(11):2473–2485, November 2024. doi:10.5194/cp-20-2473-2024. ↩
-
S. T. Jackson, J. T. Overpeck, III Webb, T., S. E. Keattch, and K. H. Anderson. Mapped plant-macrofossil and pollen records of late quaternary vegetation change in eastern north america. Quaternary Science Reviews, 16:1–70, 1997. doi:10.1016/s0277-3791(96)00047-9. ↩↩↩↩
-
S. T. Jackson, R. S. Webb, K. H. Anderson, J. T. Overpeck, III Webb, T., J. W. Williams, and B. C. S. Hansen. Vegetation and environment in eastern north america during the last glacial maximum. Quaternary Science Reviews, 19:489–508, 2000. doi:10.1016/s0277-3791(99)00093-1. ↩
-
S. T. Jackson and R. K. Booth. The role of late holocene climate variability in the expansion of yellow birch in the western great lakes region. Diversity and Distributions, 8(5):275–284, 2002. doi:10.1046/j.1472-4642.2002.00152.x. ↩
-
H. H. Birks. The importance of plant macrofossils in the reconstruction of lateglacial vegetation and climate: examples from scotland, western norway, and minnesota, usa. Quaternary Science Reviews, 22:453–473, 2003. doi:10.1016/s0277-3791(02)00248-2. ↩
-
FAUNMAP Working Group. FAUNMAP: A Database Documenting Late Quaternary Distributions of Mammal Species in the United States. Volume XXV, No. 1 of Scientific Papers. Illinois State Museum, Springfield, IL, 1994. doi:10.5749/j.ctttt09b. ↩↩↩
-
FAUNMAP Working Group. Spatial response of mammals to late quaternary environmental fluctuations. Science, 272:1601–1606, 1996. doi:10.1126/science.272.5268.1601. ↩
-
R.W. Graham and Jr. E.L. Lundelius. Faunmap ii: new data for north america with a temporal extension for the blancan, irvingtonian and early rancholabrean. 2010. URL: https://ucmp.berkeley.edu/faunmap, doi:10.7312/wood13040-009. ↩
-
A V Morgan, A Morgan, A C Ashworth, and J V Matthews, Jr. Late wisconsin fossil beetles in north america. In S C Porter, editor, Late Quaternary Environments of the United States, volume 1, pages 354–363. University of Minnesota Press, 1983. doi:10.1016/b978-0-323-99931-1.00064-7. ↩
-
Allan C Ashworth. Geological Perspectives of Global Climate Change, chapter Perspectives on Quaternary Beetles and Climate Change. Volume 47 of AAPG Studies in Geology. American Association of Petrolium Geologists, 2001. doi:10.1306/st47737. ↩
-
A. C. Ashworth. The Quaternary Period in the United States, chapter Quaternary Coleoptera of the United States and Canada. Volume 1 of Developments in Quaternary Science. Elsevier, Amsterdam, The Netherlands, 2004. doi:10.1002/gea.20072. ↩