Progress in Geography, Volume. 39, Issue 7, 1140(2020)
A collection system for paleoclimate research documents (CSPD) was developed in this study using Python (V3.6) and MySQL (V5.7) on the Linux platform. Meanwhile, 1450 research papers of paleoclimate from the National Climate Data Center (NCDC) were manually selected. The keywords from these papers were classified and, then, a keyword list for the research paper collection was prepared. Using the CSPD with the keyword list, we collected 32493 paleoclimate research papers from ResearchGate. To verify the validity of CSPD with the keyword list, we counted the frequencies of four categories of keywords from the 32493 paleoclimate research papers from ResearchGate and from the 1450 papers from NCDC, respectively. Then, the frequencies from the two document datasets were compared. The four categories of keywords refer to the dimensions of temporal scale, type of proxy data, meteorology factors, and study area. We found that the frequencies of the four categories of keywords match well for the two document datasets. This result suggests that the CSPD together with the keyword list is a valid method and the resulting document dataset represents the status of paleoclimate research. A large number of paleoclimate research documents from ResearchGate would work as a great source of paleoclimate reconstruction results, which have not been fully included by NCDC. The CSPD reached the design objective.
Get Citation
Copy Citation Text
Xuezhen ZHANG, Jun YIN, Mengxin BAI, Yanbo LI, Jingyun ZHENG.
Category:
Received: May. 23, 2019
Accepted: --
Published Online: Jan. 28, 2021
The Author Email: ZHENG Jingyun (zhengjy@igsnrr.ac.cn)