1 Introduction
1.1 Altmetric studies related to policy documents
1.2 Overton as an emerging data source for policy metrics
Figure 1. Screenshots of the Overton web interface: (a) an example of search results and (b) a detail page of a specific policy document. |
1.3 Objectives of the study
2 Data and methods
2.1 Dataset of policy documents
Table 1. Bibliographic metadata elements from Overton-indexed policy documents. |
| Metadata type | Definition |
|---|---|
| Publication date | The date when the policy documents were published. |
| Policy source | The organizations or entities from which policy documents are collected. |
| Source type | The categories of policy sources. Overton classifies policy sources into three main types: “government”, “think tank” and “intergovernmental organization (IGO)” (Overton, 2024h). Additionally, Overton tracks policy documents from “other” sources, including open repositories and non-governmental organizations (NGOs). |
| Source country | The countries associated with the policy sources. Notably, IGOs and the European Union (EU) are listed separately. |
| Document language | The languages in which the PDF files of policy documents are written. A single policy document may be released in multiple PDF versions across different languages. ① |
| Subject area | The subject areas of policy documents. For documents written in certain languages (e.g., English, French, Spanish), Overton assigns subject areas by matching phrases and entities extracted from the full text with examples from each category in the International Press Telecommunications Council (IPTC)’s Media Topics controlled vocabulary (https://iptc.org/standards/media-topics/) ② (Overton, 2024a). |
| Topics | The main themes of policy documents. Overton assigns topics to documents written in certain languages (e.g., English, French, Spanish) by matching phrases and entities extracted from the full text with the titles of Wikipedia pages (Overton, 2024a). |
Table 2. Reference metadata elements from Overton-indexed policy documents. |
| Metadata type | Definition |
|---|---|
| Scholarly references | Citations from policy documents to academic literature (i.e., scholarly papers referenced within policy documents). Overton identifies and formats these references by extracting elements from potential reference strings - such as sources, titles, and publication years - from the full text of policy documents, then searching Crossref to retrieve the DOIs of the cited scholarly works (Overton, 2024d, 2024c). |
| Policy references | Citations from one policy documents to another (i.e., policy documents referenced within other policy documents). Overton identifies and formats these references using a method similar to that employed for scholarly references. Policy references are obtained by matching elements of potential reference strings extracted from the full text with indexed policy documents in the Overton database (Overton, 2024d, 2024c). |
2.2 Indicators and analytic approaches
Table 3. Descriptive statistics of scholarly and policy references in policy documents. |
| Indicator | Scholarly references | Policy references |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 14,633 | 1,352 |
| 25th percentile | 0 | 0 |
| 50th percentile (median) | 0 | 0 |
| 75th percentile | 0 | 0 |
| 90th percentile | 0 | 1 |
| 99th percentile | 32 | 9 |
| Arithmetic mean | 1.36 | 0.45 |
| Standard deviation | 17.47 | 3.22 |
| Skewness | 154.16 | 46.51 |
| Kurtosis | 67,556.53 | 8,123.57 |
Table 4. Total, coverage, and average of references for the overall dataset. |
| Indicator | Scholarly references | Policy references |
|---|---|---|
| Total | 23,745,211 | 7,806,381 |
| Coverage | 7.7% | 10.6% |
| Average | 0.17 | 0.14 |
3 Results
3.1 Presence of references over publication years
Figure 2. Presence of references in policy documents over publication years, presented through three perspectives: (a) total, (b) coverage, and (c) average. |
3.2 Presence of references across policy sources
Figure 3. Distribution of policy documents across the four source types, and the top 10 policy sources contributing the highest number of documents within each source type. For each of the top 10 sources, the proportion of policy documents relative to the total for that source type is also presented. |
Figure 4. Presence of references in policy documents across policy sources, presented through three perspectives: (a) total, (b) coverage, and (c) average. |
3.3 Presence of references across countries and languages
Figure 5. Presence of references in policy documents from the top 10 countries/regions with the highest number of policy documents, presented through three perspectives: (a) total, (b) coverage, and (c) average. |
Figure 6. Presence of references in policy documents across the top 10 most used languages, presented through three perspectives: (a) total, (b) coverage, and (c) average. |
3.4 Presence of references across subject areas
Figure 7. Presence of references in policy documents across subject areas, presented through three perspectives: (a) total, (b) coverage, and (c) average. |
3.5 Presence of references across policy topics
Figure 8. (a) Co-word network of topics in Overton-indexed policy documents; overlay visualizations of topics colored based on scores reflecting (b) the coverage of scholarly references and (c) the coverage of policy references. In the overlay visualizations, scores of topics are normalized using the “divide by mean” function in VOSviewer. The redder a topic node, the higher the coverage of scholarly or policy references in the policy documents related to that topic, relative to all topics in the map. |
Figure 9. Distribution of the coverage of scholarly and policy references across topics within the six clusters. |


