Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

Definition of biological data

Sensitive biological data comes from two specific areas:

Nature conservation and biodiversity data

Conservation and biodiversity data could be related to endangered species, protection regulations, international environmental agreements, time and location sensitive disturbances such as during breeding periods.

These organisms often appear on a sensitive species lists such as the National Biodiversity Network Atlas sensitive species list for the UK that has been approved by the UK’s statutory bodies and relevant experts. There could also be species identified on Biodiversity Action Plans (BAP) or red lists of endangered species. For example the Butterfly Red list of Great Britain or the International Union for Conservation of Nature’s Red List of Threatened Species (IUCN Red list).

Being on these lists does not necessarily mean that data about these species cannot be shared openly as some organisms may be less harmed if more information is shared about them. An individual risk assessment must be done in each case.

However, some examples of threats that could result from sharing biological data and therefore this data needing to be restricted are:

Biosafety

Certain organisms can pose a direct threat to humans, animals, and plants such as diseases or pests. This can also include newly developed organisms such as genetically-modified plants. The whereabouts and the methods of development of these organisms may be considered sensitive. At the same time, it must be considered that it may be important to share this data openly to prevent harm.

Some biosafety data may fall under the confidential data category due to a need to protect the commercial interests of these developed organisms. However, this unnecessarily limits transparency and public peer review of these datasets, which are then submitted to regulatory authorities Nielsen (2013).

Benefits of opening up sensitive biological data

There are clear benefits to making biological data freely available, including for not-for-profit decision making, education, research and other public benefit. Therefore data should only be restricted when harm to the environment, harm to people, or harm to sustainability is possible.

Some types of biological data, particularly related to biodiversity, should not be shared openly. These data should be kept partially or completely private. However, attempts should be made to provide access to researchers or individuals who can prove a need for these data such as reuse for research purposes or to peer review research. Researchers should conduct an assessment of the risks versus benefits of publishing biological data to decide whether (part of the) data can be shared.

Each country will have specific legal restrictions for rare and endangered species, as well as biosafety regulations, that have to be taken into account for each dataset.

The majority of the biodiversity community feel there are more benefits publishing open datasets as its future reuse could lead to greater conservation opportunities, promote community engagement and reduce duplication of survey efforts Tulloch et al. (2018).

Opening up sensitive biological data

The types of data that need to be considered sensitive include occurrence data surrounding the exact location of rare, endangered or commercially valuable organisms.

In order to protect sensitive biological data, several measures can be taken:

Current best practice is discussed by Chapman (2020) pdf, who sets out 10 principles for generalising sensitive species occurrence data to enable data sharing.

Examples of projects and issues with biological data

References
  1. Nielsen, K. M. (2013). Biosafety Data as Confidential Business Information. PLOS Biology, 11(3), 1–6. 10.1371/journal.pbio.1001499
  2. Tulloch, A. I. T., Auerbach, N., Avery-Gomm, S., Bayraktarov, E., Butt, N., Dickman, C. R., Ehmke, G., Fisher, D. O., Grantham, H., Holden, M. H., Lavery, T. H., Leseberg, N. P., Nicholls, M., O’Connor, J., Roberson, L., Smyth, A. K., Stone, Z., Tulloch, V., Turak, E., … Watson, J. E. M. (2018). A decision tree for assessing the risks and benefits of publishing biodiversity data. Nature Ecology & Evolution, 2(8), 1209–1217. 10.1038/s41559-018-0608-1
  3. Thorpe, D. E., van den Berk, M., Flohr, P., Grootveld, M., Hesam, A., Van Der Meer, L., & Kalamkar, M. (2026). Guidebook: Hard-to-Share Data in the Social Sciences and Humanities. 10.5281/ZENODO.17588795
  4. Caspari, G., dos Santos Manuel, J., Gago-Silva, A., & Jendryke, M. (2024). Employing discrete global grid systems for reproducible data obfuscation. Scientific Data, 11(1). 10.1038/s41597-024-03354-5
  5. Chapman, A. (2020). Current Best Practices for Generalizing Sensitive Species Occurrence Data. 10.15468/DOC-5JP4-5G10