This tool stores functions to automate the construction of Species Distribution Models (SDM). These functions utilize an automated and versatile SDM general routine that comprises several steps. Firstly, they format occurrence data as required and create geographic areas for training and projecting models if desired. Secondly, they crop and mask environmental variables for both current and future conditions. Thirdly, the functions train SDMs using one or more algorithms and assess their performance quantitatively. Finally, the best models are combined using an ensemble approach, and projections can be made for various scenarios as desired.
Before installing and using biomodelos-sdm tool, please ensure that your computer meets the following minimum system requirements:
Operating System: Windows 7 or later (Windows 8, Windows 10, etc.). Please note that biomodelos-sdm tool is not compatible (yet) with Mac or Linux operating systems.
Memory (RAM): Your computer should have a minimum of 8 GB of RAM. However, for optimal performance, we recommend having 16 GB or more.
User Permissions: To install and run biomodelos-sdm tool, you need to have administrator user permissions on your computer. This is necessary to ensure that the software can access all the required system resources and functions seamlessly.
Optional, GIS Software: QGIS (version 3.0 or higher) or ArcGIS (ArcMap or ArcGIS Pro).
Ensuring that your computer meets these system requirements will guarantee a smooth installation and usage experience with biomodelos-sdm tool. If you have any questions or encounter any issues during the setup process, please contact our support team for assistance.
Dependencies to install, choose the version depending on your operating system and version. For example, a windows 10 terminal with more than 4 gigabytes on memory RAM almost always has a 64 bit version of windows. Surf on the web in case of more information.
A guide to install R and R Studio (in spanish) can be found here
Libraries required and their versions
"plyr" version 1.8.6
"dplyr" version 1.0.5
"automap" version 1.0.14
"PresenceAbsence" version 1.1.9
"devtools" version 2.3.2
"CoordinateCleaner" version 2.0.18
"sf" version 0.9.8
"spThin" version 0.2.0
"raster" version 3.4.10
"dismo" version 1.3.3
"biomod2" version 3.4.6
"ENMeval" version 2.0.3
"rgdal" version 1.5.23
"rJava" version 0.9.13
"kuenm" version 1.1.6
Download and uncompress the content of this repository (biomodelos-sdm) and “maxent.jar” file (downloaded previously) there. For better results choose a root directory like “C” or “D” in windows to move the folder (working directory).
Open RStudio and open the project: biomodelos-sdm.Rproj inside
the modelling folder of the repository. It can be achieving doing the
next. First, click on tool bar “File” (upper left of the RStudio
window). Second, “Open Project”. In the opened window, browse into the
computer folder structure until reach the folder uncompressed before in
the step 1 and go to the folder “biomodelos-sdm/modelling/”. Note:
For a bit more experimented users, this step is comparative to setup a
working directory with setwd()
at the cited
folder.
Create in RStudio a new script. It can be achieve going to “File” tool bar, “New File” and then “R Script”. It may well be used the icon “New file” right under the tool bar “File” or using the keyboard shortcut “Ctrl+Shift+N” in windows.
Load the setup function store in the biomodelos-sdm/modelling/. Type in the script editor:
source("setup.R")
do.install(vector.packages)
You only need to install the packages once, so, it is better to block
this command line typing a ‘#’ character in the forefront of the line
just before of the first run, like this
# do.install(vector.packages)
or even erase the line.
do.check(vector.packages)
## package successfully_installed
## 1 plyr TRUE
## 2 dplyr TRUE
## 3 automap TRUE
## 4 PresenceAbsence TRUE
## 5 devtools TRUE
## 6 CoordinateCleaner TRUE
## 7 sf TRUE
## 8 spThin TRUE
## 9 raster TRUE
## 10 dismo TRUE
## 11 biomod2 TRUE
## 12 ENMeval TRUE
## 13 rgdal TRUE
## 14 rJava TRUE
## 15 kuenm TRUE
## 16 terra TRUE
##
## ENMeval version 2.0.3 is TRUE
A message showing a table with column names “package” and “successfully_completed” will be shown in the console (corner left of the RStudio window), as well as the ENMeval version installed. For example,
package successfully_installed
1 plyr TRUE
2 dplyr TRUE
3 automap TRUE
4 PresenceAbsence TRUE
5 devtools TRUE
6 CoordinateCleaner TRUE
7 sf TRUE
8 spThin TRUE
9 raster TRUE
10 dismo TRUE
11 biomod2 TRUE
12 ENMeval TRUE
13 rgdal TRUE
14 rJava TRUE
15 kuenm TRUE
ENMeval version 2.0.3 is TRUE
In case of receiving a FALSE statement on the table or having an
ENMeval version different to 2.0.3, you need to troubleshoot before to
continue. Please refer to the vignettes and installation manual of each
problematic package: kuenm, enmeval,
etc. Then re-run do.check(vector.packages)
do.load(vector.packages)
## Warning: package 'raster' was built under R version 4.2.2
## [1] "ok"
do.folder.structure(clim.datasets = "worldclim")
The use of the character “worldclim”” inside the function does not refer to retrieve the data from the repository. It is only a way to create an organized a framework inside the working directory in which you may store the environmental variables and occurrence data of species downloaded manually or using automatized tools.
After run the function you will have in your working directory 3 new folders with subfolders:
source("R/fit_biomodelos.R")
Your folder structure must look like this:
Your RStudio window must look like this:
Now you are ready to customize the biomodelos-sdm tool and run SDM models. As mentioned earlier, you will require two additional fundamental components: environmental variables and georeferenced occurrence data for one or multiple species. We strongly recommend following the subsequent section, which outlines the structure and attributes of these elements, along with essential information to guide you through the execution and understanding of this application. Throughout various practical examples, we will model the distribution of five endemic bird species in Colombia, utilizing both the standard methods and selected advanced features of this tool to enhance your proficiency. Also, you will learn about the theory and logic shown by this software step by step.
Kramer-Schadt, S., Niedballa, J., Pilgrim, J.D., Schröder, B., Lindenborn, J., Reinfelder, V., Stillfried, M., Heckmann, I., Scharf, A.K., Augeri, D.M., Cheyne, S.M., Hearn, A.J., Ross, J., Macdonald, D.W., Mathai, J., Eaton, J., Marshall, A.J., Semiadi, G., Rustam, R., Bernard, H., Alfred, R., Samejima, H., Duckworth, J.W., Breitenmoser-Wuersten, C., Belant, J.L., Hofer, H. and Wilting, A. (2013), The importance of correcting for sampling bias in MaxEnt species distribution models. Diversity Distrib., 19: 1366-1379. https://doi.org/10.1111/ddi.12096
Peterson, A., Soberón, J., G. Pearson, R., Anderson, R., Martínez-Meyer, E., Nakamura,M., y Araújo, M. (2011) Ecological Niches and Geographic Distributions, tomo 49. 360 pp.
Velásquez-Tibata, J. I., Olaya-Rodríguez, M. H., López-Lozano, D. F., Gutierrez, C., Gonzales, I., & Londoño-Murcia, M. C. 2019. Biomodelos: a collaborative online system to map species distributions. Plos One, 14(3), e0214522. https://doi.org/10.1371/journal.pone.0214522
Aiello-Lammens, M. E., Boria, R. A., Radosavljevic, A. , Vilela, B. and Anderson, R. P. (2015). spThin: an R package for spatial thinning of species occurrence records for use in ecological niche models. Ecography, 38: 541-545. URL https://onlinelibrary.wiley.com/doi/10.1111/ecog.01132.
Cobos ME, Peterson AT, Barve N, Osorio-Olvera L. (2019) kuenm: an R package for detailed development of ecological niche models using Maxent PeerJ, 7:e6281 URL http://doi.org/10.7717/peerj.6281
Freeman, E. A. and Moisen, G. (2008). PresenceAbsence: An R Package for Presence-Absence Model Analysis. Journal of Statistical Software, 23(11):1-31. http://www.jstatsoft.org/v23/i11
Hadley Wickham (2011). The Split-Apply-Combine Strategy for Data Analysis. Journal of Statistical Software, 40(1), 1-29. URL http://www.jstatsoft.org/v40/i01/
Hadley Wickham, Jim Hester and Winston Chang (2021). devtools: Tools to Make Developing R Packages Easier. R package version 2.4.2. https://CRAN.R-project.org/package=devtools
Hadley Wickham, Romain François, Lionel Henry and Kirill Müller (2021). dplyr: A Grammar of Data Manipulation. R package version 1.0.7. https://CRAN.R-project.org/package=dplyr
Hiemstra, P.H., Pebesma, E.J., Twenhofel, C.J.W. and G.B.M. Heuvelink, 2008. Real-time automatic interpolation of ambient gamma dose rates from the Dutch Radioactivity Monitoring Network. Computers & Geosciences, accepted for publication.
Muscarella, R., Galante, P.J., Soley-Guardia, M., Boria, R.A., Kass, J., Uriarte, M. and R.P. Anderson (2014). ENMeval: An R package for conducting spatially independent evaluations and estimating optimal model complexity for ecological niche models. Methods in Ecology and Evolution.
Osorio-Olvera L., Lira‐Noriega, A., Soberón, J., Townsend Peterson, A., Falconi, M., Contreras‐Díaz, R.G., Martínez‐Meyer, E., Barve, V. and Barve, N. (2020), ntbox: an R package with graphical user interface for modeling and evaluating multidimensional ecological niches. Methods Ecol Evol. 11, 1199–1206. doi:10.1111/2041-210X.13452. https://github.com/luismurao/ntbox
Pebesma, E., 2018. Simple Features for R: Standardized Support for Spatial Vector Data. The R Journal 10 (1), 439-446, https://doi.org/10.32614/RJ-2018-009
Robert J. Hijmans, Steven Phillips, John Leathwick and Jane Elith (2020). dismo: Species Distribution Modeling. R package version 1.3-3. https://CRAN.R-project.org/package=dismo
Robert J. Hijmans (2021). raster: Geographic Data Analysis and Modeling. R package version 3.4-13. https://CRAN.R-project.org/package=raster
Roger Bivand, Tim Keitt and Barry Rowlingson (2021). rgdal: Bindings for the ‘Geospatial’ Data Abstraction Library. R package version 1.5-23. https://CRAN.R-project.org/package=rgdal
Simon Urbanek (2021). rJava: Low-Level R to Java Interface. R package version 1.0-4. https://CRAN.R-project.org/package=rJava
Thuiller Wilfried ; Georges Damien; Gueguen Maya; Engler Robin and Breiner Frank (2021). biomod2: Ensemble Platform for Species Distribution Modeling. R package version 3.5.1. https://CRAN.R-project.org/package=biomod2
Zizka A, Silvestro D, Andermann T, Azevedo J, Duarte Ritter C, Edler D, Farooq H, Herdean A, Ariza M, Scharn R, Svanteson S, Wengstrom N, Zizka V, Antonelli A (2019). “CoordinateCleaner: standardized cleaning of occurrence records from biological collection databases.” Methods in Ecology and Evolution, -7. doi: 10.1111/2041-210X.13152 (URL: https://doi.org/10.1111/2041-210X.13152), R package version 2.0-18, <URL: https://github.com/ropensci/CoordinateCleaner>.
María Helena Olaya, corporate mail, personal mail
Gabriel Alejandro Perilla Suarez, corporate mail, personal mail
Héctor Manuel Arango Martínez, corporate mail, personal mail
Cristian Alexander Cruz Rodriguez, corporate mail, personal mail
Luis Hernando Romero Jiménez, corporate mail, personal mail
Andrés Felipe Suárez Castro, personal mail
*This development is supported by National Geographic Society Grant number NGS-86896T-21, project “Developing an integrated species distribution modelling system to identify complementary conservation areas in Colombia”. Project information** * Initial phases of the project were supported by Natural map.
This project is licensed under the MIT License - see the License file for details.