One Drive folder structure
.
├── code/
│ └── Splitter/
│ ├── ...
│ └── split_script.R
├── data/
│ ├── ...
│ ├── all_sources_to_extract/
│ ├── backups/
│ ├── new_sources/
│ ├── old_files/
│ ├── status_mapping_sources/
│ └── tetrapods_extracted_sources/
├── docs/
├── raw_data/
└── README.docx
Folders (data/)
all_sources_to_extract | all_sources_to_extract
This is a directory containing all the files (PDFs, etc.) named after the name convention of the first metadata effort (<ISO3_taxa_year.pdf>). Be aware that these names will not match any current fields in the template for data extraction.new_sources | new_sources
This is a directory in which we store the new sources that people send to us or that we find during the extraction process.old_files | old_files
This is a directory is an archive of old files.status_mapping_sources | status_mapping_sources
This is the directory in which we store the IUCN mapping sources (PDF + table, named afterstatusMappingSourceID).tetrapods_extracted_sources | tetrapods_extracted_sources
This is the directory that contains all the tetrapods extracted sources (named aftersourceID).
Files (most important)
tetrapods_sources_to_extract | data/regred_tetrapods_sources_to_extract
This table has all the tetrapods sources that the data extractors will use it as a reference for the extraction process. It was generated following our metadata table (Kadlec et al., 2025) by keeping only those sources that belong to tetrapods. The content in this table should never be edited, except for the fields:dataExtractor,progress,redlistTypeOfContent,extractionType,numberOfRedlists,dateStarted,fullyExtracted,extractionRemarks.data_extraction_template | data/regred_data_extraction_template
This is the Excel (read-only) template file that data extractors will use as a template for the data extraction.geographic_entities | data/regred_geographic_entities (https://regred-project.github.io/geographic_entities/)
This is the local copy of the standard geographic locations that data extractors will use as a reference for filling thelocationtable. Only the web version should be used.data_codebooks | data/regred_data_codebooks
This is an Excel (read-only) file that keeps all the controlled vocabulary of the database stored.mapping_sources_template_xlsx | data/status_mapping_sources/regred_mapping_sources_template
This is the Excel (read-only) template file that data extractors will use as a template for generating mapping sources for conservation status.mapping_sources_template_docx | data/status_mapping_sources/regred_mapping_sources_template
This is the Word (read-only) template file that data extractors will use as a template for generating mapping sources for conservation status. They should store it as a PDF.mapping_sources_used | data/status_mapping_sources/regred_mapping_sources_used
This is the Excel file that stores all the mapping sources created to translate conservation statuses.new_sources | data/new_sources/new_sources
This is the Excel file that data extractors will use to store information about any new source (not in found in the tetrapods_sources_to_extract).split_script | code/Splitter/split_script
This is an R file to automatically divide Excel files with mixed-classes extracted sources into multiple single class-specific files.
Others
- RegRed project on GitHub https://github.com/RegRed-project.
- RegRed Zotero library.