Requires python >= 3.10
- After cloning the repo, run
git submodule initandgit submodule updateto fetch the data from https://github.com/oceans-1876/challenger-data. - Install gnfinder v0.19.
- Install gnverifier v1.0.
- Install Poetry on your system, either globally or in your virtual environments (the former is preferred).
- Run
poetry installto install the project dependencies. - If you just need to run the extractor and do not need the dev dependencies, you can run
poetry install --no-dev. - If you are going to do development work, run
pre-commit installin the project root.
All scripts must be run from project root and as modules, i.e.,
python -m <module>.
| Command | Description |
|---|---|
| create_test_Data.py | Saves a subset of actual data, which can be used with the test database in the API |
| process_stations.py | Updates stations text and species |
| process_summary_report_species_index.py | Extracts the species mentioned in the summary report index |
| update_data_sources.py | Updates Global Names data source info |