split_occurrence_data - CLI interface

Group and split occurrence data from one or more sources so that like-records (ex. species) can be processed together.

split_occurrence_data [-h] [--config_file CONFIG_FILE] [--log_filename LOG_FILENAME]
                      [--log_console] [-r REPORT_FILENAME] [-m [1 - 500]] [-k KEY_FIELD]
                      [-of OUT_FIELD] [--dwca DWCA DWCA] [--csv CSV CSV CSV CSV CSV]
                      [--species_list_filename SPECIES_LIST_FILENAME]
                      out_dir

split_occurrence_data positional arguments

  • out_dir - Directory where the output data should be written. (default: None)

split_occurrence_data optional arguments

  • -h, --help - show this help message and exit

  • --config_file CONFIG_FILE - Configuration file containing script arguments. (default: None)

  • --log_filename LOG_FILENAME, -l LOG_FILENAME - A file location to write logging data. (default: None)

  • --log_console - If provided, write log to console.

  • -r REPORT_FILENAME, --report_filename REPORT_FILENAME - File location to write the wrangler report. (default: None)

  • -m [1 - 500], --max_open_writers [1 - 500] - The maximum number of data writers to have open at once. Too many open files can cause errors. Default: 100. (default: 100)

  • -k KEY_FIELD, --key_field KEY_FIELD - A field to use to determine writer key. Multiple values can be used to utilize multiple fields. (default: None)

  • -of OUT_FIELD, --out_field OUT_FIELD - Include this field in the outputs. If not provided, all fields from the first point ready for output will be used. (default: None)

  • --dwca DWCA - A Darwin-Core Archive to process and associated wrangler configuration. (default: None)

  • --csv CSV - A CSV file to process, an associated wrangler configuration file, a species header key, an x header key, and a y header key. (default: None)

  • --species_list_filename SPECIES_LIST_FILENAME - File location to write list of species seen (after wrangling). (default: None)