2. Running

This chapter is split into two sections, one covering the practical aspects of running the model and the other covering the more conceptual aspects.

2.1. Practical

The following sections will outline the steps you need to take and commands will need to run the software and associated pre and post processing tools.

2.1.1. Getting Started

Download

Download the source code from Github: https://github.com/uaf-arctic-eco-modeling/dvm-dos-tem.git or clone the code using git:

git clone git@github.com:uaf-arctic-eco-modeling/dvm-dos-tem.git

It is also possible to download just the code for a specific release from the Releases page on Github.

Dependencies

To see the list of libraries that must be installed for dvmdostem to compile and run, look at the install commands in the project’s Dockerfile s.

While the supporting scripts don’t need compilation, they require a bunch of supporting libraries and software to be installed. Again looking at the install commands and comments in the Dockerfile s will show the dependencies.

Compile

The dvmdostem program must be compiled from C++ source code. If you have the dependencies installed and accessible on your PATH, then you can install using make, executed from the root of the project. There is also a SCons file and the command scons can be used to compile.

The majority, if not all, of the supporting scripts and tools are written in interpreted languages and don’t need compilation.

Install

There is not an explicit installation step for dvmdostem. You can add the directory where you keep the code (named dvm-dos-tem by default when you clone the repository), to your $PATH variable, or you can reference the scripts and other programs by their absolute path.

While the $PATH variable is setup in the development docker container:

develop@d146768bfaac:/data/workflows$ runmask.py -h
usage: runmask.py [-h] [--verbose] [--reset] [--all-on] ...
                    [FILE]

many of the examples still show things using the absolute path:

develop@d146768bfaac:/data/workflows$ /work/scripts/runmask.py --help
usage: runmask.py [-h] [--verbose] [--reset] [--all-on] ...
                    [FILE]

2.1.2. Control Options

There are generally three ways you can control how dvmdostem runs:

If setting are present in both the configuration files and the command line options, precedence is given to the command line.

Command line

The best way to see the command line options for dvmdostem is by using the --help flag.

Configuration

The dvmdostem program will look for a file, config/config.js when starting. The file is .json formatted, and should have the following top level keys:

{
"general": { },
"IO": { },
"calibration-IO": { },
"stage_settings": { },
"model_settings": { },
}

Of note is that under the “stage_settings” entry it is now possible to control which modules of the model are enabled at different stages. Previously this was not possible to modify after compilation unless you were using the calibration mode and the calibration directives file.

Parameters

dvmdostem is designed to be highly configurable with regards to parameters. This means that many parameters have been factored out so that their values can be set in text files which allows the operation of the model to be changed without re-compiling. But this flexibility results in lots of parameters, and managing them can be cumbersome.

dvmdostem ingests parameters that are stored in a custom, space delimited, fixed width text format. The format is a compromise that allows:

at the expense of:

The util/param.py script has many functions to help manipulate dvmdostem parameter files. Included in util/param.py are functions that can help convert from the custom fixed width text (FWT) format to command separated value (CSV) and back. Certain edits (such as adding and updating metadata) are much easier to accomplish in a spreadhsheet program. The metadata in the existing FWT files is incomplete and the assumption is that this will improve over time as users convert to CSV, work on the files, updating values (i.e. through calibration, new observations or further literature review), and updating metadata and then convert the files back to FWT before comitting to the repository. This is described in the following diagram.

More information about the csv format can be found with the param.py --csv-v1-spec command line option. Rather than circulating a template file, the intention is that the user creates template files from the FWT files using the tools in param.py.

util/param.py also has facilities for converting from FWT to json and back. These functions had thus far been most useful in integrating dvmdostem with other software such as PEcAn

Example parameter files can be found in the parameters/ directory. The general structural constraints are enumerated here:

An abbreviated example of non-PFT data from cmt_bgcsoil.txt:

//===========================================================
// CMT04 // Shrub Tundra // Calibrated for Toolik area.
2.0               // rhq10:
....
0.2               // propftos:
0.0               // fnloss:  fraction N leaching (0 - 1) when drainage occurs
.....
3.93              // initavln:  was 0.68

An abbreviated demonstration example of PFT specific data from cmt_envcanopy.txt:

//===========================================================
// CMT89 // Demo Example // more comments...
// extra comment line...
//Spruce    Decid       PFT2    ...    PFT9   // names: comments
0.10        0.10        0.10    ...    0.10   // albvisnir: canopy albedo
...
0.003       0.003       0.003   ...    0.003  // glmax: m/2 // max. canopy conductance
...
0.0         0.0         0.0     ...    0.0    // initvegsnow: initial intercepted snow water in canopy

2.1.3. Running the Model

Setting up a run

Each run should take place in its own workign directory. This directory should have a variety of configuration files, parameter files, and output directory and optionally calibration and input files. See the script setup_working_directory.py which is a helper tool for intitializing a run directory.

Spatial Size

The size of your run is controlled by the dimensions of your input set and the contents of the run-mask.nc file. If your input set is larger than 1x1, simply mark the cells you’re interested in as active in the run-mask.nc file and the model will run them, by default sequentially.

Single Site or Small Regions

Smaller regions, perhaps no more than a couple dozen cells, can be reasonably run on a standard workstation. Sequentially would be easiest, although it could be run in parallel using MPI if the NetCDF output file accesses were modified to be per-cell and not parallel.

Large Regions (HPC)

For larger regional areas we utilize an HPC cluster and OpenMPI.

Due to space limitations, we break regional runs into ‘batches’, which are subsets of the region with a specified number of cells marked in the run mask. There are a few example scripts in /scripts/chinook to help with splitting, running, and merging these regional runs. They will need to be adapted to run in another environment.

Each batch outputs in parallel to a set of output files shared by all cells in that batch. This means that currently running with MPI requires a file system that supports parallel file access.

Requirements

Boost 1.55 built with mpi
jsoncpp 0.5.0
lapack 3.8.0
OpenMPI 4.1.0 built with slurm
HDF5 1.8.19 with parallel enabled
NetCDF4 4.4.1.1 linked with hdf5
Python 2.7 (for examples in /scripts/chinook)
Python 3 (for general dvmdostem scripts)

Splitting

The batch splitting script pulls from the config/config.js file, so make sure that is set up before running. Active cells per batch is set in the splitting script itself - for a smaller regional run (50x50), we usually do 25 cells per batch. The scriptlet near the bottom of the script holds a few SLURM settings, including:

Which email notifications to send
The email address to send those notifications to
A timeout limit, after which SLURM will kill the job

When you run the script it will create a subdirectory for each batch in the specified output directory. Each subdirectory will have a copy of the config.js file and customized run-mask.nc and slurm_runner.sh files.

Starting a run

By default the batch running script runs a set of batches from an inclusive range defined by values provided in the call. Example call to run batches 0-9:

$ ./scripts/chinook/batch_run_on_chinook.sh [path to batch dir] 0 9

It can instead run a set of batches with indexes manually specified in an array if needed.

Manually set the number of concurrent batches (in this example ‘3’) to submit to the SLURM queue in the following statement:

while [ $(squeue | grep -c [username]) -ge 3 ];

Merging

The merging script will attempt to merge all files matching the output file name format (e.g. GPP_monthly_sc.nc) for each variable in the output_spec file, from the specified directory and several subdirectory levels below.

There are a few values at the beginning of the batch merging file that you will need to set:

OUTPUT_DIR_PREFIX - The parent directory of the batch-run subdirectory
OUTPUT_SPEC_PATH - The output_spec.csv file used for the run
FINAL_DIR=”${OUTPUT_DIR_PREFIX}/[subdirectory name for merged files]”
mkdir -p “${OUTPUT_DIR_PREFIX}/[subdirectory name for merged files]”

This script will take quite a long time to run - several hours for a large regional run. If it produces an incomplete merged file for a variable or two, it can be re-run for a single variable at a time.

2.1.4. Running from Restart Files

Dvmdostem can be stopped at and restarted from any inter-stage pause. The most useful point to do so will be after either EQ or SP, so the bulk of the computing does not need to be repeated and experimental TR+SC runs can be completed quickly.

The files needed to do this are automatically created and named after the stage that they hold data from: restart-[stage].nc.

Set up

Complete an initial run through to the point you wish to restart from. If you want the outputs from later stages for comparison purposes, running those as well will not disrupt the process.

If you produced output files in your initial run that you want to retain, you will need to manually move them elsewhere. Leave the restart files in the output directory.

Restart

Two flags are necessary in order to restart: --no-output-cleanup and --restart-run. The first keeps dvmdostem from re-creating the output directory (and therefore deleting its contents) and the second prevents it from creating new restart-[stage].nc files that would overwrite the ones needed to restart.

Where to restart from is controlled by how many years are specified per stage. If 0, a stage is skipped and dvmdostem attempts to continue from the next stage. For example, to restart after spinup and only run transient and scenario, the year counts would be something like this: -p 0 -e 0 -s 0 -t 115 -n 85

2.1.5. Running a Sensitivity Analysis

2.1.6. Parallel Options

2.1.7. Processing Outputs

Output Selection

Note

draft thoughts: NetCDF outputs are specified in a csv file named in config/config.js. The csv file specifies a variable name (for identification only - it does not correspond to the variable name in the code), a short description, units, and what level of detail to output on (timestep and variable part). [Link to default file after PR merge] Variable name, Description, Units, Yearly, Monthly, Daily, PFT, Compartment, Layer, Example entry: VEGC,Total veg. biomass C,gC/m2,y,m,,p,c,, This will output VegC every month, and provide both PFT and PFT compartment values. The file is more user-friendly when viewed in a spreadsheet. [example] A complete list of output combinations is below The initial list of outputs can be found at Issue #252 LAYERDEPTH, LAYERDZ, and LAYERTYPE should be automatically output if the user specifies any by-layer output. They are not currently, so ensure that they are specified on the same timestep as the desired output. HKLAYER, LAYERDEPTH, LAYERDZ, LAYERTYPE, TCLAYER, TLAYER, and VWCLAYER must have the layer option specified or they will generate NetCDF dimension bound errors.

Process

Note

draft thoughts: A single output file will be produced for each entry in the specifying file, based on variable name, timestep, and run stage. VEGC_monthly_eq.nc At the beginning of the model run, an output file will be constructed for each variable specified, for each run stage where NetCDF output is indicated and that has more than 0 years of run time. Currently the model tracks the variables specified for each timestep as separate sets (i.e. monthly separate from yearly, etc). This reduces the number of map lookups every time the output function is called, but increases the number of monthly vs. yearly string comparisons.

Variable Output Combinations

Note

draft thoughts: ‘-’ indicates that the combination is not an option ‘x’ indicates that the combination has been implemented in the code ‘?’ indicates that it is undecided if the combination should be made available, or that structure in the code needs to be modified to make data available for output. Three variables should be automatically written out if any by-layer variable is specified: Layer type Layer depth Layer thickness Currently they are written out like standard variables. Automation will need to be added in the future.

2.1.8. Processing Inputs

From IEM/SNAP data

From ERA5

2.2. Conceptural Modeling Method and Workflow

The modeling process is multifaceted and can be approached in a number of ways and for different reasons. A schematic overview of the entire process is given in the following image:

2.2.1. Develop goals

2.2.2. Conceptualization

2.2.3. Formulation

2.2.4. Implementation

2.2.5. Parameterization

2.2.6. Model Testing

2.2.7. Model Analysis

2.2.8. Pre- and Post- Processing