SuppMat 2: Looking at the ‘traits’ of the different model families
Author
Tanya Strydom
Published
December 6, 2024
Abstract
The purpose of this document is to provide a rationalisation but also ‘formalisation’ of the different model families of sorts. This starts of by trying to construct a ‘traits’ table that focuses on what the resulting network is (is it a prediction of structure or is it a prediction of interactions), how the resulting network is encoded (i.e., its anatomy) as well as trying to identify what type of data are needed to ‘parametrise’ the model. Finally we take this traits table and feed it into a hierarchical clustering model to sort of see how our broader a priori ideas of models being either topology generators or interaction predictors.
Understanding the rationale of the traits table is perhaps the most crucial thing though so I will try and elaborate on some of the ‘categories’ that I have selected and provide a very high-level glossary of terms. Many of these categories are the product of discussions we’ve hard and this is an attempt to articulate those discussions in a more categorical manner.
First. What the model is predicting. Is it the structure of a network, interactions between species, or their co-occurrence patterns.
Second. What does the network represent. Is it a conceptual network (i.e., cannot be assigned to specific community/geographic location), metaweb (network of potential interactions), realised network, or a ‘metacommunity’ (collection of co-occurring species).
Third. What is the underlying mechanism that determines the interactions between species. This is an attempt at trying to code what is discussed in Box 1 of the main text but a bit more related to the ‘computational’ mechanism and not the ‘ecological’ mechanism. Note here that the use of mathematical is used to refer to a framework that uses some sort of mathematical formalisation to determine interactions/structure, unlike ‘statistical’ which is attempting to find and apply a pattern to determine interactions.
Fourth. What do the links represent - this is much more aligned with Box 1 in the main text
Fifth. What do the nodes represent. Note here that ‘agnostic’ means that there is no node-level metadata for the species/the information at the species level is not ecologically meaningful.
Sixth. The ecological theory as to what constrains links (again Box 1 in the main text)
Seventh. How the interactions quantified
Eight. What are the data sources needed for each model.
In [1]:
suppressPackageStartupMessages(library(tidyverse))library(gt)library(ggdendro)options(warn =-1)# this is the clustering codesource("scripts/hierarchy_cluster.R")