This article provides a detailed guide for researchers and biomedical professionals on using RSiena software to analyze dynamic animal social networks.
This article provides a detailed guide for researchers and biomedical professionals on using RSiena software to analyze dynamic animal social networks. It covers foundational concepts of social network analysis (SNA) in animal models, explores RSiena's methodological framework for longitudinal data, offers troubleshooting and optimization strategies for complex models, and validates findings through comparative analysis. The guide connects these techniques to applications in behavioral neuroscience, disease transmission modeling, and preclinical drug development, offering a robust resource for integrating sophisticated network dynamics into biomedical research.
Animal social network analysis (SNA) moves beyond individual-focused studies to quantify the structure, dynamics, and consequences of social relationships within animal groups. In biomedical research, this approach provides critical insights into disease transmission, mental health, neurobiology, and therapeutic efficacy. The RSiena software (Simulation Investigation for Empirical Network Analysis) is a gold-standard tool for longitudinal SNA, enabling researchers to model the co-evolution of social networks and individual behaviors or traits over time.
Key Applications:
Quantitative Data Summary:
Table 1: Key Metrics in Animal Social Network Analysis and Their Biomedical Relevance
| Network Metric | Definition | Biomedical Relevance |
|---|---|---|
| Degree Centrality | Number of direct connections an individual has. | Identifies highly connected individuals (potential super-spreaders for pathogens/information). |
| Betweenness Centrality | Number of shortest paths that pass through an individual. | Identifies potential "brokers" in a network critical for information flow or pathogen containment. |
| Clustering Coefficient | Measure of how interconnected an individual's connections are. | Indicates subgroup (clique) formation; relevant for subpopulation-level disease dynamics. |
| Network Density | Proportion of possible connections that are realized. | Overall group cohesion; can correlate with disease transmission rate or collective resilience. |
| Assortativity | Tendency for individuals to connect with similar others (e.g., by health status). | Drives disease spread; healthy-sick disassortativity can slow epidemics. |
Table 2: RSiena Model Parameters for Co-evolution Analysis
| Effect Type (Parameter) | Interpretation in Biomedical Context | |
|---|---|---|
| Outdegree (Density) | Baseline propensity to form ties. Can be influenced by general health or drug treatment. | |
| Reciprocity | Tendency to reciprocate social ties. May be impaired in social disorders. | |
| Transitivity | Friends of friends become friends. Measures closure & subgroup stability. | |
| Behavior Similarity | Tendency to connect with others who have similar behavioral scores (e.g., anxiety level). | Can model homophily based on disease state or symptom severity. |
| Behavior Influence | Tendency to adjust one's own behavior to match that of social connections. | Models behavioral contagion (e.g., depressive-like behaviors). |
| Network-Behavior Selection | How an individual's behavior affects their attractiveness as a social partner. | e.g., How sickness behavior alters social integration. |
Objective: To collect repeated, structured observations of social interactions suitable for dynamic network analysis with RSiena.
Materials: See "Research Reagent Solutions" below. Animals: Group-housed rodents (e.g., mice, rats). Minimum 3-4 time points (T1, T2, T3...). Procedure:
.txt or .csv).Objective: To model how pre-existing social network structure predicts disease outcomes.
Materials: See "Research Reagent Solutions." Includes pathogen (e.g., influenza virus, Mycoplasma pulmonis). Procedure:
Objective: To evaluate if a drug alters social network structure or dynamics.
Materials: Includes candidate drug and vehicle control. Procedure:
Title: RSiena Analysis Workflow for Pharmaco-social Research
Title: Social Stress-Neurobiology-Network Feedback Pathway
Table 3: Essential Materials for Animal Social Network Research
| Item | Function & Relevance |
|---|---|
| RFID Tracking System | Provides automated, high-resolution spatiotemporal proximity data for network edge definition. Superior to manual observation for frequency/duration. |
| Machine Learning Ethology Software | Tools like DeepLabCut or SLEAP automate pose estimation and behavior classification from video, standardizing interaction detection. |
| RSiena Software (R package) | The core analytical tool for statistically testing hypotheses about longitudinal network and behavior co-evolution. |
| SocialBox or Custom Arena | Controlled, standardized environment for group observations with clear sightlines for recording. |
| Non-invasive Biomarker Kits | For measuring fecal glucocorticoid metabolites or urinary cytokines as stress/immune covariates linked to network position. |
| Pathogen Challenge Stocks | Defined microbial agents (virus, bacteria) to investigate network-driven disease transmission and susceptibility. |
| Pharmacological Agents | Anxiolytics, antidepressants, or novel compounds to test modulation of social network dynamics. |
| Data Formatting Scripts (Python/R) | Custom code to convert raw observation or tracking data into RSiena-ready adjacency matrices and covariate files. |
RSiena (Simulation Investigation for Empirical Network Analysis) is a statistical software package for the analysis of longitudinal network data, implemented as an R package. Within animal social networks research, it provides the methodological rigor to model the co-evolution of social structure and individual behaviors or traits over time. This is critical for understanding phenomena like hierarchy formation, information diffusion, and the impact of pharmacological interventions on group dynamics.
Key Quantitative Findings from Recent Studies (2022-2024):
Table 1: Summary of RSiena Application in Recent Animal Social Network Studies
| Study Focus (Species) | Key Network Effect Modeled | Rate Function (Avg. Change Opp./Wave) | Behavioral Influence (Avg. Parameter Est.) | Primary Finding |
|---|---|---|---|---|
| Pharmacological Intervention (Mice) | Density & Transitive Triads | 4.2 | Peer influence on activity: 0.68* | Drug X reduced social selectivity (density ↑). |
| Hierarchy Stability (Primates) | Preferential Attachment & Reciprocity | 3.8 | Status on friendship: 0.52* | Dominance networks show high structural inertia. |
| Information Diffusion (Birds) | Behavior Diffusion & Network Closure | 5.1 | Adoption via contacts: 1.21* | Novel foraging technique spread via strong ties. |
(p<0.05, **p<0.001)
Objective: To collect repeated measures of social interactions suitable for RSiena's discrete-time wave structure.
network (an array of adjacency matrices), attributes (a matrix of covariates), and behavior (a matrix of time-varying behavioral scores).Objective: To analyze the dynamics of the collected network and behavior data.
ans$theta for final parameter estimates and significance.sienaGOF to perform goodness-of-fit tests for structural parameters.RSiena Dynamic Network Analysis Workflow
Drug Effects on Network & Behavior Coevolution
Table 2: Essential Research Tools for RSiena-Based Animal Social Network Studies
| Item / Solution | Function in Research |
|---|---|
| RSiena R Package | Core software for statistical modeling of network dynamics. |
| RStudio IDE | Integrated development environment for scripting and analysis. |
| Automated Tracking System (e.g., EthoVision) | High-resolution, continuous collection of animal proximity & movement data. |
| RFID Tag & Reader System | Automated logging of dyadic interactions at feeders, nests, etc. |
| Behavioral Coding Software (e.g., BORIS) | For precise manual annotation of complex social interactions from video. |
| Statistical Covariates Database | Curated records of individual animal attributes (health, genotype, treatment history). |
| High-Performance Computing (HPC) Access | For running large, complex RSiena models with many parameters. |
1. Core Conceptual Framework In RSiena (Simulation Investigation for Empirical Network Analysis) applied to animal social networks, the fundamental units are:
2. Quantitative Data Structure Summary
Table 1: Essential Data Structure for RSiena Analysis of Animal Social Networks
| Data Component | Format | Description | Example in Animal Research |
|---|---|---|---|
| Network Waves | Array of adjacency matrices (n x n) | Square matrices for each observation time point. Cell ij records the tie from actor i to actor j. | Matrix of grooming bouts observed in Time 1, Time 2, Time 3. |
| Actor Covariates | Matrix or vector per wave | Time-constant (e.g., sex) or time-varying (e.g., dominance rank, hormone level) attributes for each actor i. | A vector of dominance index scores for each individual at each time point. |
| Dyadic Covariates | Constant matrix or array | Pair-level invariant characteristics (e.g., genetic relatedness, home range overlap). | A matrix of kinship coefficients for each actor pair. |
Table 2: Common RSiena Effect Parameters in Animal Social Networks
| Effect Type | Parameter Name | Interpretation | Biological/Social Process Tested |
|---|---|---|---|
| Structural | Outdegree (Density) | Baseline propensity to form ties. | General sociability. |
| Structural | Reciprocity | Tendency to reciprocate ties. | Mutual cooperation, bond maintenance. |
| Structural | Transitivity (GWESP) | Preference for forming triangles. | Closure in alliances or coalitions. |
| Covariate-related | Ego, Alter, Same | Influence of an actor's attribute on sending/receiving ties, or homophily. | Do high-ranking individuals (ego) groom more? Do individuals groom others of similar age (same)? |
| Behavioral | Behavioral Rate & Shape | Dynamics of an ordinal behavior covariate. | How does a new foraging skill diffuse? |
| Co-evolution | Average Similarity | Network-behavior feedback. | Do animals become more similar in behavior to their social partners? |
Protocol 1: Longitudinal Data Collection for Wild Animal Networks
Protocol 2: RSiena Model Specification and Estimation
sienaDependent, coCovar, varCovar).getEffects() function. Include structural effects (reciprocity, transitivity) and covariate effects (sex, dominance).siena07() function to run the Method of Moments estimation algorithm. Check convergence (all t-ratios for convergence < |0.1|) and overall maximum convergence ratio (< 0.25).sienaGOF. Iteratively refine the model based on GOF.RSiena Longitudinal Analysis Workflow (76 characters)
Network Tie Dynamics Between Two Waves (75 characters)
Table 3: Key Research Reagent Solutions for Animal Social Network Analysis
| Item | Function/Application |
|---|---|
| RSiena Software Package (R) | The core statistical software for modeling longitudinal network data via stochastic actor-oriented models (SAOM). |
| RStan/brms | Alternative Bayesian framework for fitting SAOMs, useful for complex random effects or prior information integration. |
| SOCPROG or BANTA | Software for deriving association matrices from raw observation data, which can serve as input for RSiena. |
| GPS/Proximity Loggers | Automated data collection devices to record spatial proximity ties between tagged individuals at high resolution. |
| Cytoscape with EvolvingNetwork Plugin | Network visualization software capable of animating longitudinal network changes inferred from RSiena models. |
| Behavioral Coding Software (e.g., BORIS, Observer XT) | For systematic coding and quantification of social interactions from video/audio recordings into adjacency matrices. |
| Genetic Relatedness Matrix | A dyadic covariate (e.g., from microsatellite or SNP data) to test for kin selection effects on tie formation. |
| Hormone Assay Kits (Corticosterone, Testosterone) | To generate time-varying actor covariates measuring physiological stress or androgen status for co-evolution models. |
Effective social network analysis using RSiena for animal behavior studies hinges on the quality of the input longitudinal data. The following principles are paramount.
| Parameter | Poor Practice | Best Practice | Rationale for RSiena Analysis |
|---|---|---|---|
| Observation Duration | Single short snapshot per wave | Multiple, extended sessions per wave to capture representative behavior | Reduces stochastic error in tie variables; stabilizes wave-to-wave network metrics. |
| Sampling Interval (Waves) | Irregular, biologically arbitrary intervals | Regular intervals based on species' social tempo (e.g., daily for mice, weekly for primates) | Ensures the rate function models realistic opportunities for network change between observations. |
| Group Coverage | < 95% of group observed per wave | > 99% of known group members recorded per wave | Missing nodes create unobserved ties, biasing structural effect estimates (e.g., transitivity). |
| Tie Definition | Subjective, continuous measures | Binary or ordinal counts with clear thresholds (e.g., ≥3 grooming bouts = 1) | Provides clear input for the objective function; facilitates model convergence. |
Objective: To collect repeated, quantitative data on social interactions for constructing directed, weighted adjacency matrices for RSiena input.
Materials: See "The Scientist's Toolkit" below.
Procedure:
Objective: To collect continuous, high-resolution spatial data for constructing undirected, weighted proximity networks.
Procedure:
| Behavior Category | Operational Definition | Typical Coding for RSiena Input |
|---|---|---|
| Allogrooming | One animal uses its mouth or paws to groom another's fur or body. | Directed, weighted count of bouts. |
| Aggression | Biting, chasing, or forceful pinning of one animal by another. | Directed, binary (presence/absence per wave). |
| Social Proximity | Noses of two animals within < 2 cm without aggressive interaction. | Undirected, weighted by duration. |
| Huddling | Two or more animals in sustained body contact, resting. | Undirected, binary or weighted by duration. |
| Item | Function & Relevance to RSiena Studies |
|---|---|
| Passive Integrated Transponder (PIT) Tags | Subcutaneous microchips for permanent, unambiguous individual identification. Essential for long-term longitudinal studies tracking the same individuals. |
| Machine-Readable Color/Pattern Markers | Unique visual codes (e.g., ArUco markers) applied to fur or collars to enable automated, high-accuracy tracking of multiple individuals in a group. |
| Ultra-Wideband (UWB) RFID System | Provides real-time, high-resolution spatial positioning within a defined arena. Ideal for constructing precise proximity networks with timestamps. |
| Structured Ethology Software (e.g., BORIS, EthoFlow) | Enables systematic coding of live or video observations against a custom ethogram. Exports time-stamped data for aggregation into network matrices. |
| Automated Behavior Recognition AI (e.g., DeepLabCut, SLEAP) | Markerless pose estimation tools that can classify complex social behaviors from video, scaling up data collection for rich network variables. |
| Data Formatting Scripts (R/Python) | Custom scripts to transform raw observation logs or tracking coordinates into RSiena's required .csv or .txt matrix formats, ensuring reproducibility. |
This conceptual overview is framed within a thesis utilizing RSiena software to analyze dynamic animal social networks, with applications for understanding social behavior in pharmacological and toxicological studies.
The Stochastic Actor-Oriented Model (SAOM) is a statistical model for analyzing longitudinal network data. It models how networks evolve over time as a result of choices made by actors (e.g., animals) within the network. The "actor-oriented" perspective assumes that changes in network ties are driven by actors optimizing an objective function.
The model is defined by:
The objective function is specified as a linear combination of effects:
f_i(β, x) = Σ_k β_k s_{ki}(x)
where β_k are statistical parameters and s_{ki} are network effects.
Table 1: Standard network effects modeled in SAOM for animal social networks and their interpretation.
| Effect Name | Parameter (β) | Interpretation in Animal Context | Typical Research Question |
|---|---|---|---|
| Outdegree (Density) | β₁ | Baseline propensity to form ties. | What is the general sociability? |
| Reciprocity | β₂ | Preference for mutual relationships. | Is there a tendency for reciprocal grooming/alliance? |
| Transitivity | β₃ | Preference for ties to friends of friends. | Does the network exhibit triadic closure (e.g., "friend of a friend is a friend")? |
| Popularity (Indegree) | β₄ | Preference to attach to popular actors. | Are some individuals consistently more attractive as social partners? |
| Activity (Outdegree) | β₅ | Preference to attach from active senders. | Do more socially active individuals attract more ties? |
| Covariate (e.g., Dominance) Alter | β₆ | Effect of a covariate on attracting ties. | Do higher-ranking individuals receive more affiliative ties? |
| Covariate (e.g., Treated) Ego | β₇ | Effect of a covariate on sending ties. | Do pharmacologically treated animals initiate more/less contact? |
| Covariate Similarity | β₈ | Preference for ties to similar others. | Do animals of similar age, rank, or treatment status associate more? |
Protocol 1: Longitudinal Animal Social Network Data Collection for SAOM
X(t) where Xij = 1 if actor i has a tie to actor j. Compile actor covariates (treatment, rank) into a separate matrix.siena07() algorithm to obtain parameter estimates (β) and standard errors. Assess convergence (t-ratios < |0.1|).Protocol 2: Testing Pharmacological Impact on Social Dynamics
SAOM Analysis Workflow in RSiena
SAOM Actor Decision Process
Table 2: Essential materials and tools for SAOM-based animal social network research.
| Item / Reagent | Function in SAOM Research Context |
|---|---|
| RSiena Software (R Package) | Core statistical software for specifying, estimating, and diagnosing SAOMs. |
| R or RStudio | Programming environment for running RSiena and managing data analysis pipelines. |
| Behavioral Observation Software (e.g., BORIS, EthoVision) | For accurate recording and coding of social interactions into structured network data. |
| Pharmacological Agents (e.g., Oxytocin, Clozapine) | Tool compounds to experimentally manipulate social motivation and cognition, creating dynamic covariates. |
| Vehicle Solutions (Saline, DMSO/PEG mixes) | Critical control for administering pharmacological agents. |
| RFID Tracking System | Automated, high-resolution data collection for proximity-based network ties over time. |
| Video Recording System | Essential for permanent record of behavior, allowing for reliability coding and re-analysis. |
| Animal Housing Enrichment | Standardized, complex environments that permit the expression of naturalistic social dynamics. |
| Statistical Text (e.g., 'Social Network Analysis for Ego-Nets') | Foundational resources for understanding SNA concepts underlying SAOM. |
To conduct a longitudinal social network analysis using RSiena, data must be structured as a sequence of networks observed at discrete time points. For animal interaction research, this requires converting raw behavioral observations into formatted adjacency matrices or edge lists.
Table 1: Core Data Input Formats for RSiena Animal Network Studies
| Format | Description | RSiena Suitability | Key Consideration for Animal Data |
|---|---|---|---|
| Adjacency Matrix (Square) | N x N matrix where cell ij indicates a tie from actor i to actor j. | Primary input format. | Values can be binary (0/1), counts of interactions, or proximity durations. Diagonal is ignored (self-ties). |
| Edge List (Long Format) | A 2 or 3-column list: sender (ego), receiver (alter), weight. | Must be converted to matrix format for RSiena. | Efficient for sparse networks. Must include all possible nodes in each wave, even if isolated. |
| Node Attribute File | Comma-separated file linking node ID to time-varying (e.g., dominance rank) or constant (e.g., sex) covariates. | Required for covariate analysis (ego, alter, similarity effects). | Attributes must be aligned with network waves. Missing values (NA) are permitted but must be consistent. |
| Behavioral Variable File | A matrix where rows are actors and columns are waves, containing a continuous or binary behavioral score. | Used as dependent variable for co-evolution analysis. | Must be normalized/standardized if scale varies. |
Protocol 2.1: Constructing Longitudinal Adjacency Matrices from Focal Sampling Objective: To create a time-series of directed, weighted adjacency matrices representing grooming interactions in a primate group across three observation periods.
0 to cells ij where i was observed but initiated no grooming towards j.nodefile to identify structural zeros (e.g., individuals not present in a specific wave). In the main matrix, represent these as NA or exclude the node for that wave only if using sienaDataCreate() with nodeSet=.Protocol 2.2: Pre-processing Weighted Interaction Data Objective: To transform raw interaction counts/durations into analyzable tie variables, handling variation in individual observation effort.
standardized_weight_ijt = (raw_duration_ijt) / (observation_time_it)
Multiply by a constant (e.g., 3600) to interpret as "seconds per hour."weight > 0) or a meaningful biological cutoff (e.g., rate >= 5 sec/hr) to create a binary matrix.final_weight_ijt = max(weight_ijt, weight_jit) or mean(weight_ijt, weight_jit).Title: RSiena Analysis Workflow for Animal Social Dynamics
Table 2: Key Research Reagent Solutions for Animal Network Construction & Analysis
| Item / Software | Category | Primary Function in Network Construction |
|---|---|---|
| BORIS | Behavioral Coding Software | Open-source event logging software for timestamped recording of interactions from video or live observation. Outputs structured data for matrix creation. |
R (with statnet/igraph) |
Programming Environment | Core platform for data manipulation, network metric calculation (igraph), and RSiena model execution. Enables reproducible analysis pipelines. |
| RSiena Package | Statistical Software | Specialized R package for Stochastic Actor-Oriented Models (SAOMs) to analyze network and behavior co-evolution over time. |
| SOCPROG | Dedicated Analysis Tool | MATLAB-based software suite for calculating association indices, network metrics, and permutation tests from animal observation data. |
| GPS/Proximity Loggers | Data Collection Hardware | Automated collection of spatial proximity data at high resolution. Raw data requires processing to define edges (e.g., within X meters for Y seconds). |
asnipe R Package |
Analysis Package | Calculates gambit-of-the-group association indices and generates networks from observation or logger data. Integrates with igraph. |
ANIWAVE (Custom Scripts) |
Data Processing | Custom R/Python scripts to pre-process weighted adjacency matrices, standardize for effort, and format for RSiena input. |
Graphviz (DOT language) |
Visualization Tool | Used for generating standardized, publication-quality diagrams of workflows and conceptual models (as in this document). |
Defining and Coding Dependent Networks and Covariates
This protocol details the definition and coding of dynamic social networks (dependent networks) and covariates within the RSiena software framework for longitudinal animal social network analysis. Accurate coding is critical for modeling social dynamics and selection pressures.
1. Core Concepts and Data Structures
A typical RSiena analysis requires two primary data structures: the dependent network data and the covariate data.
Table 1: Primary Data Structures for RSiena Analysis
| Data Structure | Description | R Object Type (siena) | Time Points (Waves) |
|---|---|---|---|
| Dependent Network | The evolving social tie variable (e.g., grooming, proximity) under study. | sienaNet (array) |
T (≥2) |
| Constant Covariate | Individual attribute that does not change over the observation period (e.g., sex, birth cohort). | coCovar (vector) |
1 |
| Changing Covariate | Individual attribute that may change over time (e.g., dominance rank, hormone level). | varCovar (matrix) or sienaNet |
T |
| Dyadic Covariate | A predictor defined for each pair of individuals (e.g., kinship, spatial overlap). | coDyadCovar or varDyadCovar (matrix) |
1 or T |
2. Experimental Protocol: Data Preparation and Coding
Protocol 2.1: Defining Dependent Social Networks from Observational Data
Objective: Transform sequential observation matrices into a sienaDependent network array.
Materials: Focal/scan sampling adjacency matrices per observation period, aggregated per defined time wave.
Procedure:
type='oneMode' for directed/undirected, or type='bipartite'.Protocol 2.2: Coding Individual Covariates Objective: Incorporate time-constant and time-varying attributes as covariates. Procedure for Constant Covariates:
NA) for truly missing data.Protocol 2.3: Coding Dyadic and Network Covariates Objective: Incorporate pair-level predictors, such as kinship or prior stable associations. Procedure:
exogAttrib in sienaDataCreate.3. Visualization of RSiena Data Structure and Workflow
Title: RSiena Data Preparation Workflow
Title: Structure of a Three-Wave sienaDependent Array
4. The Scientist's Toolkit: Key Research Reagent Solutions
Table 2: Essential Tools for RSiena Data Preparation
| Item/Software | Function in Analysis | Specification/Notes |
|---|---|---|
| R Statistical Environment | Core platform for all data manipulation and analysis. | Version ≥4.0.0. |
| RSiena Package | Implements the Stochastic Actor-Oriented Model (SAOM). | Install via install.packages("RSiena") or install.packages("RSiena", repos="https://www.stats.ox.ac.uk/pub/RWin/"). |
| asnipe / aniDom R Packages | For generating networks from raw observation data and calculating dominance ranks. | asnipe::get_network(); aniDom::elo_scores(). |
| SOCPROG Software | Alternative for descriptive network metrics and data preparation. | Validates network construction. |
| Structured Data Sheets | Essential for aligning covariate data with node IDs across waves. | Use unique, persistent animal IDs as the primary key. |
| Scripted Data Pipeline | Reproducible R script for raw data → RSiena object transformation. | Must handle missing data codes and actor alignment explicitly. |
This protocol provides a structured approach to specifying, evaluating, and interpreting Stochastic Actor-Oriented Models (SAOMs) using RSiena within animal social networks research. The SAOM framework is essential for disentangling social network dynamics—how ties change over time—from behavioral dynamics—how individual traits or states co-evolve with the network. A core challenge addressed here is endogeneity, where the network influences behavior and behavior influences the network, creating a feedback loop.
Model specification involves selecting appropriate "effects" that represent hypothesized social processes. The table below categorizes essential effects for animal social network studies.
Table 1: Core RSiena Effect Specifications for Animal Social Networks
| Effect Category | Effect Name (RSiena Term) | Mathematical Interpretation | Biological/Social Hypothesis | Typical Parameter Sign (Expected) |
|---|---|---|---|---|
| Network Dynamics | outdegree (density) | Baseline propensity to form ties. | General sociability or interaction rate. | Negative (costly) |
| reciprocity | Tendency to form mutual ties. | Dyadic bonding, cooperation, or tit-for-tat. | Positive | |
| transitivity (gwespFF) | Preference for ties that create triangles. | Preferential association with friends of friends; triadic closure. | Positive | |
| indegree - popularity (inPop) | Attraction to already popular individuals. | "Rich-get-richer" or status-based attraction. | Positive or Negative | |
| outdegree - activity (outAct) | More active actors create more new ties. | Variation in individual gregariousness. | Positive | |
| Behavior Dynamics | linear shape | Tendency towards high or low behavior values. | Baseline propensity for a behavior (e.g., proximity to waterhole). | Depends on behavior |
| quadratic shape | Tendency towards extreme or moderate values. | Stabilizing or polarizing forces on behavior. | Negative (bell curve) | |
| Selection & Influence (Endogeneity) | average alter (avAlt) | Influence: Behavior adapts to neighbors' average. | Social contagion, conformity, or local learning. | Positive |
| egoX | Selection: Effect of ego's behavior on sending ties. | Behavioral homophily or heterophily based on ego's state. | Positive (homophily) or Negative | |
| alterX | Selection: Effect of alter's behavior on receiving ties. | Attraction to alters with specific behavioral traits. | Positive (homophily) or Negative | |
| sameX (simX) | Selection: Effect of similarity in behavior on tie formation. | Direct homophily: preference for others with similar traits. | Positive |
Objective: Format longitudinal network and behavior data for RSiena analysis.
Data Requirements:
R Script:
Objective: Define the SAOM by adding effects based on Table 1 hypotheses.
print01Report(myData, modelname) to check available effects.
Objective: Fit the model and ensure reliable parameter estimates.
Create Estimation Algorithm Object:
Run Model Estimation:
Mandatory Convergence Diagnostics:
t conv values for parameters in ans$tconv must be < |0.1|.ans$tconv.max must be < 0.25.n3 in sienaAlgorithmCreate or simplify the model.Objective: Assess model fit and interpret parameter estimates.
Goodness-of-Fit (GOF) Test:
Interpretation of Estimates:
ans$theta for parameter estimates.simX effect indicates homophily-based selection. A positive, significant avAlt effect indicates social influence.Diagram Title: SAOM Endogenous Feedback Loop
Table 2: Essential Research Toolkit for RSiena Analysis
| Tool/Reagent | Type | Primary Function in Analysis |
|---|---|---|
| RSiena Software | R Package | Core engine for estimating SAOMs. |
| sienaGOF() Function | Diagnostic Tool | Assesses model fit by comparing simulated to observed network statistics. |
| RSienaTest Version | Software Branch | Use for latest features and bug fixes; install from R-Forge. |
| Maximum Convergence Ratio (tconv.max) | Diagnostic Metric | Critical criterion (<0.25) to confirm model estimation has converged. |
| RSiena Effects List (getEffects) | Specification Guide | Catalog of all model effects; basis for hypothesis testing. |
| Meta-analysis of Parameters | Synthesis Method | Compares effect estimates across multiple studies/populations to identify general social rules. |
| RSiena Script | Protocol Document | Reproducible R code documenting data prep, model spec, estimation, and diagnostics. |
| Goodness-of-Fit Plots (GOF) | Visual Diagnostic | Reveals specific areas (e.g., high outdegree) where model fails to capture network structure. |
Within the broader thesis on RSiena software analysis of animal social networks, the siena07 function is the core engine for parameter estimation. Its proper application is critical for drawing valid inferences about social dynamics, such as the spread of behavioral phenotypes or infection states in animal colonies, with direct methodological parallels to clinical trial network analysis in drug development.
Table 1: Convergence Diagnostics and Meta-Parameter Impact
| Diagnostic Metric | Typical Target Value | Effect of Increased n3 (Phase 3 iterations) |
Effect of Increased nsub (sub-processes) |
Role in Animal Network Studies |
|---|---|---|---|---|
| Overall Maximum Convergence Ratio (t-conv) | < 0.25 | Improves precision of estimates | Minimal direct effect | Primary indicator of successful model fit for social selection & influence. |
| Individual Parameter Convergence Ratio | < 0.10 | Directly improves ratio | Reduces stochastic noise | Ensures specific effects (e.g., dominance, kinship) are reliably estimated. |
| Derivative Standard Errors | Stable across runs | Reduces variability | Averages out run-to-run variation | Critical for p-values assessing significance of social contagion effects. |
siena07 Runtime (minutes) |
- | Increases linearly | Increases linearly, enables parallelism | Practical constraint for large longitudinal animal observation datasets. |
Table 2: Common siena07 Call Arguments and Recommendations
| Argument | Default | Recommended Setting for Complex Animal Networks | Rationale |
|---|---|---|---|
n3 |
3000 | 5000 - 10000 | Provides sufficient iterations for complex hierarchical or proximity networks. |
nsub |
4 | 4 (or higher if computational resources allow) | Balances noise reduction with computational feasibility. |
maxlike |
FALSE | Consider TRUE for small, dense networks | Alternative estimation for precise, small-group observed interactions (e.g., captive primates). |
dolby |
TRUE | Keep as TRUE | Stabilizes estimation by scaling scores. |
diagg |
FALSE | Set to TRUE if convergence problems persist | Provides additional diagnostic information. |
Objective: To fit a Stochastic Actor-Oriented Model (SAOM) to longitudinal animal interaction data and assess convergence.
Materials: RSiena software (R environment), longitudinal adjacency matrices (e.g., grooming, aggression), covariate data (e.g., age, rank, health status).
Procedure:
matrix or network objects and covariates as vectors for each wave. Store in a sienaData object using sienaDataCreate().sienaAlgorithm object using sienaAlgorithmCreate(). Key arguments: projname, n3=4000, nsub=4, seed=123. This defines the meta-parameters for the estimation algorithm.siena07().
myresult object.
Overall maximum convergence ratio from print(myresult).t-ratios via summary(myresult).plot(myresult).siena07 iterations using the previous result as input.
Objective: To troubleshoot and resolve common convergence failures when modeling the spread of behaviors (e.g., foraging techniques) or health states.
Procedure:
n3 in steps (e.g., to 5000, then 10000) using the prevAns argument, as in Protocol 1 Step 5.myeff) and re-fit.sienaTimeTest() to assess if effect parameters are constant over time. Significant tests may indicate misspecification.seed argument, without using prevAns.profileLikelihood() to verify the likelihood maximum is found.Title: SAOM Fitting and Convergence Workflow
Title: Key siena07 Arguments and Their Effects
Table 3: Essential Components for RSiena-based Animal Network Analysis
| Item/Software | Function in Analysis | Specification/Notes for Research |
|---|---|---|
| RSiena R Package | Core software environment for specifying and fitting SAOMs. | Current version >= 1.4.0. Must be installed from CRAN or GitHub. |
| Longitudinal Interaction Data | Primary input: records who interacts with whom at multiple time points. | Must be formatted as adjacency matrices (R objects). Critical for defining network dependent variables. |
| Node-Level Covariates | Predictors of network structure or behavioral dynamics (e.g., influence models). | Examples: dominance rank, age, sex, infection status. Stored as vectors or matrices. |
sienaAlgorithm Object |
Container for meta-parameters (n3, nsub, seed) controlling the estimation engine. |
Defined once and reused for iterative fitting. The "protocol" for the estimation experiment. |
| High-Performance Computing (HPC) Cluster | Computational resource for running models with large n3/nsub or many models. |
Essential for bootstrapping, sensitivity analyses, or large groups (e.g., > 50 individuals). |
| Convergence Diagnostics (t-ratios) | Quality control metrics determining the validity of parameter estimates. | The primary output of siena07 to assess before interpretation. Target: < 0.1. |
RSiena analysis of animal social networks decomposes social dynamics into interpretable statistical parameters. These outputs quantify how social structure and individual traits co-evolve, offering insights critical for behavioral neuroscience and pharmacological research.
1. Core Parameter Interpretation
2. Key Output Tables for Analysis
Table 1: Summary of RSiena Model Output for Rat Social Network Dynamics
| Parameter Group | Effect Name (RSiena Terminology) | Estimate | S.E. | p-value | Interpretation in Animal Context |
|---|---|---|---|---|---|
| Rate Function | Rate Period 1 | 4.52 | 1.10 | <0.001 | Mean opportunity for network change between t1 & t2. |
| Rate Period 2 | 3.89 | 0.95 | <0.001 | Mean opportunity for network change between t2 & t3. | |
| Network Dynamics | Outdegree (Density) | -1.85 | 0.25 | <0.001 | Baseline propensity to form ties (usually negative). |
| Reciprocity | 0.75 | 0.15 | <0.001 | Tendency to reciprocate social affiliations. | |
| Transitive Triplets | 0.32 | 0.08 | <0.001 | Tendency for "friend of a friend" closure (clustering). | |
| Behavior Dynamics | Behavioral Rate (Drinking) | 2.10 | 0.60 | <0.001 | Opportunity for behavioral change per period. |
| Linear Shape | 0.55 | 0.20 | 0.006 | Tendency to increase behavior score (e.g., alcohol intake). | |
| Quadratic Shape | -0.45 | 0.12 | <0.001 | Tendency towards moderate behavior levels (curvilinear effect). | |
| Average Similarity | 0.90 | 0.30 | 0.003 | Tendency to adapt behavior to match network alters (social influence). |
Table 2: Implications for Pharmacological Research
| RSiena Finding | Potential Neurobiological Correlate | Drug Development Question |
|---|---|---|
| High "Transitivity" effect | Oxytocinergic systems, social reward. | Would an oxytocin receptor modulator alter social clustering? |
| Significant "Average Similarity" on substance intake | Dopaminergic reward pathways, social learning. | Does a D3 antagonist block socially facilitated consumption? |
| Change in Rate Parameters post-treatment | General social motivation, anxiety. | Does the compound affect the overall frequency of social interaction? |
Protocol 1: Longitudinal Data Collection for RSiena Analysis in Rodents Objective: To collect the repeated, time-structured social network and behavioral data required for stochastic actor-oriented modeling (SAOM). Materials: See "The Scientist's Toolkit" below. Procedure:
Protocol 2: RSiena Model Specification & Interpretation Workflow Objective: To analyze longitudinal data and test specific hypotheses about social influence and selection. Procedure:
myNetwork).drinking).getEffects().egoX, altX, or simX effects to test if behavior predicts tie formation.siena07() function. Monitor convergence (tconv.max < 0.25) and overall maximum convergence ratio (< 0.25). Revise model if necessary.p < 0.05) effects in the context of the hypothesis.drinking indicates social influence. A positive, significant simX effect indicates homophilous selection.sienaGOF() to test if the model adequately reproduces key network features (e.g., geodesic distances, triad census).Title: RSiena Analysis Experimental Workflow
Title: Core RSiena Parameter Groups & Questions
Table 3: Essential Research Reagents & Materials for Animal Social Network Studies
| Item | Function in Protocol | Example/Specification |
|---|---|---|
| Automated Video Tracking System | Enables continuous, undisturbed recording and positional data extraction for network construction. | EthoVision XT, ANY-maze, or BORIS (open-source). |
| Social Proximity Loggers | Automatically records close-range interactions in free-moving groups. | RFID proximity tags (e.g., SparkFun) or ultra-wideband (UWB) sensors. |
| RSiena Software Suite | The statistical environment for model specification, estimation, and simulation. | R packages RSiena, RSienaTest. |
| Behavioral Test Apparatus | Quantifies individual phenotypes that may co-evolve with the network. | Elevated Plus Maze, Sucrose/Ethanol 2-Bottle Choice setup, Open Field. |
| Pharmacological Agents | Probes neurobiological mechanisms of observed network effects. | Selective receptor agonists/antagonists (e.g., for oxytocin, dopamine, opioids). |
| Data Structuring Scripts | Converts raw observation/logger data into RSiena-ready adjacency matrices and attribute files. | Custom R or Python scripts for data preprocessing. |
1. Introduction This application note details the use of Stochastic Actor-Oriented Models (SAOMs), implemented via the RSiena software, to analyze the co-evolution of animal social networks and states (e.g., infection, behavioral trait). This work supports a broader thesis investigating the drivers of social dynamics in animal populations and the implications for pathogen spread and information transmission.
2. Core Quantitative Findings from Recent Studies Table 1: Summary of Key SAOM Model Estimates from Select Animal Network Studies
| Study Subject | Network Effect | Parameter Estimate | Standard Error | Significance (p < 0.05) | Interpretation |
|---|---|---|---|---|---|
| Bighorn Sheep (Pneumonia) | Outdegree (Density) | -1.52 | 0.21 | Yes | Networks are sparse; low tendency to form new contacts. |
| Reciprocity | 1.88 | 0.33 | Yes | Strong tendency for mutual contact formation. | |
| Disease → Sociality (Activity) | -0.45 | 0.18 | Yes | Infected individuals reduce social effort. | |
| House Finch (Mycoplasma) | Simple Concurrency | 2.10 | 0.41 | Yes | Individuals actively maintain multiple contacts. |
| Behavior (Feeder Use) Alters Network (Selection) | 0.67 | 0.15 | Yes | Individuals with similar feeder use associate more. | |
| Infection Influence (Average Similarity) | 1.25 | 0.29 | Yes | Tendency for infected individuals to associate with each other. | |
| Great Tit (Exploratory Behavior) | Social Network → Behavior (Contagion) | 0.31 | 0.09 | Yes | An individual's exploratory behavior is influenced by its network partners' behavior. |
| Behavior → Network (Selection) | 0.42 | 0.11 | Yes | Individuals with similar exploratory behavior form ties. |
3. Detailed Experimental Protocols
Protocol 1: Longitudinal Animal Social Network Data Collection for RSiena Analysis Objective: To collect the repeated network and attribute data required for SAOMs. Materials: GPS proximity loggers, RFID feeder systems, or direct observation equipment. Procedure:
Protocol 2: Executing a Basic SAOM for Social Contagion Analysis Objective: To model the joint evolution of a social network and a binary behavioral or disease state. Software: R with RSiena package installed. Procedure:
sienaDataCreate() to create a Siena data object, specifying the dependent network(s) and covariates.getEffects() to view available effects. Create an effects object and include key parameters:
avSim (average similarity for contagion), egoX (behavior effect on activity), altX (behavior effect on popularity).siena07() to estimate parameters. Use multiple sub-processes (nbrNodes) for efficiency and robustness.sienaGOF() on auxiliary statistics (e.g., indegree, outdegree, triad census distributions).avSim effect indicates social contagion. Significant egoX or altX indicates the state influences social tie formation.4. Visualization of SAOM Analytical Workflow
Title: RSiena SAOM Analysis Iterative Workflow
5. Signaling Pathways in Social Transmission
Title: Neuro-Behavioral Pathway for Social Transmission
6. The Scientist's Toolkit: Research Reagent Solutions
Table 2: Essential Materials for Social Transmission Field Studies
| Item / Solution | Function | Example Use Case |
|---|---|---|
| Miniaturized GPS/Proximity Loggers | Logs continuous spatiotemporal location or pairwise encounters between individuals. | Quantifying dynamic contact networks for disease transmission models in wildlife. |
| Automated RFID Feeder Systems | Records individual visits to resource points, providing data on association and foraging behavior. | Studying how shared resource use drives network structure and trait transmission. |
| Non-Invasive Biological Samplers (e.g., Buccal Swabs, Fecal Collectors) | Enables collection of DNA, hormones, or pathogen samples without disturbing the subject. | Genotyping for relatedness, stress hormone analysis, or pathogen surveillance. |
| RT-PCR Pathogen Detection Kits | Sensitive and specific detection of pathogen RNA/DNA from field-collected samples. | Determining infection status as a binary or continuous nodal covariate in SAOMs. |
| Behavioral Scoring Software (e.g., BORIS, EthoVision) | Facilitates systematic coding and analysis of observed behaviors from video footage. | Quantifying behavioral traits (e.g., boldness, exploration) for network-behavior co-evolution analysis. |
| RSiena R Package | Statistical software for the analysis of longitudinal network data using Stochastic Actor-Oriented Models. | Modeling the co-evolution of social networks and individual states (infection, behavior). |
Within RSiena software analysis of animal social networks, convergence failures indicated by t-ratios with absolute values greater than 0.1 signify that parameter estimates are insufficiently stable for reliable inference. This Application Note provides a structured protocol for diagnosing root causes and implementing solutions, critical for robust longitudinal network analysis in behavioral neuroscience and drug development research.
Table 1: Convergence Assessment Metrics in RSiena
| Metric | Target Value | Acceptable Range | Indication of Failure |
|---|---|---|---|
| Overall maximum t-ratio | < 0.1 | 0.1 - 0.2 (warning) | > 0.2 |
| Individual parameter t-ratio | < 0.1 | 0.11 - 0.15 | > 0.15 |
| Score test ratio | ~1.0 | 0.8 - 1.2 | < 0.8 or > 1.2 |
| Convergence ratio (Phase 3) | > 0.3 | 0.2 - 0.3 (borderline) | < 0.2 |
Table 2: Common Causes and Prevalence in Animal Network Studies
| Root Cause | Estimated Frequency (%) in Failed Models | Primary Diagnostic Check |
|---|---|---|
| Insufficient iterations (Phase 3) | 45% | Check n3 value & convergence ratio |
| Model misspecification | 25% | Score-type tests, goodness-of-fit (GOF) |
| Sparse/volatile network data | 15% | Examine Jaccard indices & period stability |
| Multicollinearity (rate/effects) | 10% | Correlation between parameter estimates |
| Algorithmic issues | 5% | Check standard deviations, eigenvalues |
siena07() with default parameters (n3=1000, seed=123).sienaFit object, record:
tconv: Maximum absolute t-ratio for convergence.theta).n3=3000 in sienaAlgorithmCreate() and re-estimate.n3 progressively to 5000, then 10000.n3 > 3000, use prevAns argument in siena07() to initialize from previous fit, aiding stability.siena07() with multiple random seeds (e.g., seed=c(123, 456, 789)). Consistent estimates across runs indicate robustness.sienaGOF() for out-degree, in-degree, and triad census distributions. Significant p-values (<0.05) indicate misspecification.(stable ties)/((ties at t1)+(ties at t2)-(stable ties)). Values <0.3 indicate excessive change, complicating estimation.fix argument in sienaAlgorithmCreate().sienaAlgorithmCreate(), set diagonalize=0.2 to reduce correlation between parameters.firstg=0.01 to prevent early iteration instability.sienaFit object. Very small eigenvalues (< 1e-6) suggest collinearity.siena07() with n3=2000, nsub=4 (multiple subphases) for final polishing.Title: RSiena Convergence Diagnosis Workflow
Table 3: Essential Computational Tools for RSiena Analysis
| Tool/Reagent | Function/Benefit | Example/Note |
|---|---|---|
| RSiena (v. 1.4-xx) | Core software for SAOM estimation. | Use latest version from CRAN for bug fixes. |
sienaAlgorithmCreate() |
Configures estimation parameters (n3, diagonalize). | Key function for implementing protocols. |
siena07() with prevAns |
Uses prior estimates for initialization, improving stability. | Critical for high-iteration runs. |
sienaGOF() |
Performs goodness-of-fit tests to diagnose model misspecification. | Uses Monte Carlo simulation. |
| Jaccard Index Script | Custom R code to calculate network stability between waves. | Identifies overly volatile data. |
| Multiple Random Seeds | Set of integers (e.g., 123, 456, 789) for seed argument. |
Tests estimation robustness. |
| High-Performance Computing (HPC) Cluster | Enables long runs (n3>5000) and multiple simulations. | Essential for large animal networks. |
| RStudio Projects | Organizes scripts, data, and outputs for reproducible workflow. | Maintains version control. |
1. Introduction Within the context of RSiena analysis of animal social networks, achieving adequate model fit is paramount for valid inference. Goodness-of-Fit (GoF) tests diagnose discrepancies between simulated networks from the estimated model and the observed longitudinal network data. This protocol details strategies to improve GoF, integral to a thesis on robust dynamic network analysis in behavioral ecology and translational drug development research, where network dynamics may reflect treatment effects.
2. Quantitative Summary of Common GoF Statistics Table 1: Key GoF Statistics in RSiena and Target Improvement Strategies
| GoF Statistic | Description | Target Range | Primary Improvement Strategy |
|---|---|---|---|
| Outdegree Distribution | Frequency of nodes with k outgoing ties. | Simulated distribution envelope contains observed. | Add outdegree activity (outTrunc) or outdegree popularity (outOutAss). |
| Indegree Distribution | Frequency of nodes with k incoming ties. | Simulated distribution envelope contains observed. | Add indegree activity (inTrunc) or indegree popularity (inInAss). |
| Geodesic Distance Distribution | Frequency of shortest path lengths. | Simulated mean/quantiles match observed. | Include triadic effects (transTrip, cycle3) or gravity effects. |
| Triad Census | Counts of all 16 possible directed triadic motifs. | Simulated counts match observed for key motifs. | Include specific triadic effects (transTrip, transRecTrip, cycle3). |
| Edge-wise Shared Partners | Distribution of ties by number of common neighbors. | Simulated distribution matches observed. | Add GWESP (geometrically weighted edgewise shared partners) effect. |
3. Protocol: Iterative GoF Diagnosis and Model Improvement
3.1. Materials & Research Reagent Solutions Table 2: Essential Toolkit for RSiena GoF Analysis
| Item | Function in Analysis |
|---|---|
| RSiena Software Suite (v.1.3-14 or later) | Core environment for SAOM estimation and simulation. |
sienaGOF() Function |
Primary routine for calculating GoF test statistics. |
Parallel Computing Cluster/foreach Package |
Enables computationally intensive GoF simulations. |
| Diagnostic Plot Scripts (ggplot2) | Visualizes distributions of observed vs. simulated statistics. |
| Theoretical Effect Glossary | Guide to candidate effects (e.g., GWESP, outTrunc) for model expansion. |
3.2. Step-by-Step Experimental Protocol
Step 1: Baseline Model Estimation
siena07() with adequate phase 2 and 3 iterations for convergence (max. ratio < 0.25, all t-convergence ratios < |0.1|).Step 2: Initial Goodness-of-Fit Assessment
gof <- sienaGOF(estimatedModel, varName, behaviorName = NULL, GOF, parallel = TRUE, nsim = 1000)GOF = ~ indegree + outdegree + geodesic.distance + triad.census for comprehensive diagnosis.plot(gof)Step 3: Diagnose Specific Misfit
Step 4. Model Expansion & Re-Estimation
outTrunc or outSqrt).Step 5. Iterative Re-testing
sienaGOF() on the new model.sienaGOF() tests (e.g., GOF = ~ idegree + odegree) for focused assessment post-modification.Step 6. Final Validation
sienaGOF output) for an overall fit measure (p > 0.05 indicates no significant misfit).4. Visualization of the GoF Improvement Workflow
Goodness-of-Fit Diagnostic & Improvement Cycle
Handling Sparse Networks and Missing Data in Animal Observations
In the broader thesis context of applying RSiena (Simulation Investigation for Empirical Network Analysis) to animal social networks, data quality is paramount. RSiena models the co-evolution of social networks and behaviors through longitudinal stochastic actor-oriented models (SAOM). A core challenge in ecological fieldwork is the inherent sparsity of observed interactions and frequent missing observation periods (e.g., due to animal absence, equipment failure). Sparse or gappy data can bias SAOM parameter estimates, leading to incorrect inferences about social dynamics, influence, and selection processes. These methodological insights are also critical for professionals in behavioral pharmacology and drug development, where accurate social network models may serve as biomarkers for treatment efficacy or social side effects.
Table 1: Common Causes and Impacts of Sparse/Missing Data in Animal Observation Studies
| Cause Category | Specific Example | Typical Incidence Range | Primary Impact on RSiena Analysis |
|---|---|---|---|
| Observation Gaps | Failed GPS/tag battery | 10-40% of scheduled fixes | Creates structurally missing waves, impeding longitudinal tie change assessment. |
| Individual Absence | Animal foraging out of range | 5-30% of individuals per wave | Leads to reduced node set, affecting density & degree estimates. |
| Sparse Interactions | Rare dominance encounters in cryptic species | Network Density < 0.05 | Challenges rate function estimation; increases standard errors. |
| Partial Observation | Limited field of view from camera traps | 15-50% of group unobserved | Results in censored, non-random missing ties, risking bias. |
Table 2: Comparison of Data Handling Methods for RSiena
| Method | Core Principle | RSiena Implementation | Advantages | Limitations |
|---|---|---|---|---|
| Multiple Imputation | Generates multiple plausible network versions. | sienaGOF with imputed data sets. |
Propagates uncertainty, provides robust CI. | Computationally intensive; assumes Missing at Random (MAR). |
| Structural Zeros | Defines impossible ties (e.g., across pens). | sienaDependent with structZero matrix. |
Realistically constrains solution space. | Does not address random or informative missingness. |
| Hypermodel (Bayesian) | Integrates uncertainty into estimation. | RSiena with Bayes option in sienaAlgorithm. |
Fully Bayesian handling of missing data. | High complexity, long run times. |
| Listwise Deletion | Removes nodes/periods with excessive missingness. | Manual preprocessing before creating sienaDependent. |
Simple, straightforward. | Loss of power, potential for severe bias. |
Protocol A: Pre-Collection Study Design to Minimize Missingness
RSimulate in RSiena) to generate networks under expected missing data regimes (e.g., 20% node loss). Determine the minimum observation intensity required for reliable parameter recovery.Protocol B: Post-Hoc Data Processing and Imputation for RSiena
btergm or ergm package in R to fit a TERGM on the observed panel of networks.siena data sets (sienaDataCreate) from the imputed networks and analyze using siena07 in parallel.mitools package).Protocol C: RSiena Analysis with Informed Priors for Sparse Networks
density (outdegree) and reciprocity in the objective function. This stabilizes the estimation of more complex effects.sienaAlgorithm with bayes=TRUE.transTrip), specify informative priors (e.g., N(0, 0.5)) derived from meta-analyses of similar species or pilot studies.sienaGOF to simulate networks from the fitted model. Compare distributions of geodesic distances, triad censuses, or edgewise shared partners between simulated and observed (with imputations) networks to validate the model's fit despite initial data sparsity.Diagram: Workflow for Handling Missing Data in RSiena Analysis
Title: RSiena Missing Data Workflow
Diagram: SAOM for Animal Networks with Missing Data Mechanisms
Title: Latent Network Model with Missing Data
Table 3: Essential Tools for Robust Animal Social Network Analysis
| Item / Solution | Function in Context of Sparse/Missing Data | Example Product / R Package |
|---|---|---|
| Programmable GPS Loggers | High-frequency spatial data allows proximity-based network inference, filling gaps in direct interaction logs. | OrniTrack-50, Movebank API. |
| Automated Camera Array | Provides continuous, multi-angle coverage to reduce individual absence artifacts. | Reconyx HyperFire 2, camtrapR package. |
| Passive Integrated Transponder (PIT) Systems | Logs all visits of tagged individuals to key nodes (feeders, burrows), generating complete association matrices. | Oregon RFID, feedr package. |
R aniSNA Suite |
Curated packages for diagnostics (asnipe), imputation (btergm), and RSiena wrapper (RSiena) workflows. |
asnipe, btergm, RSiena. |
| Network Simulation Framework | Generates synthetic data under known missingness regimes to test analysis robustness. | RSimulate (in RSiena), igraph functions. |
| High-Performance Computing (HPC) Access | Enables multiple imputation, Bayesian RSiena, and large simulation studies, which are computationally demanding. | Cloud platforms (AWS), university HPC clusters. |
Within the broader thesis on RSiena software for the analysis of animal social networks, managing model complexity is paramount for deriving biologically interpretable and statistically robust conclusions. The core principle is parsimony: selecting the simplest adequate model to explain observed social dynamics. Overly complex models risk overfitting, reduced generalizability, and obscured key drivers of behavior.
siena06() procedure to detect and mitigate correlation between effect statistics.sienaGOF() to test whether the simplified model adequately reproduces key network features (e.g., triad census, degree distribution).Table 1: Comparison of Complex vs. Parsimonious RSiena Model for Baboon Grooming Network
| Model Feature | Complex Model (All Effects Tested) | Parsimonious Final Model | Interpretation | ||||
|---|---|---|---|---|---|---|---|
| Rate Parameter (Period 1-2) | 8.45 (p=0.12) | 7.21 (p=0.08) | Stable estimated change opportunities. | ||||
| Outdegree (Density) | -2.10 | -2.05 | Consistent low baseline tie probability. | ||||
| Reciprocity | 1.85 | 1.80 | Strong, stable mutual grooming preference. | ||||
| Transitivity | 0.65 | 0.68 | Strong closure in grooming triangles. | ||||
| Age Homophily | 0.55* | 0.52* | Significant preference for similar-age partners. | ||||
| Dominance (Sender) | 0.30 (p=0.25) | -- | Effect excluded; not significant. | ||||
| Kinship | 1.15 | 1.20 | Strong preference for grooming relatives. | ||||
| Convergence t-ratios | All < | 0.1 | All < | 0.1 | Both models converged adequately. | ||
| Overall Maximum Convergence Ratio | 0.15 | 0.09 | Parsimonious model shows better convergence. | ||||
| Bayesian Information Criterion (BIC) | 1245.7 | 1232.3 | Lower BIC supports the parsimonious model. |
Note: * p < 0.05; * p < 0.01; -- effect not included.*
Objective: To build a parsimonious stochastic actor-oriented model (SAOM) explaining network dynamics.
Materials: RSiena software (v.1.3-14 or later), R environment, network data in matrix form, attribute data for covariates.
Procedure:
sna or igraph to inform plausible effects.sienaDependent() to specify the network panels. Use coCovar() or varCovar() for covariates. Combine with sienaDataCreate().getEffects().outdegree), reciprocity (recip), and transitivity (transTrip or transRecTrip).siena07(myAlgorithm, data=myData, effects=myEff).effFrom for a covariate sender effect, simX for homophily).siena06().Objective: To validate that a parsimonious RSiena model adequately captures the observed network's structure.
Procedure:
ans), select auxiliary network statistics. Common choices include indegree distribution, outdegree distribution, geodesic distance distribution, and triad census.sienaGOF(ans, auxiliaryFunction=triadCensus, verbose=TRUE) to simulate many networks based on the fitted model and calculate the chosen statistic for each.plot(gof.object)) and examine the p-value from the Monte Carlo test. A p-value > 0.05 (or > 0.10) suggests the model does not significantly misfit on that statistic.RSiena Model Simplification Workflow
Table 2: Essential Components for RSiena-based Animal Social Network Analysis
| Item | Function in Research | Example/Specification |
|---|---|---|
| Focal/Ad Libitum Behavioral Data | Raw input for constructing social networks. Defines ties (e.g., grooming, proximity). | Ethogram-defined interaction events, collected via focal animal sampling or automated tracking. |
| Individual Animal Covariates | Explanatory variables for network dynamics. | Age, sex, dominance rank index, genetic relatedness coefficient, hormone levels. |
| RSiena Software Suite | Core statistical environment for SAOM estimation and simulation. | R package RSiena (v.1.3-14+). Requires R (v.4.0.0+). |
| High-Performance Computing (HPC) Cluster | Facilitates model estimation, which is computationally intensive for large networks or many waves. | Access to multi-core nodes for parallelizing siena07 runs via siena07RunToConvergence. |
| Network Preprocessing Scripts | Converts raw observation data into adjacency matrices and covariate files for RSiena. | Custom R/Python scripts for data aggregation, matrix construction, and Jaccard index calculation. |
| Goodness-of-Fit Diagnostic Scripts | Validates the simplified model's ability to reproduce network features. | R scripts utilizing sienaGOF with custom auxiliary statistics (e.g., specific triad types). |
| Visualization Tools | Communicates network structure, change, and model results. | R packages igraph (for static plots) and networkDynamic (for dynamic visualization). |
Abstract: This Application Note provides a structured guide for managing and analyzing large-scale animal network data within the RSiena framework, a primary tool for longitudinal social network analysis. The protocols focus on computational efficiency, data integrity, and scalability, contextualized within a broader thesis on advancing animal social networks research for applications in behavioral ecology and translational drug development.
Efficient pre-processing is critical for handling large datasets. The following protocol standardizes data ingestion for RSiena.
Protocol 1.1: Data Compression and Formatting for RSiena
.csv files for networks (adjacency matrices) and covariates. Use non-numeric delimiters (e.g., "NA") for missing data.Matrix package in R before RSiena import. This reduces memory footprint.sienaDataCreate() function with the sparse = TRUE argument for network data. Store the returned siena object as an .RData file for rapid subsequent loading.Table 1: Performance Impact of Sparse Matrix Format
| Network Size (n) | Dense Matrix Memory (MB) | Sparse Matrix Memory (MB) | RSiena Model Creation Time (s) |
|---|---|---|---|
| 200 | 0.32 | 0.12 | 15.2 |
| 500 | 2.00 | 0.45 | 48.7 |
| 1000 | 8.00 | 1.20 | 185.3 |
| 2000 | 32.00 | 3.10 | 721.5 |
Note: Memory values are for one binary network matrix. Sparse matrix savings increase with higher sparsity.
RSiena estimation is computationally intensive. These protocols leverage parallel computing and algorithmic settings.
Protocol 2.1: Parallelized Parameter Estimation
siena07 backend. Example:
sienaAlgorithmCreate() function. Use projname to define unique output file names for parallel runs.siena07() with the cl = mycluster and useCluster = TRUE arguments. Always include returnDeps = TRUE for simulation-based diagnostics.sienaGroup() to aggregate results from multiple independent runs for robustness.Protocol 2.2: Intelligent Phase Specification
maxlike = TRUE (Method of Moments) to quickly approximate parameters.maxlike = FALSE (Maximum Likelihood Estimation) for accurate standard errors. This two-phase approach reduces total computation time.Title: RSiena Large-Scale Data Analysis Workflow
Robust diagnostics prevent model degeneracy and ensure reliability in large-scale inference.
Protocol 3.1: Simulation-Based Goodness-of-Fit (GoF)
sienaGOF() with the parallel = TRUE argument to simulate networks based on the fitted model.Table 2: Key Diagnostic Statistics and Acceptable Ranges
| Diagnostic Metric | Target Range | Interpretation | ||
|---|---|---|---|---|
| Overall Maximum Convergence Ratio (t-conv) | ( t < 0.25 ) | Indicates successful convergence of all parameters. | ||
| Individual t-ratios | ( | t | < 0.10 ) | Convergence for specific parameters. |
| Goodness-of-Fit p-value | ( p > 0.05 ) | Model adequately reproduces network structure. | ||
| Monte Carlo Standard Error | ( SE < | estimate | /2 ) | Parameter estimate is precise relative to its magnitude. |
Table 3: Essential Computational Tools for Large-Scale Animal Network Analysis
| Item/Software | Function & Application in Analysis |
|---|---|
| RSiena (v. 1.3-12 or later) | Core software for stochastic actor-oriented modeling (SAOM) of longitudinal network data. |
R Matrix Package |
Provides sparse matrix data structures to efficiently store and manipulate large, sparse networks. |
| Parallel (R core package) | Enables multi-core processing for RSiena estimation and GoF simulations, drastically reducing time. |
| snowFT Package | Provides fault-tolerant parallel computing, essential for long-running jobs on HPC clusters. |
| Graphviz | Used for visualizing complex model structures and derived network pathways (as in the diagram above). |
| RStudio Server Pro | Web-based IDE allowing researchers to run analyses on remote, powerful servers from any location. |
| Custom R Script Repository | Version-controlled collection of scripts for data formatting, batch RSiena runs, and result extraction. |
Title: Diagnosing and Solving RSiena Model Degeneracy
Application Notes and Protocols
This document outlines advanced protocols for investigating the coevolution of multiple social networks (e.g., proximity, aggression, grooming) and behavioral traits (e.g., boldness, hormone levels) within the framework of RSiena (Simulation Investigation for Empirical Network Analysis). This work forms part of a broader thesis on extending RSiena, a stochastic actor-oriented model (SAOM) framework designed for human networks, to the complexities of longitudinal animal social network data. These methods allow researchers and drug development professionals to model how social connections influence, and are influenced by, multiple individual phenotypes, a dynamic critical for understanding social behavior's biological underpinnings and potential pharmacological modulation.
1. Protocol: Longitudinal Multi-Network Data Collection for RSiena
Objective: To collect the time-structured, network and covariate data required for RSiena coevolution analysis. Materials: See "Research Reagent Solutions" table. Procedure: 1. Subject Identification & Baseline: Anesthetize and PIT-tag all individuals (N>50 recommended for power) in the study population. Record baseline covariates (e.g., body mass, sex, baseline GC from hair/fecal sample). 2. Observation Sessions: Conduct standardized behavioral observations (e.g., 10-minute scan samples every 30 minutes for 5 days) to construct networks for each type of social tie (e.g., proximity <1m, grooming, aggression). Use automated tracking (RFID, GPS) where possible for continuous proximity. 3. Temporal Waves: Repeat step 2 at defined intervals (e.g., 3-4 time points, t1, t2, t3), ensuring the interval is appropriate for the behavioral and physiological scales of interest (e.g., weeks for hormones, seasons for dominance). 4. Covariate Measurement: At each wave, collect individual-level state variables. For physiological measures like glucocorticoids (GC), conduct a controlled stress series (capture, restraint, blood sample at 0, 15, 30 min post-stress) on a subset or use non-invasive fecal sampling. Score behavioral traits via standardized assays (e.g., open field test for boldness). 5. Network Construction: For each wave and network type, create N x N adjacency matrices. For proximity, use a threshold (e.g., individuals recorded within 1m in >30% of scans per wave have a tie). Weighted or binary ties are acceptable for RSiena.
2. Protocol: RSiena Model Specification for Coevolution
Objective: To specify an SAOM that jointly models the coevolution of two networks and multiple behaviors.
Software: RSiena (version 1.4-4 or later) in R.
Procedure:
1. Data Preparation: Create dependent network objects (sienaNet) for each network (e.g., network.prox, network.groom). Create covariate objects (coCovar, varCovar, sienaCompositionChange) for actor attributes.
2. Model Specification:
- Define the basic rate functions for each network's change between waves.
- Specify structural effects for each network (e.g., outdegree, reciprocity, transitivity).
- Add cross-network effects (e.g., eval term: X -> Y models how ties in network X influence the formation of ties in network Y).
- Specify behavioral evolution using the sienaBehavior function. Include shape (tendency), linear/quadratic tendencies, and social influence effects (e.g., avSim for average similarity to alters).
- Include network-behavior coevolution effects:
* Selection: How a behavior influences tie formation (e.g., egoX, altX, simX).
* Influence: How network position influences behavioral change (e.g., totSim for total similarity).
3. Model Estimation: Use siena07 with Phase 2 and 3 iterations. Check convergence (t-ratios < |0.1|) and overall maximum convergence ratio (< 0.25).
4. Goodness-of-Fit: Use sienaGOF to test if the model adequately reproduces key network statistics (e.g., geodesic distances, triad census).
3. Protocol: Experimental Pharmacological Intervention within Coevolution Framework
Objective: To test the causal effect of a drug (e.g., a glucocorticoid receptor antagonist like Mifepristone) on social network-behavior coevolution. Materials: See "Research Reagent Solutions" table. Procedure: 1. Baseline Period (t1): Complete Protocol 1, Wave 1. 2. Randomized Assignment: Randomly assign subjects to Treatment (drug) and Vehicle control groups, matched for baseline network centrality and GC levels. 3. Treatment Administration: Administer drug or vehicle via appropriate route (e.g., oral, injection) at a defined dose and schedule that maintains target engagement over the observation period. 4. Post-Treatment Observation (t2): Repeat Protocol 1, Wave 2, during the treatment period. 5. Washout & Follow-up (t3): Cease treatment, allow for washout, and repeat Wave 3 to assess persistence of effects. 6. RSiena Analysis: Include a time-varying covariate for treatment group. Test interaction effects between treatment and key parameters (e.g., does treatment alter the strength of social influence on GC dynamics?).
Visualizations
Title: Workflow for Multi-Network Coevolution Study
Title: Drug Action on GC-Social Feedback Loop
Research Reagent Solutions
| Item | Function & Relevance to Research |
|---|---|
| Passive Integrated Transponder (PIT) Tags & Readers | Enables unique identification and automated, continuous recording of proximity/interactions for dynamic network construction. |
| Miniaturized Biologgers (GPS, Accelerometers) | Provides high-resolution spatial and activity data to infer behavioral states and social encounters beyond direct observation. |
| Glucocorticoid Receptor Antagonist (e.g., Mifepristone) | Pharmacological tool to block GR signaling, allowing causal tests of the HPA axis role in network-behavior coevolution. |
| Enzyme Immunoassay (EIA) Kits for Glucocorticoids | For quantifying fecal, salivary, or plasma GC concentrations as a key physiological covariate in coevolution models. |
| Automated Video Tracking Software (e.g., EthoVision, DeepLabCut) | Extracts precise locomotor and social interaction data from video, feeding into network matrices and behavioral scores. |
| RSiena Software (R package) | The core analytical platform for estimating stochastic actor-oriented models of network-behavior coevolution. |
| Standardized Behavioral Arenas (Open Field, Novel Object) | Used for consistent, repeated assays of personality traits (boldness, exploration) as evolving covariates in the model. |
Quantitative Data Summary: Key RSiena Effects in Coevolution Studies
| Effect Type | Parameter Name (RSiena) | Typical Estimate Range* | Interpretation in Animal Studies |
|---|---|---|---|
| Network Structure | Outdegree (density) | -1.5 to -3.0 | Baseline tendency to form ties (usually negative due to sparsity). |
| Reciprocity | 0.5 to 2.5 | Tendency for mutual ties. Strong in grooming/alliance networks. | |
| Transitive Triads | 0.1 to 0.5 | Tendency for friend-of-a-friend ties (closure). | |
| Cross-Network | Proximity -> Grooming | 0.2 to 1.0 | Proximity ties increase the likelihood of grooming ties forming. |
| Selection | Similarity on GC (simX) | 0.1 to 0.8 | Tendency to associate with others having similar GC levels. |
| Influence | Average Alter GC (avSim) | 0.05 to 0.4 | An individual's GC level changes toward the average of its alters. |
| Behavior Shape | GC Linear Tendency | -0.3 to 0.3 | Directional trend in GC level (negative = decrease over time). |
| GC Quadratic Tendency | -0.1 to -0.5 | Curvilinear trend (often negative, indicating reversion to mean). |
*Ranges are illustrative, derived from simulated and empirical examples in literature. Actual values depend on species, network definition, and timescale.
Within a doctoral thesis employing RSiena (Simulation Investigation for Empirical Network Analysis) for longitudinal animal social network research, internal validation is paramount. This analysis ensures the robustness, reliability, and reproducibility of the statistical models used to understand social dynamics, such as the evolution of dominance hierarchies or cooperative alliances. Two critical components of this validation are the use of Jaccard indices to assess model stability and comprehensive sensitivity analysis to probe the influence of modeling choices on inferred social parameters. These methods guard against overinterpretation of stochastic actor-oriented models (SAOMs) fitted with RSiena.
Jaccard Index in RSiena: The Jaccard index measures the stability of model estimation. For RSiena's Method of Moments (MoM) estimation, it quantifies the overlap between the observed network data and the networks simulated during the estimation process. A low Jaccard index for a given wave indicates poor convergence, suggesting the model cannot adequately simulate networks similar to the observed data, thus casting doubt on the parameter estimates.
Sensitivity Analysis: This involves systematically varying components of the analytical pipeline—such as network definition criteria (e.g., proximity interaction duration thresholds), model specification (inclusion/exclusion of specific effects), or data handling (treatment of missing data)—and observing the impact on the final parameter estimates and significance. In animal social network studies, this is crucial as biological conclusions (e.g., "this population shows strong social learning") must be resilient to methodological decisions.
Table 1: Interpretation of Jaccard Indices in RSiena
| Jaccard Index Range | Interpretation | Recommended Action |
|---|---|---|
| < 0.20 | Critical Instability. Estimation has failed. | Re-specify model, check data, increase simulation iterations (n3). |
| 0.20 - 0.30 | Poor Stability. Results are highly unreliable. | Increase n2 and n3 parameters substantially; consider model simplification. |
| 0.30 - 0.50 | Moderate Stability. Results are acceptable but interpret with caution. | Report indices. Minor model adjustments may improve stability. |
| 0.50 - 0.60 | Good Stability. Model convergence is satisfactory. | Standard for reliable inference. |
| > 0.60 | Excellent Stability. Very reliable convergence. | Ideal outcome. |
Table 2: Sensitivity Analysis Results Template (Hypothetical Study on Grooming Networks)
| Altered Condition | Key Parameter (e.g., "Transitivity") | Estimate (Original) | Estimate (Altered) | Change in Significance? | Conclusion on Robustness |
|---|---|---|---|---|---|
| Baseline Model | Transitivity | 0.65 (p<0.01) | — | — | Reference. |
| Threshold: 5 sec -> 10 sec | Transitivity | 0.65 (p<0.01) | 0.58 (p<0.05) | Yes (weakened) | Moderately robust; effect persists but is sensitive to definition. |
| Excluding juveniles | Transitivity | 0.65 (p<0.01) | 0.11 (p=0.62) | Yes (lost) | Not robust; effect driven by juvenile interactions. |
Alternative Convergence (n2=2000) |
Transitivity | 0.65 (p<0.01) | 0.63 (p<0.01) | No | Highly robust to estimation settings. |
Protocol 1: Calculating and Interpreting Jaccard Indices in RSiena
siena07() in RSiena. Ensure sufficient phase 2 and 3 iterations (e.g., n2=1000, n3=3000).Jaccard index for each wave is contained in the sf element of the sienaFit object. Access via fit$sf$Jaccard.n2=2000, n3=5000.Protocol 2: Conducting a Sensitivity Analysis for an RSiena-Based Animal Network Study
n2, n3, or the algorithm (MoM vs. Maximum Likelihood).RSiena Jaccard Validation Workflow
Sensitivity Analysis Protocol for RSiena
Table 3: Essential Materials and Tools for RSiena Internal Validation
| Item | Category | Function in Validation |
|---|---|---|
| RSiena Software (v.1.3-xx) | Core Software | The R package used to specify, estimate, and simulate Stochastic Actor-Oriented Models for longitudinal network data. |
| R/RStudio | Computing Environment | Provides the platform for running RSiena, managing data, and executing validation scripts. |
| High-Performance Computing (HPC) Cluster Access | Computational Resource | Sensitivity analysis requires numerous model re-fits; HPC drastically reduces computation time. |
| Structured Observation Data (e.g., HMM, GPS) | Primary Data | Raw behavioral or spatial data from which networks are constructed for RSiena analysis. |
| Custom R Scripts for Batch Processing | Software Tool | Automates the running of multiple RSiena models across sensitivity conditions, ensuring consistency. |
| Jaccard Index Extraction Function | Diagnostic Tool | A script (fit$sf$Jaccard) to retrieve stability metrics from fitted RSiena objects. |
| Results Aggregation Template (e.g., Table 2) | Data Management | A pre-formatted spreadsheet or R data frame to systematically record outcomes from all sensitivity runs. |
| Statistical Comparison Function | Analysis Tool | Scripts (e.g., using dplyr or base R) to calculate differences in estimates between baseline and sensitivity models. |
This document provides application notes and protocols for social network analysis (SNA) within animal social networks research, framed by a broader thesis investigating the utility of RSiena for modeling the dynamics of animal societies. While traditional static SNA tools (igraph, UCINET) describe network structure at a single point in time, RSiena enables the statistical modeling of network change and behavioral co-evolution, crucial for understanding the temporal dynamics underlying social structure, disease transmission, and information flow in animal populations. This comparative analysis is foundational for research into social determinants of health and behavior, with implications for epidemiological modeling and intervention strategies.
Table 1: Fundamental Comparison of RSiena and Static SNA Tools
| Feature | RSiena (Dynamic) | igraph / UCINET (Static) |
|---|---|---|
| Primary Purpose | Statistical modeling of network & behavior change over time. | Description, visualization, and hypothesis-testing of single or multiple static networks. |
| Data Requirement | Minimum of two (ideally more) network observations at discrete time points. | One or more cross-sectional network snapshots. |
| Key Output | Model parameters (rates, objective function) estimating selection & influence effects. | Network metrics (centrality, density, clustering), visualizations, statistical tests. |
| Underlying Framework | Stochastic actor-oriented models (SAOMs) based on utility maximization. | Graph theory & algebraic sociometry. |
| Temporal Inference | Direct, models the process of tie change. | Indirect, via comparison of separate snapshots. |
| Handling Missing Data | Explicit models for missing network ties. | Typically requires complete data or ad-hoc imputation. |
Objective: To collect robust longitudinal social interaction data suitable for both static and dynamic (RSiena) network analysis.
Materials:
Procedure:
t, create an adjacency matrix $X^t$, where $x_{ij}^t$ represents the strength or frequency of interaction between individuals i and j. Apply a consistent threshold or weighting scheme across waves.Objective: To characterize network structure at each observation wave and test for differences between groups or time points using static methods.
Workflow:
Static SNA Analysis Workflow (92 chars)
Procedure (using igraph in R):
Objective: To model the co-evolution of the social network and individual traits, testing specific hypotheses about social dynamics.
Workflow:
RSiena Dynamic Modeling Workflow (64 chars)
Procedure:
Table 2: Example RSiena Model Output Interpretation
| Parameter | Effect | Estimate (θ) | S.E. | p-value | Interpretation in Animal Context |
|---|---|---|---|---|---|
| Rate | Network rate (period 1) | 3.50 | 0.21 | <.001 | Animals evaluate/change ties ~3.5 times per period. |
| Structural | Transitive Triplets | 0.28 | 0.05 | <.001 | Preference for forming ties that close triangles (friends of friends). |
| Covariate | Health Similarity | 0.65 | 0.15 | <.001 | Animals associate with others of similar health status (homophily). |
| Behavior | Average Alter (Health) | 0.42 | 0.10 | <.001 | Social influence: an individual's health score tends toward its associates' average. |
Table 3: Essential Tools for Animal Social Network Analysis
| Item | Function & Relevance | Example/Detail |
|---|---|---|
| Automated Tracking | Enables high-resolution, continuous collection of spatial proximity data, foundational for network construction. | GPS collars, RFID stations, automated video tracking software (e.g., DeepLabCut, idTracker). |
| Ethogram Software | Standardizes behavioral coding to ensure reliable, replicable definition of social ties. | Noldus Observer XT, BORIS, or custom database solutions. |
| R Environment | Primary platform for analysis, integrating data management, static (igraph) and dynamic (RSiena) analysis. | RStudio with tidyverse, igraph, RSiena, statnet packages. |
| UCINET | User-friendly GUI for foundational static SNA, descriptive metrics, and permutation tests (MRQAP). | Includes NetDraw for visualization. Useful for initial exploratory analysis. |
| Permutation Test Frameworks | Provides non-parametric statistical tests for network hypotheses, controlling for non-independence. | igraph functions, sna::qaptest, or Netlogo simulation models. |
| High-Performance Computing (HPC) | Critical for computationally intensive RSiena models (large groups, many waves) and simulation-based GOF tests. | Cloud computing clusters or local servers with parallel processing capabilities. |
Within a thesis on RSiena software for animal social network analysis, a comparative evaluation of available longitudinal network models is essential. RSiena (Stochastic Actor-Oriented Model, SAOM) and the Temporal Exponential Random Graph Model (TERGM) with its Separable temporal ERGM (STERGM) extension represent two dominant paradigms for modeling network dynamics. This analysis details their theoretical distinctions, application notes, and protocols for researchers in behavioral ecology and translational sciences, where dynamic social interactions can inform models of disease transmission or treatment effects in animal cohorts.
Table 1: Core Theoretical & Practical Comparison
| Feature | RSiena (SAOM) | TERGM/STERGM |
|---|---|---|
| Philosophy | Actor-oriented; agents drive change via myopic stochastic optimization. | Network-oriented; discrete-time Markov process for the whole network. |
| Time Handling | Continuous-time; models unobserved micro-steps between observations. | Discrete-time; models the probability of a network at time t given time t-1. |
| Dependent Unit | Network change as sequences of tie additions/deletions by actors. | The state of the entire network at a discrete time point. |
| Parameter Interpretation | Rate (speed of change), Objective Function (preferences), Evaluation (uncertainty). | Network statistics (e.g., edges, triangles) predicting tie existence in a given time step. |
| Separation of Dynamics | Integrated in the micro-step process. | Explicitly separable in STERGM: Formation and dissolution parameters. |
| Primary Software | RSiena (R) |
btergm, ergm, networkDynamic (R) |
| Ideal Use Case | Testing hypotheses about individual social drivers (e.g., preference for transitive closure) with finely spaced observations. | Forecasting network evolution or testing structural hypotheses with clear discrete time intervals (e.g., annual censuses). |
Table 2: Illustrative Simulation Results (Hypothetical Data)
| Model | Simulated Effect (Edge) | Parameter Estimate (Mean) | Std. Error | p-value |
|---|---|---|---|---|
| RSiena | Density | -1.50 | 0.15 | <0.001 |
| Transitivity | 0.60 | 0.08 | <0.001 | |
| Rate Period 1 | 6.20 | 0.85 | -- | |
| STERGM | Formation: Edges | -2.10 | 0.22 | <0.001 |
| Formation: Triangles | 0.45 | 0.10 | <0.001 | |
| Dissolution: Edges | 1.80 | 0.20 | <0.001 |
Protocol A: RSiena Model Fitting for Longitudinal Animal Association Data
sienaDataCreate() to specify networks, covariates (e.g., age, dominance rank), and constant/ changing actor variables.sienaAlgorithmCreate(). Include structural effects (e.g., transTrip, cycle3, inPop) and covariate effects (egoX, altX, simX).siena07() with appropriate Monte Carlo iterations. Check convergence (t-ratios < |0.1|) and overall goodness-of-fit (sienaGOF).Protocol B: TERGM/STERGM Application for Seasonal Network Dynamics
network objects and a list of matrices.ergm package, specify separate formation and dissolution formulae (e.g., formation = ~edges + triangles, dissolution = ~edges + offset(edges)).stergm() or the more recent tergm estimation functions (GMM or MCMC). For TERGM, use btergm() with bootstrap confidence intervals.simulate.stergm to simulate future networks. Compare forecasted networks to held-out data for validation.Diagram 1: Model Selection Workflow
Table 3: Key Research Reagents & Computational Tools
| Item | Function in Analysis | Example/Note |
|---|---|---|
| High-Resolution Tracking System (e.g., RFID, GPS) | Generates raw proximity/association data for network edge definition. | Used to create adjacency matrices for each observation wave. |
| Behavioral Coding Ethogram | Standardized catalog of social behaviors (aggression, affiliation). | Provides covariates (actor attributes) or alternative network definitions. |
| R Statistical Environment | Primary computational platform for all analyzed models. | Versions 4.3.0+. Essential for reproducibility. |
RSiena Package (v. 1.4.0.1+) |
Implements the Stochastic Actor-Oriented Model (SAOM). | Core tool for RSiena analysis; requires sna, network packages. |
statnet Suite (ergm, tergm, etc.) |
Provides tools for TERGM/STERGM estimation, simulation, and diagnostics. | btergm package also commonly used for TERGM estimation. |
| High-Performance Computing (HPC) Cluster Access | Facilitates bootstrapping and MCMC estimation for large/complex models. | Critical for model convergence with large animal cohorts or dense networks. |
Network Visualization Software (e.g., Gephi, ndtv) |
Enables graphical representation of dynamic networks for hypothesis generation and result presentation. | Aids in communicating temporal patterns to interdisciplinary teams. |
1. Application Notes & Data Synthesis
The application of RSiena (Simulation Investigation for Empirical Network Analysis) in primatology and rodent behavioral research provides a robust framework for analyzing the co-evolution of social networks and individual traits. The following tables summarize quantitative findings from key validation studies.
Table 1: Validated RSiena Model Effects in Primate Studies
| Study Subject (Reference) | Key Network Dynamics Effect (Rate, Evaluation, Selection) | Parameter Estimate (θ) | Convergent t-ratio | Key Covariate (Behavior) | Interpretation |
|---|---|---|---|---|---|
| Rhesus Macaques (2023) | Friendship Formation (Density) | -1.75 | 0.12 | Grooming (Behavior) | Baseline propensity for tie formation is negative, requiring other effects to drive connections. |
| Transitivity (transTrip) | 0.58 | 0.05 | -- | Strong tendency for triadic closure ("friend of a friend becomes a friend"). | |
| Selection by Dominance Rank (egoX, altX, simX) | 0.45 (simX) | -0.10 | Dominance Rank | Individuals prefer ties with others of similar dominance rank (homophily). | |
| Chimpanzees (2022) | Rate of Network Change (period 1-2) | 4.32 | 0.08 | Cooperative Hunting | Network structure is highly dynamic between observation periods. |
| Reciprocity (recip) | 0.87 | -0.15 | -- | Strong mutual tie formation. | |
| Influence on Hunting Participation (fromX) | 0.39 | 0.11 | Hunting Participation | Social partners influence an individual's likelihood to participate in hunts. |
Table 2: Validated RSiena Model Effects in Rodent (e.g., Rat) Pharmacological Studies
| Study Design (Reference) | Key Network Dynamics Effect | Parameter Estimate (θ) | Convergent t-ratio | Pharmacological Covariate | Key Finding |
|---|---|---|---|---|---|
| Social Defeat Model (2024) | Rate of Interaction Change (post-stress) | 5.21 | 0.09 | Fluoxetine (SSRI) Treatment | Stress increases network volatility. |
| Outdegree (Density) Post-Stress | -2.15 | 0.14 | -- | Stressed animals exhibit fewer social ties overall. | |
| Behavioral Alteration: Social Withdrawal (avgAlt) | -0.62 | -0.08 | Drug vs. Vehicle | Treated animals showed less influence from withdrawn partners. | |
| Novel Drug Screening (2023) | Pro-social Effect (egoX) | 0.78 | -0.12 | Candidate Drug 'X' (vs. Placebo) | Drug administration directly increased individual pro-social activity. |
| Homophily by Activity State (simX) | 0.51 | 0.06 | Locomotor Activity | Active individuals preferentially associate. |
2. Experimental Protocols
Protocol 1: Longitudinal Primate Social Network Data Collection for RSiena Objective: To collect structured longitudinal data suitable for RSiena analysis of wild or captive primate groups.
Protocol 2: Rodent Social Interaction Network Assay for Pharmacological Intervention Objective: To generate dynamic network data to test drug effects on social behavior using RSiena.
myModel <- sienaDependent(arrayOfNetworks). Include drug condition as a constant covariate and activity level as a changing covariate. Test for selection (drug influences network structure) and influence (network position affects behavioral change) effects.3. Mandatory Visualizations
Primate RSiena Longitudinal Analysis Workflow
RSiena Drug Effect Modeling Pathways
4. The Scientist's Toolkit: Research Reagent Solutions
| Item | Function in RSiena Animal Research |
|---|---|
| RFID Tracking System | Enables automated, continuous recording of social proximity (a network tie definition) with high temporal resolution for multiple subjects. |
| EthoVision XT or similar | Video tracking software for automated quantification of locomotor activity and zone occupancy, used to create behavioral covariates. |
| BORIS (Behavioral Observation Research Interactive Software) | Free, open-source tool for manual coding of complex social interactions from video to validate/refine automated network data. |
R Statistical Environment with RSiena & RSienaTest packages |
Core software for model specification, estimation, and goodness-of-fit testing of longitudinal network models. |
sna or igraph R packages |
For preliminary network visualization, descriptive statistic calculation (e.g., centrality, density) before RSiena analysis. |
| Standardized Housing (Visible Burrow/Complex Arena) | Creates a semi-naturalistic environment that allows expression of a full repertoire of social behaviors critical for valid network construction. |
| Salivary/Plasma Hormone Assay Kits (CORT, OT) | To measure physiological covariates (stress, bonding) that can act as selection or influence variables in the social network. |
| Automated Data Pipeline Scripts (Python/R) | Custom scripts to transform raw RFID/tracking logs into formatted adjacency matrices and covariate files required by RSiena. |
Application Notes
The investigation of animal social networks using RSiena (Simulation Investigation for Empirical Network Analysis) is pivotal for understanding behavioral contagion, social support phenotypes, and intervention outcomes in biomedical research. Assessing the robustness and replicability of findings from stochastic actor-oriented models (SAOMs) is critical before translating insights into, for example, pharmacokinetic models of socially modulated drug efficacy. These notes provide a framework for such assessment within a thesis on RSiena-based animal social network research.
Table 1: Key SAOM Parameter Diagnostics for Robustness Assessment
| Diagnostic Metric | Target Value/Range | Indication of Robustness | Implication for Replicability |
|---|---|---|---|
| Convergence t-ratio (all parameters) | Absolute value < 0.1 | Good model convergence. | Estimated parameters are reliable and less dependent on initial simulation values. |
| Overall Maximum Convergence Ratio | < 0.25 | Acceptable convergence of the entire model. | The simulation algorithm has properly explored the parameter space. |
| Score Test (Overall) | p > 0.10 | No significant omitted structural effects. | Model specification is adequate, reducing specification error risk in replication. |
| Period-specific sienaTimeTest p-value | p > 0.05 (Bonferroni-corrected) | Parameter estimates are stable across time. | Findings are not driven by a single observational period, aiding temporal replicability. |
Table 2: Quantitative Replicability Framework for Network Dynamics
| Replication Tier | Data Requirement | Analytical Requirement | Success Criterion (Example) |
|---|---|---|---|
| Direct Replication | Same species, strain, housing, observation protocol. | Identical RSiena model specification & estimation settings. | Parameter signs match, and estimates fall within 95% confidence intervals of original study. |
| Conceptual Replication | Different cohort or lab of same species; similar ethogram. | Test same behavioral theories (e.g., transitivity, homophily) in SAOM. | Significant effects for the same network mechanisms (e.g., transitive triplets) are reproduced. |
| Generalization | Different species or major ecological context. | RSiena models adapted to new network structures/dependent behaviors. | Core social process (e.g., status-driven hierarchy formation) is conserved in dynamics. |
Experimental Protocols
Protocol 1: Robustness Check Suite for a Single RSiena Analysis
siena07 with default settings and at least 4,000 iterations. Store the resulting sienaFit object (fit1).siena07 output or summary(fit1). Flag parameters with |t-ratio| > 0.1.fit1 using sienaGOF.
b. Calculate auxiliary statistics (e.g., geodesic distances, triad census) for observed and simulated networks.
c. Perform a Monte Carlo test (plot.sienaGOF) to check for significant discrepancies (p < 0.05).fit1. Use this to generate confidence intervals for robustness.Protocol 2: Multi-Cohort Replication Analysis Workflow
fit_A, fit_B, fit_C.Visualizations
Title: SAOM Robustness Assessment Protocol Workflow
Title: Multi-Cohort Replicability Analysis Framework
The Scientist's Toolkit: Research Reagent Solutions
Table 3: Essential Materials for RSiena-Based Animal Social Network Research
| Item | Function in Research |
|---|---|
| Automated Behavioral Tracking System (e.g., RFID, video-based pose estimation) | Generates high-resolution, longitudinal interaction data with minimal observer interference, forming the raw input for network matrices. |
| Ethogram & Behavioral Coding Protocol | Standardized operational definitions of social interactions (edges) to ensure construct validity and cross-study comparability. |
R Statistical Environment with RSiena Package |
Core software for specifying, estimating, diagnosing, and simulating Stochastic Actor-Oriented Models (SAOMs). |
| Pre-registration Template (e.g., on OSF) | Document for specifying hypotheses, network measures, SAOM effects, and analysis plans prior to data collection to confirmatory replication. |
| High-Performance Computing (HPC) Cluster Access | Facilitates running large numbers of RSiena simulations (for GoF, bootstrapping) and complex multi-group models in a feasible timeframe. |
Standardized Data Exchange Format (e.g., RDS files for network arrays) |
Ensures exact replication of data input across research teams and simplifies sharing for secondary reproducibility analyses. |
Integrating RSiena Results with Other Omics or Environmental Data
Application Notes
Social network dynamics analyzed via RSiena (Simulation Investigation for Empirical Network Analysis) provide a powerful quantitative framework for understanding social influence, selection processes, and behavioral contagion in animal models. These dynamics are a critical phenotype, often acting as both a driver and an outcome of underlying biological states. Integrating RSiena-derived parameters with other omics layers or environmental variables enables researchers to construct mechanistic models linking social behavior to molecular pathways, environmental stressors, and, ultimately, translational outcomes in drug development.
Key RSiena output parameters suitable for integration include:
This multi-modal integration faces challenges: temporal alignment of longitudinal network snapshots with omics sampling, distinguishing correlation from causation, and the high-dimensional nature of omics data. The protocols below outline structured approaches to overcome these hurdles.
Table 1: Key RSiena Output Parameters for Multi-Omics Integration
| Parameter Type | Example | Biological/Environmental Integration Hypothesis |
|---|---|---|
| Network Rate | Change frequency per period | Correlates with glucocorticoid levels or environmental volatility. |
| Network Effect | Strength of transitivity | Associated with gene expression profiles related to social bonding (e.g., oxytocin pathway). |
| Behavior Effect | Average similarity effect | May moderate the effect of a pharmacologic agent on behavioral spread. |
| Covariate Effect | Sex alter effect | Interaction with sex-hormone related metabolomic signatures. |
Protocol 1: Correlating RSiena Parameters with Bulk Transcriptomic Data
Objective: To identify gene expression pathways associated with individual-level social network tendencies.
Materials & Reagents:
limma, WGCNA, sienaRI.Procedure:
limma), regressing gene expression against the score while controlling for technical batches and biological covariates.The Scientist's Toolkit: Key Research Reagent Solutions
| Item | Function in Integration Research |
|---|---|
| Passive Integrated Transponder (PIT) Tags | Enables automated, longitudinal recording of social proximity or associations in free-moving animals. |
| sienaRI R Package | Calculates residual indices of individual positioning in networks, providing quantitative traits for correlation. |
| Nucleic Acid Crosslinking Reagents (e.g., formaldehyde) | For stabilizing protein-RNA complexes in subsequent CLIP-seq protocols when investigating molecular mediators of social behavior. |
| Single-Cell RNA-Seq Kit (e.g., 10x Genomics) | To dissect cell-type-specific transcriptional responses linked to network position. |
| LC-MS/MS Metabolomics Platform | For profiling global metabolomic shifts associated with social stress or dominance derived from RSiena models. |
Protocol 2: Testing Environmental Moderation of Social Influence Processes
Objective: To test if an environmental variable (e.g., drug treatment, resource scarcity) moderates the strength of behavioral contagion in the network.
Materials & Reagents:
Procedure:
Drug) or control (Vehicle) groups within a single social group or across replicated groups.sienaTimeTest function to check if the effect of treatment varies over time.Diagram 1: RSiena-Omics Integration Workflow
Diagram 2: Testing Environmental Moderation in RSiena
RSiena provides an unparalleled statistical framework for uncovering the complex, time-dependent social processes that govern animal societies, offering critical insights for biomedical research. By mastering its foundational logic, methodological workflow, troubleshooting techniques, and validation practices, researchers can robustly model phenomena such as social contagion of stress, transmission of infectious agents, or the spread of learned behaviors in preclinical models. The future lies in integrating these dynamic network models with neurobiological, pharmacological, and genomic data, paving the way for a more holistic understanding of how social dynamics influence health, disease progression, and treatment efficacy. This approach promises to refine animal models, enhance translational relevance, and ultimately inform interventions targeting socially modulated pathways in human health.