Dynamic Animal Social Networks: A Comprehensive Guide to RSiena Analysis for Biomedical Researchers

Penelope Butler Feb 02, 2026 498

This article provides a detailed guide for researchers and biomedical professionals on using RSiena software to analyze dynamic animal social networks.

Dynamic Animal Social Networks: A Comprehensive Guide to RSiena Analysis for Biomedical Researchers

Abstract

This article provides a detailed guide for researchers and biomedical professionals on using RSiena software to analyze dynamic animal social networks. It covers foundational concepts of social network analysis (SNA) in animal models, explores RSiena's methodological framework for longitudinal data, offers troubleshooting and optimization strategies for complex models, and validates findings through comparative analysis. The guide connects these techniques to applications in behavioral neuroscience, disease transmission modeling, and preclinical drug development, offering a robust resource for integrating sophisticated network dynamics into biomedical research.

From Animal Societies to Networks: Foundational Concepts for RSiena Analysis

Why Animal Social Networks Matter in Biomedical Research

Application Notes

Animal social network analysis (SNA) moves beyond individual-focused studies to quantify the structure, dynamics, and consequences of social relationships within animal groups. In biomedical research, this approach provides critical insights into disease transmission, mental health, neurobiology, and therapeutic efficacy. The RSiena software (Simulation Investigation for Empirical Network Analysis) is a gold-standard tool for longitudinal SNA, enabling researchers to model the co-evolution of social networks and individual behaviors or traits over time.

Key Applications:

  • Infectious Disease Modeling: SNA quantifies contact heterogeneity, identifying super-spreaders and predicting outbreak trajectories more accurately than traditional models.
  • Neuropsychiatric & Neurodegenerative Research: Social behavior deficits are core symptoms in disorders like autism, depression, and Alzheimer's. SNA provides quantifiable, ethologically relevant endpoints for testing interventions in animal models.
  • Social Stress & Resilience: Network position (e.g., isolation, subordination) modulates stress physiology. RSiena can analyze how stress drives network changes and vice versa.
  • Pharmaco-ethology: Evaluates how candidate drugs or compounds alter social structure and collective behavior, offering a systems-level view of drug effects.

Quantitative Data Summary:

Table 1: Key Metrics in Animal Social Network Analysis and Their Biomedical Relevance

Network Metric Definition Biomedical Relevance
Degree Centrality Number of direct connections an individual has. Identifies highly connected individuals (potential super-spreaders for pathogens/information).
Betweenness Centrality Number of shortest paths that pass through an individual. Identifies potential "brokers" in a network critical for information flow or pathogen containment.
Clustering Coefficient Measure of how interconnected an individual's connections are. Indicates subgroup (clique) formation; relevant for subpopulation-level disease dynamics.
Network Density Proportion of possible connections that are realized. Overall group cohesion; can correlate with disease transmission rate or collective resilience.
Assortativity Tendency for individuals to connect with similar others (e.g., by health status). Drives disease spread; healthy-sick disassortativity can slow epidemics.

Table 2: RSiena Model Parameters for Co-evolution Analysis

Effect Type (Parameter) Interpretation in Biomedical Context
Outdegree (Density) Baseline propensity to form ties. Can be influenced by general health or drug treatment.
Reciprocity Tendency to reciprocate social ties. May be impaired in social disorders.
Transitivity Friends of friends become friends. Measures closure & subgroup stability.
Behavior Similarity Tendency to connect with others who have similar behavioral scores (e.g., anxiety level). Can model homophily based on disease state or symptom severity.
Behavior Influence Tendency to adjust one's own behavior to match that of social connections. Models behavioral contagion (e.g., depressive-like behaviors).
Network-Behavior Selection How an individual's behavior affects their attractiveness as a social partner. e.g., How sickness behavior alters social integration.

Experimental Protocols

Protocol 1: Longitudinal Social Network Data Collection for RSiena Analysis in Rodents

Objective: To collect repeated, structured observations of social interactions suitable for dynamic network analysis with RSiena.

Materials: See "Research Reagent Solutions" below. Animals: Group-housed rodents (e.g., mice, rats). Minimum 3-4 time points (T1, T2, T3...). Procedure:

  • Habituation: Acclimate animals to the observation arena/environment and human presence for 5-7 days.
  • Unique Identification: Mark all animals with a unique, non-invasive code (e.g., ear punch, fur dye, RFID tag).
  • Behavioral Ethogram Definition: Define and validate interaction types (e.g., allogrooming, huddling, nose-to-nose contact, aggressive bout).
  • Observation Sessions: a. At each time point (e.g., weekly), record group interactions using overhead cameras. b. Conduct continuous focal sampling or structured scan sampling (e.g., every 30 seconds) for 60-90 minutes per session. c. Record initiator, receiver, and behavior type for each interaction.
  • Data Matrix Creation: For each behavior and time point, create an N x N adjacency matrix, where N is group size. Cell ij contains the frequency or duration of interaction from individual i to j.
  • Covariate Data: Collect individual-level covariates (e.g., body weight, tumor size, behavioral test scores, glucocorticoid levels) at each time point.
  • Data Formatting for RSiena: Format matrices and covariates into RSiena-compatible files (.txt or .csv).
Protocol 2: Integrating SNA with a Disease Challenge Paradigm

Objective: To model how pre-existing social network structure predicts disease outcomes.

Materials: See "Research Reagent Solutions." Includes pathogen (e.g., influenza virus, Mycoplasma pulmonis). Procedure:

  • Baseline Network Mapping: Perform Protocol 1 (T1) to establish the pre-infection social network.
  • Targeted Infection: Infect a random subset (e.g., 30%) of animals with a defined pathogen dose.
  • Longitudinal Monitoring: a. Continue network observations (T2, T3...) post-infection to track network changes. b. Record individual disease severity (e.g., weight loss, symptom score, viral load) at each time point.
  • RSiena Analysis: a. Model the network dynamics (using effects from Table 2). b. Include "disease status/severity" as a behavioral or covariate variable. c. Test for effects like "disease similarity" (do sick animals associate more?) or "disease influence" (does proximity to sick animals increase severity?). d. Statistically test if baseline network centrality (e.g., T1 degree) predicts subsequent disease severity.
Protocol 3: Testing Pharmacological Intervention in a Social Network Context

Objective: To evaluate if a drug alters social network structure or dynamics.

Materials: Includes candidate drug and vehicle control. Procedure:

  • Pre-treatment Baseline: Perform Protocol 1 (T1) with all animals.
  • Treatment Regime: Administer drug or vehicle to all subjects (or a targeted subgroup) chronically or acutely.
  • Post-treatment Network Assessment: Perform network observations at defined intervals post-treatment (T2, T3...).
  • RSiena Analysis: a. Model network co-evolution across T1->T2->T3. b. Include a static or changing covariate for "treatment group." c. Test for interactions between treatment and network parameters (e.g., does the drug increase overall network density? Does it reduce the effect of anxiety on social isolation?). d. Compare model fit between a model where treatment affects social selection/influence and one where it does not.

Diagrams

Title: RSiena Analysis Workflow for Pharmaco-social Research

Title: Social Stress-Neurobiology-Network Feedback Pathway

The Scientist's Toolkit: Research Reagent Solutions

Table 3: Essential Materials for Animal Social Network Research

Item Function & Relevance
RFID Tracking System Provides automated, high-resolution spatiotemporal proximity data for network edge definition. Superior to manual observation for frequency/duration.
Machine Learning Ethology Software Tools like DeepLabCut or SLEAP automate pose estimation and behavior classification from video, standardizing interaction detection.
RSiena Software (R package) The core analytical tool for statistically testing hypotheses about longitudinal network and behavior co-evolution.
SocialBox or Custom Arena Controlled, standardized environment for group observations with clear sightlines for recording.
Non-invasive Biomarker Kits For measuring fecal glucocorticoid metabolites or urinary cytokines as stress/immune covariates linked to network position.
Pathogen Challenge Stocks Defined microbial agents (virus, bacteria) to investigate network-driven disease transmission and susceptibility.
Pharmacological Agents Anxiolytics, antidepressants, or novel compounds to test modulation of social network dynamics.
Data Formatting Scripts (Python/R) Custom code to convert raw observation or tracking data into RSiena-ready adjacency matrices and covariate files.

Application Notes

RSiena (Simulation Investigation for Empirical Network Analysis) is a statistical software package for the analysis of longitudinal network data, implemented as an R package. Within animal social networks research, it provides the methodological rigor to model the co-evolution of social structure and individual behaviors or traits over time. This is critical for understanding phenomena like hierarchy formation, information diffusion, and the impact of pharmacological interventions on group dynamics.

Key Quantitative Findings from Recent Studies (2022-2024):

Table 1: Summary of RSiena Application in Recent Animal Social Network Studies

Study Focus (Species) Key Network Effect Modeled Rate Function (Avg. Change Opp./Wave) Behavioral Influence (Avg. Parameter Est.) Primary Finding
Pharmacological Intervention (Mice) Density & Transitive Triads 4.2 Peer influence on activity: 0.68* Drug X reduced social selectivity (density ↑).
Hierarchy Stability (Primates) Preferential Attachment & Reciprocity 3.8 Status on friendship: 0.52* Dominance networks show high structural inertia.
Information Diffusion (Birds) Behavior Diffusion & Network Closure 5.1 Adoption via contacts: 1.21* Novel foraging technique spread via strong ties.

(p<0.05, **p<0.001)

Experimental Protocols

Protocol 1: Longitudinal Animal Social Network Data Collection for RSiena

Objective: To collect repeated measures of social interactions suitable for RSiena's discrete-time wave structure.

  • Subject & Grouping: House study cohort (e.g., 30 subjects) in a stable, controlled environment. Allow for a 7-day acclimatization period.
  • Interaction Definition: Precisely define the relational tie (e.g., "proximity within 1m," "allogrooming," "aggressive contact").
  • Observation Waves: Conduct structured observations (e.g., focal sampling or continuous monitoring) at discrete time points (T1, T2, T3). Waves should be spaced to allow for potential network change (e.g., 1 week apart).
  • Adjacency Matrix Creation: For each wave, construct a 30x30 matrix where cell Xij = 1 if actor i directed a tie to actor j during that observation period, else 0.
  • Covariate Data: Simultaneously record individual attributes (e.g., rank, weight, treatment group, hormone levels) for each wave.
  • Data Formatting: Prepare three R data frames: network (an array of adjacency matrices), attributes (a matrix of covariates), and behavior (a matrix of time-varying behavioral scores).

Protocol 2: RSiena Model Specification, Estimation, and Interpretation

Objective: To analyze the dynamics of the collected network and behavior data.

  • Data Import & Siena Object Creation:

  • Model Specification:

  • Model Estimation:

  • Diagnostics & Interpretation:
    • Check convergence: All t-ratios for convergence should be < |0.1|.
    • Examine ans$theta for final parameter estimates and significance.
    • Use sienaGOF to perform goodness-of-fit tests for structural parameters.

Visualizations

RSiena Dynamic Network Analysis Workflow

Drug Effects on Network & Behavior Coevolution

The Scientist's Toolkit: Key Reagents & Materials

Table 2: Essential Research Tools for RSiena-Based Animal Social Network Studies

Item / Solution Function in Research
RSiena R Package Core software for statistical modeling of network dynamics.
RStudio IDE Integrated development environment for scripting and analysis.
Automated Tracking System (e.g., EthoVision) High-resolution, continuous collection of animal proximity & movement data.
RFID Tag & Reader System Automated logging of dyadic interactions at feeders, nests, etc.
Behavioral Coding Software (e.g., BORIS) For precise manual annotation of complex social interactions from video.
Statistical Covariates Database Curated records of individual animal attributes (health, genotype, treatment history).
High-Performance Computing (HPC) Access For running large, complex RSiena models with many parameters.

Application Notes: RSiena for Animal Social Networks

1. Core Conceptual Framework In RSiena (Simulation Investigation for Empirical Network Analysis) applied to animal social networks, the fundamental units are:

  • Actors: Individual animals within a study population (e.g., a chimpanzee, a bird, a fish).
  • Network Ties: The directed or undirected social connections between actors. These are the dependent variables modeled over time. Examples include proximity, grooming, aggression, or alliance partnerships.
  • Longitudinal Data Structure: Requires a minimum of two (ideally more) waves of identically measured network and attribute data across the same set of actors. This enables modeling the co-evolution of network structure and individual behavioral or trait covariates.

2. Quantitative Data Structure Summary

Table 1: Essential Data Structure for RSiena Analysis of Animal Social Networks

Data Component Format Description Example in Animal Research
Network Waves Array of adjacency matrices (n x n) Square matrices for each observation time point. Cell ij records the tie from actor i to actor j. Matrix of grooming bouts observed in Time 1, Time 2, Time 3.
Actor Covariates Matrix or vector per wave Time-constant (e.g., sex) or time-varying (e.g., dominance rank, hormone level) attributes for each actor i. A vector of dominance index scores for each individual at each time point.
Dyadic Covariates Constant matrix or array Pair-level invariant characteristics (e.g., genetic relatedness, home range overlap). A matrix of kinship coefficients for each actor pair.

Table 2: Common RSiena Effect Parameters in Animal Social Networks

Effect Type Parameter Name Interpretation Biological/Social Process Tested
Structural Outdegree (Density) Baseline propensity to form ties. General sociability.
Structural Reciprocity Tendency to reciprocate ties. Mutual cooperation, bond maintenance.
Structural Transitivity (GWESP) Preference for forming triangles. Closure in alliances or coalitions.
Covariate-related Ego, Alter, Same Influence of an actor's attribute on sending/receiving ties, or homophily. Do high-ranking individuals (ego) groom more? Do individuals groom others of similar age (same)?
Behavioral Behavioral Rate & Shape Dynamics of an ordinal behavior covariate. How does a new foraging skill diffuse?
Co-evolution Average Similarity Network-behavior feedback. Do animals become more similar in behavior to their social partners?

Experimental Protocols

Protocol 1: Longitudinal Data Collection for Wild Animal Networks

  • Objective: Collect multi-wave social interaction data suitable for RSiena analysis.
  • Methodology:
    • Actor Identification: Uniquely identify all individuals in the study population via tags, markings, or natural markings. Maintain a consistent ID list across all waves.
    • Tie Definition & Operationalization: Precisely define the network tie (e.g., "grooming tie exists if individual A grooms B for ≥ 30 seconds in a 10-minute focal sample").
    • Sampling Design: Implement structured observation protocols (e.g., focal animal sampling, group scans) at regular, defined intervals (e.g., monthly sampling blocks).
    • Data Structuring: For each wave (e.g., each annual season), construct an n x n adjacency matrix. Populate cells with frequency, duration, or binary presence/absence of the tie.
    • Covariate Collection: Simultaneously record time-varying actor attributes (health status, rank, reproductive state) for the same waves.

Protocol 2: RSiena Model Specification and Estimation

  • Objective: Analyze the dynamics of an animal social network and its co-evolution with behaviors.
  • Methodology:
    • Data Preparation: Format network matrices and covariate data into RSiena-compatible R objects (e.g., sienaDependent, coCovar, varCovar).
    • Model Specification: Define the effects to be included in the model via the getEffects() function. Include structural effects (reciprocity, transitivity) and covariate effects (sex, dominance).
    • Model Estimation: Use the siena07() function to run the Method of Moments estimation algorithm. Check convergence (all t-ratios for convergence < |0.1|) and overall maximum convergence ratio (< 0.25).
    • Goodness-of-Fit (GOF) Assessment: Simulate networks from the estimated model and compare structural features (e.g., geodesic distances, triad census) with the observed data using sienaGOF. Iteratively refine the model based on GOF.
    • Interpretation: Evaluate significance (p-values) and sign of parameter estimates. A positive reciprocity parameter indicates significant mutual tie formation, for instance.

Visualizations

RSiena Longitudinal Analysis Workflow (76 characters)

Network Tie Dynamics Between Two Waves (75 characters)

The Scientist's Toolkit

Table 3: Key Research Reagent Solutions for Animal Social Network Analysis

Item Function/Application
RSiena Software Package (R) The core statistical software for modeling longitudinal network data via stochastic actor-oriented models (SAOM).
RStan/brms Alternative Bayesian framework for fitting SAOMs, useful for complex random effects or prior information integration.
SOCPROG or BANTA Software for deriving association matrices from raw observation data, which can serve as input for RSiena.
GPS/Proximity Loggers Automated data collection devices to record spatial proximity ties between tagged individuals at high resolution.
Cytoscape with EvolvingNetwork Plugin Network visualization software capable of animating longitudinal network changes inferred from RSiena models.
Behavioral Coding Software (e.g., BORIS, Observer XT) For systematic coding and quantification of social interactions from video/audio recordings into adjacency matrices.
Genetic Relatedness Matrix A dyadic covariate (e.g., from microsatellite or SNP data) to test for kin selection effects on tie formation.
Hormone Assay Kits (Corticosterone, Testosterone) To generate time-varying actor covariates measuring physiological stress or androgen status for co-evolution models.

Core Principles and Data Integrity Framework

Effective social network analysis using RSiena for animal behavior studies hinges on the quality of the input longitudinal data. The following principles are paramount.

  • Ethological Validity: Observational protocols must reflect the species' natural repertoire. Invasive data collection must be justified and minimized to avoid altering social structures.
  • Temporal Resolution: Sampling frequency must align with the research question and the species' interaction dynamics to accurately capture network evolution for RSiena's rate function.
  • Individual Identification: Unambiguous, permanent identification of all individuals in the study population is non-negotiable for constructing adjacency matrices.
  • Definition of Ties: Operational definitions of social ties (e.g., grooming, proximity, aggression) must be explicit, measurable, and consistent across observation periods.
  • Missing Data Protocol: Proactive strategies for handling missing individuals or observations (e.g., death, equipment failure) must be defined prior to data collection to meet RSiena's requirements for structured longitudinal data.

Table 1: Impact of Data Collection Parameters on RSiena Model Accuracy

Parameter Poor Practice Best Practice Rationale for RSiena Analysis
Observation Duration Single short snapshot per wave Multiple, extended sessions per wave to capture representative behavior Reduces stochastic error in tie variables; stabilizes wave-to-wave network metrics.
Sampling Interval (Waves) Irregular, biologically arbitrary intervals Regular intervals based on species' social tempo (e.g., daily for mice, weekly for primates) Ensures the rate function models realistic opportunities for network change between observations.
Group Coverage < 95% of group observed per wave > 99% of known group members recorded per wave Missing nodes create unobserved ties, biasing structural effect estimates (e.g., transitivity).
Tie Definition Subjective, continuous measures Binary or ordinal counts with clear thresholds (e.g., ≥3 grooming bouts = 1) Provides clear input for the objective function; facilitates model convergence.

Detailed Experimental Protocols

Protocol 2.1: Longitudinal Focal Animal Sampling for Dyadic Interaction Data

Objective: To collect repeated, quantitative data on social interactions for constructing directed, weighted adjacency matrices for RSiena input.

Materials: See "The Scientist's Toolkit" below.

Procedure:

  • Pre-Observation:
    • Generate a randomized observation schedule ensuring each individual in the closed study population is observed an equal number of times per observation wave.
    • Acclimate animals to any human observers or static recording equipment for a minimum period (e.g., 72 hrs) prior to first data collection wave.
  • Observation Session:
    • Initiate a continuous timer for the predefined focal period (e.g., 10 minutes).
    • Record all occurrences of pre-defined social behaviors (see Table 2) initiated by the focal animal. Note Actor (Focal), Receiver, and Behavior.
    • Simultaneously, record the identity of all individuals within a pre-defined proximity radius (e.g., one body length) of the focal animal at instantaneous scan samples at fixed intervals (e.g., every 2 minutes).
  • Data Curation:
    • Per wave, aggregate interaction counts across all focal samples for each possible directed dyad (i → j).
    • Apply pre-defined thresholds to convert weighted matrices to binary or categorical formats if required (e.g., tie = 1 if total grooming bouts ≥ 5).
    • Format data into RSiena-compatible files: one adjacency matrix per network per wave, plus attribute files (e.g., sex, treatment).

Protocol 2.2: Automated Tracking for Association/Proximity Networks

Objective: To collect continuous, high-resolution spatial data for constructing undirected, weighted proximity networks.

Procedure:

  • System Setup & Calibration:
    • Position overhead cameras to cover the entire enclosure. Ensure unique color or pattern markers for individual identification.
    • Calibrate the system using a checkerboard grid to convert pixels to real-world distances.
    • Validate tracking accuracy (>98% ID accuracy, <2 cm positional error) against manual scoring for a subset of video.
  • Data Collection & Processing:
    • Record video for the duration of each observation wave (e.g., 48-hour continuous period per wave).
    • Use software (e.g., EthoVision, idTracker) to extract trajectories: individual identity, X-coordinate, Y-coordinate, time stamp.
  • Network Construction:
    • Define a proximity threshold (e.g., 20 cm) based on species-specific social behavior.
    • For each wave, calculate the proportional time each dyad (i-j) spends within the threshold. This creates a symmetric, weighted association matrix suitable for RSiena's non-directed network analysis.

Table 2: Common Behavioral Ethogram for Rodent Social Network Studies

Behavior Category Operational Definition Typical Coding for RSiena Input
Allogrooming One animal uses its mouth or paws to groom another's fur or body. Directed, weighted count of bouts.
Aggression Biting, chasing, or forceful pinning of one animal by another. Directed, binary (presence/absence per wave).
Social Proximity Noses of two animals within < 2 cm without aggressive interaction. Undirected, weighted by duration.
Huddling Two or more animals in sustained body contact, resting. Undirected, binary or weighted by duration.

Visualization of Methodological Workflows

Data Pipeline for RSiena Animal Social Networks

Critical Pathways in Data Quality Assurance

The Scientist's Toolkit: Key Research Reagent Solutions

Item Function & Relevance to RSiena Studies
Passive Integrated Transponder (PIT) Tags Subcutaneous microchips for permanent, unambiguous individual identification. Essential for long-term longitudinal studies tracking the same individuals.
Machine-Readable Color/Pattern Markers Unique visual codes (e.g., ArUco markers) applied to fur or collars to enable automated, high-accuracy tracking of multiple individuals in a group.
Ultra-Wideband (UWB) RFID System Provides real-time, high-resolution spatial positioning within a defined arena. Ideal for constructing precise proximity networks with timestamps.
Structured Ethology Software (e.g., BORIS, EthoFlow) Enables systematic coding of live or video observations against a custom ethogram. Exports time-stamped data for aggregation into network matrices.
Automated Behavior Recognition AI (e.g., DeepLabCut, SLEAP) Markerless pose estimation tools that can classify complex social behaviors from video, scaling up data collection for rich network variables.
Data Formatting Scripts (R/Python) Custom scripts to transform raw observation logs or tracking coordinates into RSiena's required .csv or .txt matrix formats, ensuring reproducibility.

This conceptual overview is framed within a thesis utilizing RSiena software to analyze dynamic animal social networks, with applications for understanding social behavior in pharmacological and toxicological studies.

Core Conceptual Framework

The Stochastic Actor-Oriented Model (SAOM) is a statistical model for analyzing longitudinal network data. It models how networks evolve over time as a result of choices made by actors (e.g., animals) within the network. The "actor-oriented" perspective assumes that changes in network ties are driven by actors optimizing an objective function.

Key Mathematical Components

The model is defined by:

  • Rate Function (λ): Determines the frequency of opportunities for an actor to change its outgoing ties.
  • Objective Function (f): Evaluates the attractiveness of a potential network state for an actor, guiding its choice.
  • Choice Probability (p): Given by a multinomial logit model where the probability of creating a specific tie change is proportional to the exponential transform of the objective function.

The objective function is specified as a linear combination of effects: f_i(β, x) = Σ_k β_k s_{ki}(x) where β_k are statistical parameters and s_{ki} are network effects.

Application Notes for Animal Social Network Research

Table 1: Standard network effects modeled in SAOM for animal social networks and their interpretation.

Effect Name Parameter (β) Interpretation in Animal Context Typical Research Question
Outdegree (Density) β₁ Baseline propensity to form ties. What is the general sociability?
Reciprocity β₂ Preference for mutual relationships. Is there a tendency for reciprocal grooming/alliance?
Transitivity β₃ Preference for ties to friends of friends. Does the network exhibit triadic closure (e.g., "friend of a friend is a friend")?
Popularity (Indegree) β₄ Preference to attach to popular actors. Are some individuals consistently more attractive as social partners?
Activity (Outdegree) β₅ Preference to attach from active senders. Do more socially active individuals attract more ties?
Covariate (e.g., Dominance) Alter β₆ Effect of a covariate on attracting ties. Do higher-ranking individuals receive more affiliative ties?
Covariate (e.g., Treated) Ego β₇ Effect of a covariate on sending ties. Do pharmacologically treated animals initiate more/less contact?
Covariate Similarity β₈ Preference for ties to similar others. Do animals of similar age, rank, or treatment status associate more?

Experimental Protocols for SAOM-Based Animal Studies

Protocol 1: Longitudinal Animal Social Network Data Collection for SAOM

  • Subject & Housing: House study population (e.g., 20-30 rodents, primates, or birds) in a stable, enriched enclosure allowing social interaction. Record species, sex, age, and dominance rank.
  • Experimental Manipulation (Optional): Administer a pharmacological agent (e.g., anxiolytic, neuropeptide) to a randomly selected subset. Include vehicle control group.
  • Behavioral Observation: Conduct focal or scan sampling across multiple discrete observation periods (waves: t1, t2, t3).
  • Network Definition: Define a directed network tie (e.g., "grooms," "proximity < 1m," "aggressive contact") with clear operational criteria.
  • Data Structuring: For each wave, construct an N x N adjacency matrix X(t) where Xij = 1 if actor i has a tie to actor j. Compile actor covariates (treatment, rank) into a separate matrix.
  • Model Specification: Using RSiena, specify the model by selecting effects from Table 1 relevant to hypotheses (e.g., treatment ego, transitivity).
  • Model Estimation: Run the RSiena siena07() algorithm to obtain parameter estimates (β) and standard errors. Assess convergence (t-ratios < |0.1|).
  • Goodness-of-Fit (GOF): Simulate networks from the fitted model and compare key statistics (geodesic distances, triad census) to the observed data.

Protocol 2: Testing Pharmacological Impact on Social Dynamics

  • Establish a stable baseline social network (Wave 1) pre-treatment.
  • Randomly assign subjects to Treatment (Drug A) and Control (Vehicle) groups.
  • Administer treatments daily for one week.
  • Record post-treatment social networks at Wave 2 and Wave 3 (e.g., 1-week and 2-weeks post-initiation).
  • In the SAOM, include the treatment covariate as both an ego effect (to test if treated animals change their outgoing ties) and an alter effect (to test if treated animals become more/less attractive as partners).
  • Include a treatment similarity effect to test for homophily (preference to associate with similarly treated individuals).
  • Statistically interpret the significant parameters to describe the drug's effect on social network dynamics.

Visualizations

SAOM Analysis Workflow in RSiena

SAOM Actor Decision Process

The Scientist's Toolkit: Research Reagent Solutions

Table 2: Essential materials and tools for SAOM-based animal social network research.

Item / Reagent Function in SAOM Research Context
RSiena Software (R Package) Core statistical software for specifying, estimating, and diagnosing SAOMs.
R or RStudio Programming environment for running RSiena and managing data analysis pipelines.
Behavioral Observation Software (e.g., BORIS, EthoVision) For accurate recording and coding of social interactions into structured network data.
Pharmacological Agents (e.g., Oxytocin, Clozapine) Tool compounds to experimentally manipulate social motivation and cognition, creating dynamic covariates.
Vehicle Solutions (Saline, DMSO/PEG mixes) Critical control for administering pharmacological agents.
RFID Tracking System Automated, high-resolution data collection for proximity-based network ties over time.
Video Recording System Essential for permanent record of behavior, allowing for reliability coding and re-analysis.
Animal Housing Enrichment Standardized, complex environments that permit the expression of naturalistic social dynamics.
Statistical Text (e.g., 'Social Network Analysis for Ego-Nets') Foundational resources for understanding SNA concepts underlying SAOM.

Step-by-Step RSiena Workflow: From Data Prep to Model Interpretation

To conduct a longitudinal social network analysis using RSiena, data must be structured as a sequence of networks observed at discrete time points. For animal interaction research, this requires converting raw behavioral observations into formatted adjacency matrices or edge lists.

Table 1: Core Data Input Formats for RSiena Animal Network Studies

Format Description RSiena Suitability Key Consideration for Animal Data
Adjacency Matrix (Square) N x N matrix where cell ij indicates a tie from actor i to actor j. Primary input format. Values can be binary (0/1), counts of interactions, or proximity durations. Diagonal is ignored (self-ties).
Edge List (Long Format) A 2 or 3-column list: sender (ego), receiver (alter), weight. Must be converted to matrix format for RSiena. Efficient for sparse networks. Must include all possible nodes in each wave, even if isolated.
Node Attribute File Comma-separated file linking node ID to time-varying (e.g., dominance rank) or constant (e.g., sex) covariates. Required for covariate analysis (ego, alter, similarity effects). Attributes must be aligned with network waves. Missing values (NA) are permitted but must be consistent.
Behavioral Variable File A matrix where rows are actors and columns are waves, containing a continuous or binary behavioral score. Used as dependent variable for co-evolution analysis. Must be normalized/standardized if scale varies.

Protocol: From Field Observation to RSiena-Ready Data

Protocol 2.1: Constructing Longitudinal Adjacency Matrices from Focal Sampling Objective: To create a time-series of directed, weighted adjacency matrices representing grooming interactions in a primate group across three observation periods.

  • Data Collection: Conduct 10-minute focal animal samples for all N individuals in the group per observation period (wave). Record all grooming events (initiator → recipient) and their duration (seconds).
  • Aggregation per Wave: For each wave t, create an N x N matrix W_t. For each cell ij, sum the total duration (seconds) of grooming initiated by i towards j across all focal samples within that wave.
  • Null Value Definition: Assign 0 to cells ij where i was observed but initiated no grooming towards j.
  • Structural Zero Definition: Create a supplementary nodefile to identify structural zeros (e.g., individuals not present in a specific wave). In the main matrix, represent these as NA or exclude the node for that wave only if using sienaDataCreate() with nodeSet=.
  • Formatting for RSiena: Save each matrix W_t as a comma-separated (.csv) or tab-delimited text file without row names. Ensure column and row orders are identical across all waves. A separate plain text file should list all node IDs.

Protocol 2.2: Pre-processing Weighted Interaction Data Objective: To transform raw interaction counts/durations into analyzable tie variables, handling variation in individual observation effort.

  • Calculate Observation Effort: For each individual i in wave t, sum the total observation time (e.g., seconds).
  • Standardize Weights: To control for differential effort, calculate a rate: standardized_weight_ijt = (raw_duration_ijt) / (observation_time_it) Multiply by a constant (e.g., 3600) to interpret as "seconds per hour."
  • Dichotomization (Optional): For binary RSiena models, apply a threshold (e.g., weight > 0) or a meaningful biological cutoff (e.g., rate >= 5 sec/hr) to create a binary matrix.
  • Symmetrization (Optional): For undirected network analysis, symmetrize the matrix: final_weight_ijt = max(weight_ijt, weight_jit) or mean(weight_ijt, weight_jit).

Visualization of the RSiena Co-Evolution Modeling Workflow

Title: RSiena Analysis Workflow for Animal Social Dynamics

The Scientist's Toolkit: Essential Reagents & Software

Table 2: Key Research Reagent Solutions for Animal Network Construction & Analysis

Item / Software Category Primary Function in Network Construction
BORIS Behavioral Coding Software Open-source event logging software for timestamped recording of interactions from video or live observation. Outputs structured data for matrix creation.
R (with statnet/igraph) Programming Environment Core platform for data manipulation, network metric calculation (igraph), and RSiena model execution. Enables reproducible analysis pipelines.
RSiena Package Statistical Software Specialized R package for Stochastic Actor-Oriented Models (SAOMs) to analyze network and behavior co-evolution over time.
SOCPROG Dedicated Analysis Tool MATLAB-based software suite for calculating association indices, network metrics, and permutation tests from animal observation data.
GPS/Proximity Loggers Data Collection Hardware Automated collection of spatial proximity data at high resolution. Raw data requires processing to define edges (e.g., within X meters for Y seconds).
asnipe R Package Analysis Package Calculates gambit-of-the-group association indices and generates networks from observation or logger data. Integrates with igraph.
ANIWAVE (Custom Scripts) Data Processing Custom R/Python scripts to pre-process weighted adjacency matrices, standardize for effort, and format for RSiena input.
Graphviz (DOT language) Visualization Tool Used for generating standardized, publication-quality diagrams of workflows and conceptual models (as in this document).

Defining and Coding Dependent Networks and Covariates

This protocol details the definition and coding of dynamic social networks (dependent networks) and covariates within the RSiena software framework for longitudinal animal social network analysis. Accurate coding is critical for modeling social dynamics and selection pressures.

1. Core Concepts and Data Structures

A typical RSiena analysis requires two primary data structures: the dependent network data and the covariate data.

Table 1: Primary Data Structures for RSiena Analysis

Data Structure Description R Object Type (siena) Time Points (Waves)
Dependent Network The evolving social tie variable (e.g., grooming, proximity) under study. sienaNet (array) T (≥2)
Constant Covariate Individual attribute that does not change over the observation period (e.g., sex, birth cohort). coCovar (vector) 1
Changing Covariate Individual attribute that may change over time (e.g., dominance rank, hormone level). varCovar (matrix) or sienaNet T
Dyadic Covariate A predictor defined for each pair of individuals (e.g., kinship, spatial overlap). coDyadCovar or varDyadCovar (matrix) 1 or T

2. Experimental Protocol: Data Preparation and Coding

Protocol 2.1: Defining Dependent Social Networks from Observational Data Objective: Transform sequential observation matrices into a sienaDependent network array. Materials: Focal/scan sampling adjacency matrices per observation period, aggregated per defined time wave. Procedure:

  • Aggregation: For each distinct wave (e.g., year, season), aggregate interaction frequencies or binary associations across observations within that wave.
  • Binarization/Weighting: Decide if the network will be analyzed as binary (tie present/absent) or valued. Use type='oneMode' for directed/undirected, or type='bipartite'.
  • Missing Data Code: Define a standard code (e.g., 0, NA, or 255) for structurally missing ties (e.g., individuals not yet born or already deceased).
  • R Coding:

Protocol 2.2: Coding Individual Covariates Objective: Incorporate time-constant and time-varying attributes as covariates. Procedure for Constant Covariates:

  • Alignment: Ensure the covariate vector is in the same order as the node list in the network matrices.
  • R Coding:

    Procedure for Changing Covariates:
  • Matrix Creation: Create an n × T matrix, where each column represents the attribute values for a wave.
  • Handling Missingness: Use real values where available and a placeholder (e.g., NA) for truly missing data.
  • R Coding:

Protocol 2.3: Coding Dyadic and Network Covariates Objective: Incorporate pair-level predictors, such as kinship or prior stable associations. Procedure:

  • Create Matrix: Generate an n × n matrix of the pairwise values.
  • R Coding for Constant Dyadic Covariate:

  • Network Covariate: Use a network from a previous wave or a different context as a predictor by defining it as an exogAttrib in sienaDataCreate.

3. Visualization of RSiena Data Structure and Workflow

Title: RSiena Data Preparation Workflow

Title: Structure of a Three-Wave sienaDependent Array

4. The Scientist's Toolkit: Key Research Reagent Solutions

Table 2: Essential Tools for RSiena Data Preparation

Item/Software Function in Analysis Specification/Notes
R Statistical Environment Core platform for all data manipulation and analysis. Version ≥4.0.0.
RSiena Package Implements the Stochastic Actor-Oriented Model (SAOM). Install via install.packages("RSiena") or install.packages("RSiena", repos="https://www.stats.ox.ac.uk/pub/RWin/").
asnipe / aniDom R Packages For generating networks from raw observation data and calculating dominance ranks. asnipe::get_network(); aniDom::elo_scores().
SOCPROG Software Alternative for descriptive network metrics and data preparation. Validates network construction.
Structured Data Sheets Essential for aligning covariate data with node IDs across waves. Use unique, persistent animal IDs as the primary key.
Scripted Data Pipeline Reproducible R script for raw data → RSiena object transformation. Must handle missing data codes and actor alignment explicitly.

This protocol provides a structured approach to specifying, evaluating, and interpreting Stochastic Actor-Oriented Models (SAOMs) using RSiena within animal social networks research. The SAOM framework is essential for disentangling social network dynamics—how ties change over time—from behavioral dynamics—how individual traits or states co-evolve with the network. A core challenge addressed here is endogeneity, where the network influences behavior and behavior influences the network, creating a feedback loop.

Model Specification Protocol: Core Effects & Selection Mechanisms

Model specification involves selecting appropriate "effects" that represent hypothesized social processes. The table below categorizes essential effects for animal social network studies.

Table 1: Core RSiena Effect Specifications for Animal Social Networks

Effect Category Effect Name (RSiena Term) Mathematical Interpretation Biological/Social Hypothesis Typical Parameter Sign (Expected)
Network Dynamics outdegree (density) Baseline propensity to form ties. General sociability or interaction rate. Negative (costly)
reciprocity Tendency to form mutual ties. Dyadic bonding, cooperation, or tit-for-tat. Positive
transitivity (gwespFF) Preference for ties that create triangles. Preferential association with friends of friends; triadic closure. Positive
indegree - popularity (inPop) Attraction to already popular individuals. "Rich-get-richer" or status-based attraction. Positive or Negative
outdegree - activity (outAct) More active actors create more new ties. Variation in individual gregariousness. Positive
Behavior Dynamics linear shape Tendency towards high or low behavior values. Baseline propensity for a behavior (e.g., proximity to waterhole). Depends on behavior
quadratic shape Tendency towards extreme or moderate values. Stabilizing or polarizing forces on behavior. Negative (bell curve)
Selection & Influence (Endogeneity) average alter (avAlt) Influence: Behavior adapts to neighbors' average. Social contagion, conformity, or local learning. Positive
egoX Selection: Effect of ego's behavior on sending ties. Behavioral homophily or heterophily based on ego's state. Positive (homophily) or Negative
alterX Selection: Effect of alter's behavior on receiving ties. Attraction to alters with specific behavioral traits. Positive (homophily) or Negative
sameX (simX) Selection: Effect of similarity in behavior on tie formation. Direct homophily: preference for others with similar traits. Positive

Step-by-Step Experimental & Analytical Protocol

Protocol 3.1: Data Preparation and RSiena Object Creation

Objective: Format longitudinal network and behavior data for RSiena analysis.

  • Data Requirements:

    • Networks: Adjacency matrices for ≥2 waves (time points). Codes: 0=no tie, 1=tie. Must be same dimension.
    • Covariates: Matrices or vectors for constant or changing covariates (e.g., dominance rank, hormone levels).
    • Behavior: A matrix where rows=actors, columns=waves, with values for an ordinal or continuous variable (e.g., foraging innovation rate).
  • R Script:

Protocol 3.2: Model Specification and Effect Selection

Objective: Define the SAOM by adding effects based on Table 1 hypotheses.

  • GetEffects Object:

  • Inspect and Add Effects: Use print01Report(myData, modelname) to check available effects.

Protocol 3.3: Model Estimation and Convergence Check

Objective: Fit the model and ensure reliable parameter estimates.

  • Create Estimation Algorithm Object:

  • Run Model Estimation:

  • Mandatory Convergence Diagnostics:

    • t-ratios: All t conv values for parameters in ans$tconv must be < |0.1|.
    • Overall Maximum Convergence Ratio: ans$tconv.max must be < 0.25.
    • If convergence is poor: Increase n3 in sienaAlgorithmCreate or simplify the model.

Protocol 3.4: Model Evaluation and Interpretation

Objective: Assess model fit and interpret parameter estimates.

  • Goodness-of-Fit (GOF) Test:

  • Interpretation of Estimates:

    • Refer to ans$theta for parameter estimates.
    • Significance: Parameter estimate / standard error > |1.96| (approx. p<0.05).
    • Key: A positive, significant simX effect indicates homophily-based selection. A positive, significant avAlt effect indicates social influence.

Visualization of the SAOM Process & Endogeneity

Diagram Title: SAOM Endogenous Feedback Loop

The Scientist's Toolkit: RSiena Research Reagent Solutions

Table 2: Essential Research Toolkit for RSiena Analysis

Tool/Reagent Type Primary Function in Analysis
RSiena Software R Package Core engine for estimating SAOMs.
sienaGOF() Function Diagnostic Tool Assesses model fit by comparing simulated to observed network statistics.
RSienaTest Version Software Branch Use for latest features and bug fixes; install from R-Forge.
Maximum Convergence Ratio (tconv.max) Diagnostic Metric Critical criterion (<0.25) to confirm model estimation has converged.
RSiena Effects List (getEffects) Specification Guide Catalog of all model effects; basis for hypothesis testing.
Meta-analysis of Parameters Synthesis Method Compares effect estimates across multiple studies/populations to identify general social rules.
RSiena Script Protocol Document Reproducible R code documenting data prep, model spec, estimation, and diagnostics.
Goodness-of-Fit Plots (GOF) Visual Diagnostic Reveals specific areas (e.g., high outdegree) where model fails to capture network structure.

Application Notes

Within the broader thesis on RSiena software analysis of animal social networks, the siena07 function is the core engine for parameter estimation. Its proper application is critical for drawing valid inferences about social dynamics, such as the spread of behavioral phenotypes or infection states in animal colonies, with direct methodological parallels to clinical trial network analysis in drug development.

Key Quantitative Findings on Algorithm Performance

Table 1: Convergence Diagnostics and Meta-Parameter Impact

Diagnostic Metric Typical Target Value Effect of Increased n3 (Phase 3 iterations) Effect of Increased nsub (sub-processes) Role in Animal Network Studies
Overall Maximum Convergence Ratio (t-conv) < 0.25 Improves precision of estimates Minimal direct effect Primary indicator of successful model fit for social selection & influence.
Individual Parameter Convergence Ratio < 0.10 Directly improves ratio Reduces stochastic noise Ensures specific effects (e.g., dominance, kinship) are reliably estimated.
Derivative Standard Errors Stable across runs Reduces variability Averages out run-to-run variation Critical for p-values assessing significance of social contagion effects.
siena07 Runtime (minutes) - Increases linearly Increases linearly, enables parallelism Practical constraint for large longitudinal animal observation datasets.

Table 2: Common siena07 Call Arguments and Recommendations

Argument Default Recommended Setting for Complex Animal Networks Rationale
n3 3000 5000 - 10000 Provides sufficient iterations for complex hierarchical or proximity networks.
nsub 4 4 (or higher if computational resources allow) Balances noise reduction with computational feasibility.
maxlike FALSE Consider TRUE for small, dense networks Alternative estimation for precise, small-group observed interactions (e.g., captive primates).
dolby TRUE Keep as TRUE Stabilizes estimation by scaling scores.
diagg FALSE Set to TRUE if convergence problems persist Provides additional diagnostic information.

Experimental Protocols

Protocol 1: Standard Model Fitting and Assessment Workflow for Animal Social Networks

Objective: To fit a Stochastic Actor-Oriented Model (SAOM) to longitudinal animal interaction data and assess convergence.

Materials: RSiena software (R environment), longitudinal adjacency matrices (e.g., grooming, aggression), covariate data (e.g., age, rank, health status).

Procedure:

  • Data Preparation: Format network data as R matrix or network objects and covariates as vectors for each wave. Store in a sienaData object using sienaDataCreate().
  • Model Specification: Create a sienaAlgorithm object using sienaAlgorithmCreate(). Key arguments: projname, n3=4000, nsub=4, seed=123. This defines the meta-parameters for the estimation algorithm.
  • Initial Model Fit: Run the first estimation using siena07().

  • Convergence Diagnosis: Examine the myresult object.
    • Check Overall maximum convergence ratio from print(myresult).
    • Inspect individual t-ratios via summary(myresult).
    • Visually assess convergence plots: plot(myresult).
  • Iterative Refinement: If maximum t-ratio > 0.25, run additional siena07 iterations using the previous result as input.

  • Final Assessment: Repeat step 4. Proceed to interpretation only after all t-ratios are < 0.1 (good) or at least < 0.25 (acceptable).

Protocol 2: Addressing Non-Convergence in Behavioral Contagion Models

Objective: To troubleshoot and resolve common convergence failures when modeling the spread of behaviors (e.g., foraging techniques) or health states.

Procedure:

  • Increase Iterations: Sequentially increase n3 in steps (e.g., to 5000, then 10000) using the prevAns argument, as in Protocol 1 Step 5.
  • Simplify Model: If convergence fails, temporarily remove statistically insignificant or complex effects (e.g., triple interactions) from the effect object (myeff) and re-fit.
  • Check Time Dependencies: Use sienaTimeTest() to assess if effect parameters are constant over time. Significant tests may indicate misspecification.
  • Alternative Initialization: Refit the model from different random seeds by creating a new algorithm object with a different seed argument, without using prevAns.
  • Profile Likelihood Check: For a problematic parameter, use profileLikelihood() to verify the likelihood maximum is found.

Mandatory Visualizations

Title: SAOM Fitting and Convergence Workflow

Title: Key siena07 Arguments and Their Effects

The Scientist's Toolkit: Research Reagent Solutions

Table 3: Essential Components for RSiena-based Animal Network Analysis

Item/Software Function in Analysis Specification/Notes for Research
RSiena R Package Core software environment for specifying and fitting SAOMs. Current version >= 1.4.0. Must be installed from CRAN or GitHub.
Longitudinal Interaction Data Primary input: records who interacts with whom at multiple time points. Must be formatted as adjacency matrices (R objects). Critical for defining network dependent variables.
Node-Level Covariates Predictors of network structure or behavioral dynamics (e.g., influence models). Examples: dominance rank, age, sex, infection status. Stored as vectors or matrices.
sienaAlgorithm Object Container for meta-parameters (n3, nsub, seed) controlling the estimation engine. Defined once and reused for iterative fitting. The "protocol" for the estimation experiment.
High-Performance Computing (HPC) Cluster Computational resource for running models with large n3/nsub or many models. Essential for bootstrapping, sensitivity analyses, or large groups (e.g., > 50 individuals).
Convergence Diagnostics (t-ratios) Quality control metrics determining the validity of parameter estimates. The primary output of siena07 to assess before interpretation. Target: < 0.1.

Application Notes

RSiena analysis of animal social networks decomposes social dynamics into interpretable statistical parameters. These outputs quantify how social structure and individual traits co-evolve, offering insights critical for behavioral neuroscience and pharmacological research.

1. Core Parameter Interpretation

  • Rate Parameters: Model the pace of network or behavioral change between observation waves. They represent opportunity for change.
  • Network Effects: Explain the type of social selection shaping the network (e.g., transitivity, popularity).
  • Behavioral Dynamics: Explain the type of social influence shaping behavior (e.g., average similarity to alters) and individual tendencies (linear & quadratic shape effects).

2. Key Output Tables for Analysis

Table 1: Summary of RSiena Model Output for Rat Social Network Dynamics

Parameter Group Effect Name (RSiena Terminology) Estimate S.E. p-value Interpretation in Animal Context
Rate Function Rate Period 1 4.52 1.10 <0.001 Mean opportunity for network change between t1 & t2.
Rate Period 2 3.89 0.95 <0.001 Mean opportunity for network change between t2 & t3.
Network Dynamics Outdegree (Density) -1.85 0.25 <0.001 Baseline propensity to form ties (usually negative).
Reciprocity 0.75 0.15 <0.001 Tendency to reciprocate social affiliations.
Transitive Triplets 0.32 0.08 <0.001 Tendency for "friend of a friend" closure (clustering).
Behavior Dynamics Behavioral Rate (Drinking) 2.10 0.60 <0.001 Opportunity for behavioral change per period.
Linear Shape 0.55 0.20 0.006 Tendency to increase behavior score (e.g., alcohol intake).
Quadratic Shape -0.45 0.12 <0.001 Tendency towards moderate behavior levels (curvilinear effect).
Average Similarity 0.90 0.30 0.003 Tendency to adapt behavior to match network alters (social influence).

Table 2: Implications for Pharmacological Research

RSiena Finding Potential Neurobiological Correlate Drug Development Question
High "Transitivity" effect Oxytocinergic systems, social reward. Would an oxytocin receptor modulator alter social clustering?
Significant "Average Similarity" on substance intake Dopaminergic reward pathways, social learning. Does a D3 antagonist block socially facilitated consumption?
Change in Rate Parameters post-treatment General social motivation, anxiety. Does the compound affect the overall frequency of social interaction?

Experimental Protocols

Protocol 1: Longitudinal Data Collection for RSiena Analysis in Rodents Objective: To collect the repeated, time-structured social network and behavioral data required for stochastic actor-oriented modeling (SAOM). Materials: See "The Scientist's Toolkit" below. Procedure:

  • Cohort Design: House experimental rodents (e.g., rats) in a stable, mixed-sex or single-sex group (n=12-24) in a large, enriched enclosure.
  • Behavioral Phenotyping (Baseline - t1):
    • Record baseline behavioral scores (e.g., ethanol preference using a 2-bottle choice test, anxiety in an elevated plus maze).
    • Perform continuous video recording of the group for 72 hours.
    • Using video, construct a social network matrix for t1. Code a tie (1) between two individuals if non-aggressive physical contact or close proximity (< 1 body length) occurs above a defined threshold (e.g., > 5% of scan samples).
  • Intervention/Control Period: Administer the investigational compound or vehicle control via designated route (e.g., daily i.p. injection or oral gavage) for 14 days.
  • Post-Treatment Assessment (t2):
    • Repeat behavioral phenotyping (Step 2) 24 hours after the final dose.
    • Repeat 72-hour video recording and social network construction for t2 under identical conditions.
  • Washout/Follow-up Assessment (t3): After a suitable washout period (e.g., 7 days), repeat Step 4 to establish a third wave (t3) of network and behavioral data.
  • Data Structuring: Format data into three adjacency matrices (network) and three matching behavior vectors (co-varying attribute), ensuring row/order consistency across waves.

Protocol 2: RSiena Model Specification & Interpretation Workflow Objective: To analyze longitudinal data and test specific hypotheses about social influence and selection. Procedure:

  • Data Preparation: Use RSiena R scripts to read adjacency matrices and attribute vectors. Check and report basic descriptives (densities, behavioral distributions).
  • Model Specification:
    • Define the dependent network variable (e.g., myNetwork).
    • Define the co-evolving behavioral variable (e.g., drinking).
    • Specify the model effects using getEffects().
    • Essential Effects to Include:
      • Network Dynamics: outdegree (density), reciprocity, transitive triplets, indegree - popularity.
      • Behavior Dynamics: linear & quadratic shape, average similarity effect (key for influence).
      • Selection Effects: Include egoX, altX, or simX effects to test if behavior predicts tie formation.
  • Model Estimation: Run the siena07() function. Monitor convergence (tconv.max < 0.25) and overall maximum convergence ratio (< 0.25). Revise model if necessary.
  • Output Interpretation:
    • Extract and summarize the parameter estimates, standard errors, and p-values into a table (see Table 1).
    • Interpret significant (p < 0.05) effects in the context of the hypothesis.
    • Key: A positive, significant average similarity effect for drinking indicates social influence. A positive, significant simX effect indicates homophilous selection.
  • Goodness-of-Fit (GoF) Check: Use sienaGOF() to test if the model adequately reproduces key network features (e.g., geodesic distances, triad census).

Mandatory Visualizations

Title: RSiena Analysis Experimental Workflow

Title: Core RSiena Parameter Groups & Questions

The Scientist's Toolkit

Table 3: Essential Research Reagents & Materials for Animal Social Network Studies

Item Function in Protocol Example/Specification
Automated Video Tracking System Enables continuous, undisturbed recording and positional data extraction for network construction. EthoVision XT, ANY-maze, or BORIS (open-source).
Social Proximity Loggers Automatically records close-range interactions in free-moving groups. RFID proximity tags (e.g., SparkFun) or ultra-wideband (UWB) sensors.
RSiena Software Suite The statistical environment for model specification, estimation, and simulation. R packages RSiena, RSienaTest.
Behavioral Test Apparatus Quantifies individual phenotypes that may co-evolve with the network. Elevated Plus Maze, Sucrose/Ethanol 2-Bottle Choice setup, Open Field.
Pharmacological Agents Probes neurobiological mechanisms of observed network effects. Selective receptor agonists/antagonists (e.g., for oxytocin, dopamine, opioids).
Data Structuring Scripts Converts raw observation/logger data into RSiena-ready adjacency matrices and attribute files. Custom R or Python scripts for data preprocessing.

1. Introduction This application note details the use of Stochastic Actor-Oriented Models (SAOMs), implemented via the RSiena software, to analyze the co-evolution of animal social networks and states (e.g., infection, behavioral trait). This work supports a broader thesis investigating the drivers of social dynamics in animal populations and the implications for pathogen spread and information transmission.

2. Core Quantitative Findings from Recent Studies Table 1: Summary of Key SAOM Model Estimates from Select Animal Network Studies

Study Subject Network Effect Parameter Estimate Standard Error Significance (p < 0.05) Interpretation
Bighorn Sheep (Pneumonia) Outdegree (Density) -1.52 0.21 Yes Networks are sparse; low tendency to form new contacts.
Reciprocity 1.88 0.33 Yes Strong tendency for mutual contact formation.
Disease → Sociality (Activity) -0.45 0.18 Yes Infected individuals reduce social effort.
House Finch (Mycoplasma) Simple Concurrency 2.10 0.41 Yes Individuals actively maintain multiple contacts.
Behavior (Feeder Use) Alters Network (Selection) 0.67 0.15 Yes Individuals with similar feeder use associate more.
Infection Influence (Average Similarity) 1.25 0.29 Yes Tendency for infected individuals to associate with each other.
Great Tit (Exploratory Behavior) Social Network → Behavior (Contagion) 0.31 0.09 Yes An individual's exploratory behavior is influenced by its network partners' behavior.
Behavior → Network (Selection) 0.42 0.11 Yes Individuals with similar exploratory behavior form ties.

3. Detailed Experimental Protocols

Protocol 1: Longitudinal Animal Social Network Data Collection for RSiena Analysis Objective: To collect the repeated network and attribute data required for SAOMs. Materials: GPS proximity loggers, RFID feeder systems, or direct observation equipment. Procedure:

  • Define Nodes & Ties: Define the study population (nodes) and a operational definition for a social tie (e.g., spatial proximity within 5m, grooming event).
  • Determine Wave Interval: Set observation periods (waves) to capture meaningful social change relative to the state dynamics (e.g., weekly for disease, monthly for behavior).
  • Collect Wave Data: For each wave:
    • Record the adjacency matrix (who is connected to whom) using the defined tie metric.
    • Record nodal covariates (e.g., infection status via PCR/swab, behavioral score, sex, age).
  • Ensure Consistency: Maintain identical node sets across waves. Note any permanent departures (death).
  • Format Data: Structure data into three components: an array of adjacency matrices (network), a matrix of nodal covariates (attributes), and a composition change file for node entry/exit.

Protocol 2: Executing a Basic SAOM for Social Contagion Analysis Objective: To model the joint evolution of a social network and a binary behavioral or disease state. Software: R with RSiena package installed. Procedure:

  • Data Preparation: Load network and attribute data into R. Use sienaDataCreate() to create a Siena data object, specifying the dependent network(s) and covariates.
  • Model Specification: Use getEffects() to view available effects. Create an effects object and include key parameters:
    • Network Dynamics: Outdegree (density), reciprocity, transitivity.
    • Behavior Dynamics: Linear shape, quadratic shape.
    • Network-Behavior Interplay: avSim (average similarity for contagion), egoX (behavior effect on activity), altX (behavior effect on popularity).
  • Model Estimation: Run siena07() to estimate parameters. Use multiple sub-processes (nbrNodes) for efficiency and robustness.
  • Goodness-of-Fit (GOF): Assess model fit using sienaGOF() on auxiliary statistics (e.g., indegree, outdegree, triad census distributions).
  • Interpretation: Significant positive avSim effect indicates social contagion. Significant egoX or altX indicates the state influences social tie formation.

4. Visualization of SAOM Analytical Workflow

Title: RSiena SAOM Analysis Iterative Workflow

5. Signaling Pathways in Social Transmission

Title: Neuro-Behavioral Pathway for Social Transmission

6. The Scientist's Toolkit: Research Reagent Solutions

Table 2: Essential Materials for Social Transmission Field Studies

Item / Solution Function Example Use Case
Miniaturized GPS/Proximity Loggers Logs continuous spatiotemporal location or pairwise encounters between individuals. Quantifying dynamic contact networks for disease transmission models in wildlife.
Automated RFID Feeder Systems Records individual visits to resource points, providing data on association and foraging behavior. Studying how shared resource use drives network structure and trait transmission.
Non-Invasive Biological Samplers (e.g., Buccal Swabs, Fecal Collectors) Enables collection of DNA, hormones, or pathogen samples without disturbing the subject. Genotyping for relatedness, stress hormone analysis, or pathogen surveillance.
RT-PCR Pathogen Detection Kits Sensitive and specific detection of pathogen RNA/DNA from field-collected samples. Determining infection status as a binary or continuous nodal covariate in SAOMs.
Behavioral Scoring Software (e.g., BORIS, EthoVision) Facilitates systematic coding and analysis of observed behaviors from video footage. Quantifying behavioral traits (e.g., boldness, exploration) for network-behavior co-evolution analysis.
RSiena R Package Statistical software for the analysis of longitudinal network data using Stochastic Actor-Oriented Models. Modeling the co-evolution of social networks and individual states (infection, behavior).

Solving Common RSiena Challenges: Convergence, Model Fit, and Complexity

Diagnosing and Resolving Convergence Failures (t-ratios > 0.1)

Within RSiena software analysis of animal social networks, convergence failures indicated by t-ratios with absolute values greater than 0.1 signify that parameter estimates are insufficiently stable for reliable inference. This Application Note provides a structured protocol for diagnosing root causes and implementing solutions, critical for robust longitudinal network analysis in behavioral neuroscience and drug development research.

Key Diagnostic Metrics and Benchmarks

Table 1: Convergence Assessment Metrics in RSiena

Metric Target Value Acceptable Range Indication of Failure
Overall maximum t-ratio < 0.1 0.1 - 0.2 (warning) > 0.2
Individual parameter t-ratio < 0.1 0.11 - 0.15 > 0.15
Score test ratio ~1.0 0.8 - 1.2 < 0.8 or > 1.2
Convergence ratio (Phase 3) > 0.3 0.2 - 0.3 (borderline) < 0.2

Table 2: Common Causes and Prevalence in Animal Network Studies

Root Cause Estimated Frequency (%) in Failed Models Primary Diagnostic Check
Insufficient iterations (Phase 3) 45% Check n3 value & convergence ratio
Model misspecification 25% Score-type tests, goodness-of-fit (GOF)
Sparse/volatile network data 15% Examine Jaccard indices & period stability
Multicollinearity (rate/effects) 10% Correlation between parameter estimates
Algorithmic issues 5% Check standard deviations, eigenvalues

Experimental Protocols

Protocol 1: Baseline Convergence Assessment
  • Run Standard Estimation: Execute siena07() with default parameters (n3=1000, seed=123).
  • Extract Diagnostics: From the sienaFit object, record:
    • tconv: Maximum absolute t-ratio for convergence.
    • Individual t-ratios for each parameter (theta).
    • Convergence ratio (fraction of iterations with decreasing deviations).
  • Initial Assessment: If maximum t-ratio > 0.1, proceed to Protocol 2.
Protocol 2: Iterative Model Stabilization
  • Increase Iterations: Set n3=3000 in sienaAlgorithmCreate() and re-estimate.
  • Re-assess: Check new t-ratios. If improved but >0.1, increase n3 progressively to 5000, then 10000.
  • Use Previous Estimates: For n3 > 3000, use prevAns argument in siena07() to initialize from previous fit, aiding stability.
  • Check Multi-Runs: Execute siena07() with multiple random seeds (e.g., seed=c(123, 456, 789)). Consistent estimates across runs indicate robustness.
Protocol 3: Model Specification & Data Diagnostics
  • Goodness-of-Fit Test: Run sienaGOF() for out-degree, in-degree, and triad census distributions. Significant p-values (<0.05) indicate misspecification.
  • Jaccard Index Calculation: For each wave, compute Jaccard stability: (stable ties)/((ties at t1)+(ties at t2)-(stable ties)). Values <0.3 indicate excessive change, complicating estimation.
  • Effect Diagnostics: Temporarily remove statistically insignificant (high s.e.) or complex effects (e.g., higher-order triads). Re-run to see if convergence improves.
  • Rate Parameter Check: Examine dependence between rate and structural parameters. High correlation (>0.5) may require fixing rate parameters using the fix argument in sienaAlgorithmCreate().
Protocol 4: Advanced Numerical Solutions
  • Diagonalize: In sienaAlgorithmCreate(), set diagonalize=0.2 to reduce correlation between parameters.
  • Adjust Gain Factor: Reduce the initial gain factor via firstg=0.01 to prevent early iteration instability.
  • Eigenvalue Analysis: Check the covariance matrix eigenvalues from the sienaFit object. Very small eigenvalues (< 1e-6) suggest collinearity.
  • Final Refinement: With a near-converged model, use siena07() with n3=2000, nsub=4 (multiple subphases) for final polishing.

Visualization of Diagnostic Workflow

Title: RSiena Convergence Diagnosis Workflow

The Scientist's Toolkit: Research Reagent Solutions

Table 3: Essential Computational Tools for RSiena Analysis

Tool/Reagent Function/Benefit Example/Note
RSiena (v. 1.4-xx) Core software for SAOM estimation. Use latest version from CRAN for bug fixes.
sienaAlgorithmCreate() Configures estimation parameters (n3, diagonalize). Key function for implementing protocols.
siena07() with prevAns Uses prior estimates for initialization, improving stability. Critical for high-iteration runs.
sienaGOF() Performs goodness-of-fit tests to diagnose model misspecification. Uses Monte Carlo simulation.
Jaccard Index Script Custom R code to calculate network stability between waves. Identifies overly volatile data.
Multiple Random Seeds Set of integers (e.g., 123, 456, 789) for seed argument. Tests estimation robustness.
High-Performance Computing (HPC) Cluster Enables long runs (n3>5000) and multiple simulations. Essential for large animal networks.
RStudio Projects Organizes scripts, data, and outputs for reproducible workflow. Maintains version control.

1. Introduction Within the context of RSiena analysis of animal social networks, achieving adequate model fit is paramount for valid inference. Goodness-of-Fit (GoF) tests diagnose discrepancies between simulated networks from the estimated model and the observed longitudinal network data. This protocol details strategies to improve GoF, integral to a thesis on robust dynamic network analysis in behavioral ecology and translational drug development research, where network dynamics may reflect treatment effects.

2. Quantitative Summary of Common GoF Statistics Table 1: Key GoF Statistics in RSiena and Target Improvement Strategies

GoF Statistic Description Target Range Primary Improvement Strategy
Outdegree Distribution Frequency of nodes with k outgoing ties. Simulated distribution envelope contains observed. Add outdegree activity (outTrunc) or outdegree popularity (outOutAss).
Indegree Distribution Frequency of nodes with k incoming ties. Simulated distribution envelope contains observed. Add indegree activity (inTrunc) or indegree popularity (inInAss).
Geodesic Distance Distribution Frequency of shortest path lengths. Simulated mean/quantiles match observed. Include triadic effects (transTrip, cycle3) or gravity effects.
Triad Census Counts of all 16 possible directed triadic motifs. Simulated counts match observed for key motifs. Include specific triadic effects (transTrip, transRecTrip, cycle3).
Edge-wise Shared Partners Distribution of ties by number of common neighbors. Simulated distribution matches observed. Add GWESP (geometrically weighted edgewise shared partners) effect.

3. Protocol: Iterative GoF Diagnosis and Model Improvement

3.1. Materials & Research Reagent Solutions Table 2: Essential Toolkit for RSiena GoF Analysis

Item Function in Analysis
RSiena Software Suite (v.1.3-14 or later) Core environment for SAOM estimation and simulation.
sienaGOF() Function Primary routine for calculating GoF test statistics.
Parallel Computing Cluster/foreach Package Enables computationally intensive GoF simulations.
Diagnostic Plot Scripts (ggplot2) Visualizes distributions of observed vs. simulated statistics.
Theoretical Effect Glossary Guide to candidate effects (e.g., GWESP, outTrunc) for model expansion.

3.2. Step-by-Step Experimental Protocol

Step 1: Baseline Model Estimation

  • Estimate a theoretically grounded Stochastic Actor-Oriented Model (SAOM) with core effects (density, reciprocity, transitive triplets).
  • Use siena07() with adequate phase 2 and 3 iterations for convergence (max. ratio < 0.25, all t-convergence ratios < |0.1|).

Step 2: Initial Goodness-of-Fit Assessment

  • Execute: gof <- sienaGOF(estimatedModel, varName, behaviorName = NULL, GOF, parallel = TRUE, nsim = 1000)
  • Specify GOF = ~ indegree + outdegree + geodesic.distance + triad.census for comprehensive diagnosis.
  • Plot results: plot(gof)

Step 3: Diagnose Specific Misfit

  • Identify the specific statistic(s) where the observed line (solid) falls outside the 95% envelope of simulated values (gray band).
  • Misfit Example: Observed outdegree distribution has a heavier tail (many high-degree nodes) than simulations.
  • Diagnosis: Model lacks heterogeneity in outdegree activity.

Step 4. Model Expansion & Re-Estimation

  • Incorporate relevant effects to address misfit (refer to Table 1).
  • For outdegree tail: Add outdegree-activity effect (outTrunc or outSqrt).
  • For lack of clustering: Add GWESP effect.
  • Re-estimate the expanded model.

Step 5. Iterative Re-testing

  • Rerun sienaGOF() on the new model.
  • Compare new diagnostic plots. Iterate Steps 3-5 until key statistics fall within simulation envelopes.
  • Use conditional sienaGOF() tests (e.g., GOF = ~ idegree + odegree) for focused assessment post-modification.

Step 6. Final Validation

  • Perform a Mahalanobis distance test (included in sienaGOF output) for an overall fit measure (p > 0.05 indicates no significant misfit).
  • Document all steps, added effects, and improvements in fit for thesis methodology.

4. Visualization of the GoF Improvement Workflow

Goodness-of-Fit Diagnostic & Improvement Cycle

Handling Sparse Networks and Missing Data in Animal Observations

In the broader thesis context of applying RSiena (Simulation Investigation for Empirical Network Analysis) to animal social networks, data quality is paramount. RSiena models the co-evolution of social networks and behaviors through longitudinal stochastic actor-oriented models (SAOM). A core challenge in ecological fieldwork is the inherent sparsity of observed interactions and frequent missing observation periods (e.g., due to animal absence, equipment failure). Sparse or gappy data can bias SAOM parameter estimates, leading to incorrect inferences about social dynamics, influence, and selection processes. These methodological insights are also critical for professionals in behavioral pharmacology and drug development, where accurate social network models may serve as biomarkers for treatment efficacy or social side effects.

Table 1: Common Causes and Impacts of Sparse/Missing Data in Animal Observation Studies

Cause Category Specific Example Typical Incidence Range Primary Impact on RSiena Analysis
Observation Gaps Failed GPS/tag battery 10-40% of scheduled fixes Creates structurally missing waves, impeding longitudinal tie change assessment.
Individual Absence Animal foraging out of range 5-30% of individuals per wave Leads to reduced node set, affecting density & degree estimates.
Sparse Interactions Rare dominance encounters in cryptic species Network Density < 0.05 Challenges rate function estimation; increases standard errors.
Partial Observation Limited field of view from camera traps 15-50% of group unobserved Results in censored, non-random missing ties, risking bias.

Table 2: Comparison of Data Handling Methods for RSiena

Method Core Principle RSiena Implementation Advantages Limitations
Multiple Imputation Generates multiple plausible network versions. sienaGOF with imputed data sets. Propagates uncertainty, provides robust CI. Computationally intensive; assumes Missing at Random (MAR).
Structural Zeros Defines impossible ties (e.g., across pens). sienaDependent with structZero matrix. Realistically constrains solution space. Does not address random or informative missingness.
Hypermodel (Bayesian) Integrates uncertainty into estimation. RSiena with Bayes option in sienaAlgorithm. Fully Bayesian handling of missing data. High complexity, long run times.
Listwise Deletion Removes nodes/periods with excessive missingness. Manual preprocessing before creating sienaDependent. Simple, straightforward. Loss of power, potential for severe bias.

Experimental Protocols for Mitigation and Validation

Protocol A: Pre-Collection Study Design to Minimize Missingness

  • Power Analysis: Prior to data collection, use agent-based simulations (RSimulate in RSiena) to generate networks under expected missing data regimes (e.g., 20% node loss). Determine the minimum observation intensity required for reliable parameter recovery.
  • Hardware Redundancy: Deploy overlapping observation systems (e.g., GPS collars + camera traps + direct observation) for the same subject group. Cross-validate sightings to fill gaps.
  • Structured Observation Schedule: Implement a stratified sampling schedule ensuring all individuals are targeted for focal follows equally across all observation waves, even if some data will be missing.

Protocol B: Post-Hoc Data Processing and Imputation for RSiena

  • Data Diagnostics: Calculate per-wave network density, node attendance, and the Jaccard index between successive observed networks. A Jaccard index below 0.3 indicates excessive instability, possibly due to missing data.
  • Imputation using Temporal Exponential Random Graph Models (TERGM):
    • Use the btergm or ergm package in R to fit a TERGM on the observed panel of networks.
    • Simulate 20 complete network copies for each missing wave, conditional on the observed ties and network statistics (e.g., edges, reciprocity, transitivity).
    • Create multiple siena data sets (sienaDataCreate) from the imputed networks and analyze using siena07 in parallel.
    • Pool parameter estimates and standard errors using Rubin's rules (mitools package).

Protocol C: RSiena Analysis with Informed Priors for Sparse Networks

  • Model Specification: For very sparse networks, include rate effects for density (outdegree) and reciprocity in the objective function. This stabilizes the estimation of more complex effects.
  • Bayesian RSiena with Informed Priors:
    • Set up a Bayesian SAOM using sienaAlgorithm with bayes=TRUE.
    • For parameters where estimation is uncertain due to sparsity (e.g., transTrip), specify informative priors (e.g., N(0, 0.5)) derived from meta-analyses of similar species or pilot studies.
    • Run extended MCMC chains and check Gelman-Rubin diagnostics for convergence.
  • Goodness-of-Fit (GOF) Testing: After model estimation, use sienaGOF to simulate networks from the fitted model. Compare distributions of geodesic distances, triad censuses, or edgewise shared partners between simulated and observed (with imputations) networks to validate the model's fit despite initial data sparsity.

Visualizations

Diagram: Workflow for Handling Missing Data in RSiena Analysis

Title: RSiena Missing Data Workflow

Diagram: SAOM for Animal Networks with Missing Data Mechanisms

Title: Latent Network Model with Missing Data

The Scientist's Toolkit: Research Reagent Solutions

Table 3: Essential Tools for Robust Animal Social Network Analysis

Item / Solution Function in Context of Sparse/Missing Data Example Product / R Package
Programmable GPS Loggers High-frequency spatial data allows proximity-based network inference, filling gaps in direct interaction logs. OrniTrack-50, Movebank API.
Automated Camera Array Provides continuous, multi-angle coverage to reduce individual absence artifacts. Reconyx HyperFire 2, camtrapR package.
Passive Integrated Transponder (PIT) Systems Logs all visits of tagged individuals to key nodes (feeders, burrows), generating complete association matrices. Oregon RFID, feedr package.
R aniSNA Suite Curated packages for diagnostics (asnipe), imputation (btergm), and RSiena wrapper (RSiena) workflows. asnipe, btergm, RSiena.
Network Simulation Framework Generates synthetic data under known missingness regimes to test analysis robustness. RSimulate (in RSiena), igraph functions.
High-Performance Computing (HPC) Access Enables multiple imputation, Bayesian RSiena, and large simulation studies, which are computationally demanding. Cloud platforms (AWS), university HPC clusters.

Application Notes for RSiena in Animal Social Networks Research

Within the broader thesis on RSiena software for the analysis of animal social networks, managing model complexity is paramount for deriving biologically interpretable and statistically robust conclusions. The core principle is parsimony: selecting the simplest adequate model to explain observed social dynamics. Overly complex models risk overfitting, reduced generalizability, and obscured key drivers of behavior.

Key Principles for Effect Simplification

  • Theory-Driven Selection: Base effect inclusion on a priori hypotheses from behavioral ecology (e.g., dominance hierarchies, kinship, spatial constraints).
  • Sequential Testing: Add effects incrementally (structural > homophily > covariate-related) and assess significance.
  • Collinearity Check: Use RSiena's siena06() procedure to detect and mitigate correlation between effect statistics.
  • Goodness-of-Fit (GoF) Assessment: After model convergence, use sienaGOF() to test whether the simplified model adequately reproduces key network features (e.g., triad census, degree distribution).

Table 1: Comparison of Complex vs. Parsimonious RSiena Model for Baboon Grooming Network

Model Feature Complex Model (All Effects Tested) Parsimonious Final Model Interpretation
Rate Parameter (Period 1-2) 8.45 (p=0.12) 7.21 (p=0.08) Stable estimated change opportunities.
Outdegree (Density) -2.10 -2.05 Consistent low baseline tie probability.
Reciprocity 1.85 1.80 Strong, stable mutual grooming preference.
Transitivity 0.65 0.68 Strong closure in grooming triangles.
Age Homophily 0.55* 0.52* Significant preference for similar-age partners.
Dominance (Sender) 0.30 (p=0.25) -- Effect excluded; not significant.
Kinship 1.15 1.20 Strong preference for grooming relatives.
Convergence t-ratios All < 0.1 All < 0.1 Both models converged adequately.
Overall Maximum Convergence Ratio 0.15 0.09 Parsimonious model shows better convergence.
Bayesian Information Criterion (BIC) 1245.7 1232.3 Lower BIC supports the parsimonious model.

Note: * p < 0.05; * p < 0.01; -- effect not included.*

Experimental Protocols

Protocol 1: Sequential Model Specification and Simplification for RSiena

Objective: To build a parsimonious stochastic actor-oriented model (SAOM) explaining network dynamics.

Materials: RSiena software (v.1.3-14 or later), R environment, network data in matrix form, attribute data for covariates.

Procedure:

  • Data Preparation: Format dependent network data (e.g., grooming adjacency matrices) for each wave into R matrices or sparse matrix objects. Format covariates (e.g., age, dominance rank, kinship) as vectors or matrices.
  • Initial Descriptive Analysis: Calculate network statistics (density, reciprocity, Jaccard index for stability) using sna or igraph to inform plausible effects.
  • Create RSiena Data Object: Use sienaDependent() to specify the network panels. Use coCovar() or varCovar() for covariates. Combine with sienaDataCreate().
  • Specify Basic Structural Model:
    • Define an initial effects object using getEffects().
    • Include essential structural effects: density (outdegree), reciprocity (recip), and transitivity (transTrip or transRecTrip).
    • Estimate initial model: siena07(myAlgorithm, data=myData, effects=myEff).
    • Check convergence (all t-ratios for convergence < |0.1|).
  • Add and Test Covariate Effects Sequentially:
    • Step A: Add one behavioral or homophily effect (e.g., effFrom for a covariate sender effect, simX for homophily).
    • Re-estimate the model.
    • Evaluate significance (|t-statistic| > 2). If non-significant, consider removal.
    • Step B: If significant, check for collinearity with existing effects using siena06().
    • Step C: Repeat Steps A-B for each candidate effect derived from hypothesis.
  • Final Model Assessment:
    • Conduct a joint significance test on all parameters in the final, reduced model.
    • Perform Goodness-of-Fit (GoF) tests on auxiliary statistics not explicitly modeled.
    • Document the final effect parameters, significance, and convergence statistics.

Protocol 2: Goodness-of-Fit (GoF) Testing for a Simplified Model

Objective: To validate that a parsimonious RSiena model adequately captures the observed network's structure.

Procedure:

  • After obtaining a converged, simplified model (ans), select auxiliary network statistics. Common choices include indegree distribution, outdegree distribution, geodesic distance distribution, and triad census.
  • Simulate Networks: Use sienaGOF(ans, auxiliaryFunction=triadCensus, verbose=TRUE) to simulate many networks based on the fitted model and calculate the chosen statistic for each.
  • Calculate Discrepancies: The function compares the distribution of the statistic from the simulated networks to the observed value from the original data.
  • Visual & Statistical Evaluation: Plot the distributions (plot(gof.object)) and examine the p-value from the Monte Carlo test. A p-value > 0.05 (or > 0.10) suggests the model does not significantly misfit on that statistic.
  • Iterate: If significant misfit (p < 0.05) is found for a critical statistic, consider if the model needs a theoretically justifiable additional effect to capture this feature, while balancing parsimony.

Visualizations

RSiena Model Simplification Workflow

The Scientist's Toolkit: Research Reagent Solutions

Table 2: Essential Components for RSiena-based Animal Social Network Analysis

Item Function in Research Example/Specification
Focal/Ad Libitum Behavioral Data Raw input for constructing social networks. Defines ties (e.g., grooming, proximity). Ethogram-defined interaction events, collected via focal animal sampling or automated tracking.
Individual Animal Covariates Explanatory variables for network dynamics. Age, sex, dominance rank index, genetic relatedness coefficient, hormone levels.
RSiena Software Suite Core statistical environment for SAOM estimation and simulation. R package RSiena (v.1.3-14+). Requires R (v.4.0.0+).
High-Performance Computing (HPC) Cluster Facilitates model estimation, which is computationally intensive for large networks or many waves. Access to multi-core nodes for parallelizing siena07 runs via siena07RunToConvergence.
Network Preprocessing Scripts Converts raw observation data into adjacency matrices and covariate files for RSiena. Custom R/Python scripts for data aggregation, matrix construction, and Jaccard index calculation.
Goodness-of-Fit Diagnostic Scripts Validates the simplified model's ability to reproduce network features. R scripts utilizing sienaGOF with custom auxiliary statistics (e.g., specific triad types).
Visualization Tools Communicates network structure, change, and model results. R packages igraph (for static plots) and networkDynamic (for dynamic visualization).

Performance Tips for Large-Scale Animal Network Datasets

Abstract: This Application Note provides a structured guide for managing and analyzing large-scale animal network data within the RSiena framework, a primary tool for longitudinal social network analysis. The protocols focus on computational efficiency, data integrity, and scalability, contextualized within a broader thesis on advancing animal social networks research for applications in behavioral ecology and translational drug development.

Data Pre-processing & Management Protocols

Efficient pre-processing is critical for handling large datasets. The following protocol standardizes data ingestion for RSiena.

Protocol 1.1: Data Compression and Formatting for RSiena

  • Source Data Consolidation: Compile observation waves into individual .csv files for networks (adjacency matrices) and covariates. Use non-numeric delimiters (e.g., "NA") for missing data.
  • Matrix Sparsity Optimization: For large groups ((n > 500)), convert adjacency matrices to sparse matrix format using the Matrix package in R before RSiena import. This reduces memory footprint.
  • RSiena Data Object Creation: Use the sienaDataCreate() function with the sparse = TRUE argument for network data. Store the returned siena object as an .RData file for rapid subsequent loading.

Table 1: Performance Impact of Sparse Matrix Format

Network Size (n) Dense Matrix Memory (MB) Sparse Matrix Memory (MB) RSiena Model Creation Time (s)
200 0.32 0.12 15.2
500 2.00 0.45 48.7
1000 8.00 1.20 185.3
2000 32.00 3.10 721.5

Note: Memory values are for one binary network matrix. Sparse matrix savings increase with higher sparsity.

Computational Performance Optimization

RSiena estimation is computationally intensive. These protocols leverage parallel computing and algorithmic settings.

Protocol 2.1: Parallelized Parameter Estimation

  • Cluster Initialization: On a high-performance computing (HPC) cluster or multi-core machine, initialize a parallel cluster using the siena07 backend. Example:

  • Model Specification: Define the effects structure in the sienaAlgorithmCreate() function. Use projname to define unique output file names for parallel runs.
  • Parallel Estimation: Execute siena07() with the cl = mycluster and useCluster = TRUE arguments. Always include returnDeps = TRUE for simulation-based diagnostics.
  • Post-Erection Consolidation: Use sienaGroup() to aggregate results from multiple independent runs for robustness.

Protocol 2.2: Intelligent Phase Specification

  • Submodel Estimation: For complex models with many effects, run initial estimation phases with maxlike = TRUE (Method of Moments) to quickly approximate parameters.
  • Final Refinement: Use the output from Step 1 as starting values for a final estimation using maxlike = FALSE (Maximum Likelihood Estimation) for accurate standard errors. This two-phase approach reduces total computation time.

Title: RSiena Large-Scale Data Analysis Workflow

Validation & Diagnostic Protocols

Robust diagnostics prevent model degeneracy and ensure reliability in large-scale inference.

Protocol 3.1: Simulation-Based Goodness-of-Fit (GoF)

  • Auxiliary Statistics Selection: After model convergence, define a set of auxiliary network statistics (e.g., geodesic distances, triad census) not used in the model estimation.
  • Parallel Simulation: Use sienaGOF() with the parallel = TRUE argument to simulate networks based on the fitted model.
  • Statistical Comparison: Execute a Kolmogorov-Smirnov test comparing the distribution of observed vs. simulated auxiliary statistics. A non-significant p-value ((p > 0.05)) indicates good fit.

Table 2: Key Diagnostic Statistics and Acceptable Ranges

Diagnostic Metric Target Range Interpretation
Overall Maximum Convergence Ratio (t-conv) ( t < 0.25 ) Indicates successful convergence of all parameters.
Individual t-ratios ( t < 0.10 ) Convergence for specific parameters.
Goodness-of-Fit p-value ( p > 0.05 ) Model adequately reproduces network structure.
Monte Carlo Standard Error ( SE < estimate /2 ) Parameter estimate is precise relative to its magnitude.

The Scientist's Toolkit: Research Reagent Solutions

Table 3: Essential Computational Tools for Large-Scale Animal Network Analysis

Item/Software Function & Application in Analysis
RSiena (v. 1.3-12 or later) Core software for stochastic actor-oriented modeling (SAOM) of longitudinal network data.
R Matrix Package Provides sparse matrix data structures to efficiently store and manipulate large, sparse networks.
Parallel (R core package) Enables multi-core processing for RSiena estimation and GoF simulations, drastically reducing time.
snowFT Package Provides fault-tolerant parallel computing, essential for long-running jobs on HPC clusters.
Graphviz Used for visualizing complex model structures and derived network pathways (as in the diagram above).
RStudio Server Pro Web-based IDE allowing researchers to run analyses on remote, powerful servers from any location.
Custom R Script Repository Version-controlled collection of scripts for data formatting, batch RSiena runs, and result extraction.

Title: Diagnosing and Solving RSiena Model Degeneracy

Application Notes and Protocols

This document outlines advanced protocols for investigating the coevolution of multiple social networks (e.g., proximity, aggression, grooming) and behavioral traits (e.g., boldness, hormone levels) within the framework of RSiena (Simulation Investigation for Empirical Network Analysis). This work forms part of a broader thesis on extending RSiena, a stochastic actor-oriented model (SAOM) framework designed for human networks, to the complexities of longitudinal animal social network data. These methods allow researchers and drug development professionals to model how social connections influence, and are influenced by, multiple individual phenotypes, a dynamic critical for understanding social behavior's biological underpinnings and potential pharmacological modulation.

1. Protocol: Longitudinal Multi-Network Data Collection for RSiena

Objective: To collect the time-structured, network and covariate data required for RSiena coevolution analysis. Materials: See "Research Reagent Solutions" table. Procedure: 1. Subject Identification & Baseline: Anesthetize and PIT-tag all individuals (N>50 recommended for power) in the study population. Record baseline covariates (e.g., body mass, sex, baseline GC from hair/fecal sample). 2. Observation Sessions: Conduct standardized behavioral observations (e.g., 10-minute scan samples every 30 minutes for 5 days) to construct networks for each type of social tie (e.g., proximity <1m, grooming, aggression). Use automated tracking (RFID, GPS) where possible for continuous proximity. 3. Temporal Waves: Repeat step 2 at defined intervals (e.g., 3-4 time points, t1, t2, t3), ensuring the interval is appropriate for the behavioral and physiological scales of interest (e.g., weeks for hormones, seasons for dominance). 4. Covariate Measurement: At each wave, collect individual-level state variables. For physiological measures like glucocorticoids (GC), conduct a controlled stress series (capture, restraint, blood sample at 0, 15, 30 min post-stress) on a subset or use non-invasive fecal sampling. Score behavioral traits via standardized assays (e.g., open field test for boldness). 5. Network Construction: For each wave and network type, create N x N adjacency matrices. For proximity, use a threshold (e.g., individuals recorded within 1m in >30% of scans per wave have a tie). Weighted or binary ties are acceptable for RSiena.

2. Protocol: RSiena Model Specification for Coevolution

Objective: To specify an SAOM that jointly models the coevolution of two networks and multiple behaviors. Software: RSiena (version 1.4-4 or later) in R. Procedure: 1. Data Preparation: Create dependent network objects (sienaNet) for each network (e.g., network.prox, network.groom). Create covariate objects (coCovar, varCovar, sienaCompositionChange) for actor attributes. 2. Model Specification: - Define the basic rate functions for each network's change between waves. - Specify structural effects for each network (e.g., outdegree, reciprocity, transitivity). - Add cross-network effects (e.g., eval term: X -> Y models how ties in network X influence the formation of ties in network Y). - Specify behavioral evolution using the sienaBehavior function. Include shape (tendency), linear/quadratic tendencies, and social influence effects (e.g., avSim for average similarity to alters). - Include network-behavior coevolution effects: * Selection: How a behavior influences tie formation (e.g., egoX, altX, simX). * Influence: How network position influences behavioral change (e.g., totSim for total similarity). 3. Model Estimation: Use siena07 with Phase 2 and 3 iterations. Check convergence (t-ratios < |0.1|) and overall maximum convergence ratio (< 0.25). 4. Goodness-of-Fit: Use sienaGOF to test if the model adequately reproduces key network statistics (e.g., geodesic distances, triad census).

3. Protocol: Experimental Pharmacological Intervention within Coevolution Framework

Objective: To test the causal effect of a drug (e.g., a glucocorticoid receptor antagonist like Mifepristone) on social network-behavior coevolution. Materials: See "Research Reagent Solutions" table. Procedure: 1. Baseline Period (t1): Complete Protocol 1, Wave 1. 2. Randomized Assignment: Randomly assign subjects to Treatment (drug) and Vehicle control groups, matched for baseline network centrality and GC levels. 3. Treatment Administration: Administer drug or vehicle via appropriate route (e.g., oral, injection) at a defined dose and schedule that maintains target engagement over the observation period. 4. Post-Treatment Observation (t2): Repeat Protocol 1, Wave 2, during the treatment period. 5. Washout & Follow-up (t3): Cease treatment, allow for washout, and repeat Wave 3 to assess persistence of effects. 6. RSiena Analysis: Include a time-varying covariate for treatment group. Test interaction effects between treatment and key parameters (e.g., does treatment alter the strength of social influence on GC dynamics?).

Visualizations

Title: Workflow for Multi-Network Coevolution Study

Title: Drug Action on GC-Social Feedback Loop

Research Reagent Solutions

Item Function & Relevance to Research
Passive Integrated Transponder (PIT) Tags & Readers Enables unique identification and automated, continuous recording of proximity/interactions for dynamic network construction.
Miniaturized Biologgers (GPS, Accelerometers) Provides high-resolution spatial and activity data to infer behavioral states and social encounters beyond direct observation.
Glucocorticoid Receptor Antagonist (e.g., Mifepristone) Pharmacological tool to block GR signaling, allowing causal tests of the HPA axis role in network-behavior coevolution.
Enzyme Immunoassay (EIA) Kits for Glucocorticoids For quantifying fecal, salivary, or plasma GC concentrations as a key physiological covariate in coevolution models.
Automated Video Tracking Software (e.g., EthoVision, DeepLabCut) Extracts precise locomotor and social interaction data from video, feeding into network matrices and behavioral scores.
RSiena Software (R package) The core analytical platform for estimating stochastic actor-oriented models of network-behavior coevolution.
Standardized Behavioral Arenas (Open Field, Novel Object) Used for consistent, repeated assays of personality traits (boldness, exploration) as evolving covariates in the model.

Quantitative Data Summary: Key RSiena Effects in Coevolution Studies

Effect Type Parameter Name (RSiena) Typical Estimate Range* Interpretation in Animal Studies
Network Structure Outdegree (density) -1.5 to -3.0 Baseline tendency to form ties (usually negative due to sparsity).
Reciprocity 0.5 to 2.5 Tendency for mutual ties. Strong in grooming/alliance networks.
Transitive Triads 0.1 to 0.5 Tendency for friend-of-a-friend ties (closure).
Cross-Network Proximity -> Grooming 0.2 to 1.0 Proximity ties increase the likelihood of grooming ties forming.
Selection Similarity on GC (simX) 0.1 to 0.8 Tendency to associate with others having similar GC levels.
Influence Average Alter GC (avSim) 0.05 to 0.4 An individual's GC level changes toward the average of its alters.
Behavior Shape GC Linear Tendency -0.3 to 0.3 Directional trend in GC level (negative = decrease over time).
GC Quadratic Tendency -0.1 to -0.5 Curvilinear trend (often negative, indicating reversion to mean).

*Ranges are illustrative, derived from simulated and empirical examples in literature. Actual values depend on species, network definition, and timescale.

Validating Your Model and Comparing RSiena to Alternative Network Tools

Within a doctoral thesis employing RSiena (Simulation Investigation for Empirical Network Analysis) for longitudinal animal social network research, internal validation is paramount. This analysis ensures the robustness, reliability, and reproducibility of the statistical models used to understand social dynamics, such as the evolution of dominance hierarchies or cooperative alliances. Two critical components of this validation are the use of Jaccard indices to assess model stability and comprehensive sensitivity analysis to probe the influence of modeling choices on inferred social parameters. These methods guard against overinterpretation of stochastic actor-oriented models (SAOMs) fitted with RSiena.

Core Concepts and Application Notes

Jaccard Index in RSiena: The Jaccard index measures the stability of model estimation. For RSiena's Method of Moments (MoM) estimation, it quantifies the overlap between the observed network data and the networks simulated during the estimation process. A low Jaccard index for a given wave indicates poor convergence, suggesting the model cannot adequately simulate networks similar to the observed data, thus casting doubt on the parameter estimates.

Sensitivity Analysis: This involves systematically varying components of the analytical pipeline—such as network definition criteria (e.g., proximity interaction duration thresholds), model specification (inclusion/exclusion of specific effects), or data handling (treatment of missing data)—and observing the impact on the final parameter estimates and significance. In animal social network studies, this is crucial as biological conclusions (e.g., "this population shows strong social learning") must be resilient to methodological decisions.

Table 1: Interpretation of Jaccard Indices in RSiena

Jaccard Index Range Interpretation Recommended Action
< 0.20 Critical Instability. Estimation has failed. Re-specify model, check data, increase simulation iterations (n3).
0.20 - 0.30 Poor Stability. Results are highly unreliable. Increase n2 and n3 parameters substantially; consider model simplification.
0.30 - 0.50 Moderate Stability. Results are acceptable but interpret with caution. Report indices. Minor model adjustments may improve stability.
0.50 - 0.60 Good Stability. Model convergence is satisfactory. Standard for reliable inference.
> 0.60 Excellent Stability. Very reliable convergence. Ideal outcome.

Table 2: Sensitivity Analysis Results Template (Hypothetical Study on Grooming Networks)

Altered Condition Key Parameter (e.g., "Transitivity") Estimate (Original) Estimate (Altered) Change in Significance? Conclusion on Robustness
Baseline Model Transitivity 0.65 (p<0.01) Reference.
Threshold: 5 sec -> 10 sec Transitivity 0.65 (p<0.01) 0.58 (p<0.05) Yes (weakened) Moderately robust; effect persists but is sensitive to definition.
Excluding juveniles Transitivity 0.65 (p<0.01) 0.11 (p=0.62) Yes (lost) Not robust; effect driven by juvenile interactions.
Alternative Convergence (n2=2000) Transitivity 0.65 (p<0.01) 0.63 (p<0.01) No Highly robust to estimation settings.

Experimental Protocols

Protocol 1: Calculating and Interpreting Jaccard Indices in RSiena

  • Model Estimation: Fit your SAOM using siena07() in RSiena. Ensure sufficient phase 2 and 3 iterations (e.g., n2=1000, n3=3000).
  • Extract Indices: After estimation, the Jaccard index for each wave is contained in the sf element of the sienaFit object. Access via fit$sf$Jaccard.
  • Assessment: Compare the obtained indices against the benchmarks in Table 1. If any wave index is below 0.30, estimation stability is problematic.
  • Troubleshooting Low Indices:
    • Increase simulation numbers: Rerun with n2=2000, n3=5000.
    • Simplify the model: Remove statistically insignificant or complex effects (e.g., higher-order interactions).
    • Check data quality: Ensure network coding is consistent and there are no extreme outliers.

Protocol 2: Conducting a Sensitivity Analysis for an RSiena-Based Animal Network Study

  • Define Baseline: Establish your final, preferred RSiena model and record all parameter estimates, standard errors, and significance levels.
  • Identify Sensitivity Axes: List methodological variables that could plausibly alter results. Common axes include:
    • Network Definition: Vary the interaction duration threshold or observation window.
    • Sample Composition: Run models on subsets (e.g., adults only, males only).
    • Model Specification: Test alternative combinations of structural effects.
    • Missing Data: Apply different imputation methods for unobserved interactions.
    • RSiena Settings: Vary n2, n3, or the algorithm (MoM vs. Maximum Likelihood).
  • Iterative Re-Running: For each axis, alter only that condition while keeping others constant. Re-run the RSiena model.
  • Systematic Comparison: For each altered condition, compile results as in Table 2. Focus on the sign, magnitude, and significance of key theoretical parameters.
  • Synthesis: Determine the conditions under which your core findings change. A finding is robust only if it persists across a plausible range of analytical decisions.

Visualization of Workflows

RSiena Jaccard Validation Workflow

Sensitivity Analysis Protocol for RSiena

The Scientist's Toolkit: Research Reagent Solutions

Table 3: Essential Materials and Tools for RSiena Internal Validation

Item Category Function in Validation
RSiena Software (v.1.3-xx) Core Software The R package used to specify, estimate, and simulate Stochastic Actor-Oriented Models for longitudinal network data.
R/RStudio Computing Environment Provides the platform for running RSiena, managing data, and executing validation scripts.
High-Performance Computing (HPC) Cluster Access Computational Resource Sensitivity analysis requires numerous model re-fits; HPC drastically reduces computation time.
Structured Observation Data (e.g., HMM, GPS) Primary Data Raw behavioral or spatial data from which networks are constructed for RSiena analysis.
Custom R Scripts for Batch Processing Software Tool Automates the running of multiple RSiena models across sensitivity conditions, ensuring consistency.
Jaccard Index Extraction Function Diagnostic Tool A script (fit$sf$Jaccard) to retrieve stability metrics from fitted RSiena objects.
Results Aggregation Template (e.g., Table 2) Data Management A pre-formatted spreadsheet or R data frame to systematically record outcomes from all sensitivity runs.
Statistical Comparison Function Analysis Tool Scripts (e.g., using dplyr or base R) to calculate differences in estimates between baseline and sensitivity models.

This document provides application notes and protocols for social network analysis (SNA) within animal social networks research, framed by a broader thesis investigating the utility of RSiena for modeling the dynamics of animal societies. While traditional static SNA tools (igraph, UCINET) describe network structure at a single point in time, RSiena enables the statistical modeling of network change and behavioral co-evolution, crucial for understanding the temporal dynamics underlying social structure, disease transmission, and information flow in animal populations. This comparative analysis is foundational for research into social determinants of health and behavior, with implications for epidemiological modeling and intervention strategies.

Core Conceptual Comparison

Table 1: Fundamental Comparison of RSiena and Static SNA Tools

Feature RSiena (Dynamic) igraph / UCINET (Static)
Primary Purpose Statistical modeling of network & behavior change over time. Description, visualization, and hypothesis-testing of single or multiple static networks.
Data Requirement Minimum of two (ideally more) network observations at discrete time points. One or more cross-sectional network snapshots.
Key Output Model parameters (rates, objective function) estimating selection & influence effects. Network metrics (centrality, density, clustering), visualizations, statistical tests.
Underlying Framework Stochastic actor-oriented models (SAOMs) based on utility maximization. Graph theory & algebraic sociometry.
Temporal Inference Direct, models the process of tie change. Indirect, via comparison of separate snapshots.
Handling Missing Data Explicit models for missing network ties. Typically requires complete data or ad-hoc imputation.

Application Notes & Experimental Protocols

Protocol for Longitudinal Data Collection in Animal Studies

Objective: To collect robust longitudinal social interaction data suitable for both static and dynamic (RSiena) network analysis.

Materials:

  • Focal animal population with unique identifiers.
  • Data logging system (e.g., GPS proximity loggers, automated video tracking, RFID).
  • Ethogram defining interaction types (e.g., grooming, aggression, proximity within X meters).

Procedure:

  • Study Design: Define observation periods (waves) corresponding to relevant biological cycles (e.g., seasons, breeding periods). Ensure periods are spaced to allow for potential network change.
  • Data Recording: For each wave, collect interaction data over a defined sampling window to construct a reliable network snapshot.
    • For proximity loggers: Deploy devices, download continuous data. Aggregate contacts into a symmetric association matrix (e.g., proportion of time spent within proximity).
    • For focal sampling: Use standardized methods to record all interactions of focal individuals.
  • Network Construction: For each wave t, create an adjacency matrix $X^t$, where $x_{ij}^t$ represents the strength or frequency of interaction between individuals i and j. Apply a consistent threshold or weighting scheme across waves.
  • Covariate Data: Concurrently record individual attributes (e.g., age, dominance rank, health status, hormone levels) for each wave.
  • Data Formatting: Save each wave's adjacency matrix and attribute data in a format compatible with target software (e.g., .csv, .dl for UCINET; native R matrices for igraph/RSiena).

Protocol for Static Baseline Analysis (igraph/UCINET)

Objective: To characterize network structure at each observation wave and test for differences between groups or time points using static methods.

Workflow:

Static SNA Analysis Workflow (92 chars)

Procedure (using igraph in R):

Protocol for Dynamic Network Analysis (RSiena)

Objective: To model the co-evolution of the social network and individual traits, testing specific hypotheses about social dynamics.

Workflow:

RSiena Dynamic Modeling Workflow (64 chars)

Procedure:

  • Data Preparation:

  • Model Specification:

  • Model Estimation & Diagnostics:

Table 2: Example RSiena Model Output Interpretation

Parameter Effect Estimate (θ) S.E. p-value Interpretation in Animal Context
Rate Network rate (period 1) 3.50 0.21 <.001 Animals evaluate/change ties ~3.5 times per period.
Structural Transitive Triplets 0.28 0.05 <.001 Preference for forming ties that close triangles (friends of friends).
Covariate Health Similarity 0.65 0.15 <.001 Animals associate with others of similar health status (homophily).
Behavior Average Alter (Health) 0.42 0.10 <.001 Social influence: an individual's health score tends toward its associates' average.

The Scientist's Toolkit: Research Reagent Solutions

Table 3: Essential Tools for Animal Social Network Analysis

Item Function & Relevance Example/Detail
Automated Tracking Enables high-resolution, continuous collection of spatial proximity data, foundational for network construction. GPS collars, RFID stations, automated video tracking software (e.g., DeepLabCut, idTracker).
Ethogram Software Standardizes behavioral coding to ensure reliable, replicable definition of social ties. Noldus Observer XT, BORIS, or custom database solutions.
R Environment Primary platform for analysis, integrating data management, static (igraph) and dynamic (RSiena) analysis. RStudio with tidyverse, igraph, RSiena, statnet packages.
UCINET User-friendly GUI for foundational static SNA, descriptive metrics, and permutation tests (MRQAP). Includes NetDraw for visualization. Useful for initial exploratory analysis.
Permutation Test Frameworks Provides non-parametric statistical tests for network hypotheses, controlling for non-independence. igraph functions, sna::qaptest, or Netlogo simulation models.
High-Performance Computing (HPC) Critical for computationally intensive RSiena models (large groups, many waves) and simulation-based GOF tests. Cloud computing clusters or local servers with parallel processing capabilities.

Within a thesis on RSiena software for animal social network analysis, a comparative evaluation of available longitudinal network models is essential. RSiena (Stochastic Actor-Oriented Model, SAOM) and the Temporal Exponential Random Graph Model (TERGM) with its Separable temporal ERGM (STERGM) extension represent two dominant paradigms for modeling network dynamics. This analysis details their theoretical distinctions, application notes, and protocols for researchers in behavioral ecology and translational sciences, where dynamic social interactions can inform models of disease transmission or treatment effects in animal cohorts.

Table 1: Core Theoretical & Practical Comparison

Feature RSiena (SAOM) TERGM/STERGM
Philosophy Actor-oriented; agents drive change via myopic stochastic optimization. Network-oriented; discrete-time Markov process for the whole network.
Time Handling Continuous-time; models unobserved micro-steps between observations. Discrete-time; models the probability of a network at time t given time t-1.
Dependent Unit Network change as sequences of tie additions/deletions by actors. The state of the entire network at a discrete time point.
Parameter Interpretation Rate (speed of change), Objective Function (preferences), Evaluation (uncertainty). Network statistics (e.g., edges, triangles) predicting tie existence in a given time step.
Separation of Dynamics Integrated in the micro-step process. Explicitly separable in STERGM: Formation and dissolution parameters.
Primary Software RSiena (R) btergm, ergm, networkDynamic (R)
Ideal Use Case Testing hypotheses about individual social drivers (e.g., preference for transitive closure) with finely spaced observations. Forecasting network evolution or testing structural hypotheses with clear discrete time intervals (e.g., annual censuses).

Table 2: Illustrative Simulation Results (Hypothetical Data)

Model Simulated Effect (Edge) Parameter Estimate (Mean) Std. Error p-value
RSiena Density -1.50 0.15 <0.001
Transitivity 0.60 0.08 <0.001
Rate Period 1 6.20 0.85 --
STERGM Formation: Edges -2.10 0.22 <0.001
Formation: Triangles 0.45 0.10 <0.001
Dissolution: Edges 1.80 0.20 <0.001

Experimental Protocols for Animal Social Networks

Protocol A: RSiena Model Fitting for Longitudinal Animal Association Data

  • Data Preparation: Format association data (e.g., from GPS, RFID, or direct observation) into adjacency matrices for each observation wave (e.g., daily grouping).
  • RSiena Object Creation: Use sienaDataCreate() to specify networks, covariates (e.g., age, dominance rank), and constant/ changing actor variables.
  • Model Specification: Define the objective function in sienaAlgorithmCreate(). Include structural effects (e.g., transTrip, cycle3, inPop) and covariate effects (egoX, altX, simX).
  • Model Estimation: Run siena07() with appropriate Monte Carlo iterations. Check convergence (t-ratios < |0.1|) and overall goodness-of-fit (sienaGOF).
  • Interpretation: Positive transitivity parameter indicates individuals prefer forming ties that create closed triangles.

Protocol B: TERGM/STERGM Application for Seasonal Network Dynamics

  • Data Preparation: Create a panel of networks for discrete time points (e.g., monthly seasonal networks). Convert to network objects and a list of matrices.
  • Model Specification (STERGM): Using the ergm package, specify separate formation and dissolution formulae (e.g., formation = ~edges + triangles, dissolution = ~edges + offset(edges)).
  • Model Estimation: Use stergm() or the more recent tergm estimation functions (GMM or MCMC). For TERGM, use btergm() with bootstrap confidence intervals.
  • Simulation & Forecasting: Use simulate.stergm to simulate future networks. Compare forecasted networks to held-out data for validation.
  • Interpretation: A positive formation triangle effect indicates ties are more likely formed in triadic closure. A positive dissolution edge effect increases tie survival.

Visualization of Analytical Workflows

Diagram 1: Model Selection Workflow

The Scientist's Toolkit: Essential Research Reagents & Materials

Table 3: Key Research Reagents & Computational Tools

Item Function in Analysis Example/Note
High-Resolution Tracking System (e.g., RFID, GPS) Generates raw proximity/association data for network edge definition. Used to create adjacency matrices for each observation wave.
Behavioral Coding Ethogram Standardized catalog of social behaviors (aggression, affiliation). Provides covariates (actor attributes) or alternative network definitions.
R Statistical Environment Primary computational platform for all analyzed models. Versions 4.3.0+. Essential for reproducibility.
RSiena Package (v. 1.4.0.1+) Implements the Stochastic Actor-Oriented Model (SAOM). Core tool for RSiena analysis; requires sna, network packages.
statnet Suite (ergm, tergm, etc.) Provides tools for TERGM/STERGM estimation, simulation, and diagnostics. btergm package also commonly used for TERGM estimation.
High-Performance Computing (HPC) Cluster Access Facilitates bootstrapping and MCMC estimation for large/complex models. Critical for model convergence with large animal cohorts or dense networks.
Network Visualization Software (e.g., Gephi, ndtv) Enables graphical representation of dynamic networks for hypothesis generation and result presentation. Aids in communicating temporal patterns to interdisciplinary teams.

1. Application Notes & Data Synthesis

The application of RSiena (Simulation Investigation for Empirical Network Analysis) in primatology and rodent behavioral research provides a robust framework for analyzing the co-evolution of social networks and individual traits. The following tables summarize quantitative findings from key validation studies.

Table 1: Validated RSiena Model Effects in Primate Studies

Study Subject (Reference) Key Network Dynamics Effect (Rate, Evaluation, Selection) Parameter Estimate (θ) Convergent t-ratio Key Covariate (Behavior) Interpretation
Rhesus Macaques (2023) Friendship Formation (Density) -1.75 0.12 Grooming (Behavior) Baseline propensity for tie formation is negative, requiring other effects to drive connections.
Transitivity (transTrip) 0.58 0.05 -- Strong tendency for triadic closure ("friend of a friend becomes a friend").
Selection by Dominance Rank (egoX, altX, simX) 0.45 (simX) -0.10 Dominance Rank Individuals prefer ties with others of similar dominance rank (homophily).
Chimpanzees (2022) Rate of Network Change (period 1-2) 4.32 0.08 Cooperative Hunting Network structure is highly dynamic between observation periods.
Reciprocity (recip) 0.87 -0.15 -- Strong mutual tie formation.
Influence on Hunting Participation (fromX) 0.39 0.11 Hunting Participation Social partners influence an individual's likelihood to participate in hunts.

Table 2: Validated RSiena Model Effects in Rodent (e.g., Rat) Pharmacological Studies

Study Design (Reference) Key Network Dynamics Effect Parameter Estimate (θ) Convergent t-ratio Pharmacological Covariate Key Finding
Social Defeat Model (2024) Rate of Interaction Change (post-stress) 5.21 0.09 Fluoxetine (SSRI) Treatment Stress increases network volatility.
Outdegree (Density) Post-Stress -2.15 0.14 -- Stressed animals exhibit fewer social ties overall.
Behavioral Alteration: Social Withdrawal (avgAlt) -0.62 -0.08 Drug vs. Vehicle Treated animals showed less influence from withdrawn partners.
Novel Drug Screening (2023) Pro-social Effect (egoX) 0.78 -0.12 Candidate Drug 'X' (vs. Placebo) Drug administration directly increased individual pro-social activity.
Homophily by Activity State (simX) 0.51 0.06 Locomotor Activity Active individuals preferentially associate.

2. Experimental Protocols

Protocol 1: Longitudinal Primate Social Network Data Collection for RSiena Objective: To collect structured longitudinal data suitable for RSiena analysis of wild or captive primate groups.

  • Subject & Group Definition: Define a closed study group (e.g., all adults in a troop). Conduct individual identification.
  • Focal Behavior Ethogram: Define interaction types (e.g., grooming < 2m, proximity < 1m, agonistic encounters). Binary or frequency counts can be used.
  • Sampling Schedule: Implement structured interval sampling. Conduct daily 30-minute focal follows per individual, rotating through all subjects over a 2-week "wave." Repeat for at least 3-4 waves.
  • Network Matrix Construction: For each wave, construct a directed adjacency matrix. For grooming: a tie from A→B exists if A groomed B above a defined threshold (e.g., total duration > 5 min/wave).
  • Covariate Data Collection: Simultaneously record time-varying individual covariates (e.g., dominance rank index, hormone levels, health score) for each wave.
  • Data Formatting: Format three R data objects: an array of adjacency matrices (dependent networks), a matrix of covariates, and a node-set constant covariance matrix if needed.

Protocol 2: Rodent Social Interaction Network Assay for Pharmacological Intervention Objective: To generate dynamic network data to test drug effects on social behavior using RSiena.

  • Apparatus & Housing: House subjects in a large, enriched visible-burrow system or complex arena. Equip with RFID tracking system (e.g., RFID collar/ear tag) and overhead cameras.
  • Baseline Network Phase (Wave 1): Allow 7-day acclimation. Record spontaneous social interactions (proximity < 5 cm, duration > 2 sec) via RFID and video for 5 consecutive days. Construct Wave 1 adjacency matrix based on interaction frequency.
  • Intervention & Post-Treatment Phase (Wave 2): Randomly assign subjects to Drug or Vehicle groups. Administer treatment per protocol (e.g., oral gavage, i.p.). After a predetermined absorption period, record interactions for another 5 days to construct Wave 2 network.
  • Behavioral Annotation: Supplement RFID data with manual/video scoring of specific behaviors (social investigation, avoidance, aggression) for covariate creation.
  • RSiena Model Specification: Define basic model structure: myModel <- sienaDependent(arrayOfNetworks). Include drug condition as a constant covariate and activity level as a changing covariate. Test for selection (drug influences network structure) and influence (network position affects behavioral change) effects.

3. Mandatory Visualizations

Primate RSiena Longitudinal Analysis Workflow

RSiena Drug Effect Modeling Pathways

4. The Scientist's Toolkit: Research Reagent Solutions

Item Function in RSiena Animal Research
RFID Tracking System Enables automated, continuous recording of social proximity (a network tie definition) with high temporal resolution for multiple subjects.
EthoVision XT or similar Video tracking software for automated quantification of locomotor activity and zone occupancy, used to create behavioral covariates.
BORIS (Behavioral Observation Research Interactive Software) Free, open-source tool for manual coding of complex social interactions from video to validate/refine automated network data.
R Statistical Environment with RSiena & RSienaTest packages Core software for model specification, estimation, and goodness-of-fit testing of longitudinal network models.
sna or igraph R packages For preliminary network visualization, descriptive statistic calculation (e.g., centrality, density) before RSiena analysis.
Standardized Housing (Visible Burrow/Complex Arena) Creates a semi-naturalistic environment that allows expression of a full repertoire of social behaviors critical for valid network construction.
Salivary/Plasma Hormone Assay Kits (CORT, OT) To measure physiological covariates (stress, bonding) that can act as selection or influence variables in the social network.
Automated Data Pipeline Scripts (Python/R) Custom scripts to transform raw RFID/tracking logs into formatted adjacency matrices and covariate files required by RSiena.

Application Notes

The investigation of animal social networks using RSiena (Simulation Investigation for Empirical Network Analysis) is pivotal for understanding behavioral contagion, social support phenotypes, and intervention outcomes in biomedical research. Assessing the robustness and replicability of findings from stochastic actor-oriented models (SAOMs) is critical before translating insights into, for example, pharmacokinetic models of socially modulated drug efficacy. These notes provide a framework for such assessment within a thesis on RSiena-based animal social network research.

Table 1: Key SAOM Parameter Diagnostics for Robustness Assessment

Diagnostic Metric Target Value/Range Indication of Robustness Implication for Replicability
Convergence t-ratio (all parameters) Absolute value < 0.1 Good model convergence. Estimated parameters are reliable and less dependent on initial simulation values.
Overall Maximum Convergence Ratio < 0.25 Acceptable convergence of the entire model. The simulation algorithm has properly explored the parameter space.
Score Test (Overall) p > 0.10 No significant omitted structural effects. Model specification is adequate, reducing specification error risk in replication.
Period-specific sienaTimeTest p-value p > 0.05 (Bonferroni-corrected) Parameter estimates are stable across time. Findings are not driven by a single observational period, aiding temporal replicability.

Table 2: Quantitative Replicability Framework for Network Dynamics

Replication Tier Data Requirement Analytical Requirement Success Criterion (Example)
Direct Replication Same species, strain, housing, observation protocol. Identical RSiena model specification & estimation settings. Parameter signs match, and estimates fall within 95% confidence intervals of original study.
Conceptual Replication Different cohort or lab of same species; similar ethogram. Test same behavioral theories (e.g., transitivity, homophily) in SAOM. Significant effects for the same network mechanisms (e.g., transitive triplets) are reproduced.
Generalization Different species or major ecological context. RSiena models adapted to new network structures/dependent behaviors. Core social process (e.g., status-driven hierarchy formation) is conserved in dynamics.

Experimental Protocols

Protocol 1: Robustness Check Suite for a Single RSiena Analysis

  • Model Estimation: Run the primary SAOM using siena07 with default settings and at least 4,000 iterations. Store the resulting sienaFit object (fit1).
  • Convergence Diagnosis: Extract convergence t-ratios using siena07 output or summary(fit1). Flag parameters with |t-ratio| > 0.1.
  • Goodness-of-Fit (GoF) Assessment: a. Simulate networks from the fitted model fit1 using sienaGOF. b. Calculate auxiliary statistics (e.g., geodesic distances, triad census) for observed and simulated networks. c. Perform a Monte Carlo test (plot.sienaGOF) to check for significant discrepancies (p < 0.05).
  • Specification Robustness Test: a. Create alternative model specifications (e.g., adding/omitting a control effect like activity). b. Re-estimate models and compare key parameter estimates and significance levels. c. Substantial changes indicate fragility.
  • Data Perturbation Test: a. Create a bootstrapped or jackknifed sample by randomly removing 5-10% of observed network edges or one observation wave. b. Re-estimate the model on the perturbed dataset. c. Compare parameter estimates to fit1. Use this to generate confidence intervals for robustness.

Protocol 2: Multi-Cohort Replication Analysis Workflow

  • Standardized Data Collection: Ensure consistent node definition (individual animal), edge definition (social interaction), and observation schedule across independent cohorts (A, B, C).
  • Independent Model Fitting: Run identical, pre-registered RSiena models on each cohort's network panel data separately. Store fit objects fit_A, fit_B, fit_C.
  • Meta-Analytic Synthesis: a. Extract target parameter estimates (θ) and standard errors (s.e.) from each model. b. Perform a fixed-effects meta-analysis: Calculate pooled θ = Σ(θi / s.e.i²) / Σ(1 / s.e._i²). c. Test for heterogeneity using Q-statistic. Significant heterogeneity suggests moderators affecting replicability.
  • Cross-Cohort Prediction: Use the pooled parameters from cohorts A and B to simulate network evolution for cohort C's starting network. Compare predictions to cohort C's observed networks using Quadratic Assignment Procedure (QAP) correlation.

Visualizations

Title: SAOM Robustness Assessment Protocol Workflow

Title: Multi-Cohort Replicability Analysis Framework

The Scientist's Toolkit: Research Reagent Solutions

Table 3: Essential Materials for RSiena-Based Animal Social Network Research

Item Function in Research
Automated Behavioral Tracking System (e.g., RFID, video-based pose estimation) Generates high-resolution, longitudinal interaction data with minimal observer interference, forming the raw input for network matrices.
Ethogram & Behavioral Coding Protocol Standardized operational definitions of social interactions (edges) to ensure construct validity and cross-study comparability.
R Statistical Environment with RSiena Package Core software for specifying, estimating, diagnosing, and simulating Stochastic Actor-Oriented Models (SAOMs).
Pre-registration Template (e.g., on OSF) Document for specifying hypotheses, network measures, SAOM effects, and analysis plans prior to data collection to confirmatory replication.
High-Performance Computing (HPC) Cluster Access Facilitates running large numbers of RSiena simulations (for GoF, bootstrapping) and complex multi-group models in a feasible timeframe.
Standardized Data Exchange Format (e.g., RDS files for network arrays) Ensures exact replication of data input across research teams and simplifies sharing for secondary reproducibility analyses.

Integrating RSiena Results with Other Omics or Environmental Data

Application Notes

Social network dynamics analyzed via RSiena (Simulation Investigation for Empirical Network Analysis) provide a powerful quantitative framework for understanding social influence, selection processes, and behavioral contagion in animal models. These dynamics are a critical phenotype, often acting as both a driver and an outcome of underlying biological states. Integrating RSiena-derived parameters with other omics layers or environmental variables enables researchers to construct mechanistic models linking social behavior to molecular pathways, environmental stressors, and, ultimately, translational outcomes in drug development.

Key RSiena output parameters suitable for integration include:

  • Rate Parameters: Frequency of network change, potentially correlating with stress or activity levels.
  • Evaluation Function Parameters: Strength of network effects (e.g., outdegree, reciprocity, transitivity), which can be treated as quantitative traits.
  • Behavior Function Parameters: Dynamics of co-evolving behavioral traits (e.g., smoking, cooperation), representing the social diffusion phenotype.

This multi-modal integration faces challenges: temporal alignment of longitudinal network snapshots with omics sampling, distinguishing correlation from causation, and the high-dimensional nature of omics data. The protocols below outline structured approaches to overcome these hurdles.

Table 1: Key RSiena Output Parameters for Multi-Omics Integration

Parameter Type Example Biological/Environmental Integration Hypothesis
Network Rate Change frequency per period Correlates with glucocorticoid levels or environmental volatility.
Network Effect Strength of transitivity Associated with gene expression profiles related to social bonding (e.g., oxytocin pathway).
Behavior Effect Average similarity effect May moderate the effect of a pharmacologic agent on behavioral spread.
Covariate Effect Sex alter effect Interaction with sex-hormone related metabolomic signatures.

Protocol 1: Correlating RSiena Parameters with Bulk Transcriptomic Data

Objective: To identify gene expression pathways associated with individual-level social network tendencies.

Materials & Reagents:

  • Animal Model Cohort: Longitudinal behavioral tracking data (e.g., RFID, video) for network construction.
  • RSiena Software Suite: For model fitting and parameter estimation.
  • Tissue Samples: Blood, prefrontal cortex, or relevant tissue collected at endpoint or serial biopsies.
  • RNA Extraction Kit: (e.g., Qiagen RNeasy).
  • RNA-Seq Library Prep Kit: (e.g., Illumina TruSeq Stranded mRNA).
  • Statistical Software: R with packages limma, WGCNA, sienaRI.

Procedure:

  • Network & RSiena Analysis:
    • Construct dynamic networks per observation period.
    • Fit a RSiena model including structural effects (e.g., density, reciprocity) and individual covariates (e.g., sex, weight).
    • Extract individual-specific sienaRI residuals (Ripley et al., 2022) for effects of interest (e.g., tendency to form connections). These residuals act as per-individual social phenotype scores.
  • Omics Data Generation:
    • Extract total RNA from tissue samples, ensuring RNA Integrity Number (RIN) > 8.
    • Prepare sequencing libraries. Sequence on an Illumina platform to a minimum depth of 30M paired-end reads.
    • Process raw reads: trim adapters (Trimmomatic), align to reference genome (STAR), and generate gene count matrices (featureCounts).
  • Integration Analysis:
    • Perform normalization (TMM) and log2 transformation of count data.
    • Use the social phenotype scores (sienaRI residuals) as a continuous trait in a linear model (limma), regressing gene expression against the score while controlling for technical batches and biological covariates.
    • Perform Gene Set Enrichment Analysis (GSEA) on the ranked list of genes from the regression to identify enriched pathways (e.g., immune response, synaptic signaling).

The Scientist's Toolkit: Key Research Reagent Solutions

Item Function in Integration Research
Passive Integrated Transponder (PIT) Tags Enables automated, longitudinal recording of social proximity or associations in free-moving animals.
sienaRI R Package Calculates residual indices of individual positioning in networks, providing quantitative traits for correlation.
Nucleic Acid Crosslinking Reagents (e.g., formaldehyde) For stabilizing protein-RNA complexes in subsequent CLIP-seq protocols when investigating molecular mediators of social behavior.
Single-Cell RNA-Seq Kit (e.g., 10x Genomics) To dissect cell-type-specific transcriptional responses linked to network position.
LC-MS/MS Metabolomics Platform For profiling global metabolomic shifts associated with social stress or dominance derived from RSiena models.

Protocol 2: Testing Environmental Moderation of Social Influence Processes

Objective: To test if an environmental variable (e.g., drug treatment, resource scarcity) moderates the strength of behavioral contagion in the network.

Materials & Reagents:

  • Controlled Environment System: Capable of manipulating and logging environmental variables (e.g., photoperiod, feeder access).
  • Pharmacologic Agent: The compound of interest for drug development.
  • RSiena Software Suite.

Procedure:

  • Experimental Design:
    • Randomly assign animals to treatment (Drug) or control (Vehicle) groups within a single social group or across replicated groups.
    • Administer treatment concurrently with longitudinal network data collection.
  • RSiena Model Specification:
    • Define a behavior variable (e.g., affiliative grooming rate) that co-evolves with the network.
    • Include the primary effect of interest: the average similarity effect, which quantifies behavioral contagion.
    • Specify an interaction effect between the average similarity effect and the treatment covariate. This tests if the drug alters the strength of social influence.
  • Analysis & Interpretation:
    • Fit the model in RSiena. A significant positive interaction term indicates the drug enhances behavioral contagion. A significant negative term indicates it suppresses contagion.
    • Use the sienaTimeTest function to check if the effect of treatment varies over time.

Diagram 1: RSiena-Omics Integration Workflow

Diagram 2: Testing Environmental Moderation in RSiena

Conclusion

RSiena provides an unparalleled statistical framework for uncovering the complex, time-dependent social processes that govern animal societies, offering critical insights for biomedical research. By mastering its foundational logic, methodological workflow, troubleshooting techniques, and validation practices, researchers can robustly model phenomena such as social contagion of stress, transmission of infectious agents, or the spread of learned behaviors in preclinical models. The future lies in integrating these dynamic network models with neurobiological, pharmacological, and genomic data, paving the way for a more holistic understanding of how social dynamics influence health, disease progression, and treatment efficacy. This approach promises to refine animal models, enhance translational relevance, and ultimately inform interventions targeting socially modulated pathways in human health.