Programme

Introduction

  1. Background information on R and RStudio (🔗slides)

    More info, if needed:

  2. Presentation of the package’s website: https://umr-astre.pages.mia.inra.fr/sit/.

    • Central venue for documentation, code, questions, installation instructions, tutorials, everything.
  3. Overview of the sit R package.

    • Installation of sit. Go through installation steps and make sure everyone has it running.

  1. Introduction to R and RStudio (Sow)

  2. Package Documentation.

    • Reference pages and vignettes are accessible from the web and from within R. Demo.
  3. MRR data templates

    • Overview of the legacy Excel template.

      A single point-release only. Rings as approximation to distances. No adjustment for non-uniform spatial arrangement of traps.

    • Overview of sit’s data model, to establish nomenclature. Works with GPS coordinates and computes distances. Improves or extends some of the calculations. In general, should give similar results.

Importing MRR data into sit

  1. Basic data manipulation in R (🔗slides)

    Filtering observations, selecting and renaming variables, reshaping tables, tidy format. A more realistic example.

  2. Practical session

    Objective: get your data into a sit object. Verify correctness. If no own data, reproduce the introductory example in your computer.

    If not finished, finish at home and review in the morning.

  3. Special data types: geographical coordinates; coordinate reference systems; dates and times. (🔗web)

  4. Take-home tasks (optional)

Statistical Data analysis using the sit R package

  1. The sit object. Descriptive and graphical summaries. Extracting the data. Saving and loading.

    1. Retrieving Results Section 1: Overview of the experimental set up

    2. Retrieving Results Section 2: Extraction functions.

  2. Retrieve results on competitiveness, survival and dispersion of sterile males and on density of the wild population.

  3. Practical session with real MRR data from participants or from the sit_prototype in the package.

    • Objective: make a script that loads the data, produces the sit object and saves it for back-up. Make a second script that performs some analyses.

Data analysis workflow

  1. Getting help and helping. Building a community of users. The sit website. Reporting bugs, issues and feature requests in the development platform. Asking for and providing support in the mailing list.

  2. Making a statistical report with Rmarkdown. Manage project materials using RStudio projects.

    References:

    Hadley Wickham and Garrett Grolemund (2017) R for Data Science. Chapter 27: R Markdown

    Hadley Wickham and Garrett Grolemund (2017) R for Data Science. Chapter 8: Workflow: projects

    Rafael A. Irizarry (2021)Introduction to Data Science. Chapter 40: Reproducible projects with RStudio and R markdown

    Video Tutorial: 7-3 Interactive Data Analysis - Converting R Notebooks into R Markdown Documents

  3. Guided practical session

    • Objective: set up a full project analysis template from reading data from files, to producing the analytical report in RMarkdown. Use your own data or the demonstration files provided in the package.

    • Template project produced live during the session (🡇zip)

  4. Data analysis of pilot-trial data: population suppression trials

  5. Principles of data management (🔗slides)

Conclusions

Advantages of the R-package over the Excel template:

  • Support multiple release points
  • Support areal releases
  • Adjustment for non-uniform arrangements of traps
  • Work with precise coordinates and distances rather than rings
  • Programmable and reproducible analyses
  • No need to manually tweak formulas and tables with the risk of forgetting something and getting wrong results.

Perspectives:

  • The package tries to provide reasonable defaults to facilitate retrieving typical results, while providing enough flexibility to conduct more advanced analyses.

  • Pooling vs no-pooling: partial pooling using random effects models.

  • Statistical inference: uncertainty of estimates (beyond point estimates), joint modelling of multiple parameters.

  • Integrate new methods and models in the package.