Above: Suzanne Moja, 36, from Madrid, Spain is an epidemologist and supervises the team of MSF researchers in Bokoro, Chad. Source: Charlotte Morris/MSF.
The survey templates have been developed to match the data that would be collected using MSF Ethical Review Board pre-approved surveys on:
The survey report templates are special “RMarkdown” scripts, and are included with sitrep package installation. RMarkdown scripts give you a way of writing your analysis code and report in one document by allowing you to interleave chunks of code that the computer will run in between explanatory text meant for a person to read.
For assistance using the survey templates
Read more about RMarkdown scripts and how to modify the templates in the Training Materials pages of this website.
The case study walk-though offers you a step-by-step guide on how to use the templates to fit your data and run the analysis. Althrough the walk-through uses an outbreak template, the structure and techniques used to modify the templates are similar. At the start of the walk-through you can download the case study’s data files and final template to follow along. In the walk-through, you will learn lessons for modifying the templates to fit your needs.
To access the survey templates:
Further resources
This website will not cover the detailed aspects of designing, implementing and analysing a survey in MSF contexts. There are many other training and briefing materials that address this aspect. We will cover a few crucial concepts which are required for the understanding and use of the templates for your survey analysis.
For further survey methodology resources please see:
There are different ways to sample populations for surveys. The most commonly used in MSF settings are:
The templates can accomodate analysis of data from all sampling designs.
Before implementing a survey you will need to calculate the sample size for that survey.
When calculating a sample size for your survey you take the below parameters into consideration:
You will need to use an estimate of the prevalence/coverage of the your main outcome of interest in your to-be-surveyed population (prevalence of malnutrition, measles vaccination coverage, etc.). You will use published or grey literature to determine a logical estimate of this parameter.
The largest sample size you will need will be that which is calculated using an estimate of prevalence of 50%. The closer to 0% or 100% prevalence you set your estimate, the smaller your sample size will be. Therefore, if you have no idea what your estimate of prevalence is, try and use 50% and the largest possible sample size.
The easiest way to remember the design effect (which we usually refer to as DEFF), is that it is a measure of the variability of the outcome of interest within your clusters (if you are conducting a cluster based survey) and between your clusters.
The higher the DEFF, the higher the variability between clusters and the higher the probability that subjects within a cluster are similar.
In a well designed survey, you will aim to have a calculated DEFF for your outcome of interest as close as possible to 1. This is because a DEFF of 1 means that your survey design has approached Simple Random Sampling (SRS) for your outcome of interest. SRS is the gold standard in survey design, but often not possible in MSF contexts. In general, in two-stage cluster sampling surveys, we would assume a DEFF of 2.
Mathematically DEFF is calculated as:
DEFF=1+(1-n) * rho where n=sample size of your survey and rho=intraclass coefficient
When analysing your survery data, we often recalculate the DEFF for the main outcomes of interest. This because the DEFF will give you a good estimate on whether the survey design and sampling approach made sense in your survey for your outcome of interest.
The precision around your estimate stipulates how close to the your estimate you would like the 95% confidence interval of your prevalence estimate to fall.
For example: you have calculated a 72% measles vaccination coverage in your study population. You would like to ensure that the 95% confidence interval is narrow (between 70-74%); thus you would set your precision at 2% (2% above and below 72%).
The lower your precision, the higher your sample size calculation will be.
You only use the estimate of the precision when calculating a sample size before your survey.
The templates assume that data will have been collected using mobile data collection or will have been entered into an electronic database (Excel, Epi Data, Redcap etc.).
We recommend that you try and establish generic data dictionaries for mobile data collection (through template questionnaires) or through template databases. This will improve the consistency of the naming of your variables and will facilitate the frequent and systematic use of the Survey Templates.
We have designed the templates to run through a similar series of analytical questions in the same order:
Calculation of the main outcomes of interest (including DEFF):
Calculation of secondary outcomes of interest
The sampling stratgey MUST be considered by the epidemiologist using the templates in order to ensure an appropriate weighting is applied to the analysis.