-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathSampling Assay 3D.Rmd
226 lines (169 loc) · 7.58 KB
/
Sampling Assay 3D.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
---
title: "Sampling Assay 3D"
author: "Xianbin Cheng"
date: "March 11, 2019"
output: html_document
---
# Objective
* Create a module that simulates taking analytical samples from work samples
# Method
1. Load libraries and source R code.
```{r, warning = FALSE, message = FALSE}
source(file = "Sampling_libraries.R")
source(file = "Sampling_contamination.R")
source(file = "Sampling_visualization.R")
source(file = "Sampling_assay_prep.R")
source(file = "Sampling_plan.R")
source(file = "Sampling_plan_3d.R")
source(file = "Sampling_assay_3d.R")
source(file = "Sampling_assay.R")
```
2. List important parameters.
**Sampling contamination**
* `n_contam` = the number of contamination points
* `x_lim` = the limits of the x-axis
* `y_lim` = the limits of the y-axis
* `z_lim` = the limits of the z-axis
* `x` = the horizontal coordinate of the contamination center, which follows a uniform distribution by default
* `y` = the vertical coordinate of the contamination center, which follows a uniform distribution by default
* `z` = the vertical coordinate of the contamination center, which follows a uniform distribution by default
* `cont_level` = a vector that indicates the mean contamination level (logCFU/g or logCFU/mL) and the standard deviation in a log scale, assuming contamination level follows a log normal distribution $ln(cont\_level)$~$N(\mu, \sigma^2)$.
* `dis_level` = a vector that indicates the mode (ppb) and the lower bound (ppb), assuming contamination level follows $lb+Gamma(\alpha, \theta=\frac {mode-20}{\alpha-1})$
** Mode 1: Discrete Spread**
* `n_affected` = the number of affected kernels near the contamination spot, which follows a Poisson distribution ($Pois(\lambda = 5)$)
* `covar_mat` = covariance matrix of `x` and `y`, which defines the spread of contamination. Assume the spread follows a 2D normal distribution with var(X) = 0.25, var(Y) = 0.25 and cov(X, Y) = 0
** Model 2: Continuous Spread**
We do not consider such type of spread in a 3D space.
**Sampling strategies**
* `method_sp` = sampling strategy, including `srs`, `strs`, `ss`.
**Mode 1: Continuous Spread**
* `n_sp` = the number of sampling points
* `n_strata` = the number of strata (applicable to *2D Stratified random sampling*)
* `by` = the side along which the field is divided into strata. It is either "row" or "column" (applicable to *2D Stratified random sampling*) **OR** the side along which a sample is taken every k steps (applicable to *2D Systematic sampling*).
**Mode 1: Discrete Spread**
* `d` = inner diameter of the probe (m)
* `L` = length of the probe (m). It can be 5, 6, 8, 10, 12 feet, depending on the container type. Remember to convert it to meters. We assume it's fully inserted to the corn
* `rho` = average density of a kernel (g/cm3)
* `m_kbar` = average mass of a kernel
* `container` = "truck", "barge", "hopper"
* `depth_ft` = the depth of corn inside the truck. There are two sampling patterns for trucks, depending on whether depth is higher than or lower than 4 ft.
* `compartment` and `type` = arguments for hopper cars. `compartment` can be 2 or 3, `type` can be `open_top` or `trough`.
**Sample Assay**
* `method_det` = method of detection
+ Plating: LOD = 2500 CFU/g
+ Enrichment: LOD = 1 CFU
+ ELISA: LOD = 1 ng/g (Helica Total Aflatoxins ELISA kit)
**Mode 1: Discrete Spread:**
* `Mc` = maximum concentration limit of mycotoxin (ng/g or ppb)
**Mode 2: Continuous Spread:**
* `case` = 1 ~ 15 cases that define the stringency of the sampling plan.
* Attributes plans:
+ `n` = number of analytical units (25g)
+ `c` = maximum allowable number of analytical units yielding positive results
+ `m` = microbial count or concentration above which an analytical unit is considered positive
+ `M` = microbial count or concentration, if any analytical unit is above `M`, the lot is rejected.
```{r}
## The input parameters
n_contam = 10
x_lim = c(0, 8)
y_lim = c(0, 2)
z_lim = c(0, 2)
lims = list(xlim = x_lim, ylim = y_lim, zlim = z_lim)
cont_level = c(7, 1)
dis_level = c("mode"= 40000, "lb" = 20)
spread = "discrete"
### Discrete
n_affected = 10
covar_mat = make_covar_mat(spread = spread, varx = 0.04, vary = 0.04, varz = 0.04, covxy = 0, covxz = 0, covyz = 0)
### Continuous
spread_radius = 1
LOC = 10^(-3)
fun = "exp"
method_sp = "ss"
# Continuous
n_sp = 15
n_strata = 5
by = "row"
# Discrete
# sp_radius = 0.5 (This is theoretically for SRS and STRS)
container = "truck"
#depth_ft = 5
#compartment = 2
#type = "open_top"
d = 0.2
L = get_Lprobe(container = container, lims = lims)
m_kbar = 0.3 #unit: g
rho = 1.28
# Assay
# Define parameters: e.g. S.aureus in shrimps
case = 9
m = 50
M = 500
# Aflatoxin
Mc = 20
# Detection method: plating, enrichment, ELISA aflatoxin
method_det = "ELISA aflatoxin"
```
3. Generate contamination and sample points.
```{r, eval = FALSE}
# Run this if we want reproducibility
set.seed(123)
```
```{r}
# Generate the coordinates of contamination points
contam_xy = sim_contam_new(n_contam = n_contam, lims = lims, spread = spread, covar = covar_mat, n_affected = n_affected, spread_radius = spread_radius, cont_level = cont_level, dis_level = dis_level)
```
```{r, echo = FALSE, eval = FALSE}
## Test case
contam_xy = read.csv(file = "Test_case_truck.csv", row.names = 1, header = TRUE, stringsAsFactors = FALSE)
```
```{r}
# Generate the coordinates of sample points
sp_xy = sim_plan_new(method_sp = method_sp, spread = spread, lims = lims, radius = d/2, container = container)
```
```{r}
# Pre-generate healthy kernel concentrations to save time
conc_neg = rpert(n = 10^6, min = 0, mode = 0.7, max = 19.99, shape = 80)
```
```{r}
# Generate the distance matrix
dist_contam_sp = calc_dist(df_contam = contam_xy, df_sp = sp_xy, spread = spread, method_sp = method_sp)
contam_sp_xy = gen_sim_data_new(df_contam = contam_xy, df_sp = sp_xy, dist = dist_contam_sp, spread = spread, d = d, L = L, rho = rho, m_kbar = m_kbar, conc_neg = conc_neg)
```
4. Make sure the sample size is correct. Then get the work portion and test portion.
* `tox` = mycotoxin, including aflatoxin `AF`, fumonisin `FM`, zearalenone `ZEN`, Ochratoxin A `OTA`, Deoxynivalenol `DEN`
* For aflatoxin, the minimum sample size is:
* 908 g (2 lbs) for trucks
* 1362 g (3 lbs) for railcars (hopper)
* 4540 g (10 lbs) for barges, sublots and composite samples
* 4540 g is the recommended submitted sample size
* The `get_work_portion()` can also be used to get the file sample
```{r}
sample_dis = get_sample_dis(data = contam_sp_xy, container = container, m_kbar = m_kbar, tox = "AF")
```
5. Calculate the test portion mycotoxin concentration and decide whether to reject the lot.
```{r}
deci_dis = lot_decision_new(data = sample_dis, Mc = Mc, spread = "discrete", method_det = method_det)
```
# Result
1. Show the captured kernels.
```{r}
contam_sp_xy$samples[[1]]
```
2. Visualize the contamination and sample points.
```{r}
overlay_draw(method_sp = method_sp, data = contam_sp_xy$combined, spread = spread, xlim = lims$xlim, ylim = lims$ylim)
overlay_draw_probe(data = contam_sp_xy$combined, lims = lims, L = L)
```
3. Visualize aflatoxin distribution in raw sample, work portion, and test portion
```{r, out.width="50%", echo = FALSE, fig.show = "hold", warning = FALSE}
sample_dist(data = contam_sp_xy$samples[[2]], Mc = Mc)
sample_dist(data = sample_dis$work, Mc = Mc)
sample_dist(data = sample_dis$test, Mc = Mc)
```
```{r}
assay_draw(data = sample_dis$test, Mc = Mc, method_det = method_det, spread = spread)
```
```{r}
words(x = deci_dis)
```