Signal processing and analysis of human brain potentials (EEG) [Exercise 8]

Multiple Comparison Corrections

Use data = ccs_eeg_utils.ex8_simulateData() to simulate a simple difference between two condition. Instead of electrodes x time, we now have a simple rectangular matrix (or \(n_{subject}=15\) of them), but the principles of multiple comparison corrections, can be applied to it as well.

T: calculate the mean of the data over subjects and plot it as an image

T: Plot some subjects individually. Can you infer anything from the “single subject” displays?

T-Values

The t-value is a weighting of your effect-strength (i.e. the difference between the means of two conditions) and the standard deviation (=spread) of this effect. \(t = \frac{mean(x)}{\frac{std(x)}{\sqrt(n-1)}}\). Use this formula to manually compare the t-values over the n=15 subjects. Compare it with scipy.stats.ttest_1samp(data,popmean=0,axis=0).

T: The scipy function also returns p-values. We want to plot the t-values next to the p-values. Because we will iteratively add different multiple comparison corrections, it is best to generate a function that allows to subsequently increase the amount of plots one can plot. I recommend to plot the log10(p-values)

Bonus: You can also use m = np.ma.masked_where(pvalues>0.05,tvalues) to get a nice masked view of the data.

False Discovery Rate

We will investigate False Discovery Rate a bit. First let’s generate data without any effect and only with noise. Thus all possible findings with p<0.05 have to be false positives

T: run data_h0 = ccs_eeg_utils.ex8_simulateData(signal_mean=0,noise_between=0) and plot the t-values as an imshow (be sure to add a colorbar always)

T: plot a histogram of all p-values (regardless of position)

T: Also plot a histogram of the p-values of the data with the effect

T: Count how many pvalues are below 0.05 each from the data with and without effect. For FDR we have to estimate how many “significant” (=> pvalue<\(\alpha\), with \(alpha=0.05\) typically) values we would get by chance (=false positives). Instead of estimating the number of p-values from one dataset (which is much more involved), we can also take our null-model pvalue-count as well. Calculate the ratio of H0/H1 significant-pvalues. This is your False-Discovery rate. Can you manually adjust alpha, so that the FDR is 0.05?

T: Use mne.stats.fdr_correction to calculate the proper fdr-correction. Use the plotting function from the beginning to directly compare the p-values with and without correction.

Bonus: F-Max Permutation test

In the lecture we discussed permutation tests and permutation cluster tests. What we didnt discuss is that you can easily adjust a permutation test to correct for multiple comparisons. We permute each 40x40 grid element concurrently, but instead of saving for each grid element the permuted statistics (e.g. the t-value), we save the maximum of all grid elements. This biases our permutation distribution towards large t-values, and concurrently makes it harder for the observed value to “stand out” (=> be unlikely) from that distribution. You can use mne.stats.permutation_t_test to calculate this.

Cluster Permutation Tests

We will implement a simple cluster permutation test, before making use of the mne-implementation. For this we need the package scikit-image to be able to use skimage.measure.label to get the clusters.

A cluster permutation test has the following structure

calculate t_obs, the t-values for your observed data (as before mne.stats.ttest_1samp_no_p)
Threshold t_obs using scipy.stats.t.ppf(1-(2*alpha), n-1) as the threshold value. This converts a p-value back to the t-value. In principle you could also decide to use a t-theshold of e.g. 2. The threshold is arbitrarily set, but important.
Because our cluster are in image-space, neighbours can easily be calculated using skimage.measure.label
Find the largest cluster and save it to c_obs
Permutation, do 1000 times:
1. generate a signFlip vector with length n (by default n=15 subjects) consisting of random “1” and “-1”, one for each subject. Assuming the \(H_0\) is true (which we do in this loop), the sign around “0” is random for each subject, so no harm should be done in flipping it (it will change the resulting statistic obviously, but doing it 1000 time shoudnt introuce / hide an effect)
2. Multiply the signFlip vector to the data
3. repeat step 1-4 of the observed data for the permuted data to get c_perm
4. Save this largest clustersize
Append c_obs to your c_perm (the simplified reason is, that else you could get a p-value of 0 more details if of interest)
1 - np.mean(c_obs>=c_perms) gives you your p-value

Note: The test could be improved by e.g. summing the t-values of a cluster instead of merely counting the cluster-extend, but that leads us a bit astray from what we want to understand here.

T: Running a simular permutation test in MNE is much easier:

t_clust, clusters, p_values, H0 = mne.stats.permutation_cluster_1samp_test(
    data, threshold=threshold, adjacency=None,
    n_permutations=1000, out_type='mask')

The threshold is the same threshold you used before
Typically we would have to supply the adjacency manually, because the adjancy depends on which channels are next to eachother. But in this case we can put None and mne will assume it is a grid-structure

In order to fill the clusters with their respective p-values:

p_clust = np.ones(data.shape[1:])
for cl, p in zip(clusters, p_values):
    p_clust[cl] = p

This step is controversial, because clusters do not have any real p-value see here, “interpretation of significant TFCE value”. But pragmatically, I think it is still useful to gauge the Signal-To-Noise ratio of the clusters. As long as you do not literally interpret the p-value as a probability, you should be fine.

T: Add the cluster-permutation to your comparison plot

TFCE

last but not least, we will get rid of this initial cluster-formung threshold. TFCE integrates over all possible thresholds. We will not implement TFCE here, but simply call the mne-python function.

t_tfce, _, p_tfce, H0 = mne.stats.permutation_cluster_1samp_test(
    data, adjacency=None,threshold = dict(start=0, step=0.2),
    n_permutations=1000, out_type='mask')

T: Add this to your comparison plot. We are done! I hope you learned the differences and underlying algorithms of several multiple-comparison corrects!