= Tutorial 21: Noise and data covariance matrices =
''Authors: Francois Tadel, ''''''Elizabeth Bock, ''John C Mosher, Richard Leahy, Sylvain Baillet''

The source estimation methods we use need some metrics computed from the recordings. The minimum norm solution requires an estimation of the noise level in the recordings ('''noise covariance matrix''') and the beamformers need additionally a prototype of the effect we are targeting ('''data covariance matrix'''). This first section of this tutorial shows how to compute a noise covariance matrix from the MEG empty room recordings. The details that follow can be skipped if you are not interested.

<<TableOfContents(2,2)>>

== Compute the noise covariance ==
Ideally, we want to represent only the noise of the sensors. In MEG, this is easy to obtain with a few minutes of empty room measurements. The only constrain is to use noise recordings that have been acquired the same day as the subject's recordings (if possible just before) and pre-processed in the same way (same sampling rate and frequency filters). In this study we have already  prepared a segment of 2min of noise recordings, we will estimate the noise covariance based on it.

Right-click on the link to the '''noise recordings''' '''> Noise covariance'''. Available menus:

 * '''Import from file''': Use a matrix that was computed previously using the MNE software.
 * '''Import from Matlab''': Import from any [nChannels x nChannels] matrix from the Matlab workspace.
 * '''Compute from recordings''': Use the selected recordings to estimate the noise covariance.
 * '''No noise modeling''': Use an identity matrix as the noise noise covariance. Useful when you don't have access to noise recordings (eg. ongoing EEG activity or simulations). <<BR>><<BR>> {{attachment:noisecov_popup.gif||height="267",width="474"}}

Select the menu '''Noise covariance > Compute from recordings'''. Available options:

 * '''Data selection''': The top part of this window shows a summary of the files that have been selected to estimate the noise: 1 file of 120s at 600Hz. Total number of time samples: 72,000. We can chose to use only a part of this file with the option "baseline". The large chunks of continuous files are split in blocks of a maximum of '''10,000 samples''', that are then processed as different files.

 * '''Remove DC offset''': All the selected blocks of data are baseline corrected and concatenated to form a large matrix "F". There are two options for the baseline correction:<<BR>>'''Block by block''': The average value is subtracted from each block before the concatenation. <<BR>>If Fi is the recordings corresponding to block #i: F = Concatenate[Fi - mean(Fi)].<<BR>>'''Global''': The average value is removed after concatenation (same correction for all blocks). <<BR>>F = Concatenate[Fi] - mean(Concatenate[Fi]).

 * The noise covariance is computed from this concatenated matrix: '''NoiseCov = F * F' / Nsamples'''

 * '''Output''': Compute either a full noise covariance (best option) or just a diagonal matrix (only the variances of the channels). The second option is only useful if you do not have enough time samples to estimate the covariance properly. Always keep the default selection unless you know exactly what you are doing, Brainstorm detects what the preferred option is  for every case. <<BR>><<BR>> {{attachment:noisecov_options.gif||height="378",width="375"}}

Keep the default options and click on ['''OK'''].

 * One new file appears in the noise dataset, next to the channel file. Description of the popup menus:

 * '''Display as image''': Shows the noise covariance matrix as an indexed image. This can be useful to quickly check the quality of the recordings: noisier channels appear in red. You can display the noise covariance for all the sensors at once, or for each sensor type separately. Double-clicking on the file displays all the sensors.
 * '''Copy to other conditions''': Copy this file to all the other folders of the same subject.
 * '''Copy to other subjects''': Copy this file to all the folders of all the subjects in the protocol.
 * You can also copy a noise covariance file to another folder  just like any other file: <<BR>>Right-click > File > Copy/Paste, or keyboard shortcuts Ctrl+C/Ctrl+V. <<BR>><<BR>> {{attachment:noisecov_file.gif||height="139",width="378"}} {{attachment:noisecov_display.gif||height="170",width="148"}}

Right-click on the the noise covariance file > '''Copy to other folders''': We need this file in the two folders where the epochs were imported, in order to estimate the sources for them.

 . {{attachment:noisecov_copy.gif||height="225",width="212"}}

<<TAG(Advanced)>>

== Other scenarios ==
Computationally speaking, this noise covariance matrix is very easy to calculate, the Brainstorm interface offers a lot of flexibility to select the files and time windows you want to use. The real difficulty is to define what "noise" means. The ideal is to use segments of recordings that contain only the noise of the sensors, or segments of recordings that do not contain any of the brain signals of interest. This section is not directly useful for the current tutorial, but can be used as a reference for selecting the appropriate method in another experiment.

==== MEG ====
'''Empty room''': The MEG case is usually easier because we have access to real noise measurements, the MEG room just has to be empty. Record a few minutes right before bringing the subject in the MEG, or after the experiment is done. This would isolate only the noise from the sensors, which is what we are interested in most cases.<<BR>>If you acquire several runs successively and your MEG system is relatively stable, you can assume that the state of the sensors doesn't change much over the time. Therefore, you can re-use the same noise recordings and noise covariance matrix for several runs and subjects acquired during the same day.

'''Resting baseline''': Alternatively, when studying evoked responses (aka event-related responses), you can use a few minutes of recordings where the subject is resting, ie. not performing the task. Record those resting segments before or after the experiment, or before/after each run. This approach considers the resting brain activity as "noise", the sources estimated for the evoked response are going to be preferentially the ones that were not activated during the resting period.

'''Pre-stimulation baseline''': It can also be a valid approach to use the pre-stimulation baseline of  the individual trials to estimate the noise covariance. But keep in mind  that in this case, everything in your pre-stimulation baseline is going  to be attenuated in the  source reconstruction, noise and brain  activity. Therefore, your stimuli have to be distant enough in time so  that the response to a stimulus is not recorded in the "baseline" of the  following one. For repetitive stimuli, randomized delays between  stimuli can help avoiding expectation effects in the baseline.

==== EEG ====
The EEG case is typically more complicated. It is not possible to estimate the noise of the sensors only. Only the two other approaches described for the MEG are still valid: <<BR>>'''resting baseline''' and '''pre-stimulation baseline'''.

The noise level of the electrodes recordings depends primarily on the quality of the connection with the skin, which varies a lot from a subject to another, or even during the acquisition of one single subject. The conductive gel or solution used on the electrodes tends to dry, and the electrode cap can move. Therefore, it is very important to use one channel file per subject, hence one noise covariance per subject. In some specific cases, if the quality of the recordings varies a lot over the time, it can be interesting to split long recordings in different runs, with different noise covariance matrices too.

==== EEG and resting state ====
When studying the resting brain, you cannot use resting recordings as a noise baseline. For MEG the best choice is to use empty room measurements. For '''EEG''', you can chose between two different approaches: using the sensors variance, or not using any noise information.<<BR>>'''Option #1''': Calculate the covariance over a long segment of the resting recordings, but save only the diagonal, ie. the variance of the sensors. To do so from the interface: just check the box "Diagonal matrix" in the options window.<<BR>>'''Option #2''': Select "No noise modeling" in the popup menu. This would use an identity matrix instead of a noise covariance matrix (equal, unit variance of noise on every sensor). In the inverse modeling, this is equivalent to the assumption that the  noise in the recordings is homoskedastic, and equivalent for all the  sensors. The problem with this approach is that an electrode with a higher level of noise is going to be interpreted as a lot of activity in its region of the brain.

==== Noise and epilepsy ====
Analyzing a single interictal spike, using either EEG and MEG data, we are faced with a similar problem in defining what is noise. The brain activity before and after the spike can even be very informative about the spike's generation, particularly if it is part of a sequence of interictal activity that precedes ictal (seizure) onset. Defining a segment of time adjacent the spike as "background" may not be practical. In practice, however, we often can find a temporal region of spontaneous brain activity in the recordings that appears adequate for declaring as background, even in the epileptic patient. As discussed above, MEG has the additional option of using empty room data as a baseline, an option not available in EEG.

We thus have the same options as above:<<BR>>'''Option #1a''': Compute  the noise covariance statistics from blocks of recordings away from the  peak of any identified interictal spike, and keep only the diagonal (the  variance of the sensors).<<BR>>'''Option #1b:''' If a large period of time is available, calculate the full noise covariance.<<BR>>'''Option #2(MEG): '''Use empty room data as the baseline.<<BR>>'''Option #3''': Select "No noise modeling" in the popup menu (identity matrix, unit variance of noise on every sensor).

<<TAG(Advanced)>>

== Recommendations ==
 * '''Long noise recordings''': In  order to get a good estimation of the noise covariance, we need a significant number of time samples, at least '''N*(N+1)/2''', where N is the number of sensors. This means about 40s for CTF275 recordings at 1000Hz, or 20s for 128-channel EEG at 500Hz. Always try to use as much data as possible for estimating this noise covariance.
 * '''Do not import averages''': For this reason, you should never compute the noise covariance matrix from averaged responses. If you want to import recordings that you have fully pre-processed with another program, we recommend you import the individual trials and use them to compute the noise covariance. If you can only import the averaged responses in the Brainstorm database, you have to be aware that you may get poor results in the source estimation.
 * '''Using one block''': If you want to use a segment of "quiet" recordings in a continuous file: right-click on the continuous file > Noise covariance > Compute from recordings, then copy the noise covariance to the other folders. This is the case described in this tutorial.

 * '''Use single trials''': If you want to use the pre-stimulation baseline of the single trials, first import the trials in the database, then select all the groups of imported trials at once, right-click on one of them > Noise covariance > Compute from recordings, and finally copy the file to the other folders.
 * '''Using multiple continuous blocks''': This is similar to the single trial case. Import in the database all the blocks you consider as quiet resting baselines, then select all the imported blocks in the database explorer >  Noise covariance > Compute from recordings.

<<TAG(Advanced)>>

== Data covariance matrix ==
The computation of a data covariance matrix is very similar to a noise covariance matrix, except that you need to target the segments of recordings of interest instead of the noise. In the case of an event-related study, you can consider all the recordings in a range of latencies after the stimulation corresponding to the effect you want to localize in the brain.

 * For '''run#01''', select '''all the trials''', right-click > '''Data covariance > Compute from recordings'''. <<BR>><<BR>> {{attachment:datacov_popup.gif||height="199",width="370"}}
 * We need to specify the time window of interest in these recordings. If want to image the activity during the primary response, we can for instance consider the segment '''[50,150]ms''' post-stimulus. <<BR>><<BR>> {{attachment:datacov_options.gif||height="356",width="363"}} {{attachment:datacov_timewindow.gif||height="105",width="238"}}
 * Repeat the operation for '''run#02'''. <<BR>><<BR>> {{attachment:datacov_files.gif||height="257",width="225"}}

<<TAG(Advanced)>>

== On the hard drive ==
Right-click on any noise covariance file > File > View file contents:

 . {{attachment:noisecov_contents.gif||height="175",width="487"}}

==== Structure of the noise/data covariance files ====
 * '''Comment''': String displayed in the database explorer to represent this file.
 * '''NoiseCov''':  [nChannels x nChannels] noise covariance: '''F * F' ./ (nSamples-1)'''<<BR>>Unknown values are set to zero.
 * '''FourthMoment''': [nChannels x nChannels] fourth order moments: '''F.<<HTML(^)>>2 * F'.<<HTML(^)>>2 ./ (nSamples-1) '''
 * '''nSamples''': [nChannels x nChannels] number of time samples that were used for each pair of sensors. This is not necessarily the same value everywhere, some channels can be bad only for a few trials.

'''Related functions'''

 * bst_noisecov.m
 * panel_noisecov.m

<<TAG(Advanced)>>

== Additional documentation ==
 * Forum: EEG reference: http://neuroimage.usc.edu/forums/showthread.php?1525#post6718

<<HTML(<!-- END-PAGE -->)>>

<<EmbedContent("http://neuroimage.usc.edu/bst/get_prevnext.php?prev=Tutorials/HeadModel&next=Tutorials/SourceEstimation")>>

<<EmbedContent(http://neuroimage.usc.edu/bst/get_feedback.php?Tutorials/NoiseCovariance)>>