Correlating with voxel time courses#

When we have a 4D image, we can think of the data in several ways. For example the data could be:

  • A series of 3D volumes (slicing over the last axis);

  • A collection of 1D voxel time courses (slicing over the first three axes).

# Our usual set-up
import numpy as np
import matplotlib.pyplot as plt
# Display array values to 4 digits of precision
np.set_printoptions(precision=4, suppress=True)
# Load the function to fetch the data file we need.
import nipraxis
# Fetch the data file.
data_fname = nipraxis.fetch_file('ds114_sub009_t2r1.nii')
# Show the file name of the fetched data.
data_fname
'/home/runner/.cache/nipraxis/0.5/ds114_sub009_t2r1.nii'

We load a 4D file:

import nibabel as nib
img = nib.load(data_fname)
img.shape
(64, 64, 30, 173)

We drop the first volume; as you remember, the first volume is very different from the rest of the volumes in the series:

# Drop the first volume
data = img.get_fdata()
data = data[..., 1:]
data.shape
(64, 64, 30, 172)

As you have seen in the 4D introduction, we can think of this 4D data as a series of 3D volumes. That is the way we have been thinking of the 4D data so far:

# This is slicing over the last (time) axis
vol0 = data[..., 0]
vol0.shape
(64, 64, 30)

In that page, we found a 3D coordinate for an interesting voxel. The 3D coordinate is just the indices in the first three dimensions — the dimensions representing space. We can write the coordinate as indices in a tuple like this: (42, 32, 19). As you saw in the 4D page, the first index of 42 refers to a position towards the left of the brain (> 31). The second index of 32 refers to a position almost in the center front to back. The last index of 19 refers to a position a little further towards the top of the brain – in this image.

Here is the position of that coordinate displayed on a plane of the 3D volume (in fact, the mean volume).

# Where is this in the brain?
mean_data = np.mean(data, axis=-1)
# Make a nice bright dot in the right place
mean_data[42, 32, 19] = np.max(mean_data)
plt.imshow(mean_data[:, :, 19], cmap='gray')
<matplotlib.image.AxesImage at 0x7f05a09ce0e0>
_images/80a349b5d54f29ec89029b4c5677c435850d72071108a53017eab26c42e9d9b5.png

If I slice into the data array with these coordinates, I will get a vector, with the image value at that position (43, 32, 19), for every point in time:

# This is slicing over all three of the space axes
voxel_time_course = data[42, 32, 19]
print(voxel_time_course.shape)
plt.plot(voxel_time_course)
(172,)
[<matplotlib.lines.Line2D at 0x7f05a0716290>]
_images/90535ab4123f1f6cba9af56df84c615a8a022dfd75e6ada05ab57d88d540591d.png

This is a “voxel time course”.

We might want to do ordinary statistical type things with this time course. For example, we might want to correlate this time course with a measure of whether the subject was doing the task or not.

This measure will have 1 for each volume (time point) where the subject was doing the task, and 0 for each volume where the subject was at rest.

We call this a “neural” time course, because we believe that the nerves in the relevant brain area will switch on when the task starts (value = 1) and then switch off when the task stops (value = 0).

To get this on-off measure, we will use our pre-packaged function for OpenFMRI data:

# Fetch the condition file
import nipraxis

cond_fname = nipraxis.fetch_file('ds114_sub009_t2r1_cond.txt')
cond_fname
'/home/runner/.cache/nipraxis/0.5/ds114_sub009_t2r1_cond.txt'
# Load the neural time course using pre-packaged function
from nipraxis.stimuli import events2neural
TR = 2.5  # time between volumes
n_trs = img.shape[-1]  # The original number of TRs
neural = events2neural(cond_fname, TR, n_trs)
plt.plot(neural)
[<matplotlib.lines.Line2D at 0x7f05a0748460>]
_images/817d7e55db1c759d5315ea95b4fe874bd3b443f8924d325c3ef5bcb50185c5d1.png

Here we plot the voxel time course against this neural prediction:

# Plot the neural prediction against the data
neural = neural[1:]
# Notice the 'o' to specify the "line marker"
plt.plot(neural, voxel_time_course, 'o')
# Set the axis limits to give space on left and right
axis = plt.gca()
axis.set_xlim(-0.1, 1.1)
(-0.1, 1.1)
_images/5939f6a279a110ebd0c1bc9695a0ee748edbac2d9d1ecb5a3b946cc498ae6e3a.png

We can look at the correlation between the on-off prediction and the voxel time course:

# Correlate the neural time course with the voxel time course
corr_array = np.corrcoef(neural, voxel_time_course)
corr_array
array([[1.    , 0.5429],
       [0.5429, 1.    ]])

Notice that Numpy has correlated neural with neural — to give 1 — and voxel_time_course, to give:

# Correlation of neural (row 0) with voxel_time_course (column 1)
correlation = corr_array[0, 1]
correlation
0.5428503395516837

In the same way it has correlated voxel_time_course with neural and voxel_time_course, to give a 2 by 2 array.

A reminder on correlation#

We are sure that correlation is familiar to you, but here we define it, as a reminder.

Correlation between two 1D arrays is the result of converting each array to z-scores, element-wise multiplying the z-score arrays, and taking the mean of the result.

Turning an array into a z-score (AKA standard score) is the operation of:

  • Subtracting the mean of the array then

  • Dividing by the standard deviation.

Let’s do that for the two arrays above:

# z-scores for voxel_time_course.
vtc_mean = np.mean(voxel_time_course)
vtc_std = np.std(voxel_time_course)
vtc_z_scores = (voxel_time_course - vtc_mean) / vtc_std
# Show the first 10 values.
vtc_z_scores[:10]
array([-0.0652, -0.0194, -0.1109, -1.163 ,  0.4837,  1.6273,  1.673 ,
        0.621 ,  0.4837,  0.621 ])
# z-scores for neural time course
neural_mean = np.mean(neural)
neural_std = np.std(neural)
neural_z_scores = (neural - neural_mean) / neural_std
neural_z_scores[:10]
array([-0.977 , -0.977 , -0.977 ,  1.0235,  1.0235,  1.0235,  1.0235,
        1.0235,  1.0235,  1.0235])

Now we multiply them together, using the usual element by element Numpy multiplication rules for arrays:

multiplied = vtc_z_scores * neural_z_scores
multiplied[:10]
array([ 0.0637,  0.019 ,  0.1083, -1.1903,  0.4951,  1.6656,  1.7124,
        0.6356,  0.4951,  0.6356])

To calculate the correlation, we take the mean:

correlation_again = np.mean(multiplied)
correlation_again
0.5428503395516836

This recalculation is very very close to the original. The difference is so tiny we can put it down to floating point error.

np.isclose(correlation, correlation_again)
True