ABIDE Preprocessed

Functional Preprocessing

There is no consensus on the best methods for preprocessing resting state fMRI data. Rather than being prescriptive and favoring a single processing strategy, we have preprocessed the data using four different preprocessing pipelines, each of which was implemented using the chosen parameters and settings of the pipeline developers.

The preprocessing steps implemented by the different pipelines are fairly similar. What varies most are the specific algorithms used for each of the steps, their software implementations, and the parameters used. The following sections provide an overview of the different preprocessing steps and how they vary across pipelines.

Basic Processing

Drop first "N" volumes 4 0 4 0
Slice timing correction Yes Yes Yes No
Motion realignment Yes Yes Yes Yes
Intensity normalization 4D Global mean = 1000 4D Global mean = 1000 No Non-uniformity correction
using median volume

Nuisance Signal Removal

Each pipeline implemented some form of nuisance variable regression1,2 to clean confounding variation due to physiological processes (heart beat and respiration), head motion, and low frequency scanner drifts, from the fMRI signal.

Motion 24-param 24-param 24-param scrubbing and 1st principal component of 6 motion parameters & their squares
Tissue signals mean WM and CSF signals CompCor
(5 PCs)
mean WM and CSF signals mean WM and CSF signals
Motion realignment Yes Yes Yes Yes
Low-frequency drifts linear and quadratic trends linear and quadratic trends linear and quadratic trends discrete cosine basis with a 0.01 Hz high-pass cut-off

Processing strategies

Each pipeline was used to calculate four different preprocessing strategies:

Strategy Band-Pass Filtering Global Signal Regression
filt_global Yes Yes
filt_noglobal Yes No
nofilt_global No Yes
nofilt_noglobal No No

For strategies that include global signal correction, the global mean signal was included with nuisance variable regression. Band-pass filtering (0.01 - 0.1 Hz) was applied after nuisance variable regression.


A transform from original to template (MNI152) space was calculated for each dataset from a combination of functional-to-anatomical and anatomical-to-template transforms. The anatomical-to-template transforms were calculated using a two step procedure that involves (one or more) linear transform that is later refined with a very high dimensional non-linear transform. When data are written into template space (typically after the calculation of derivatives, except for NIAK) all transforms are used simultaneously to avoid multiple interpolations.

Functional to Anatomical boundary-based rigid body (BBR) boundary-based rigid body (BBR) rigid body rigid body
Anatomical to Standard FLIRT + FNIRT ANTs DARTEL CIVET


Statistical derivatives (e.g., regional homogeneity) were generated from preprocessed functional data for each of the four processing strategies generated from each of the four processing pipelines. As mentioned earlier, these derivatives were all generated using CPAC. Although the calculation of the derivatives were the same for every pipeline, there were differences in each pipeline as to when each derivative was registered to standard space and when smoothing was applied. In every case the final resolution of the calculated derivatives is 3x3x3 mm3.

Approach 1

For CCS, CPAC, and DPARSF, the derivatives listed below were calculated in native space using unsmoothed functional data. The results were then written into template space (MNI152) and spatially smoothed with a 6-mm FWHM Gaussian kernel. The registration and smoothing were performed using steps specific to each pipeline.

In contrast, the derivatives listed below were calculated on the unsmoothed functional data in template (MNI152) space and then smoothed with a 6-mm FWHM Gaussian kernel.

Note as mentioned earlier VMHC was calculated on the functional data registered to the symmetric standard MNI152 brain.

Approach 2

In the NIAK pipeline, the functional data was written into template space and spatially smoothed with a 6-mm FWHM Gaussian kernel prior to calculating the statistical derivatives.

Regions of Interest

We also extracted mean time-series for several sets of regions-of-interests. In each case, the mean time-series was taken from functional data already registered in standard space for every pipeline. More specifically, time series were extracted for seven ROI atlases:

  • Automated Anatomical Labeling (AAL): The AAL atlas distributed with the AAL Toolbox was fractionated to functional resolution (3x3x3 mm3) using nearest-neighbor interpolation. [Atlas] [Labels]

  • Eickhoff-Zilles (EZ): The EZ atlas was derived from the max-propagation atlas distributed with the SPM Anatomy Toolbox. The atlas was transformed into template space using the Colin 27 template (also distributed with the toolbox) as an intermediary and fractionated into functional resolution using nearest-neighbor interpolation. [Atlas][Labels]

  • Harvard-Oxford (HO): The HO atlas distributed with FSL is split into cortical and subcortical probabilistic atlases. A 25% threshold was applied to each of these atlases and they were subsequently bisected into left and right hemispheres at the midline (x=0). ROIs representing left/right WM, left/right GM, left/right CSF and brainstem were removed from the subcortical atlas. The subcortical and cortical ROIs were combined and then fractionated into functional resolution using nearest-neighbor interpolation. [Atlas][Labels]

  • Talaraich and Tournoux (TT): The TT atlas distributed with AFNI was coregistered and warped into template space and subsequently fractionated into functional resolution using nearest neighbor interpolation. [Atlas][Labels]

  • Dosenbach 160: The Dosenbach 160 atlas distributed with DPARSF/DPABI includes 160 4.5-mm radius spheres placed at coordinates from Table S6 in Dosenbach et al., 20104. These regions were identified from meta-analyses of task-related fMRI studies. [Atlas][Labels]

  • Craddock 200 (CC200): Functional parcellation was accomplished using a two-stage spatially-constrained functional procedure applied to preprocessed and unfiltered resting state data corresponding to 41 individuals from an independent dataset (age: 18–55; mean 31.2; std. dev. 7.8; 19 females)5. A grey matter mask was constructed by averaging individual-level grey matter masks derived by automated segmentation. Individual-level connectivity graphs were constructed by treating each within-gm-mask voxel as a node and edges corresponding to super-threshold temporal correlations to the voxels’ 3D (27 voxel) neighborhood. Each graph was partitioned into 200 regions using normalized cut spectral clustering. Association matrices were constructed from the clustering results by setting the connectivity between voxels to 1 if they are in the same ROI and 0 otherwise. A group-level correspondence matrix was constructed by averaging the individual level association matrices and subsequently partitioned into 200 regions using normalized cut clustering. The resulting group-level analysis was fractionated into functional resolution using nearest-neighbor interpolation. Labels were generated for each of the resulting ROIs from their overlap with AAL, EZ, HO, and TT atlases using the cluster naming script distributed with the pyClusterROI toolbox. [Atlas][Labels]

  • Craddock 400 (CC200): The procedure described for the CC200 atlas was repeated for 400 regions to create the CC400 atlas. [Atlas][Labels]

Minimally Preprocessed Data

Minimally preprocessed data is only available for the C-PAC pipeline and was processed using only the following steps:

  1. Slice timing correction
  2. Realignment to correct for motion
  3. Written into template space at 3x3x3 mm3 isotropic resolution


  1. Lund 

  2. Fox 2005 

  3. Chai et al., 2012 

  4. Dosenbach, Nico U. F. et al. “Prediction of Individual Brain Maturity Using fMRI.” Science (New York, N.Y.) 329.5997 (2010): 1358–1361. PMC. Web. 21 Mar. 2015. doi: 10.1126/science 

  5. Craddock, R. C., James, G. A., Holtzheimer, P. E., Hu, X. P., & Mayberg, H. S. A whole brain fMRI atlas generated via spatially constrained spectral clustering, Human Brain Mapping, 2012, 33, 1914-1928 doi: 10.1002/hbm.21333.