This repository is an extension to the braindecode library.
It adds extraction of features as well as convenience functions to run feature-based decoding.
The purpose of this library is to facilitate the generation of strong feature-based baselines.
braindecode-features expects a working installation of braindecode.
Additionally, it requires to install packages listed in requirements.txt
. So download the reposiory and install:
git clone git://github.com/gemeinl/braindecode-features.git
cd braindecode-features
pip install -r requirements.txt
braindecode-features offers a way to get from preprocessed data in the form of braindecode datasets (n_channels x n_times)
to a feature representation (n_examples x n_features)
.
It implements several extraction routines suitable to extract features that cover different properties of EEG signals.
This includes univariate and bivariate time domain features, features based on Fourier, Hilbert, and wavelet transform, as well as cross-frequency features.
Basic usage of braindecode-features is straightforward:
# Assume we have data given as a braindecode BaseConcatDataset of BaseDatasets
# We define a number of frequency bands which we are interested in
frequency_bands = [(0, 4), (4, 8), (8, 13), (13, 30), (30, 50)]
# We extract the features
from braindecode_features import extract_ds_features
feature_df = extract_ds_features(
ds=data,
frequency_bands=frequency_bands,
)
The function will automatically extract all implemented features and return a FeatureDataFrame.
The FeatureDataFrame is the key data structure in this repository which looks as follows:
The DataFrame has an explicit 4-level MultiIndex. The levels correspond to feature domain, feature type, channel(s), and frequency band(s).
The first domain Description
is a special case and does not hold any features but information required for decoding, i.e.
Trial
– the trial the features were extracted from. It is equivalent to the number of datasets in the braindecode dataset,
Window
– the compute window the features were extracted from. In the simplest case, the number of windows and trials is identical, however, any number of sliding windows can be generated by setting the windowing parameters,
Target
– the decoding targets.
In the following columns, all extracted features are stored.
The domains differ in the way they filter the data, and also in the way they pass data to the feature extraction functions.
The time extraction routine filters the continuous data in time domain to the specified frequency bands (does not filter again, if another domain already filtered in time domain). Afterwards it cuts windows. Features are computed one dataset and frequency band at a time and the shape that is passed to the time feature extraction functions is (n_windows x n_channels x n_times)
.
Features implemented with this domain are:
- covariance
- energy
- higuchi_fractal_dimension
- interquartile_range
- kurtosis
- line_length
- maximum
- mean
- median
- minimum
- petrosian_fractal_dimension
- phase_locking_value
- root_mean_square
- skewness
- standard_deviation
- variance
- zero_crossings
- zero_crossings_derivative
The Fourier extraction routine takes the original unfiltered continuous signals. It cuts windows, transforms the data using the Fourier transform and picks frequency bins corresponding to the specified frequency bands. The number of bins depends on the width of the frequency band, such that the last dimension of the shape passed to the Fourier feature extraction functions is flexible. Features are computed one dataset and frequency band at a time and the shape that is passed to the Fourier feature extraction functions is (n_windows x n_channels x n_fourier_coefficients)
.
Features implemented with this domain are:
- maximum
- mean
- median
- minimum
- peak_frequency
- power
- standard_deviation
- value_range
- variance
The Wavelet extraction routine takes the original unfiltered continuous signals. It transforms the data using the continuous wavelet transform with a Morlet mother wavelet at scales corresponding to pseudo frequencies (with and approxiamte stepwidth of 1) within the specified frequency bands. The number of scales depends on the width of the frequency band, such that the first dimension of the shape passed to wavelet feature extraction functions is flexible. Afterwards it cuts windows. Features are computed one dataset and frequency band at a time and the shape that is passed to the wavelet feature extraction functions is (n_scales x n_windows x n_channels x n_times)
. Features are afterwards averaged over the dimension of scales.
Features implemented with this domain are:
- bounded_variation
- maximum
- mean
- median
- minimum
- power
- standard_deviation
- value_range
- variance
The cross-frequency extraction routine filters the continuous data in time domain to the specified frequency bands (does not filter again, if another domain already filtered in time domain). It transforms the data of all possible pairs of two different frequency bands using Hilbert transform and cuts windows. Features are computed one dataset at a time and the shape that is passed to the cross-frequency feature extraction functions is (2 x n_windows x n_channels x n_times)
.
Features implemented with this domain are:
- cross_frequency_coupling
Inspired by the cropped decoding of the neural networks implemented in braindecode signals are split into compute windows. In the FeatureDataFrame, they are listed as a separate column and there are multiple ways to handle them:
- Interpret every window as an independent example. This corresponds to the standard FeatureDataFrame. Note that this will require combination of window predictions in the end again to obtain trial predictions.
- Aggregate the features over all windows of a trial. Note that this eliminates all time-resolved information.
- Do none of the above and create giant feature vectors by appending all window features. Note that the window information is then moved from a single column to the MultiIndex. Also note that if there are a variable number of windows per trial, then the minimum number of windows of a trial has to be used for all trials to create feature vectors of the same length required for the decoders.
All three options can easily be used as demonstrated in the code cell below:
from braindecode_features import prepare_features
X, y, groups, feature_names = prepare_features(
df=feature_df,
agg_func=None, # np.mean, np.median, ...
windows_as_examples=False, # True
)
The function will transform the FeatureDataFrame into X, y, and groups that are commonly known in machine learning.
The feature names are generated during extraction of features and used as MultiIndex in the FeatureDataFrame. When preparing features for decoding, they are converted into a new DataFrame with the columns Domain, Feature, (Window,) Channel, and Frequency which looks as follows.
The feature names allow for detailed post-hoc analysis, since they hold all required information to backtrack a feature performing well during decoding to its origin.
braindecode-features offers a simple way to filter the potentially huge FeatureDataFrame to a subset of features. For example, we can filter for features that were extracted from the Fourier domain, for variance features, for features extracted at electrode locations that contain a 'C' in the sensor name (including bivariate features where one channel had a 'C' in its name), or features extracted at frequency band 8-13 Hz:
from braindecode_features import filter_df
subset_df = filter_df(
df=feature_df,
query='Fourier', # 'variance', 'C' , '8-13'
exact_match=False, # True , False, True
level_to_consider=None, # 1 , 2 , 3
)
In all cases, the filtering function preserves the entries in domain Description
and concatenates the selection appropriately, such that filter_df
can be called multiple times and that the resulting DataFrame is compatible with the further steps leading to decoding.
Running the code requires following libraries: