Configuration#
The configuration of the preprocessing pipeline is handled by the preprocessing/config.py file.
The settings object in preprocessing/config.py manages all configuration parameters, with
built-in defaults and a flexible loading mechanism.
Loading Precedence#
The pipeline loads configuration settings with the following precedence:
Explicit path: Using
--config_pathin CLI orsettings.load(path)in API.Environment variable:
MULTIPLEYE_CONFIGpointing to a YAML file.Local default:
multipleye_settings_preprocessing.yamlin the current working directory.Legacy location:
multipleye_settings_preprocessing.yamlin the repository root (deprecated).
Configuration Settings#
The main settings include:
Data Collection Configuration#
DATA_COLLECTION_NAME: Identifier for your data collection (e.g.,ME_EN_UK_LON_LAB1_2025). (Required)INCLUDE_PILOTS: Whether to include pilot data (default:False).EXCLUDE_SESSIONS: List of session IDs to exclude.INCLUDE_SESSIONS: List of session IDs to include (if provided, only these will be processed).EXPECTED_SAMPLING_RATE_HZ: The expected sampling rate of the eye tracker (default:1000).
Logging Configuration#
CONSOLE_LOG_LEVEL: Log level for the console output (default:INFO).FILE_LOG_LEVEL: Log level for the file output (default:DEBUG).LOG_LEVEL: Default log level for the package/Python (default:INFO).
Psychometric Test Settings#
PSYCHOMETRIC_TESTS_DIR: Directory containing psychometric test sessions (configured per data collection)
Processing Parameters#
expected_sampling_rate_hz: Expected sampling rate of the eye tracker in Hz (default: 1000)include_sessions: Optional list of specific session IDs to processexclude_sessions: Optional list of session IDs to exclude from processinginclude_pilots: Whether to include pilot sessions in the processing (default: True)session_to_stimuli: Mapping for non-standard stimulus versions (not yet in use)
Programmatic Usage (Notebooks)#
In a Jupyter notebook, you can load your configuration explicitly:
from preprocessing import settings
settings.load_from_yaml("path/to/your_config.yaml")
CLI Usage#
When running the preprocessing script:
python -m preprocessing.scripts.run_multipleye_preprocessing --config_path your_config.yaml
Internal Constants#
The settings object also contains technical parameters that should generally not need
modification:
Standard data structure (
RAW_DATA_FOLDER,FIXATIONS_FOLDER, etc.)Sanity check acceptable thresholds
Eyetracker names and stimulus name mappings