Add BEETL data to MOABB for benchmarking #675

Samuel-Boehm · 2024-12-22T23:27:56Z

Hi MOABB team,

I'm a PhD student working on BCI benchmarking and would like to contribute code to integrate the BEETL dataset into MOABB. Before proceeding with the review, I want to be transparent that I am not the owner or license holder of the original BEETL dataset - I'm simply contributing code to make this publicly available dataset accessible through MOABB for benchmarking purposes.

The dataset is publicly available through Figshare and was published as part of a BCI competition. I've implemented the loader following MOABB's conventions and patterns as good as I can. I basically followed what is done for Stieger2021 as I saw that they also uploaded the data to Figshare.

Key implementation details:

Dataset split into leaderboard and final evaluation phases (This might be confusing first but I wanted to stick to the original split as it was during the competition.
Handles two different data formats (Dataset A and B) with different sampling rates and channel configurations
Proper handling of labels and trial information
After the competition was done a .txt with the hidden labels was released, this is also downloaded and added to the data here.

Important Implementation Note:
In the current implementation, I concatenate all runs/races for each subject into a single continuous Raw object. This differs from the original data structure where trials were stored separately. I made this choice to align with MOABB's typical usage patterns, but I'm open to discussion if this should be handled differently.

I would greatly appreciate:

Your review of the code quality and adherence to MOABB standards
Guidance on whether it's appropriate to include this dataset given I'm not the original data holder

Greetings from Germany, Merry Christmas and a Happy New Year!
Samuel

sylvchev · 2025-01-20T14:25:09Z

Thanks, that is a really nice addition!
For the dataset split, the solution you proposed is perfectly fine.

sylvchev

Thanks for this PR, I'm not sure about the downloader changes, could you explain?

sylvchev · 2025-01-20T14:27:37Z

moabb/datasets/download.py

@@ -206,30 +206,39 @@ def fs_issue_request(method, url, headers, data=None, binary=False):

 def fs_get_file_list(article_id, version=None):
    """List all the files associated with a given article.
-


Please avoid changing docstring formatting, it is done to generate automatically the doc on the website, you could revert those two line suppression.

sylvchev · 2025-01-20T14:35:27Z

moabb/datasets/download.py

+    all_files = []
+    page = 1
+
+    while True:


Why did you need to change the downloader function for this dataset? There is 2 items max for this dataset.
If you want to address a specific issue for another dataset, please open another PR.

Sorry, that is a fix I wrote for the Stieger2021 dataset and does not belong here! I branched my other developing branch, not knowing this change is still in there. It should be the original function, Ill change it back.

Accidentally copied fs_get_file_list from another branch. Reverted to default version.

Samuel-Boehm and others added 12 commits December 11, 2024 19:31

handle pagination - needed for Stieger2021 ds

a71aaa8

[pre-commit.ci] auto fixes from pre-commit.com hooks

b05d58a

Merge branch 'develop' into develop

18ea5d7

add beetl datasets

705ebef

Merge branch 'develop' of github.com:Samuel-Boehm/moabb into develop

3dd4f22

beetl dataset final eval does now contain labels

d342929

added some description to the datasets

98d6d7f

add more info to descirption

20c320d

[pre-commit.ci] auto fixes from pre-commit.com hooks

4e2d113

renamed returns to contain the phase information in the session names

fef6327

added integers back into the session names as they are mandatory

83eabc0

removed underscores from session names as they are also not allowed

642cc6e

sylvchev reviewed Jan 20, 2025

View reviewed changes

fix: Reset fs_get_file_list to default version

7524474

Accidentally copied fs_get_file_list from another branch. Reverted to default version.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add BEETL data to MOABB for benchmarking #675

Add BEETL data to MOABB for benchmarking #675

Samuel-Boehm commented Dec 22, 2024

sylvchev commented Jan 20, 2025 •

edited

Loading

sylvchev left a comment

sylvchev Jan 20, 2025

sylvchev Jan 20, 2025

Samuel-Boehm Jan 23, 2025

		@@ -206,30 +206,39 @@ def fs_issue_request(method, url, headers, data=None, binary=False):

		def fs_get_file_list(article_id, version=None):
		"""List all the files associated with a given article.

Add BEETL data to MOABB for benchmarking #675

Are you sure you want to change the base?

Add BEETL data to MOABB for benchmarking #675

Conversation

Samuel-Boehm commented Dec 22, 2024

sylvchev commented Jan 20, 2025 • edited Loading

sylvchev left a comment

Choose a reason for hiding this comment

sylvchev Jan 20, 2025

Choose a reason for hiding this comment

sylvchev Jan 20, 2025

Choose a reason for hiding this comment

Samuel-Boehm Jan 23, 2025

Choose a reason for hiding this comment

sylvchev commented Jan 20, 2025 •

edited

Loading