Skip to content

Conversation

@neuromechanist
Copy link
Member

@neuromechanist neuromechanist commented Dec 22, 2024

This PR addresses #153 by introducing specifications for handling and standardizing stimulus files and their annotations within the BIDS specifications. The changes focus on improving the organization, referencing, and metadata of stimulus files to enhance consistency, reusability, and efficiency.

Note

The issue: #153
Google Doc
Example PR: bids-standard/bids-examples#433

Known issues:

  • The validator fails because extensions in /src/schema/rules/files/raw require datatype. Stimuli might be a special data type that can only be present at the root of the dataset. So, the datatype field is missing for now.
  • There are some style errors by the remark validators

cc: @bids-standard/bep044 and @monique2208

Implement the standardization of stimulus files and their annotations within the BIDS specifications.

* **Add new file `src/modality-specific-files/stimuli.md`**
  - Describe the specifications for the stimuli directory.
  - Include guidelines for storing stimulus files and their annotations.
  - Define what goes into `stimuli.tsv/json`, `annotations.tsv/json`, and `stim-<label>.json`.
  - Use the same style as other modality-specific docs to design the tables, variables, and examples.

* **Modify `src/modality-specific-files/task-events.md`**
  - Add a section detailing the standardization of stimulus files and their annotations within the BIDS specifications.
  - Include examples of how to use the `stim_file` and `stim_id` columns in `events.tsv` files.
  - Provide guidelines for storing stimulus files in the `/stimuli` directory.
  - Expand the definition of the `stim_file` column to include `stim_id`.

* **Modify `src/schema/objects/columns.yaml`**
  - Update the definition of the `stim_file` column to ensure consistency in stimulus file references.
  - Add the `stim_id` column definition for `events.tsv` files.

* **Modify `src/schema/rules/checks/events.yaml`**
  - Add a check for missing stimulus files declared in `events.tsv`.
  - Add a check for missing `stim_id` references in `events.tsv`.

* **Modify `src/schema/rules/sidecars/events.yaml`**
  - Specify the `StimulusPresentation` metadata field for `events.tsv` files.
  - Include the `stim_id` column in the metadata field specifications.

* **Modify `src/schema/objects/entities.yaml`**
  - Add entities described in the document with proper requirement levels and descriptions.

* **Modify `src/schema/objects/suffixes.yaml`**
  - Add suffixes for `{audio, image, video, audiovideo}`.
  - Include the file extensions and descriptions for each suffix.

* **Add new file `src/schema/rules/sidecars/stimulus.yaml`**
  - Define sidecar tables for `stimuli.tsv/json`, `annotations.tsv/json`, and `stim-<label>.json`.
  - Use the same style as other modality-specific docs to design the tables.

---

For more details, open the [Copilot Workspace session](https://copilot-workspace.githubnext.com/neuromechanist/bids-specification?shareId=XXXX-XXXX-XXXX-XXXX).
@Remi-Gau
Copy link
Collaborator

Remi-Gau commented Jan 9, 2025

will do a bit of clean up to get less red in CI and maybe see if we can get the HTML version of the BEP to render

@Remi-Gau
Copy link
Collaborator

Remi-Gau commented Jan 9, 2025

HTML: stimuli page
https://bids-specification--2022.org.readthedocs.build/en/2022/modality-specific-files/stimuli.html

Copy link
Collaborator

@yarikoptic yarikoptic left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

initial quick pass with little recommendations etc

- Implement make_root_filename_template for directories like /stimuli
- Register new macro in main.py following existing patterns
- Update stimuli.md to use new root template macro
- Addresses placement, prefix, and annotation files issues
- Remove incorrect datatype designation from stimuli rules
- Create schema-driven root filename template macro for path-based rules
- Update stimuli.md to use proper root template macro
- Fix understanding: stimuli is root-level directory, not subject datatype
- Create make_root_filename_template macro for root-level directories like /stimuli
- Update bidsschematools to skip path-based rules (without datatypes) in entity table and filename templates
- Remove stimuli from entity-table.md as it doesn't follow subject-scoped entity ordering
- Dynamically look up entity definitions from schema instead of hardcoding mappings
The test was failing because it tried to access datatypes attribute on all rules,
including the new stimuli rules that use path-based organization instead.
… stimuli tables

- Add explicit schema path loading to make_sidecar_table to avoid caching issues
- Temporarily comment out stimuli.Stimuli tables that need namespace fixes
- File tree generation from schema is now working for /stimuli directory
- Fix entity format labels in root template macro (use <label> instead of hardcoded)
- Add explicit schema loading to make_columns_table to avoid caching issues
- Re-enable all stimuli tables (sidecar and columns) that now work correctly
- File tree now properly shows stim-<label>[_part-<label>] format
…macro

Add generic root-level template macro for non-datatype directories
- Show .<extension> when more than 2 extensions exist for same suffix
- Keep stem files (stimuli.tsv, annotations.tsv) explicit for clarity
- Keep events files explicit (only 2 extensions)
- Consolidate audio/image/video/audiovideo files (5-6 extensions each)
The <extension> placeholder was being interpreted as an HTML tag and hidden.
Now using &lt;extension&gt; for HTML output while keeping <extension> for PDF.
@neuromechanist
Copy link
Member Author

Update prior to making PR out of Draft:

  1. Moved Stimuli to Modality Agnostic section.
  2. Added MACROS to render root-level file tree (should hopefully help with other use cases down the road)
  3. Tests are passing for simple (no-annotation) example from [WIP] examples for BEP044, Stim-BIDS bids-examples#433
  4. X Validator should probably be extended to handle event files under stimuli 🤔
  5. Tests are passing

Copy link
Member

@dorahermes dorahermes left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

"raw",
path="stimuli")
}}

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

here it seems to not render entirely as desired - -<label> (and <index>? for part) is missing

image


### Using `stim_file`

Reference stimulus files directly using the `stim_file` column, where values represent the relative path to the stimulus file within the `/stimuli` directory:
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@neuromechanist I do not want to derail this BEP, but it might as well be included here (or later).

I would like your feedback on our

use case, where we will have a distinct video record for every MRI data file in the dataset (thus per subject/[session/]). I envisioned placing them nearby the data files:

and that is reflective somehow of @bendichter 's

where it is a audiovideo record of behavior, instead of audiovideo record of stimuli.

One logical way could be to allow for stimuli/sub-<label>/[ses-<label>/]... hierarchy but that feels suboptimal since if unique per sub/ses -- better just go nearby the data.

I would say, we (with @vmdocua) would just add stim- entity to the file, associated with the data file, so e.g. for every _events.tsv level file it would produce the one with extra _stim-reprostim_audiovideo.{mkv,json} .

So in the long run, I would need to advocate allowing for such files in the hierarchy, and that will use stim entity then, with stimuli.tsv on top level providing description for what that reprostim means.

@neuromechanist
Copy link
Member Author

Very nice @yarikoptic. Indeed recording the stimulus/subject individual presentations could unlock accounting for even more variabilities throughout the session.

  1. I agree that individual stimuli presentation files should go into the subject/session files

  2. This might be a sidetone, but I'd think that [ENH] allow for _stim.{mp[34],mkv,avi} to provide stimuli files for func data #750 and [ENH] Add audio/video recordings to behavioral experiments #2231 are basically two different things, and that (some form of) both should exist, somehow. We should identify if the video file is a stimulus or behavior (or both?) (through sidecar metadata?).

Hopefully we should not overuse stim (especially if the recording captures both subject and stimulus, then it is not stimulus) and we could use something more common like recording:

sub-01_task-forrest_split-01_recording-reprostim_audiovideo.mp4
Ideally, we should allow:
sub-01_task-forrest_split-01_recording-reprostim_events.tsv
as the audiovideo.mp4 is a data file like any other bold or eeg data file that would deserve an events.tsv file ;D.

But, in the case that only the presented stimuli is recorded, like presenting randomized NSD data, capturing the video feed and sharing the presented video per participant (to make sure that event times are correct captured?):
sub-01_task-nsd_stim.mp4 would be enough
and then we have the usual sub-01_task-nsd_events.tsv to match that, as stimulus file is "assumed" to accompany a data file (Bold or EEG for that matter).

I hope that I could convey this not super twisted. LMK.

Either way, both should not have a bearing on BEP044. One of the reasons not to push BEP044 out now is to make sure that the use-cases are mature enough and that tools can sufficiently use it. Hence, working on https://github.com/Annotation-Garden/HEDit. So, please bring on more use cases :D.

@effigies effigies removed their request for review December 18, 2025 19:21
@effigies effigies modified the milestones: 1.11.0, 1.12.0 Dec 18, 2025
@effigies effigies removed the copenhagen For discussion in Copenhagen label Dec 18, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.