Source code and data for the PhD Thesis "Measuring the Contributions of Vision and Text Modalities in Multimodal Transformers"

Dataset

DOI

This dataset contains source code and data used in the PhD thesis "Measuring the Contributions of Vision and Text Modalities in Multimodal Transformers". The dataset is split into five repositories: Code and resources related to chapter 2 of the thesis (Section 2.2., method described in "Using Scene Graph Representations and Knowledge Bases") Code and resources related to chapter 3 of the thesis (VALSE dataset). Code and resources related to chapter 4 of the thesis: MM-SHAP measure and experiments code. Code and resources related to chapter 5 of the thesis: CCSHAP measure and experiments code related to large language models (LLMs). Code and resources related to the experiments with vision and language model decoders from chapters 3, 4, and 5.

Identifier
DOI	https://doi.org/10.11588/data/68HOOP
Metadata Access	https://heidata.uni-heidelberg.de/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.11588/data/68HOOP

Provenance
Creator	Parcalabescu, Letitia
Publisher	heiDATA
Contributor	Parcalabescu, Letitia; Frank, Anette; heiDATA: Heidelberg Research Data Repository
Publication Year	2024
Funding Reference	bwHPC and the German Research Foundation (DFG) INST 35/1597-1 FUGG
Rights	info:eu-repo/semantics/openAccess
OpenAccess	true
Contact	Parcalabescu, Letitia (Heidelberg University, Department of Computational Linguistics); Frank, Anette (Heidelberg University, Department of Computational Linguistics)

Representation
Resource Type	Dataset
Format	application/zip
Size	17206604; 854757425; 489773; 456409; 488208
Version	2.0
Discipline	Other