Welcome to the documentation for the MOMA-LRG dataset!#
The Multi-Object, Multi-Actor Dataset with Language Refined Graphs (MOMA-LRG) is an novel benchmark designed to develop highly general and interpretable video understanding models.
The dataset is a research project from the Stanford Vision and Learning Lab.
Note
The dataset and its API are currently under active development. For the source code, please navigate to this GitHub repository.
API Reference