University of Amsterdam / Amsterdam University of Applied Sciences
Browse
1/1
10 files

Dora WalkingTours Dataset (ICLR 2024)

dataset
posted on 2024-02-13, 10:09 authored by Shashanka Venkataramanan, Mamshad Nayeem Rizve, Joao Carreira, Y.M. AsanoY.M. Asano, Yannis Avrithis

Self-supervised learning has unlocked the potential of scaling up pretraining to billions of images, since annotation is unnecessary. But are we making the best use of data? How more economical can we be? In this work, we attempt to answer this question by making two contributions. First, we investigate first-person videos and introduce a "Walking Tours" dataset. These videos are high-resolution, hours-long, captured in a single uninterrupted take, depicting a large number of objects and actions with natural scene transitions. They are unlabeled and uncurated, thus realistic for self-supervision and comparable with human learning.

Second, we introduce a novel self-supervised image pretraining method tailored for learning from continuous videos.


Reference:

Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video. Shashanka Venkataramanan, Mamshad Nayeem Rizve, João Carreira, Yuki M. Asano, Yannis Avrithis. In: International Conference on Learning Representations 2024

History

Retention period

2099-01-01

Usage metrics

    University of Amsterdam / Amsterdam University of Applied Sciences

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC