RUDDER contains video that describes the creation of scientific toys from waste material. Till time existing datasets have data of videos and their relevant sentences/captions in English but RUDDER has data of videos, sentences/captions and audio too! These videos being instructional in nature, audio plays an important role. Moreover this is probably the first truly authentic multi-lingual video dataset(unlike existing bi-lingual datasets).
RUDDER consists of 492(to be updated) videos, with an average length of 80 seconds and around 7 sentences describing every video. A video is present in multiple languages namely HINDI, TAMIL, MALAYALAM, URDU, KANNADA. (Note: All videos do not have audio in every language mentioned earlier)