Josh and Ollie met on their very first day at SOAS, University of London.
They were wrong. JollyVids pioneered Because users are in a low-cortisol state, they are 300% more likely to engage with ads that feature sensory experiences: candle melting ASMR, the sound of rain on a tent, or brushes on ceramic pottery. jollyvids.
| Aspect | What the paper offers | How you can leverage it | |--------|----------------------|------------------------| | | Detailed statistics (category distribution, duration histograms, language coverage), collection pipeline, and quality‑control measures. | Quickly assess whether JollyVids matches your target domain or task. | | Annotation schema | Multi‑level annotations (global caption, per‑segment actions, audio transcript, object bounding boxes for a 10 % subset). | Re‑use the schema for extending your own dataset or for fine‑grained evaluation. | | Baseline models & code | End‑to‑end training scripts for CLIP‑style video‑text encoders, a transformer‑based captioner, and a retrieval system (all released under Apache‑2.0). | Jump‑start experiments without building the pipeline from scratch. | | Benchmark results | Comparative tables on MSR‑VTT, ActivityNet Captions, and HowTo100M, showing absolute improvements of 4–12 % when pre‑training on JollyVids. | Cite concrete performance gains when arguing for JollyVids pre‑training in a paper or grant. | | Ethical considerations | Discussion of bias analysis (demographic, geographic, and content‑type), licensing compliance, and a data‑usage policy. | Use the authors’ checklist to ensure responsible deployment of models trained on JollyVids. | | Future directions | Suggestions for multimodal reasoning (e.g., video‑question answering), long‑form video extensions, and cross‑modal generation. | Identify open research problems you can target in your own work. | Josh and Ollie met on their very first
If you want, I can expand any section into a full investor one-pager, product spec, or marketing plan. | Aspect | What the paper offers |