What is AToMiC?
See our white paper for more technical details.
Authoring Tools for Multimedia Content (AToMiC) Track at the Text Reterieval Conference (TREC) is intended to encourage research in multimedia search systems. We aim to provide evaluation resources and toolkits for researchers from information retrieval, natural language processing, computer vision, and multimedia communities.
See our white paper for more technical details if needed.
MotivationPermalink
Multimedia search is still a challenging task. The ever-growing production of text–image data has brought many works from different research communities to address the problem of effectively accessing multimedia resources. However, their capabilities for real-world applications is still questionable. The de facto image–text datasets (Flickr30k and MS-COCO) are constructed to generate image-text associations based on high-level concepts (i.e., describing generic objects such as mountain, or cat), while real-world multimedia search applications often deal with very specific semantic entities (e.g. Mount Fuji, or Larry the Cat). Hence, we need a better proxy to evaluate multimodal retrieval systems.
TasksPermalink
The TREC 2023 AToMiC track is organised with two tasks of the standard ad hoc retrieval setup. We assume the existence of a corpus C comprised of a collection of documents {d1,d2…dn}. In response to a user’s information need represented as query q. The system’s goal is to return a top-k ranked list of documents that maximizes some metric of quality such as nDCG or MRR.
For TREC 2023, we provide two different collections: a collection of texts CT={t1,t2…tn} (t stands for text) and a collection of images CM={m1,m2…mn} (m stands for media).
Image SuggestionPermalink
An information need (for convenience, a query, denoted q, is simply a text t drawn from CT, i.e., q∈CT. That is, an editor of Wikipedia examines a specific section of an article and wishes to locate an appropriate image in the collection CM.
Image PromotionPermalink
This task is the inverse of the image suggestion task. The query q is an image drawn from CM and the collection to be searched is CT.
Organizers of AToMiC 2023Permalink
- Jheng-Hong (Matt) Yang, Univeristy of Waterloo
- Jimmy Lin, University of Waterloo
- Carlos Lassance, Naver Labs Europe
- Rafael S. Rezende, Naver Labs Europe
- Stéphane Clinchant, Naver Labs Europe
- Krishna Srinivasan, Google Research
- Miriam Redi, Wikimedia Foundation