LongEval 2024 Test Collection

DOI

The collection consists of queries and documents provided by the Qwant search Engine (https://www.qwant.com). The queries, which were issued by the users of Qwant, are based on the selected trending topics. The documents in the collection were selected with respect to these queries using the Qwant click model. Apart from the documents selected using this model, the collection also contains randomly selected documents from the Qwant index. All the data was collected over June 2023 and August 2023. In total, the collection contains 1,925 test queries. The set of documents consist of 4,321,642 downloaded, cleaned and filtered Web Pages. Apart from their original French versions, the collection also contains translations of the webpages and queries into English. The collection serves as the official test collection for the 2024 LongEval Information Retrieval Lab (https://clef-longeval.github.io/) organised at CLEF.

The data is released under the Qwant LongEval Attribution-NonCommercial-ShareAlike License.

This version includes the topics (questions) that have been used in the LongEval 2024 Lab and their qrels.

Identifier
DOI https://doi.org/10.48436/ym4jf-rp602
Related Identifier IsVersionOf https://doi.org/10.48436/wm79f-88x06
Related Identifier IsRequiredBy https://doi.org/10.1007/978-3-031-71908-0_10
Related Identifier IsVersionOf https://doi.org/10.48436/p026v-96e13
Metadata Access https://researchdata.tuwien.ac.at/oai2d?verb=GetRecord&metadataPrefix=oai_datacite&identifier=oai:researchdata.tuwien.ac.at:ym4jf-rp602
Provenance
Creator Fink, Tobias; Piroi, Florina ORCID logo; Galuščáková, Petra; Devaud, Romain; Gonzalez-Saez, Gabriela; Iommi, David; Mulhem, Philippe; Goeuriot, Lorraine; Popel, Martin; El-Ebshihy, Alaa
Publisher TU Wien
Publication Year 2025
Funding Reference FWF Austrian Science Fund 013tf3c58 ROR I4471-N Kodicare
Rights Qwant LongEval Attribution-NonCommercial-ShareAlike License; https://lindat.mff.cuni.cz/repository/xmlui/page/Qwant_LongEval_BY-NC-SA_License
OpenAccess true
Contact tudata(at)tuwien.ac.at
Representation
Language French
Resource Type Dataset
Discipline Other