This dataset comprises three files:
File 1 contains 60,400 food products described in French with 4 textual variables: name of product, denomination, precision (flavor...), method of conservation. There are the 4 textual variables selected by the OQALI experts to preserve categorisation during evolutions of OQALI thesaurus. The set of 60,400 food products is also categorized into the sectors and families of the OQALI thesaurus.
File 2 contains the OQALI thesaurus (32 sectors divided into 643 families). By example, in OQALI, “Dairy products and fresh desserts” is one of the largest sectors, which contains several families such as “Classic yogurts and sweetened fermented milks”, “Classic sweet fresh cheeses”.
File 3 explains how brands have been anomized in File 1.